mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-04-21 23:19:22 +00:00
qwen3next: add absolute sanity guards to fused regression
This commit is contained in:
@@ -104,3 +104,5 @@ Relative (`ik` vs mainline):
|
||||
- Also integrated into the broader eval harness:
|
||||
- `scripts/qwen3next-eval.sh --with-gpu --with-fused-regression ...`
|
||||
- Results are surfaced in `SUMMARY.md` under `IK Fused Delta Regression`.
|
||||
- Fused regression now enforces absolute non-fused sanity too:
|
||||
- mode0 decode/prefill PPL must stay below configurable thresholds (defaults: `10.0` / `10.0`).
|
||||
|
||||
Reference in New Issue
Block a user