juuso-oskari
374536f19a
CK-UA: checkpoint FA4 pipeline + int64 Q/O base-offset fix
...
Working state before the pipeline cleanup/refactor:
* FA4 matrix-softmax warp-group overlap pipeline (UA_FA4_PIPELINE=1).
* Widen per-CTA query/output base offsets to long_index_t so large
total_q (big-batch prefill) can't overflow int32 and fault on the
output store (cache_ptr_int32_overflow_possible only covers K/V).
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-03 08:47:43 +00:00
..
2025-11-26 11:00:05 -07:00
2025-12-02 13:30:27 +01:00
2025-11-26 11:00:05 -07:00
2026-03-27 09:18:14 +00:00
2026-03-13 01:21:08 +00:00
2026-03-31 03:40:25 +00:00
2026-03-03 21:55:14 +00:00
2026-05-27 13:24:11 +00:00
2026-02-25 16:13:13 +00:00
2026-05-15 17:34:50 +00:00
2026-03-27 20:37:23 +00:00
2026-03-17 18:58:56 +00:00
2026-03-31 08:03:41 +00:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2026-01-13 09:21:29 -08:00
2026-01-27 12:56:09 -08:00
2026-01-30 10:52:19 +08:00
2025-11-26 11:00:05 -07:00
2026-02-11 05:52:42 +00:00
2026-01-31 00:59:47 +08:00
2025-11-26 11:00:05 -07:00
2026-01-13 09:21:29 -08:00
2026-06-03 08:47:43 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-11 10:00:52 +00:00
2026-03-02 12:21:44 +00:00
2026-03-12 08:27:49 +00:00
2026-03-16 08:31:56 +00:00
2026-03-16 08:31:56 +00:00
2026-03-12 08:27:49 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2025-11-26 11:00:05 -07:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-03-02 12:21:44 +00:00
2026-04-01 16:39:15 +00:00