Files
ik_llama.cpp/ggml
Kawrakow 02ae22388f Apply offfset to KQ_max in CUDA flash attention (#1196)
* Apply offfset to KQ_max in CUDA flash attention

* Forgot to add to fattn-common.h
2026-01-29 07:27:53 +02:00
..
2024-07-27 07:55:01 +02:00
2026-01-22 13:20:23 +02:00
2024-07-27 07:55:01 +02:00