ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-06 12:00:29 +00:00

Files

Kawrakow 02ae22388f Apply offfset to KQ_max in CUDA flash attention (#1196 )

* Apply offfset to KQ_max in CUDA flash attention

* Forgot to add to fattn-common.h

2026-01-29 07:27:53 +02:00

2024-07-27 07:55:01 +02:00

2026-01-22 13:20:23 +02:00

2026-01-29 07:27:53 +02:00

.gitignore

2024-07-27 07:55:01 +02:00

CMakeLists.txt

2026-01-22 13:20:23 +02:00