Files
ik_llama.cpp/ggml
Kawrakow 2fddc45a02 Vulkan: flash attention for DeepSeek models (#584)
* vulkan: support mixed/deepseekR1 FA head sizes (#14509)

* vulkan: better parameterize FA by head sizes

* vulkan: support mixed/deepseekR1 FA head sizes

* Fix the FA cherry-pick

---------

Co-authored-by: Jeff Bolz <jbolz@nvidia.com>
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2025-07-05 15:14:12 +02:00
..
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00