Files
ik_llama.cpp/github-data/pull_requests/35-Fix Zen4 Flash Attention.md
2025-07-22 18:18:40 +02:00

388 B

🐛 #35 - Fix Zen4 Flash Attention

Author ikawrakow
State Closed
Created 2024-09-02
Updated 2024-09-02

Description

Closes #34

Funny enough, the bug was not in the FA implementation but in the way I was calling iqk_flash_attn_noalibi from ggml.