Files
ik_llama.cpp/ggml
Iwan Kawrakow 5ccd33ea04 Bitnet: use the standard llm_build_kv to build self attention
My main motivation was to enable FA. But FA does not work anyway
because head size is 100 for the Botnet ternary models
(and I had forgotten this little detail).
2024-10-24 16:29:26 +03:00
..
2024-07-27 07:55:01 +02:00
2024-10-24 14:15:03 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00