mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-04-28 18:32:04 +00:00
Broadcast src0 into src1 across dimensions 2 and 3 when needed. This is required for models that use GQA.
697 KiB
697 KiB