mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-04-24 16:39:45 +00:00
Broadcast src0 into src1 across dimensions 2 and 3 when needed. This is required for models that use GQA.
68 KiB
68 KiB