ik_llama.cpp/llama.cpp at 51e9d02599336e62948d29f1d6c05addeb921ac2

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-21 15:09:40 +00:00

Files

fairydreaming 27b040691c llama : use n_embd_head_v when reshaping kqv (#7327 )

* llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv

* llama : use n_embd_v_gqa and n_embd_head_v instead of n_embd_k_gqa and n_embd_head_k when making a view of cached value vectors.

---------

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

2024-05-17 14:24:38 +03:00

728 KiB

Raw Blame History

View Raw

728 KiB Raw Blame History

728 KiB

Raw Blame History