ik_llama.cpp/llama.cpp at d8426bb13e5925deee7dbb0c9827b6389471f058

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-12 06:50:08 +00:00

Files

postmasters 9a92a78e47 llama : add gemma model (#5631 )

There are couple things in this architecture:

1. Shared input and output embedding parameters.
2. Key length and value length are not derived from `n_embd`.

More information about the models can be found at
https://ai.google.dev/gemma. GGUFs can be downloaded from
https://huggingface.co/google.

2024-02-21 15:08:22 +02:00

503 KiB

Raw Blame History

View Raw

503 KiB Raw Blame History

503 KiB

Raw Blame History