Files
ik_llama.cpp/ggml
Iwan Kawrakow 3c974f5076 Make q8_0_r4 work with tensor row sizes that are not a multiple of 128
They still need to be divisible by 32.
2025-01-28 17:22:49 +02:00
..
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00
2024-10-04 14:43:26 +03:00