ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-13 23:40:09 +00:00

Files

Iwan Kawrakow 403d4eef35 quantize-stats on transposed tensors

I always wanted to know if transposing the model tensors may
improve quantization. If for whatever reason there was a correlation
between weights in different rows but at the same position within a row,
a transposed version of the tensor would quantize better.
This commit tried and, nope, no luck.

2024-10-05 17:57:52 +03:00

CMakeLists.txt

build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )

2024-06-13 00:41:52 +01:00

quantize-stats.cpp

quantize-stats on transposed tensors

2024-10-05 17:57:52 +03:00