ik_llama.cpp/ggml-cuda/convert.cu at bd1871fa2b1bf8a081b43ba9bc85f8ffd46fac46

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-28 10:21:48 +00:00

Files

DAN™ e00b4a8f81 Fix more int overflow during quant (PPL/CUDA). (#6563 )

* Fix more int overflow during quant.

* Fix some more int overflow in softmax.

* Revert back to int64_t.

2024-04-29 00:38:44 +02:00

View Raw