ik_llama.cpp/ggml.c at e40aa5185e2c6d654790ced3c412bc855a81c3ea

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-28 02:11:50 +00:00

Files

Georgi Gerganov e40aa5185e ggml : adjust mul_mat_f16 work memory (#1226 )

* llama : minor - remove explicity int64_t cast

* ggml : reduce memory buffer for F16 mul_mat when not using cuBLAS

* ggml : add asserts to guard for incorrect wsize

2023-04-29 18:43:28 +03:00

408 KiB

Raw Blame History

View Raw

408 KiB Raw Blame History

408 KiB

Raw Blame History