ik_llama.cpp/ggml-cuda/common.cuh at 40b7feb8e2b421cdd60db0f89bc2b6c8f7210b52

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-04-28 18:32:04 +00:00

Files

Johannes Gäßler b02a859891 CUDA: generalize FP16 fattn vec kernel (#7061 )

* CUDA: generalize FP16 fattn vec kernel

* disable unsupported head sizes for AMD in test

* try AMD fix

* fix batch size 2-8

* partially revert changes

2024-05-09 14:32:02 +02:00

23 KiB

Raw Blame History

View Raw

23 KiB Raw Blame History

23 KiB

Raw Blame History