mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-01-26 17:20:01 +00:00
* Adding Ling/Ring (a.k.a., Bailing-MoE2) * Add expert group selection (not working, so turned off) * BailingMoE2 conversion * WIP * Bits and pieces --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
4 lines
98 B
Plaintext
4 lines
98 B
Plaintext
#include "common.cuh"
|
|
|
|
void ggml_cuda_argmax(ggml_backend_cuda_context & ctx, ggml_tensor * dst);
|