Files
ik_llama.cpp/github-data/pull_requests/111-Use fused mul - unary op also for MoE models.md
2025-07-22 18:18:40 +02:00

335 B

🔀 #111 - Use fused mul - unary op also for MoE models

Author ikawrakow
State Closed
Created 2024-10-26
Updated 2024-10-26

Description

This gives us a ~1% speedup for MoE models on CUDA and Metal.