This website requires JavaScript.
Explore
Help
Register
Sign In
ikawrakow
/
ik_llama.cpp
Watch
1
Star
0
Fork
0
You've already forked ik_llama.cpp
mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced
2026-02-07 15:00:11 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Add GitHub data: filename sanitization (
#640
)
Browse Source
...
This commit is contained in:
Thomas
2025-07-23 13:31:53 +02:00
committed by
GitHub
parent
3600d82e98
commit
eaa2510a28
626 changed files
with
0 additions
and
0 deletions
Show all changes
Ignore whitespace when comparing lines
Ignore changes in amount of whitespace
Ignore changes in whitespace at EOL
Download Patch File
Download Diff File
Expand all files
Collapse all files
0
github-data/discussions/100-New argument _ env variable for GGML_SCHED_MAX_COPIES_.md → github-data/discussions/100 - New argument _ env variable for GGML_SCHED_MAX_COPIES_.md
Unescape
Escape
View File
0
github-data/discussions/104-Convenience improvements for llama-quantize.md → github-data/discussions/104 - Convenience improvements for llama-quantize.md
Unescape
Escape
View File
0
github-data/discussions/140-Questions about weight[j].md → github-data/discussions/140 - Questions about weight_j_.md
Unescape
Escape
View File
0
github-data/discussions/15-Will LQER improve k- and i-quants_.md → github-data/discussions/15 - Will LQER improve k- and i-quants_.md
Unescape
Escape
View File
0
github-data/discussions/164-Latest CPU performance comparison with llama.cpp.md → github-data/discussions/164 - Latest CPU performance comparison with llama.cpp.md
Unescape
Escape
View File
0
github-data/discussions/165-Norm RMS Epsilon.md → github-data/discussions/165 - Norm RMS Epsilon.md
Unescape
Escape
View File
0
github-data/discussions/166-Learning more LLM quantization.md → github-data/discussions/166 - Learning more LLM quantization.md
Unescape
Escape
View File
0
github-data/discussions/18-CPU beating GPU in token generation speed.md → github-data/discussions/18 - CPU beating GPU in token generation speed.md
Unescape
Escape
View File
0
github-data/discussions/201-What is the NUMA situation _.md → github-data/discussions/201 - What is the NUMA situation _.md
Unescape
Escape
View File
0
github-data/discussions/211-help me create an importance matrix primer.md → github-data/discussions/211 - help me create an importance matrix primer.md
Unescape
Escape
View File
0
github-data/discussions/223-Recent performance testing with DeepSeek R1.md → github-data/discussions/223 - Recent performance testing with DeepSeek R1.md
Unescape
Escape
View File
0
github-data/discussions/242-Switching from llama.cpp_ktransformers, seeking advice_guidance.md → github-data/discussions/242 - Switching from llama.cpp_ktransformers_ seeking advice_guidance.md
Unescape
Escape
View File
0
github-data/discussions/25-CPU prompt processing speed for large contexts.md → github-data/discussions/25 - CPU prompt processing speed for large contexts.md
Unescape
Escape
View File
0
github-data/discussions/256-Diverging from llama.cpp.md → github-data/discussions/256 - Diverging from llama.cpp.md
Unescape
Escape
View File
0
github-data/discussions/258-Quick-start Guide coming over from llama.cpp and ktransformers!.md → github-data/discussions/258 - Quick-start Guide coming over from llama.cpp and ktransformers_.md
Unescape
Escape
View File
0
github-data/discussions/266-Benchmarking DeepSeek R1 - 16x3090.md → github-data/discussions/266 - Benchmarking DeepSeek R1 - 16x3090.md
Unescape
Escape
View File
0
github-data/discussions/286-Testing `deepseek-ai_DeepSeek-V3-0324` model support..md → github-data/discussions/286 - Testing _deepseek-ai_DeepSeek-V3-0324_ model support..md
Unescape
Escape
View File
0
github-data/discussions/288-On @compilade's PR 12557 and @jukofyork's quantization ideas.md → github-data/discussions/288 - On _compilade_s PR 12557 and _jukofyork_s quantization ideas.md
Unescape
Escape
View File
0
github-data/discussions/316-Mainline is now copying stuff from ik_llama.cpp.md → github-data/discussions/316 - Mainline is now copying stuff from ik_llama.cpp.md
Unescape
Escape
View File
0
github-data/discussions/319-KTransformers copying ik_llama.cpp.md → github-data/discussions/319 - KTransformers copying ik_llama.cpp.md
Unescape
Escape
View File
0
github-data/discussions/323-Is there an easy way to repack an existing GGUF so it could be used without --run-time-repack (thus enabling mmap).md → github-data/discussions/323 - Is there an easy way to repack an existing GGUF so it could be used wit.md
Unescape
Escape
View File
0
github-data/discussions/334-`iq4_ks` performs great on gemma-3-27b-it-qat-q4_0-unquantized.md → github-data/discussions/334 - _iq4_ks_ performs great on gemma-3-27b-it-qat-q4_0-unquantized.md
Unescape
Escape
View File
0
github-data/discussions/350-Maverick slow prompt with gpu.md → github-data/discussions/350 - Maverick slow prompt with gpu.md
Unescape
Escape
View File
0
github-data/discussions/354-Not all MLAs are born equal.md → github-data/discussions/354 - Not all MLAs are born equal.md
Unescape
Escape
View File
0
github-data/discussions/357-Qwen3 - early performance comparisons.md → github-data/discussions/357 - Qwen3 - early performance comparisons.md
Unescape
Escape
View File
0
github-data/discussions/359-Qwen3 quantization experiments.md → github-data/discussions/359 - Qwen3 quantization experiments.md
Unescape
Escape
View File
0
github-data/discussions/372-multy gpu.md → github-data/discussions/372 - multy gpu.md
Unescape
Escape
View File
0
github-data/discussions/384-ik_llama.cpp issues on an old workstation.md → github-data/discussions/384 - ik_llama.cpp issues on an old workstation.md
Unescape
Escape
View File
0
github-data/discussions/385-Qwen3 235B performance on Intel Xeon Scalable processor.md → github-data/discussions/385 - Qwen3 235B performance on Intel Xeon Scalable processor.md
Unescape
Escape
View File
0
github-data/discussions/393-Creating quantized models.md → github-data/discussions/393 - Creating quantized models.md
Unescape
Escape
View File
0
github-data/discussions/395-Why does imatrix not tokenize special tokens_.md → github-data/discussions/395 - Why does imatrix not tokenize special tokens_.md
Unescape
Escape
View File
0
github-data/discussions/396-Best settings for Maverick - Dual CPU Xeon 8480+ - RTX 3090.md → github-data/discussions/396 - Best settings for Maverick - Dual CPU Xeon 8480_ - RTX 3090.md
Unescape
Escape
View File
0
github-data/discussions/397-KV split while using `-sm row`.md → github-data/discussions/397 - KV split while using _-sm row_.md
Unescape
Escape
View File
0
github-data/discussions/399-Qwen 30b.A3b IK_LCPP comparisons on lowspec machine.md → github-data/discussions/399 - Qwen 30b.A3b IK_LCPP comparisons on lowspec machine.md
Unescape
Escape
View File
0
github-data/discussions/401-install bitnet (or other cpu models) on a fresh termux aarch64.md → github-data/discussions/401 - install bitnet _or other cpu models_ on a fresh termux aarch64.md
Unescape
Escape
View File
0
github-data/discussions/403-Tool Calling and Structured Response (Json Mode) support.md → github-data/discussions/403 - Tool Calling and Structured Response _Json Mode_ support.md
Unescape
Escape
View File
0
github-data/discussions/434-Quant Cookers Basic Guide.md → github-data/discussions/434 - Quant Cookers Basic Guide.md
Unescape
Escape
View File
0
github-data/discussions/451-Context reuse _ context shift for long prompts.md → github-data/discussions/451 - Context reuse _ context shift for long prompts.md
Unescape
Escape
View File
0
github-data/discussions/459-qwen3 metrics on ancient hardware (2x xeon Vs 2x P100).md → github-data/discussions/459 - qwen3 metrics on ancient hardware _2x xeon Vs 2x P100_.md
Unescape
Escape
View File
0
github-data/discussions/466-A curiosity..md → github-data/discussions/466 - A curiosity..md
Unescape
Escape
View File
0
github-data/discussions/477-DeepSeek-R1-0528 ik quants!.md → github-data/discussions/477 - DeepSeek-R1-0528 ik quants_.md
Unescape
Escape
View File
0
github-data/discussions/491--rtr actually hurts prompt t_s for large ubatch_.md → github-data/discussions/491 - -rtr actually hurts prompt t_s for large ubatch_.md
Unescape
Escape
View File
0
github-data/discussions/519-Android Build.md → github-data/discussions/519 - Android Build.md
Unescape
Escape
View File
0
github-data/discussions/526-Partial requant feature to save compute and time during tests..md → github-data/discussions/526 - Partial requant feature to save compute and time during tests..md
Unescape
Escape
View File
0
github-data/discussions/532-Guidance on GPU Layer Offloading Strategy in ik_llama.cpp for Multi GPU Rig (2x5090 + 2x4090).md → github-data/discussions/532 - Guidance on GPU Layer Offloading Strategy in ik_llama.cpp for Multi GPU.md
Unescape
Escape
View File
0
github-data/discussions/543-dots.llm1 support and thanks.md → github-data/discussions/543 - dots.llm1 support and thanks.md
Unescape
Escape
View File
0
github-data/discussions/545-Vulkan support_.md → github-data/discussions/545 - Vulkan support_.md
Unescape
Escape
View File
0
github-data/discussions/548-Poor performance with bf16 model on Qwen3 30B-A3B.md → github-data/discussions/548 - Poor performance with bf16 model on Qwen3 30B-A3B.md
Unescape
Escape
View File
0
github-data/discussions/556-ik_llama.cpp for Armv8.0.md → github-data/discussions/556 - ik_llama.cpp for Armv8.0.md
Unescape
Escape
View File
0
github-data/discussions/562-AMD GPU Vulkan & ROCm_HIP Discussion.md → github-data/discussions/562 - AMD GPU Vulkan _ ROCm_HIP Discussion.md
Unescape
Escape
View File
0
github-data/discussions/564-Maybe an interesting CUDA PR here..md → github-data/discussions/564 - Maybe an interesting CUDA PR here..md
Unescape
Escape
View File
0
github-data/discussions/586-Slow KV cache rm operation.md → github-data/discussions/586 - Slow KV cache rm operation.md
Unescape
Escape
View File
0
github-data/discussions/590-How important is Vulkan back-end development_.md → github-data/discussions/590 - How important is Vulkan back-end development_.md
Unescape
Escape
View File
0
github-data/discussions/591-I dont see any speed improvement in generation, so want to understand if i am missing something.md → github-data/discussions/591 - I dont see any speed improvement in generation_ so want to understand i.md
Unescape
Escape
View File
0
github-data/discussions/594-Is AVX2 a hard requirement on x64_.md → github-data/discussions/594 - Is AVX2 a hard requirement on x64_.md
Unescape
Escape
View File
0
github-data/discussions/599-mla matrix absorbtion.md → github-data/discussions/599 - mla matrix absorbtion.md
Unescape
Escape
View File
0
github-data/discussions/613-Pathological Quant_CUDA combinations -- How to know what works_.md → github-data/discussions/613 - Pathological Quant_CUDA combinations -- How to know what works_.md
Unescape
Escape
View File
0
github-data/discussions/619-gpu p2p utilization.md → github-data/discussions/619 - gpu p2p utilization.md
Unescape
Escape
View File
0
github-data/discussions/621-Deepseek v3_r1 poisoned prompt_.md → github-data/discussions/621 - Deepseek v3_r1 poisoned prompt_.md
Unescape
Escape
View File
0
github-data/discussions/623-Quantizing panels_bundles instead of blocks_.md → github-data/discussions/623 - Quantizing panels_bundles instead of blocks_.md
Unescape
Escape
View File
0
github-data/discussions/63-LLaMA-3.2 quantization evaluation.md → github-data/discussions/63 - LLaMA-3.2 quantization evaluation.md
Unescape
Escape
View File
0
github-data/discussions/8-New quantization types IQ2_K, IQ3_K, IQ4_K, IQ5_K.md → github-data/discussions/8 - New quantization types IQ2_K_ IQ3_K_ IQ4_K_ IQ5_K.md
Unescape
Escape
View File
0
github-data/discussions/82-4bpw GGML TYPE_.md → github-data/discussions/82 - 4bpw GGML TYPE_.md
Unescape
Escape
View File
0
github-data/discussions/95-Bitnet.md → github-data/discussions/95 - Bitnet.md
Unescape
Escape
View File
Write
Preview
Loading…
x
Add
Cancel
Save
Reference in New Issue
Repository
ikawrakow/ik_llama.cpp
Title
Body
Create Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
User to block:
Optional note:
The note is not visible to the blocked user.
Cancel
Block