mcm007
|
dbcbfdb0ef
|
Ik llama swap in container step by step guide (#1249)
* Create README.md
* Add container files and llama-swap configs
* Update main README.md
* Build without GGML_IQK_FA_ALL_QUANTS
Otherwise fails with CUDA_DOCKER_ARCH=default
* Mention GGML_IQK_FA_ALL_QUANTS usage
* First step more explicit
|
2026-02-07 18:30:19 +02:00 |
|