diff --git a/README.md b/README.md index ea98f291..5fc4491d 100644 --- a/README.md +++ b/README.md @@ -8,6 +8,8 @@ This repository is a fork of [llama.cpp](https://github.com/ggerganov/llama.cpp) >[!IMPORTANT] >Do not use quantized models from Unsloth that have `_XL` in their name. These are likely to not work with `ik_llama.cpp`. +> +>The above has caused some stir, so to clarify: the Unsloth `_XL` models that are likely to not work are those that contain `f16` tensors (which is never a good idea in the first place). All others are fine. ## Quickstart