ik_llama.cpp/github-data/issues/507 - Compatible gguf models _.md at ik/debug_issue_721 - ik_llama.cpp

ikawrakow/ik_llama.cpp

Fork 0

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-03-05 03:20:00 +00:00

Files

Thomas 0451f10a42 Add GitHub data: filename sanitization (#640 )

2025-07-23 13:31:53 +02:00

3.5 KiB

Raw Permalink Blame History

📝 #507 - Compatible gguf models ?

Author	`lbarasc`
State	❌ Closed
Created	2025-06-09
Updated	2025-06-14

Description

Hi,

I want to use some compatible 1bit gguf models, like microsoft bitnet 1b or falcon 1b with your software. Where can i found these models ? can you send me links to download ?

Thank you for your help.

💬 Conversation

👤 ikawrakow commented the 2025-06-09 at 12:23:07:

See #401

👤 lbarasc commented the 2025-06-09 at 16:47:49:

Here is my command under win10 64bits (with latest ik_lama with xeon e5 and rtx 3060 cuda :

the result :

D:\ik_lama>

I have no error but nothing at all ! Please help me.

👤 lbarasc commented the 2025-06-09 at 16:47:49:

Here is my command (with latest ik_lama with xeon e5 and rtx 3060 cuda :

D:\ik_lama>

I have no error but nothing at all ! Please help me.

👤 ikawrakow commented the 2025-06-09 at 16:53:40:

You need to convert the i2_s model to ik_llama.cpp quants as described in #401. You missed this step:

./build/bin/llama-quantize --allow-requantize ./models/ggml-model-i2_s.gguf ./models/bitnet.gguf iq2_bn_r4

Then your server command should use the newly created file, not the i2_s file.

👤 lbarasc commented the 2025-06-09 at 17:09:08:

I do this : D:\ik_lama>llama-quantize --allow-requantize ggml-model-i2_s.gguf bitnet.gguf iq2_bn_r4

the result is main: build = 1 (02272cd) main: built with MSVC 19.29.30159.0 for main: quantizing 'ggml-model-i2_s.gguf' to 'bitnet.gguf' as IQ2_BN_R4

but i cannot retrieve bitnet.gguf file ?

👤 saood06 commented the 2025-06-11 at 07:00:39:

Not sure why the requantize didn't work for you, but I have provided pre-converted models you can use here.

👤 ikawrakow commented the 2025-06-14 at 12:02:29:

Nothing more that we can do here.

3.5 KiB Raw Permalink Blame History

📝 #507 - Compatible gguf models ?

Description

💬 Conversation

3.5 KiB

Raw Permalink Blame History