21 Commits

Author SHA1 Message Date
Shane Zarechian
4ad39e0ca6 Fix extra parenthesis typo (#559) 2024-07-22 14:18:04 +02:00
Kevin Yin
69cd2b1df8 Remove stray ) in code example (#549) 2024-07-12 15:29:08 +02:00
turboderp
675450d845 Add Q6 and Q8 cache options to eval scripts 2024-06-09 02:13:06 +02:00
turboderp
6030517a6f Option to resume conversion job with no other args 2024-06-08 22:15:41 +02:00
turboderp
de05ac696b Add more sh tags 2024-06-08 20:41:34 +02:00
turboderp
5ca51dd5d8 Dynamic generator writeup 2024-06-08 20:26:52 +02:00
turboderp
e4ef7cfef2 Docs for eval scripts 2024-06-08 15:48:20 +02:00
rohitanshu
b5f110d215 Fixed minor typo in convert.md doc (#463)
changed '64 GB or RAM' to '64 GB of RAM'
2024-05-26 23:26:17 +02:00
turboderp
e6f230bf06 Update README.md 2024-05-25 22:50:36 +02:00
turboderp
f99f7894a7 typo 2024-04-12 23:41:33 +02:00
turboderp
324404ebe4 Q4 cache: Add groupwise Hadamard transform 2024-04-12 20:06:25 +02:00
turboderp
46b7e6ea47 Fix layout 2024-03-09 06:01:44 +01:00
turboderp
f7c89f4c51 Add Q4 test results (draft) 2024-03-09 05:59:26 +01:00
turboderp
89587d13df Update convert.py instructions 2023-12-16 22:03:25 +01:00
turboderp
02e2cb4d4a Update convert.py instructions 2023-12-16 21:51:35 +01:00
turboderp
09b981fa57 Add RoPE arguments to quantizer script 2023-11-21 05:13:37 +01:00
turboderp
95207ec848 Fix typos 2023-09-19 11:09:59 +02:00
turboderp
0494261f3e Update README.md, some documentation for convert.py 2023-09-18 19:49:33 +02:00
turboderp
ed878a8d94 Update screenshots 2023-09-12 12:11:27 +02:00
turboderp
c240eb0b70 Update README.md 2023-09-12 06:41:36 +02:00
turboderp
ddaf503e98 Add README.md 2023-09-10 14:16:53 +02:00