Commit Graph

11 Commits

Author SHA1 Message Date
turboderp
0b05686e76 Refactor, clean up and consolidate architecture logic 2024-03-06 02:46:47 +01:00
turboderp
dce84866e1 Support for StarCoder2, initial 2024-03-05 21:20:29 +01:00
turboderp
d36077cf92 Fix converter 2023-12-28 10:11:45 +01:00
turboderp
d2753a29b8 Mixtral EXL2 support, initial 2023-12-16 16:50:50 +01:00
turboderp
39fd07083a Add error norm 2023-12-13 02:20:25 +01:00
turboderp
0d63d6479c Rework quantization and optimization 2023-12-13 01:00:11 +01:00
turboderp
2e91239571 New quant optimization procedure 2023-12-08 20:19:57 +01:00
turboderp
c7d1bc7ef0 TODO items 2023-10-11 23:44:04 +02:00
turboderp
a4f2663e31 Fix edge case in optimize.py 2023-09-18 20:06:39 +02:00
turboderp
47418c0c78 Allow optimizer to include larger matrices when minimizing the max error results in a bitrate < target 2023-09-18 18:24:30 +02:00
turboderp
bb83469574 Initial commit 2023-08-30 11:05:23 +02:00