Commit Graph

13 Commits

Author SHA1 Message Date
turboderp
0b05686e76 Refactor, clean up and consolidate architecture logic 2024-03-06 02:46:47 +01:00
turboderp
dce84866e1 Support for StarCoder2, initial 2024-03-05 21:20:29 +01:00
turboderp
2044f8a31c Set inference_mode when compiling model 2024-02-22 10:48:44 +01:00
turboderp
0e9d9c1010 Prevent tensors passed to save_file from sharing memory 2024-02-01 10:14:36 +01:00
turboderp
2707e28165 Skip .bin files when compiling full model 2024-01-22 17:34:24 +01:00
turboderp
7a9d12ae4c Add non-RMS layernorm, support for Orion 2024-01-22 17:21:01 +01:00
turboderp
1f71d17b89 Use .union() for Python 3.8 compatibility 2024-01-20 06:22:14 +01:00
turboderp
d2753a29b8 Mixtral EXL2 support, initial 2023-12-16 16:50:50 +01:00
turboderp
2b0da96de7 Fix edge case if last layer doesn't fit in last shard 2023-09-23 21:23:23 +02:00
turboderp
2a3ff14af2 Remove repeated console output 2023-09-20 09:54:43 +02:00
turboderp
6fd006b9d0 More options for converter to facilitate scripting 2023-09-18 18:25:30 +02:00
turboderp
af1398ff16 Conversion: ability to save sharded models (addresses OoM when compiling output file) 2023-09-16 11:44:07 +02:00
turboderp
bb83469574 Initial commit 2023-08-30 11:05:23 +02:00