turboderp
|
0b05686e76
|
Refactor, clean up and consolidate architecture logic
|
2024-03-06 02:46:47 +01:00 |
|
turboderp
|
dce84866e1
|
Support for StarCoder2, initial
|
2024-03-05 21:20:29 +01:00 |
|
turboderp
|
cedeb616ce
|
Support Qwen2
|
2024-02-15 20:50:24 +01:00 |
|
turboderp
|
0e9d9c1010
|
Prevent tensors passed to save_file from sharing memory
|
2024-02-01 10:14:36 +01:00 |
|
turboderp
|
8a0cb9e01d
|
Add last saved checkpoint to status box
|
2024-02-01 04:56:33 +01:00 |
|
turboderp
|
4c93ce852f
|
Fix remaining time estimate
|
2024-02-01 04:56:00 +01:00 |
|
turboderp
|
735807e800
|
Use os.replace to swap checkpoint states in measure.py as well
|
2024-02-01 04:39:34 +01:00 |
|
turboderp
|
1e70113de3
|
Don't print avg accuracy, clarify "completed" -> "measured"
|
2024-02-01 04:24:10 +01:00 |
|
Ben Gorlick
|
56a0d6d995
|
Adding graceful exit signal handling and status box for estimating time remaining in quantization process
|
2024-01-30 17:33:54 -08:00 |
|
turboderp
|
7a9d12ae4c
|
Add non-RMS layernorm, support for Orion
|
2024-01-22 17:21:01 +01:00 |
|
turboderp
|
48b3211d9c
|
Fix for #281
|
2024-01-17 06:38:52 +01:00 |
|
turboderp
|
6e214f59c7
|
Optimize conversion kernels
|
2024-01-08 03:40:40 +01:00 |
|
turboderp
|
41b15dd1c3
|
Refactor to consolidate attn params
|
2024-01-04 04:52:49 +01:00 |
|
turboderp
|
37a1322096
|
Fix mistake in MLP measure
|
2023-12-16 20:30:25 +01:00 |
|
turboderp
|
d2753a29b8
|
Mixtral EXL2 support, initial
|
2023-12-16 16:50:50 +01:00 |
|
turboderp
|
0d63d6479c
|
Rework quantization and optimization
|
2023-12-13 01:00:11 +01:00 |
|