Iwan Kawrakow
86237d0555
POC: per row scale
This is a POC how to work around opinionated ggml to
have scales per row rather than per block.
Only implemened for Zen4 and only for iq2_tn.
2024-09-25 13:10:33 +03:00
..
2024-08-12 15:14:32 +02:00
2024-09-14 20:02:32 +03:00
2024-08-12 15:14:32 +02:00
2024-09-25 13:10:33 +03:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00
2024-08-12 15:14:32 +02:00
2024-09-17 14:31:29 +03:00
2024-08-12 15:14:32 +02:00
2024-07-27 07:55:01 +02:00
2024-08-12 15:14:32 +02:00
2024-07-27 07:55:01 +02:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-09-25 13:10:33 +03:00
2024-09-14 20:02:32 +03:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-09-17 10:54:42 +03:00
2024-09-25 13:08:55 +03:00
2024-09-25 13:10:33 +03:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-08-12 15:14:32 +02:00
2024-09-25 13:10:33 +03:00