Kawrakow
9e1d14f9c3
WIP GLM4.5 - this works
...
PP is already better than split mode layer, but TG for zero context
is kind of low - 60 vs 92 t/s. TG becomes better than split mode layer
at around 20k tokens. PP at 26k tokens is 1.55X of sm layer.
2025-11-30 18:05:13 +00:00
..
2025-10-11 16:01:13 +03:00
2025-11-24 06:55:14 +01:00
2025-11-24 06:55:14 +01:00
2025-11-30 18:05:13 +00:00
2025-11-30 18:05:13 +00:00
2025-11-30 18:05:12 +00:00
2025-11-14 06:58:19 +02:00
2025-11-30 18:45:38 +01:00
2025-11-30 18:45:38 +01:00
2025-11-29 07:27:15 +01:00
2025-11-29 07:27:15 +01:00
2025-11-30 18:05:12 +00:00
2025-11-30 18:05:13 +00:00
2025-08-15 09:18:07 +03:00
2025-08-15 09:18:07 +03:00
2025-11-07 07:11:23 +02:00
2025-10-30 10:49:48 +02:00
2025-11-16 12:12:41 +02:00
2025-11-30 18:05:13 +00:00
2025-11-20 11:50:09 +01:00
2025-11-30 18:45:38 +01:00
2025-06-19 10:24:53 +03:00
2025-11-29 07:27:15 +01:00
2025-11-29 07:27:15 +01:00
2025-11-30 18:05:13 +00:00
2024-07-27 07:55:01 +02:00
2024-07-27 07:55:01 +02:00
2025-08-15 09:18:07 +03:00
2025-08-15 09:18:07 +03:00