Logo
Explore Help
Register Sign In
turboderp-org/exllamav2
1
0
Fork 0
You've already forked exllamav2
mirror of https://github.com/turboderp-org/exllamav2.git synced 2026-03-15 00:07:26 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
008a0bb7774d9eef428d14f58a74c1eb0b73bd10
exllamav2/tests
History
turboderp 23fc4737ae Fast safetensors mode with direct IO and pinned buffer
2024-01-18 20:11:53 +01:00
..
test_alloc.py
34B testing
2023-09-10 06:15:33 +02:00
test_autosplit.py
Fix unhandled OoM condition when loading GPTQ model with auto split
2023-10-28 20:08:39 +02:00
test_batch_latency.py
Batch latency test script
2023-12-23 22:04:40 +01:00
test_fasttensors.py
Fast safetensors mode with direct IO and pinned buffer
2024-01-18 20:11:53 +01:00
test_gemv.py
Instrumentation etc.
2023-12-10 17:36:40 +01:00
test_hgemm.py
Tests for half GEMM kernels
2023-11-25 11:54:53 +01:00
test_humaneval.py
Add script to compare quantized and unquantized model
2023-12-23 02:57:13 +01:00
test_mmlu.py
Add script to compare quantized and unquantized model
2023-12-23 02:57:13 +01:00
test_tokenizer.py
Fix some tokenization edge cases
2023-12-03 22:03:23 +01:00
test.py
Optimizer batched sampling
2023-12-23 22:04:10 +01:00
Powered by Gitea Version: 1.25.4 Page: 71ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API