turboderp
724060b058
Dependencies: Update exllamav3
2026-03-13 23:14:09 +01:00
turboderp
761e26a137
Dependencies: Update exllamav3
2026-03-05 18:09:34 +01:00
turboderp
41511f56c6
Dependencies: Update exllamav3
2026-02-09 22:54:29 +01:00
turboderp
8a824cb127
Dependencies: Update exllamav3
2026-01-20 18:52:44 +01:00
kingbri
84bb1ce9fd
Dependencies: Fix FA2 wheels
...
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-12-19 16:52:05 -05:00
kingbri
5627f4d69e
Dependencies: Update to torch 2.9
...
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-12-19 15:59:40 -05:00
turboderp
f04fc6eb25
Dependencies: Update exllamav3
2025-12-16 12:58:31 +01:00
turboderp
8b6b793bfc
Dependencies: Update exllamav3
2025-11-25 21:17:31 +01:00
turboderp
f50015af5e
Dependencies: Update exllamav3
2025-11-23 23:27:26 +01:00
turboderp
368e87eb7d
Fix exllamav3 URL
2025-11-03 12:35:13 +01:00
turboderp
c6bf59063d
Dependencies: Update exllamav3
2025-11-02 23:45:34 +01:00
turboderp
996bc8dbe1
Dependencies: Update exllamav3
2025-10-17 23:41:44 +02:00
turboderp
2539acf800
Dependencies: Update exllamav3
2025-10-15 16:01:57 +02:00
turboderp
ec50ad17ea
Merge branch 'main_seq'
2025-10-14 02:58:00 +02:00
turboderp
f73e88e9e9
Dependencies: update exllamav3
2025-10-14 00:58:14 +02:00
turboderp
01a5915a7b
Dependencies: Pin Pydantic to version 2.11.0
...
For now. There appear to be breaking changes in 2.12.0 that affect both Formatron and FastAPI.
2025-10-08 20:43:26 +02:00
kingbri
7a0dddcbd9
Dependencies: Update exllamav3
...
v0.0.7
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-09-30 17:34:02 -04:00
kingbri
30a3cd75cf
Start: Migrate options from cu121/118 to cu12
...
This encapsulates more cuda versions and makes install easier for
new users.
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-08-19 22:56:58 -04:00
kingbri
f2a39e3a61
Dependencies: Update exllama, torch, and flash attention
...
Torch: 2.8
ExllamaV2: v0.3.2 torch 2.8
ExllamaV3: v0.0.6 torch 2.8
FA: v2.8.3
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-08-17 21:19:23 -04:00
kingbri
ab04a6ed60
Dependencies: Bump ExllamaV3
...
v0.0.5
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-07-18 22:56:35 -04:00
kingbri
bf936f5c39
Dependencies: Update exllamav2
...
v0.3.2
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-07-13 23:33:12 -04:00
turboderp
d357f100d0
Dependencies: Bump ExllamaV3
2025-06-15 19:12:45 +02:00
turboderp
691a080ac7
Dependencies: Bump ExllamaV3 and ExllamaV2
2025-05-31 23:55:04 +02:00
kingbri
fa534fe551
Dependencies: Update Ruff
...
v0.11.10
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-05-17 00:46:25 -04:00
kingbri
c9dc0b2aa4
Dependencies: Bump ExllamaV3 and ExllamaV2
...
v0.0.2 and v0.3.0 respectively
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-05-12 15:29:31 -04:00
kingbri
33ac016023
Dependencies: Add ExllamaV3
...
v0.0.1
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-05-09 21:42:07 -04:00
kingbri
2b3ed3fc79
Dependencies: Switch back to official exl2 wheels
...
These wheels are built properly and have the correct version and
filename.
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-04-26 21:27:28 -04:00
kingbri
eb435f79e3
Dependencies (TEMP): Use my wheels for exl2
...
Use these until exl2 updates its wheels to have the version equal the
filename.
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-04-26 02:11:33 -04:00
kingbri
136c8139f9
Dependencies: Update PyTorch, Exllamav2, and FA2
...
PyTorch: v2.7.0 on cuda 128 + ROCm 6.3
Exllamav2: v0.2.9
FA2: v2.7.4.post1 on cuda 128
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-04-24 21:52:48 -04:00
kingbri
9834c7f99b
Dependencies: Ungate numpy
...
numpy v2 now works with Torch
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-04-21 23:14:14 -04:00
kingbri
0dcbb7a722
Dependencies: Update torch, exllamav2, and flash-attn
...
Torch - 2.6.0
ExllamaV2 - 0.2.8
Flash-attn - 2.7.4.post1
Cuda wheels are now 12.4 instead of 12.1, feature names need to be
migrated over.
Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com >
2025-02-09 01:27:48 -05:00
Jakub Filo
f8d9cfb5fd
Bump formatron to 0.4.11
2025-01-08 00:48:25 +01:00
kingbri
cfb439c0e6
Dependencies: Update exllamav2 and pytorch for ROCm
...
Exllama v0.2.7, pytorch v2.5.1 across all cards.
AMD now requires ROCm 6.2
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2025-01-01 16:22:10 -05:00
kingbri
fa8035ef72
Dependencies: Update sse-starlette and formatron
...
Also pin newer versions of dependencies and fix an import from sse-starlette
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-21 23:14:55 -05:00
kingbri
bc3c154c96
Dependencies: Pin tokenizers
...
Use a version greater than 0.20.0 for newer model support.
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-13 00:58:25 -05:00
kingbri
f25ac4b833
Dependencies: Update ExllamaV2
...
v0.2.6
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-13 00:47:29 -05:00
kingbri
8ccd7a12a2
Merge branch 'main' into formatron
2024-12-05 23:01:22 -05:00
kingbri
ac85e34356
Depenedencies: Update Torch, FA2, and Exl2
...
Torch: 2.5, FA2 2.7.0.post2, Exl2 v0.2.5
Don't update torch for rocm as exl2 isn't built for rocm 6.2
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-03 22:57:00 -05:00
kingbri
ca86ab5477
Dependencies: Remove CUDA 11.8
...
Most software has moved to CUDA 12 and cards that aren't supported by
11.8 don't use tabby anyways.
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-03 22:37:03 -05:00
kingbri
3c4211c963
Dependencies: Ensure updated kbnf
...
Signed-off-by: kingbri <8082010+bdashore3@users.noreply.github.com >
2024-12-02 15:10:20 -05:00
DocShotgun
0836a9317f
Grammar: Initial Formatron regex and JSON schema implementation
...
* Replace LMFE's regex and JSON schema filters with Formatron's
* Remove Outlines EBNF filter in preparation for Formatron KBNF filter
* TODO: Implement Formatron KBNF filter
2024-11-23 10:27:37 -08:00
kingbri
9cd7fcaf99
Pyproject: Add pillow to deps
...
Signed-off-by: kingbri <bdashore3@proton.me >
2024-11-22 17:48:56 -05:00
kingbri
0fadb1e5e8
Merge branch 'main' into vision
2024-11-19 21:19:21 -05:00
DocShotgun
dd41eec8a4
OAI: Initial vision support in OAI chat completions
...
* Support image_url inputs containing URLs or base64 strings following OAI vision spec
* Use async lru cache for image embeddings
* Add generic wrapper class for multimodal embeddings
2024-11-17 21:23:09 -08:00
kingbri
69838e92ca
Dependencies: Update ExllamaV2
...
v0.2.4
Signed-off-by: kingbri <bdashore3@proton.me >
2024-11-13 22:16:11 -05:00
kingbri
6726014d35
Dependencies: Update ExllamaV2
...
v0.2.3
Signed-off-by: kingbri <bdashore3@proton.me >
2024-09-30 00:17:12 -04:00
kingbri
b4cda78bcc
Dependencies: Update Ruff
...
v0.6.5
Signed-off-by: kingbri <bdashore3@proton.me >
2024-09-19 22:39:08 -04:00
kingbri
c616b3b1ee
Dependencies: Update PyTorch
...
v2.4.1 and update all associated wheels to use their 2.4 versions.
Signed-off-by: kingbri <bdashore3@proton.me >
2024-09-19 22:32:23 -04:00
TerminalMan
948fcb7f5b
migrate to ruamel.yaml
2024-09-18 01:06:34 +01:00
turboderp
318c425d84
Bump exllamav2 to 0.2.2
2024-09-14 21:43:26 +02:00