Wenyao Gao
|
4dfc8e1c3f
|
VLM: support passing --mm-process-config for all models (#18467)
|
2026-04-12 17:08:05 +08:00 |
|
Aditya Sharma
|
f6e85676b5
|
model: support qwen3-asr (#22073)
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2026-04-07 13:27:05 +08:00 |
|
Piotr Mazurek
|
b5e8c4b9e3
|
model: support LFM2-VL (Liquid Foundation Model 2 Vision-Language) (#21230)
Co-authored-by: Piotr Mazurek <piotr.mazurek@liquid.ai>
|
2026-04-04 16:36:04 +08:00 |
|
Артем Савкин
|
27071e0a43
|
[NPU] Update quantization&CI documentation (#21100)
Co-authored-by: Tamir Baydasov <41994229+TamirBaydasov@users.noreply.github.com>
|
2026-03-28 21:42:21 +03:00 |
|
Nave Assaf
|
77872a8d55
|
Update Nemotron Example docs to include Super v3 and Nano 4B (#21416)
Signed-off-by: Nave Assaf <nassaf@nvidia.com>
|
2026-03-25 12:03:19 -04:00 |
|
Jiaxin(Jackson) Deng
|
c4db64c16b
|
Add Lychee Doc Links Check to Local and CI (#19742)
Co-authored-by: Zijie Xia <zijie_xia@icloud.com>
Co-authored-by: Zijie Xia <zijiexia@users.noreply.github.com>
Co-authored-by: zijiexia <37504505+zijiexia@users.noreply.github.com>
|
2026-03-24 13:48:26 -07:00 |
|
Rabinovich
|
3798a8c88d
|
docs: add out-of-tree model integration guide (#21050)
Co-authored-by: Yixiao Zeng <yixiao.zeng@xiaopeng.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
|
2026-03-20 20:07:46 -07:00 |
|
Matt Van Horn
|
6c5bf53a36
|
[Doc] Clarify that --chat-template is required for Qwen3-Reranker (#20596)
Co-authored-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com>
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-14 23:43:48 +00:00 |
|
Thomas
|
05e40922b3
|
[Doc] Fix wrong link and cmd description (#20365)
|
2026-03-11 11:30:45 -04:00 |
|
Xuhao Zhang
|
57b093dc34
|
[NPU]MindSpore backend support eagle3 (#17098)
Co-authored-by: wangtiance <tiancew@qq.com>
Co-authored-by: Tiance Wang <wangtiance@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: ronnie_zheng <zl19940307@163.com>
|
2026-03-11 09:11:19 +03:00 |
|
rakesh
|
a710b7d791
|
[Sarvam] Add inference support for Sarvam MoE LLMs (#18938)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-04 15:28:00 -08:00 |
|
Michael
|
6b8e62f94f
|
[AMD] [Qwen 3.5 Day 0] Add Qwen 3.5 nightly accuracy tests (#19479)
|
2026-03-02 19:42:42 -08:00 |
|
Michael
|
403195d59d
|
[AMD] [MiniMax-M2.5 Day 0] Add MiniMax-M2.5 nightly accuracy test (#19443)
|
2026-02-27 02:39:33 -08:00 |
|
chengshuang18
|
295bc17576
|
Feature/sdar support (#19044)
Co-authored-by: root <root@gpu-lg-cmc-h-h200-3047.host.h.pjlab.org.cn>
Co-authored-by: chengshuang <chengshuang@pjlab.org.cn>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
|
2026-02-19 21:58:15 -08:00 |
|
Cheng Wan
|
73a7f0d049
|
Revert "Add SDAR model support" (#19032)
|
2026-02-19 16:03:56 -08:00 |
|
chengshuang18
|
44ab752b7a
|
Add SDAR model support (#18318)
Co-authored-by: root <root@gpu-lg-cmc-h-h200-3047.host.h.pjlab.org.cn>
Co-authored-by: chengshuang <chengshuang@pjlab.org.cn>
Co-authored-by: 赵晨阳 <zhaochen20@outlook.com>
|
2026-02-19 11:20:32 -08:00 |
|
Bhavneek Singh
|
1ce3420784
|
Model: Support IBM Granite (Dense/Mamba + MoE) (#18040)
|
2026-02-15 11:24:41 +08:00 |
|
qianyue76
|
f06ab17a73
|
[diffusion] docs: consolidate diffusion documentation into docs (#18095)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: JiaxinD <djx2048@gmail.com>
|
2026-02-11 16:55:07 -08:00 |
|
brimon
|
ddbcfbaaab
|
feature: support bidirectional attention for Gemma-3 (#10707)
|
2026-02-09 23:17:45 +08:00 |
|
Junlin Zhou
|
14652243bd
|
[DLLM] Add JointThreshold algorithm for joint M2T and T2T decoding (#18171)
Signed-off-by: Junlin Zhou <zhoujunlin.zjl@antgroup.com>
Co-authored-by: Tiwei Bie <tiwei.btw@antgroup.com>
|
2026-02-09 14:20:45 +08:00 |
|
Rishit Shivam
|
c850a8a41a
|
[Docs] Add Falcon H1, Hunyuan-Large, Qwen3-Omni support and update Diffusion usage (#17888)
Co-authored-by: Rishitshivam <164783543+Rishitshivam@users.noreply.github.com>
Co-authored-by: Ratish P <114130421+Ratish1@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Adarsh Shirawalmath <114558126+adarshxs@users.noreply.github.com>
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
|
2026-02-06 13:17:51 -08:00 |
|
rinbaro
|
de6a03260f
|
[docs] fix misspellings & typos (#18276)
|
2026-02-05 03:35:29 +00:00 |
|
Tiance Wang
|
f6a4ff718f
|
doc update for CANN version (#18014)
Co-authored-by: wangtiance <tiancew@qq.com>
|
2026-01-30 21:10:23 -05:00 |
|
Wenchen Lo
|
046b29be16
|
GPTJForCausalLM Support (#7839)
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
|
2026-01-29 21:00:04 -08:00 |
|
baonudesifeizhai
|
84ab611af8
|
model: support DeepSeek-OCR-2 (#17897)
|
2026-01-30 09:49:51 +08:00 |
|
Yuxuan Zhang
|
7106f6c8e1
|
[GLM-OCR] Support GLM-OCR Model (#17582)
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
|
2026-01-26 22:24:00 -08:00 |
|
CSWYF3634076
|
1a19b3987d
|
[Model] Add Ernie4.5 VL model support (#15679)
Signed-off-by: CSWYF3634076 <wangyafeng@baidu.com>
Signed-off-by: wangyafeng <wangyafeng@baidu.com>
|
2026-01-25 22:36:29 -08:00 |
|
Lingjun Wen
|
cf89351691
|
[new-model] Add support for Cohere2ForCausalLM behind Command-A and Command-R Models (#16927)
|
2026-01-21 12:28:33 -08:00 |
|
Yi Zhong
|
ec9b48ea96
|
Add olmo3 in supported docs (#13672)
Signed-off-by: Vincent Zhong <207368749+vincentzed@users.noreply.github.com>
|
2026-01-16 12:18:16 -05:00 |
|
Raghav Ravishankar
|
daea51385d
|
Add AFMoE model implementation (#13216)
|
2026-01-16 20:35:42 +08:00 |
|
Adarsh Shirawalmath
|
7c39ea68f3
|
[diffusion] model: support flux Klein (#17173)
|
2026-01-16 16:16:17 +08:00 |
|
Yi Zhong
|
d1110e1c3e
|
docs only add kimi k2 thinking and kimi linear (#15789)
|
2026-01-15 12:09:52 -05:00 |
|
shuwenn
|
de94d793ad
|
feat: support qwen3(-VL) rerank scoring&chat template (#16403)
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
|
2026-01-15 00:45:46 +08:00 |
|
Adarsh Shirawalmath
|
aab906a3d4
|
[docs] sync diffusion docs to main docs (#16932)
|
2026-01-12 14:49:55 +08:00 |
|
Netanel Haber
|
bebd625ba1
|
EVS Framework: Support NemotronH_Nano_VL_V2 (#14051)
|
2026-01-05 16:18:07 +08:00 |
|
Roger Young
|
5c64a20da7
|
Update MiniMax-M2 ToolCall and add MiniMax-M2.1 in Docs (#15538)
Co-authored-by: xuebi <xuebi@minimaxi.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2025-12-23 15:11:52 -08:00 |
|
Zehuan Li
|
76743a983e
|
[DLLM] Add documentation for diffusion LLMs (#14358)
Co-authored-by: Tiwei Bie <tiwei.btw@antgroup.com>
Co-authored-by: Jinwei Yao <jinweiy@illinois.edu>
|
2025-12-11 20:29:51 -08:00 |
|
Tiance Wang
|
624725cb5e
|
Move and update MindSpore docs, make it appear on the online documentation (#14861)
Co-authored-by: wangtiance <tiancew@qq.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-12-10 23:03:50 -08:00 |
|
Simo Lin
|
49dfa1d891
|
[model-gateway] change sgl-router to sgl-model-gateway (#14312)
|
2025-12-05 12:04:48 -08:00 |
|
Lianmin Zheng
|
bc3d2a85af
|
[Minor] update docs (#14212)
|
2025-12-01 02:33:58 -08:00 |
|
Netanel Haber
|
082b54c689
|
Support nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16 (and nvidia/C-RADIOv2-H) (#12277)
|
2025-11-26 16:28:52 -07:00 |
|
wingedge
|
f1be8aa0f2
|
chore: add an unified server arg for multimodal inputs preprocess config(#12149)
Co-authored-by: bianfeng <bianfeng@pinduoduo.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
|
2025-11-18 12:18:50 +08:00 |
|
Zijian Zhang
|
aa8ecbda7a
|
model: support JetVLM (#13289)
|
2025-11-18 12:02:03 +08:00 |
|
Netanel Haber
|
9f011f617f
|
fix generative_models.md table - remove newlines (#13385)
|
2025-11-16 10:33:38 -08:00 |
|
Praneth Paruchuri
|
665f43bdd8
|
model: support teleflm (#10573)
|
2025-11-15 03:14:49 +08:00 |
|
Praneth Paruchuri
|
a53f2d6c12
|
Support orion (#10665)
|
2025-11-15 03:08:32 +08:00 |
|
Zijian Zhang
|
3633f8b0cf
|
Add Jet-Nemotron (#12448)
|
2025-11-09 01:32:47 -08:00 |
|
Amit Prakash
|
8e1d6756d5
|
docs: document video-capable multimodal models (#12565)
|
2025-11-06 00:06:19 -08:00 |
|
satyamk7054
|
9fc3e8aac7
|
Add support for Matryoshka embeddings (#126) (#11142)
Co-authored-by: Satyam Kumar <satyamk@linkedin.com>
|
2025-10-28 02:49:36 +08:00 |
|
赵晨阳
|
7ebc28f5d6
|
[WIP] support MiniMax M2 model (#12129)
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Signed-off-by: xuebi <xuebi@minimaxi.com>
Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Co-authored-by: Roger Young <42564206+rogeryoungh@users.noreply.github.com>
Co-authored-by: xuebi <xuebi@minimaxi.com>
|
2025-10-26 13:58:54 -07:00 |
|