Commit Graph

8 Commits

Author SHA1 Message Date
Ma Mingfei
af62bd9486 [CPU] Implement MXFP4 Gemm kernels for intel AMX to support GPT OSS series. (#14385) 2026-03-29 23:44:12 -07:00
blzheng
cd22aa27a9 [CPU] Add FP8 Bmm support (#9744)
Co-authored-by: Fan Yin <1106310035@qq.com>
2026-03-18 22:19:48 -07:00
Ma Mingfei
46f21a5956 use faster covnersion from float8_e4m3fn to bfloat16 (#12316) 2025-12-04 12:34:05 +08:00
Ma Mingfei
5ad296bda1 Optimize prefill performance on cpu backend (#8750) 2025-08-28 17:21:55 -07:00
YanbingJiang
fcde67b016 CPU: map changes from developing branch in sgl-kernel (#6833)
Co-authored-by: mingfeima <mingfei.ma@intel.com>
2025-06-10 01:08:15 -07:00
blzheng
cfe48c5902 [CPU] Fix build issue (#6419) 2025-05-21 11:17:10 -07:00
Chunyuan WU
5dd62c3a6f Add fp8 shared_expert kernel for CPU in sgl-kernel and add UT (#6339)
Co-authored-by: Jiang, Yanbing <yanbing.jiang@intel.com>
Co-authored-by: mingfeima <mingfei.ma@intel.com>
2025-05-18 12:42:15 -07:00
Chunyuan WU
fb4959b2c5 Add fp8 gemm kernel for CPU in sgl-kernel and add gemm UT (#6216)
Co-authored-by: YanbingJiang <yanbing.jiang@intel.com>
Co-authored-by: mingfeima <mingfei.ma@intel.com>
2025-05-15 09:10:40 -07:00