Commit Graph

8 Commits

Author SHA1 Message Date
blzheng
0dcfae5553 [CPU] Add gemma4_rmsnorm_cpu kernel (#22842)
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
2026-04-17 13:03:16 +08:00
Zaili Wang
2f4babe32b [CPU] support LayerNorm with 3D shape (#15075)
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
2026-03-18 22:15:24 -07:00
blzheng
d16ff357db [CPU] Add Gemma3RMSNorm kernel in sgl-kernel and add ut (#9324) 2025-12-15 00:24:02 -08:00
Zaili Wang
d6bd2d1126 [CPU] layernorm & fused add-layernorm kernels (#14074) 2025-12-11 16:58:23 -08:00
YanbingJiang
acde21d8d5 Add fused_rmsnorm_gated_cpu kernel for CPU to support Qwen3-Next (#11577) 2025-11-21 01:33:31 +08:00
YanbingJiang
fcde67b016 CPU: map changes from developing branch in sgl-kernel (#6833)
Co-authored-by: mingfeima <mingfei.ma@intel.com>
2025-06-10 01:08:15 -07:00
jianan-gu
ff00895c46 Add CPU optimized kernels for topk and rope fusions (#6456) 2025-06-02 17:37:34 -07:00
Ma Mingfei
a73c4df438 Add optimized native kernels in sgl-kernel (#5150)
Co-authored-by: Chunyuan WU <chunyuan.wu@intel.com>
Co-authored-by: YanbingJiang <yanbing.jiang@intel.com>
Co-authored-by: blzheng <beilei.zheng@intel.com>
2025-04-08 09:37:46 -07:00