Commit Graph

3 Commits

Author SHA1 Message Date
Chunyuan WU
6c89214584 [CPU][sgl-kernel] extend_attention_cpu and flash_attn_varlen_func: fix nan for large seq (#22434)
Co-authored-by: Ma Mingfei <mingfei.ma@intel.com>
2026-04-17 13:01:01 +08:00
blzheng
c2b01bd2fc [CPU] fix bug in AVX512 implementation of flash_attn_softmax (#20220)
Co-authored-by: Wu, Chunyuan <chunyuan.wu@intel.com>
2026-03-18 22:18:47 -07:00
Ma Mingfei
88f7759402 [CPU] optimize flash_attn_varlen_func (#15708) 2026-01-29 22:07:05 -08:00