rocking
|
5948dbffe4
|
Support fp8 dynamic quantization for fmha (#3206)
* Support qscale for dynamic quant, remove static quant
* Support hdim=256
* Remove bias test case for fp8
---------
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
Co-authored-by: asleepzzz <hanwen.chang@amd.com>
|
2025-11-24 16:28:25 +08:00 |
|