[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models.
V0.7.2 dev custom See merge request dcutoolkit/deeplearing/vllm!90
[fix]修复fused_moe.py中fused_moe接口未初始化moe_ep_size导致的deekseek等模型报错 See merge request dcutoolkit/deeplearing/vllm!89
V0.7.2 dev yangql See merge request dcutoolkit/deeplearing/vllm!85
V0.7.2 dev deepseek v3/r1 block-int8量化支持 See merge request dcutoolkit/deeplearing/vllm!83
[feat]添加BW deepseek_v3 fused_moe configs See merge request dcutoolkit/deeplearing/vllm!80
[feat]优化fused_moe tuning,上传K100_AI的configs See merge request dcutoolkit/deeplearing/vllm!79