"vllm/model_executor/models/terratorch.py" did not exist on "e189b50f53e333814d41278c5e5be66240c99018"
-
zhuwenwen authored
add VLLM_USE_FUSED_CACHE_QUANT_BMM_MLA to use fused rmsnorm + contiguous + rope(for dpsk-v3) + concat_and_cache_mla + q quant, control bmm(todo) + cat +mla (fp8)
9dd70f0e