"vllm/vscode:/vscode.git/clone" did not exist on "06c20c9904644d8f65523bb747756b2eae706b8e"
  • fanwl's avatar
    Add FA Unified Attention 2D · eb35ba1b
    fanwl authored
    - Add VLLM_V1_USE_FA_UNIFIED_ATTN_2D 环境变量
    - 0: Triton attention, 1: FA unified attention
    eb35ba1b
config.py 26.3 KB