Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
49bfe4cb5f365a532f9ee383c963bbdedcd15e02
Switch branch/tag
vllm_cscc
vllm
benchmarks
11 Feb, 2025
1 commit
add latency
· 49bfe4cb
zhuwenwen
authored
Feb 11, 2025
49bfe4cb
06 Jan, 2025
1 commit
update qwen2-moe layout and benchmark_throughput.py
· 220e6456
zhuwenwen
authored
Jan 06, 2025
220e6456
05 Dec, 2024
1 commit
change AMD GPU and dcu to hcu
· 3f78216a
zhuwenwen
authored
Dec 05, 2024
3f78216a
10 Oct, 2024
1 commit
update benchmarks
· 00d3d196
zhuwenwen
authored
Oct 10, 2024
00d3d196
26 Sep, 2024
1 commit
support head_dim 160 and update benchmark_throughput.py
· 93872128
zhuwenwen
authored
Sep 26, 2024
93872128
20 Sep, 2024
1 commit
add benchmarks to vllm whl
· 5b62725e
zhuwenwen
authored
Sep 20, 2024
5b62725e
10 Aug, 2024
2 commits
Revert feat:optimize act_and_mul_kernel
· 880b2e41
zhuwenwen
authored
Aug 10, 2024
880b2e41
add benchmarks to vllm and add env of VLLM_USE_FLASH_ATTN_AUTO
· efb2f75f
zhuwenwen
authored
Aug 10, 2024
efb2f75f