[Kernel][RFC] Refactor the punica kernel based on Triton (#5036)
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
vllm/lora/ops/__init__.py
0 → 100644
vllm/lora/ops/bgmv_expand.py
0 → 100644
This diff is collapsed.
This diff is collapsed.
vllm/lora/ops/bgmv_shrink.py
0 → 100644
This diff is collapsed.
vllm/lora/ops/sgmv_expand.py
0 → 100644
This diff is collapsed.
Please register or sign in to comment