[torch.compile] Refactor Attention Quant Fusion Pass and Remove Boilerplate (#37373)
Signed-off-by:BadrBasowid <badr.basowid@gmail.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com>
Showing
Please register or sign in to comment
Signed-off-by:BadrBasowid <badr.basowid@gmail.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com>