[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756)
Signed-off-by:Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
Showing
Please register or sign in to comment
Signed-off-by:Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>