"tests/nn/vscode:/vscode.git/clone" did not exist on "e6acdcc3a75a7ca20022ceb09434be181cd53a43"
Commit f03f72de authored by Casper Hansen's avatar Casper Hansen
Browse files

Falcon: Enable fused attention

parent ac770875
......@@ -7,7 +7,7 @@ class FalconAWQForCausalLM(BaseAWQForCausalLM):
@staticmethod
def fuse_layers(model: FalconForCausalLM, quant_config:dict):
fuser = FalconFuser(model)
# fuser.fuse_transformer()
fuser.fuse_transformer()
@staticmethod
def get_model_layers(model: FalconForCausalLM):
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment