Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
AutoAWQ
Commits
09c73fb2c5a0e7cd476d9f12875a0645d8b128ad
Switch branch/tag
autoawq
awq
modules
linear.py
14 Nov, 2023
1 commit
Fix multi-GPU loading and inference (#190)
· 09c73fb2
Casper
authored
Nov 14, 2023
09c73fb2
28 Oct, 2023
1 commit
[`core`] Support fp32 / bf16 inference (#121)
· 18712d00
Younes Belkada
authored
Oct 28, 2023
18712d00
19 Sep, 2023
1 commit
Use GEMM v2 kernel for context processing
· f3a71d1d
Casper Hansen
authored
Sep 19, 2023
f3a71d1d
08 Sep, 2023
2 commits
Implement GEMM/GEMV in quantize function and fused modules
· 5db43a7f
Casper Hansen
authored
Sep 08, 2023
5db43a7f
Refactor modules, create separate GEMM and GEMV
· fe314160
Casper Hansen
authored
Sep 08, 2023
fe314160