- 21 Feb, 2026 1 commit
-
-
Lucas Wilkinson authored
Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers " (#34997)
-
- 19 Feb, 2026 1 commit
-
-
Eldar Kurtić authored
[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers (#34471) Signed-off-by:
Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 11 Feb, 2026 1 commit
-
-
Eldar Kurtić authored
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8) (#34243) Signed-off-by:
Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com>
-
- 27 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 23 Jan, 2026 1 commit
-
-
baonudesifeizhai authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 1 commit
-
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
- 11 Dec, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 26 Nov, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 20 Nov, 2025 1 commit
-
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
- 19 Nov, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 05 Nov, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 09 Oct, 2025 1 commit
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Sep, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
- 03 Sep, 2025 1 commit
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 19 Aug, 2025 1 commit
-
-
qizixi authored
Signed-off-by:
qizixi <qizixi@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 13 Aug, 2025 1 commit
-
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
- 07 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 31 Jul, 2025 1 commit
-
-
zhiweiz authored
Signed-off-by:
morgendave <morgendave@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.me>
-
- 30 Jul, 2025 1 commit
-
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
- 12 Jul, 2025 1 commit
-
-
Zhiyu authored
Signed-off-by:Zhiyu Cheng <zhiyuc@nvidia.com>
-
- 25 Jun, 2025 1 commit
-
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
- 12 Jun, 2025 1 commit
-
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 15 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 14 May, 2025 1 commit
-
-
bnellnm authored
-
- 29 Apr, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
- 18 Apr, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 12 Apr, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 11 Apr, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:
Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by:
Ye (Charlotte) Qi <yeq@meta.com>
-
- 09 Apr, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lu Fang <fanglu@fb.com>
-
- 07 Apr, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:
Aston Zhang <22279212+astonzhang@users.noreply.github.com> Signed-off-by:
Chris Thi <chris.c.thi@gmail.com> Signed-off-by:
drisspg <drisspguessous@gmail.com> Signed-off-by:
Jon Swenson <jmswen@gmail.com> Signed-off-by:
Keyun Tong <tongkeyun@gmail.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Xiaodong Wang <xdwang@meta.com> Signed-off-by:
Yang Chen <yangche@fb.com> Signed-off-by:
Ye (Charlotte) Qi <yeq@meta.com> Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Signed-off-by:
Zijing Liu <liuzijing2014@gmail.com> Signed-off-by:
Lu Fang <lufang@fb.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-