- 23 Sep, 2024 2 commits
-
-
Jani Monoses authored
[VLM] Fix paligemma, fuyu and persimmon with transformers 4.45 : use config.text_config.vocab_size (#8707)
-
Yanyi Liu authored
-
- 22 Sep, 2024 4 commits
-
-
Lily Liu authored
-
litianjian authored
Co-authored-by:
litianjian <litianjian@bytedance.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
-
Isotr0py authored
-
- 21 Sep, 2024 3 commits
-
-
Divakar Verma authored
-
rasmith authored
[Kernel][Triton][AMD] Remove tl.atomic_add from awq_gemm_kernel, 2-5x speedup MI300, minor improvement for MI250 (#8646)
-
Cyrus Leung authored
-
- 20 Sep, 2024 3 commits
-
-
zyddnys authored
-
Niklas Muennighoff authored
-
Amit Garg authored
-
- 19 Sep, 2024 2 commits
-
-
盏一 authored
-
Roger Wang authored
-
- 18 Sep, 2024 5 commits
-
-
Tyler Michael Smith authored
-
Gregory Shtrasberg authored
Co-authored-by:
Alexei-V-Ivanov-AMD <156011006+Alexei-V-Ivanov-AMD@users.noreply.github.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
Geun, Lim authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Cyrus Leung authored
-
- 17 Sep, 2024 6 commits
-
-
Tyler Michael Smith authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
chenqianfzh authored
-
sroy745 authored
-
Roger Wang authored
-
Simon Mo authored
-
- 16 Sep, 2024 2 commits
-
-
Luka Govedič authored
-
ElizaWszola authored
Co-authored-by:Dipika <dipikasikka1@gmail.com>
-
- 15 Sep, 2024 2 commits
- 14 Sep, 2024 2 commits
-
-
youkaichao authored
-
ywfang authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 13 Sep, 2024 3 commits
-
-
Kunshang Ji authored
Co-authored-by:Yan Ma <yan.ma@intel.com>
-
Jee Jee Li authored
-
Dipika Sikka authored
-
- 12 Sep, 2024 6 commits
-
-
Wenxiang authored
-
Patrick von Platen authored
-
Roger Wang authored
[Hotfix][Core][VLM] Disable chunked prefill by default and prefix caching for multimodal models (#8425)
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
-