- 13 Mar, 2025 13 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Cyrus Leung authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@aleph-alpha.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Siyuan Liu authored
Signed-off-by:Siyuan Liu <lsiyuan@google.com>
-
Mathis Felardos authored
[Config][Disaggregated] Add timeout configuration for the torch.store and add KVTransferConfig.kv_connector_extra_config (#14367) Signed-off-by:Mathis Felardos <mathis@mistral.ai>
-
TY-AMD authored
Signed-off-by:TianyuanWu <Tianyuan.Wu@amd.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Richard Liu authored
Signed-off-by: <ricliu@google.com> Signed-off-by:Richard Liu <ricliu@google.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 12 Mar, 2025 19 commits
-
-
Kevin H. Luu authored
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
TJian authored
[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
ameyanjarlekar authored
Signed-off-by:ameyanjarlekar <aanjarlekar@nvidia.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Benjamin Chislett authored
[V1][Bugfix][Spec Decode] Fix incorrect outputs in V1 speculative decoding due to batch indexing (#14645) Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@aleph-alpha.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
Farzad Abdolhosseini authored
Signed-off-by:Farzad Abdolhosseini <farzad@fixie.ai>
-
Jennifer Zhao authored
Signed-off-by:
Jennifer Zhao <7443418+JenZhao@users.noreply.github.com> Co-authored-by:
Jennifer Zhao <7443418+JenZhao@users.noreply.github.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Randy Chen authored
Signed-off-by:
Randy Chen <acad.randyjhc@gmail.com> Signed-off-by:
Cody Yu <hao.yu.cody@gmail.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
Kevin H. Luu authored
-
- 11 Mar, 2025 8 commits
-
-
Cody Yu authored
Signed-off-by:Cody Yu <hao.yu.cody@gmail.com>
-
iefgnoix authored
-
Richard Liu authored
Signed-off-by: <ricliu@google.com> Signed-off-by:Richard Liu <ricliu@google.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Yang.Tao authored
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-