"tests/models/language/generation/test_models.py" did not exist on "909fdaf152aed44186174b6b8e876bb0c3990b3c"
- 12 Apr, 2026 5 commits
-
-
Animesh Jain authored
Signed-off-by:Animesh Jain <anijain@umich.edu>
-
Le Yang authored
-
Nicolò Lucchesi authored
The number of features supported by the connector has grown substantially and the `nixl_connector.py` file has accumulated a lot of code. Creates a separate directory and isolates connector/scheduler code in the hope of improving clarity and maintainability. Further refactor of components aimed at improving clarity and simplifying code will follow soon. Signed-off-by:NickLucche <nlucches@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 11 Apr, 2026 4 commits
-
-
EdalatiAli authored
Signed-off-by:EdalatiAli <aliedalati@cohere.com>
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
ShubyM authored
Signed-off-by:ShubyM <shubymishra20@gmail.com>
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 10 Apr, 2026 14 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
yzong-rh authored
Signed-off-by:Yifan Zong <yzong@redhat.com>
-
zhanqiuhu authored
Signed-off-by:ZhanqiuHu <zhu@redhat.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
jatseng-ai authored
Signed-off-by:
jatseng-ai <jatseng@amd.com> Signed-off-by:
jatseng-ai <janet.tseng@amd.com> Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
Claude <noreply@anthropic.com> Co-authored-by:
Matthias Gehre <matthias.gehre@amd.com>
-
PatchyTIS authored
Signed-off-by:
PatchouliTaisa <patchychen@tencent.com> Co-authored-by:
PatchouliTaisa <patchychen@tencent.com>
-
Lalithnarayan C authored
Signed-off-by:
Lalithnarayan C <Lalithnarayan.C@amd.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Injae Ryou authored
Signed-off-by:
Injae Ryou <injaeryou@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
Kyungmin Lee authored
Signed-off-by:
lkm2835 <lkm2835@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 09 Apr, 2026 13 commits
-
-
Ekagra Ranjan authored
Signed-off-by:Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
PikaPikachu authored
Signed-off-by:kangletian <Letian.Kang@amd.com>
-
Lucas Kabela authored
[Performance Improvement] Update `batched_count_greater_than` to handle batch size 1 without recompile (#38933) Signed-off-by:
Lucas Kabela <lucaskabela@meta.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
lalit10 authored
Signed-off-by:
Lalit Laxminarayan Bangad <lalitbangad@gmail.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
sihao_li authored
Signed-off-by:
sihao.li <sihao.li@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
noobHappylife authored
Signed-off-by:
noobhappylife <aratar1991@hotmail.com> Co-authored-by:
OpenAI Codex <codex@openai.com>
-
Ilya Boytsov authored
Signed-off-by:Ilya Boytsov <ilyaboytsov1805@gmail.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Maral authored
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892) Signed-off-by:
maral <maralbahari.98@gmail.com> Signed-off-by:
Maral <maralbahari.98@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 08 Apr, 2026 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
triangleXIV authored
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by:
triangle14 <y1019026570@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Rishi Puri <riship@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Signed-off-by:
sfeng33 <4florafeng@gmail.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
Flora Feng <4florafeng@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-