- 28 Jan, 2026 2 commits
-
-
ramos authored
Signed-off-by:
ramos <49182011+nemoramo@users.noreply.github.com> Signed-off-by:
mayufeng <mayufeng@example.com> Co-authored-by:
mayufeng <mayufeng@example.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Jan, 2026 10 commits
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
danielafrimi authored
Signed-off-by: <dafrimi@nvidia.com> Signed-off-by:
Daniel Afrimi <dafrimi@nvidia.com> Signed-off-by:
root <dafrimi@nvidia.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
omerpaz95 authored
Added queries and hits metrics for the Offloading Connector. Also added timing metrics for store and load operations, which take the average time it takes to load/store, per-token. The metrics are available from Prometheus and from the StatLogger. Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Omer Paz <Omer.Paz@ibm.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Roger Wang authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by:
Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
wangln19 authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
amirkl94 <203507526+amirkl94@users.noreply.github.com>
-
- 26 Jan, 2026 8 commits
-
-
Jared Wen authored
Add a new CLI option --disable-access-log-for-endpoints to suppress uvicorn access logs for specified endpoints (e.g., /health, /metrics, /ping). This addresses the need to reduce log noise in production environments where health check endpoints are frequently polled by load balancers or monitoring systems, generating excessive log entries that obscure meaningful request logs. Fixes #29982 Signed-off-by:JaredforReal <w13431838023@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Yuxuan Zhang authored
Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 25 Jan, 2026 2 commits
-
-
JJJYmmm authored
Signed-off-by:
JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Roberto L. Castro authored
Signed-off-by:LopezCastroRoberto <rocastro@redhat.com>
-
- 24 Jan, 2026 5 commits
-
-
Joshua Deng authored
Signed-off-by:
Joshua Deng <joshuakdeng@gmail.com> Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
7. Sun authored
Signed-off-by:7. Sun <jhao.sun@gmail.com>
-
7. Sun authored
Signed-off-by:7. Sun <jhao.sun@gmail.com>
-
7. Sun authored
Signed-off-by:7. Sun <jhao.sun@gmail.com>
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Luka Govedič <luka.govedic@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <luka.govedic@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 23 Jan, 2026 13 commits
-
-
dolpm authored
Signed-off-by:dolpm <34420038+dolpm@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Luka Govedič authored
Signed-off-by:Luka Govedič <lgovedic@redhat.com>
-
sangbumlikeagod authored
Signed-off-by:
sangbumlikeagod <oironese@naver.com> Signed-off-by:
sangbumlikeagod <98077576+sangbumlikeagod@users.noreply.github.com>
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
wang.yuqi authored
[Frontend][3/n] Make pooling entrypoints request schema consensus | EmbedRequest & ClassifyRequest (#32905) Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Karan Bansal authored
Signed-off-by:Karan Bansal <karanb192@gmail.com>
-
Luka Govedič authored
[torch.compile] Compile `CustomOp.forward_native` for `SiluAndMul` and `QuantFP8` to avoid raw torch ops inside opaque custom ops (#32806) Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-