- 25 Aug, 2025 12 commits
-
-
Chenguang Zheng authored
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Yu Guo authored
Signed-off-by:Yu Guo <yuguo@meta.com>
-
LIYIFAN_liyifan authored
[Bugfix] Fix Dense module loading for sentence-transformers embedding models (simplified V2) (#23408) Signed-off-by:FFFfff1FFFfff <yifanli0919@gmail.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
zifeitong authored
Signed-off-by:Zifei Tong <zifeitong@gmail.com>
-
Noam Gat authored
Signed-off-by:
Noam Gat <noamgat@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
Didier Durand authored
Signed-off-by:
Didier Durand <durand.didier@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 24 Aug, 2025 11 commits
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lucia (Lu) Fang <fanglu@meta.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
TeeKen Lau authored
Signed-off-by:teekenl <teekenlau@gmail.com>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
22quinn authored
Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Eric Marcus <eric.marcus@kaiko.ai> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
Benji Beck authored
Signed-off-by:Benji Beck <benjibeck@meta.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
- 23 Aug, 2025 10 commits
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Xu Wenqing authored
Signed-off-by:Xu Wenqing <xuwq1993@qq.com>
-
Aziz authored
Signed-off-by:AzizCode92 <azizbenothman76@gmail.com>
-
Cyrus Leung authored
Revert "[PERF] Use faster way of decode in tokenizer: avoid useless list-to-list conversion (#20000)" (#23396) Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chenxi Yang authored
Co-authored-by:Chenxi Yang <cxyang@meta.com>
-
Daifeng Li authored
Signed-off-by:
feng <fengli1702@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
WeiQing Chen authored
Signed-off-by:
ycyaw66 <497410282@qq.com> Co-authored-by:
ycyaw66 <497410282@qq.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 22 Aug, 2025 7 commits
-
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
rasmith authored
[BugFix][AMD][Quantization] Fix torch.compile issue where wvSplitKQ not being called when it should when using quantized FP8 model (#22281) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
ilmarkov <imarkov@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Zhewen Li authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
Shiyan Deng authored
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
Signed-off-by:
汪志鹏 <wangzhipeng628@gmail.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
汪志鹏 <wangzhipeng628@gmail.com>
-