- 13 Sep, 2024 1 commit
-
-
zhuwenwen authored
-
- 11 Sep, 2024 16 commits
-
-
Simon Mo authored
-
Patrick von Platen authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Lily Liu authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
Aarni Koskela authored
-
bnellnm authored
Co-authored-by:Sage Moore <sage@neuralmagic.com>
-
Cyrus Leung authored
-
Alexey Kondratiev(AMD) authored
-
Li, Jiang authored
-
Yang Fan authored
Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
zhuwenwen authored
-
Pooya Davoodi authored
-
Yangshen⚡Deng authored
Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Pavani Majety authored
-
zhuwenwen authored
-
Isotr0py authored
-
Jee Jee Li authored
-
- 10 Sep, 2024 15 commits
-
-
Tyler Michael Smith authored
-
William Lin authored
-
Alexander Matveev authored
[Bugfix] Ensure multistep lookahead allocation is compatible with cuda graph max capture (#8340)
-
Cody Yu authored
[MISC] Keep chunked prefill enabled by default with long context when prefix caching is enabled (#8342)
-
Prashant Gupta authored
-
Kevin Lin authored
-
sumitd2 authored
-
Alexey Kondratiev(AMD) authored
-
Cyrus Leung authored
-
Daniele authored
-
zhuwenwen authored
-
Cyrus Leung authored
-
Simon Mo authored
-
Dipika Sikka authored
-
zhuwenwen authored
-
- 09 Sep, 2024 7 commits
-
-
Kyle Sayers authored
-
Vladislav Kruglikov authored
-
Adam Lugowski authored
Co-authored-by:Adam Lugowski <adam.lugowski@parasail.io>
-
Kyle Mistele authored
[Bugfix] Streamed tool calls now more strictly follow OpenAI's format; ensures Vercel AI SDK compatibility (#8272)
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
- 08 Sep, 2024 1 commit
-
-
Alexander Matveev authored
-