"mmdet3d/structures/bbox_3d/coord_3d_mode.py" did not exist on "b035bc8edeef546adae77a2d0d716c0ebd32faba"
- 25 Jun, 2024 1 commit
-
-
drbh authored
* feat: first draft load multiple lora * feat: load weights within layer and refactor lora pass * fix: refactor and reduce lora math * feat: baseline impl single request multi lora support * feat: prefer lorax implementation and port loading logic * fix: prefer adapter_data and refactors * feat: perfer loraxs custom punica kernels and add mlp loras * fix: adjust batch for bgmv * fix: adjust adapter_segments logic when in batch * fix: refactor and move changes to v3 proto * fix: pass model_id for all flash causal lms * fix: pass model_id for all causal and seq2seq lms * fix: add model_id to model test * feat: add lora support to mistral and refactors * feat: prefer model id in request * fix: include rust code for adapter id * feat: bump launcher and add new lora docs * feat: support base model generation and refactors * fix: rename doc to retry ci build * feat: support if vlm models * fix: add adapter_data param and avoid missing layers * fix: add adapter_data param to phi and neox * fix: update all models forwards to include adapter_data * fix: add model_id to IdeficsCausalLM * Update lora.md Fixed a typo * Update lora.md Fixing spam image * fix: add lora kernel to dockerfile, support running without kernels and refactors * fix: avoid dockerfile conflict * fix: refactors and adjust flash llama lora logic * fix: skip llama test due to CI issue (temp) * fix: skip llama test CI (temp) 2 * fix: revert skips and prefer updated ci token for tests * fix: refactors and helpful comments * fix: add noop in TensorParallelAdapterRowLinear too * fix: refactor and move shard_lora_weights logic * fix: exit early if no adapter_data --------- Co-authored-by:Derek <datavistics@gmail.com>
-
- 05 Jun, 2024 1 commit
-
-
OlivierDehaene authored
Close #2007
-
- 04 Jun, 2024 1 commit
-
-
OlivierDehaene authored
- Refactor code to allow supporting multiple versions of the generate.proto at the same time - Add v3/generate.proto (ISO to generate.proto for now but allow for future changes without impacting v2 backends) - Add Schedule trait to abstract queuing and batching mechanisms that will be different in the future - Add SchedulerV2/V3 impl
-
- 09 Feb, 2024 1 commit
-
-
OlivierDehaene authored
Some hardware require a maximum batch size.
-
- 22 Jan, 2024 1 commit
-
-
drbh authored
This PR just bumps the latest rust version and makes clippy happy ```bash cargo clippy --all -- -D warnings # Finished dev [unoptimized + debuginfo] target(s) in 0.10s ```
-
- 14 Dec, 2023 1 commit
-
-
OlivierDehaene authored
-
- 20 Oct, 2023 1 commit
-
-
OlivierDehaene authored
See #1049 --------- Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Co-authored-by:
Wang, Yi <yi.a.wang@intel.com>
-
- 24 Jul, 2023 1 commit
-
-
OlivierDehaene authored
Close #673
-
- 19 Jul, 2023 1 commit
-
-
OlivierDehaene authored
-
- 30 Jun, 2023 1 commit
-
-
OlivierDehaene authored
Closes #478
-
- 24 May, 2023 1 commit
-
-
OlivierDehaene authored
Closes #307 #308
-
- 10 May, 2023 1 commit
-
-
OlivierDehaene authored
-
- 27 Apr, 2023 1 commit
-
-
Ehsan M. Kermani authored
-
- 26 Apr, 2023 1 commit
-
-
Nicolas Patry authored
Co-authored-by:
OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com> Co-authored-by:
OlivierDehaene <olivier@huggingface.co>
-
- 24 Apr, 2023 1 commit
-
-
OlivierDehaene authored
Co-authored-by:Nick Hill <nickhill@us.ibm.com>
-
- 21 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 09 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 28 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 13 Feb, 2023 1 commit
-
-
OlivierDehaene authored
-
- 31 Jan, 2023 3 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
Reverts huggingface/text-generation-inference#36
-
OlivierDehaene authored
Add token streaming using ServerSideEvents (SSE). The signature of the SSE events is: ```rust struct Details { finish_reason: String, generated_tokens: u32, seed: Option<u64>, } struct StreamResponse { token: Token, generated_text: Option<String>, details: Option<Details>, } struct ErrorResponse { error: String, } ```
-
- 27 Oct, 2022 1 commit
-
-
OlivierDehaene authored
-
- 22 Oct, 2022 1 commit
-
-
OlivierDehaene authored
-
- 20 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
-
- 17 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
-
- 11 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
Added validation logic
-
- 08 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
-