- 27 Mar, 2024 1 commit
-
-
Insu Jang authored
* Use self.[distribute_layers|get_stage_index] to exploit custom layer distribution * Change static methods for t5 layer distribution to member functions * Change static methods for whisper layer distribution to member functions * Replace whisper policy usage with self one * Fix test case to use non-static layer distribution methods * fix: fix typo --------- Co-authored-by:Wenhao Chen <cwher@outlook.com>
-
- 25 Mar, 2024 1 commit
-
-
Wenhao Chen authored
* fix: simplify merge_batch * fix: use return_outputs=False to eliminate extra memory consumption * feat: add return_outputs warning * style: remove `return_outputs=False` as it is the default value
-
- 20 Mar, 2024 1 commit
-
-
binmakeswell authored
* [doc] update open-sora demo * [doc] update open-sora demo * [doc] update open-sora demo
-
- 12 Mar, 2024 1 commit
-
-
digger yu authored
-
- 11 Mar, 2024 1 commit
-
-
Camille Zhong authored
-
- 07 Mar, 2024 1 commit
-
-
Camille Zhong authored
* add stream chat for chat version * remove os.system clear * modify function name
-
- 05 Mar, 2024 3 commits
-
-
hugo-syn authored
Signed-off-by:hugo-syn <hugo.vincent@synacktiv.com>
-
Dongruixuan Li authored
-
binmakeswell authored
* [doc] sora release * [doc] sora release * [doc] sora release * [doc] sora release
-
- 01 Mar, 2024 1 commit
-
-
Camille Zhong authored
-
- 28 Feb, 2024 1 commit
-
-
Tong Li authored
-
- 19 Feb, 2024 2 commits
-
-
CZYCW authored
Co-authored-by:binmakeswell <binmakeswell@gmail.com>
-
Hongxin Liu authored
* [llama] refactor inference example to fit sft * [llama] fix training script to fit gemini * [llama] fix inference script
-
- 07 Feb, 2024 6 commits
-
-
Hongxin Liu authored
-
Hongxin Liu authored
-
Hongxin Liu authored
-
Hongxin Liu authored
* [moe] add mixtral block for single expert * [moe] mixtral block fwd support uneven ep * [moe] mixtral block bwd support uneven ep * [moe] add mixtral moe layer * [moe] simplify replace * [meo] support save sharded mixtral * [meo] support load sharded mixtral * [meo] support save sharded optim * [meo] integrate moe manager into plug * [meo] fix optimizer load * [meo] fix mixtral layer
-
Hongxin Liu authored
* [moe] top2 allow uneven input * [moe] update capacity computing * [moe] remove debug info * [moe] update capacity computing * [moe] update capacity computing
-
Xuanlei Zhao authored
-
- 06 Feb, 2024 3 commits
-
-
Hongxin Liu authored
* [llama] fix memory issue * [llama] add comment
-
Hongxin Liu authored
-
Camille Zhong authored
-
- 05 Feb, 2024 4 commits
-
-
Camille Zhong authored
-
Hongxin Liu authored
-
Hongxin Liu authored
* [llama] update training script * [doc] polish docstr
-
Hongxin Liu authored
* [plugin] refactor prepare dataloader * [plugin] update train script
-
- 01 Feb, 2024 1 commit
-
-
YeAnbang authored
* fix script * fix script * fix chat nan * fix chat nan
-
- 25 Jan, 2024 1 commit
-
-
李文军 authored
[NFC] polish applications/Colossal-LLaMA-2/colossal_llama2/tokenizer/init_tokenizer.py code style (#5228)
-
- 22 Jan, 2024 1 commit
-
-
Desperado-Jia authored
-
- 18 Jan, 2024 1 commit
-
-
Michelle authored
-
- 11 Jan, 2024 1 commit
-
-
digger yu authored
-
- 10 Jan, 2024 1 commit
-
-
digger yu authored
-
- 09 Jan, 2024 1 commit
-
-
Hongxin Liu authored
* update accelerator * fix timer * fix amp * update * fix * update bug * add error raise * fix autocast * fix set device * remove doc accelerator * update doc * update doc * update doc * use nullcontext * update cpu * update null context * change time limit for example * udpate * update * update * update * [npu] polish accelerator code --------- Co-authored-by:
Xuanlei Zhao <xuanlei.zhao@gmail.com> Co-authored-by:
zxl <43881818+oahzxl@users.noreply.github.com>
-
- 08 Jan, 2024 1 commit
-
-
binmakeswell authored
* [doc] SwiftInfer release * [doc] SwiftInfer release * [doc] SwiftInfer release * [doc] SwiftInfer release * [doc] SwiftInfer release
-
- 07 Jan, 2024 2 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
binmakeswell authored
* [doc] add Colossal-LLaMA-2-13B * [doc] add Colossal-LLaMA-2-13B * [doc] add Colossal-LLaMA-2-13B
-
- 06 Jan, 2024 1 commit
-
-
Camille Zhong authored
* Update README.md * Update README.md
-
- 05 Jan, 2024 1 commit
-
-
Tong Li authored
* update readme * update readme * update link * update * update readme * update * update * update * update title * update example * update example * fix content * add conclusion * add license * update * update * update version * fix minor
-
- 22 Dec, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Xu <yuanchen.xu00@gmail.com>
-
- 20 Dec, 2023 1 commit
-
-
BlueRum authored
-