".github/vscode:/vscode.git/clone" did not exist on "03e52ecba3b60b04b552d82809043e5642509005"
- 12 Dec, 2023 1 commit
-
-
Yuanchen authored
* Support GSM, Data Leakage Evaluation and Tensor Parallel * remove redundant code and update inference.py in examples/gpt_evaluation --------- Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
- 11 Dec, 2023 1 commit
-
-
Michelle authored
* fix pangu api * add comment
-
- 07 Dec, 2023 1 commit
-
-
Yuanchen authored
* Add finetuning Colossal-Llama-2 example * Add finetuning Colossal-Llama-2 example 2 * Add finetuning Colossal-Llama-2 example and support NEFTuning * Add inference example and refine neftune * Modify readme file * update the imports --------- Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Camille Zhong <44392324+Camille7777@users.noreply.github.com>
-
- 01 Dec, 2023 1 commit
-
-
Michelle authored
* fix doc * modify doc
-
- 30 Nov, 2023 1 commit
-
-
Michelle authored
* refactor server and webui & add new feature * add requirements * modify readme and ui
-
- 29 Nov, 2023 2 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
digger yu authored
-
- 28 Nov, 2023 1 commit
-
-
Zian(Andy) Zheng authored
* add safetybench and cvalues(responsibility) eval dataset * Modify code according to review suggestions --------- Co-authored-by:Orion-Zheng <zhengzian@u.nus.edu>
-
- 27 Nov, 2023 1 commit
-
-
digger yu authored
-
- 23 Nov, 2023 1 commit
-
-
YeAnbang authored
* add langchain * add langchain * Add files via upload * add langchain * fix style * fix style: remove extra space * add pytest; modified retriever * add pytest; modified retriever * add tests to build_on_pr.yml * fix build_on_pr.yml * fix build on pr; fix environ vars * seperate unit tests for colossalqa from build from pr * fix container setting; fix environ vars * commented dev code * add incremental update * remove stale code * fix style * change to sha3 224 * fix retriever; fix style; add unit test for document loader * fix ci workflow config * fix ci workflow config * add set cuda visible device script in ci * fix doc string * fix style; update readme; refactored * add force log info * change build on pr, ignore colossalqa * fix docstring, captitalize all initial letters * fix indexing; fix text-splitter * remove debug code, update reference * reset previous commit * update LICENSE update README add key-value mode, fix bugs * add files back * revert force push * remove junk file * add test files * fix retriever bug, add intent classification * change conversation chain design * rewrite prompt and conversation chain * add ui v1 * ui v1 * fix atavar * add header * Refactor the RAG Code and support Pangu * Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo. * resolved conversation. tested scripts under examples. web demo still buggy * fix ci tests * Some modifications to add ChatGPT api * modify llm.py and remove unnecessary files * Delete applications/ColossalQA/examples/ui/test_frontend_input.json * Remove OpenAI api key * add colossalqa * move files * move files * move files * move files * fix style * Add Readme and fix some bugs. * Add something to readme and modify some code * modify a directory name for clarity * remove redundant directory * Correct a type in llm.py * fix AI prefix * fix test_memory.py * fix conversation * fix some erros and typos * Fix a missing import in RAG_ChatBot.py * add colossalcloud LLM wrapper, correct issues in code review --------- Co-authored-by:
YeAnbang <anbangy2@outlook.com> Co-authored-by:
Orion-Zheng <zheng_zian@u.nus.edu> Co-authored-by:
Zian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com> Co-authored-by:
Orion-Zheng <zhengzian@u.nus.edu>
-
- 14 Nov, 2023 1 commit
-
-
Orion-Zheng authored
-
- 09 Nov, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
- 31 Oct, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
- 17 Oct, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 16 Oct, 2023 1 commit
-
-
Zian(Andy) Zheng authored
To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer. https://github.com/huggingface/transformers/pull/25598
-
- 10 Oct, 2023 3 commits
-
-
Camille Zhong authored
-
Camille Zhong authored
-
Camille Zhong authored
add modelscope link
-
- 28 Sep, 2023 1 commit
-
-
Tong Li authored
-
- 27 Sep, 2023 3 commits
-
-
binmakeswell authored
-
Yuanchen authored
Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
flybird11111 authored
* [chat] fix gemini strategy * [chat] fix gemini strategy * [chat] fix gemini strategy * [chat] fix gemini strategy * g# This is a combination of 2 commits. [chat] fix gemini strategy fox * [chat] fix gemini strategy update llama2 example [chat] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * fix * fix * fix * fix * fix * Update train_prompts.py
-
- 26 Sep, 2023 3 commits
-
-
Chandler-Bing authored
change filename: pretraining.py -> trainin.py there is no file named pretraing.py. wrong writing
-
Tong Li authored
-
Tong Li authored
-
- 25 Sep, 2023 1 commit
-
-
binmakeswell authored
* [doc] add llama2 domain-specific solution news
-
- 24 Sep, 2023 2 commits
-
-
Yuanchen authored
* Add ColossalEval * Delete evaluate in Chat --------- Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Tong Li <tong.li352711588@gmail.com>
-
Tong Li authored
-
- 21 Sep, 2023 1 commit
-
-
Wenhao Chen authored
* feat: modify lora merge weights fn * feat: add lora merge weights config
-
- 20 Sep, 2023 1 commit
-
-
Wenhao Chen authored
* feat: modify forward fn of critic and reward model * feat: modify calc_action_log_probs * to: add wandb in sft and rm trainer * feat: update train_sft * feat: update train_rm * style: modify type annotation and add warning * feat: pass tokenizer to ppo trainer * to: modify trainer base and maker base * feat: add wandb in ppo trainer * feat: pass tokenizer to generate * test: update generate fn tests * test: update train tests * fix: remove action_mask * feat: remove unused code * fix: fix wrong ignore_index * fix: fix mock tokenizer * chore: update requirements * revert: modify make_experience * fix: fix inference * fix: add padding side * style: modify _on_learn_batch_end * test: use mock tokenizer * fix: use bf16 to avoid overflow * fix: fix workflow * [chat] fix gemini strategy * [chat] fix * sync: update colossalai strategy * fix: fix args and model dtype * fix: fix checkpoint test * fix: fix requirements * fix: fix missing import and wrong arg * fix: temporarily skip gemini test in stage 3 * style: apply pre-commit * fix: temporarily skip gemini test in stage 1&2 --------- Co-authored-by:Mingyan Jiang <1829166702@qq.com>
-
- 19 Sep, 2023 1 commit
-
-
Hongxin Liu authored
* [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format
-
- 15 Sep, 2023 1 commit
-
-
digger yu authored
Co-authored-by:flybird11111 <1829166702@qq.com>
-
- 30 Aug, 2023 1 commit
-
-
Ying Liu authored
-
- 29 Aug, 2023 1 commit
-
-
yingliu-hpc authored
* update configuration of chatglm and add support in coati * add unit test & update chatglm default config & fix bos index issue * remove chatglm due to oom * add dataset pkg in requirement-text * fix parameter issue in test_models * add ref in tokenize & rm unnessary parts * separate source & target tokenization in chatglm * add unit test to chatglm * fix test dataset issue * update truncation of chatglm * fix Colossalai version * fix colossal ai version in test
-
- 21 Aug, 2023 1 commit
-
-
Michelle authored
* update config and prompt * update config --------- Co-authored-by:Qianran Ma <qianranm@luchentech.com>
-
- 16 Aug, 2023 1 commit
-
-
Hongxin Liu authored
* [test] remove cpu marker * [test] remove gpu marker * [test] update pytest markers * [ci] update unit test ci
-
- 14 Aug, 2023 1 commit
-
-
Wenhao Chen authored
* style: apply formatter * fix: add outdated warnings * docs: add dataset format and polish * docs: polish README * fix: fix json format * fix: fix typos * revert: revert 7b example
-
- 02 Aug, 2023 1 commit
-
-
Wenhao Chen authored
* style: rename replay buffer Experience replay is typically for off policy algorithms. Use this name in PPO maybe misleading. * fix: fix wrong zero2 default arg * test: update experience tests * style: rename zero_pad fn * fix: defer init in CycledDataLoader * test: add benchmark test * style: rename internal fn of generation * style: rename internal fn of lora * fix: remove unused loss fn * fix: remove unused utils fn * refactor: remove generate_with_actor fn * fix: fix type annotation * test: add models tests * fix: skip llama due to long execution time * style: modify dataset * style: apply formatter * perf: update reward dataset * fix: fix wrong IGNORE_INDEX in sft dataset * fix: remove DataCollatorForSupervisedDataset * test: add dataset tests * style: apply formatter * style: rename test_ci to test_train * feat: add llama in inference * test: add inference tests * test: change test scripts directory * fix: update ci * fix: fix typo * fix: skip llama due to oom * fix: fix file mod * style: apply formatter * refactor: remove duplicated llama_gptq * style: apply formatter * to: update rm test * feat: add tokenizer arg * feat: add download model script * test: update train tests * fix: modify gemini load and save pretrained * test: update checkpoint io test * to: modify nproc_per_node * fix: do not remove existing dir * fix: modify save path * test: add random choice * fix: fix sft path * fix: enlarge nproc_per_node to avoid oom * fix: add num_retry * fix: make lora config of rm and critic consistent * fix: add warning about lora weights * fix: skip some gpt2 tests * fix: remove grad ckpt in rm and critic due to errors * refactor: directly use Actor in train_sft * test: add more arguments * fix: disable grad ckpt when using lora * fix: fix save_pretrained and related tests * test: enable zero2 tests * revert: remove useless fn * style: polish code * test: modify test args
-
- 01 Aug, 2023 1 commit
-
-
Wenhao Chen authored
-
- 28 Jul, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-