- 20 Dec, 2023 1 commit
-
-
BlueRum authored
-
- 15 Dec, 2023 2 commits
-
-
flybird11111 authored
* fix aaa fix fix fix * fix * fix * test ci * fix ci fix * update pytorch version in documents
-
Yuanchen authored
Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
- 12 Dec, 2023 2 commits
-
-
flybird11111 authored
* fix aaa fix fix fix * fix * fix * test ci * fix ci fix * llama support dist-cross fix fix fix fix fix fix fix fix * fix * fix * fix fix * test ci * test ci * fix * [Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878) * Add finetuning Colossal-Llama-2 example * Add finetuning Colossal-Llama-2 example 2 * Add finetuning Colossal-Llama-2 example and support NEFTuning * Add inference example and refine neftune * Modify readme file * update the imports --------- Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Camille Zhong <44392324+Camille7777@users.noreply.github.com> * llama support dist-cross fix fix fix fix fix fix fix fix * fix * fix * fix fix * test ci * test ci * fix * fix ci * fix ci --------- Co-authored-by:
Yuanchen <70520919+chengeharrison@users.noreply.github.com> Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Camille Zhong <44392324+Camille7777@users.noreply.github.com>
-
Yuanchen authored
* Support GSM, Data Leakage Evaluation and Tensor Parallel * remove redundant code and update inference.py in examples/gpt_evaluation --------- Co-authored-by:Xu Yuanchen <yuanchen.xu00@gmail.com>
-
- 11 Dec, 2023 1 commit
-
-
Michelle authored
* fix pangu api * add comment
-
- 08 Dec, 2023 1 commit
-
-
flybird11111 authored
* fix aaa fix fix fix * fix * fix * test ci * fix ci fix
-
- 07 Dec, 2023 1 commit
-
-
Yuanchen authored
* Add finetuning Colossal-Llama-2 example * Add finetuning Colossal-Llama-2 example 2 * Add finetuning Colossal-Llama-2 example and support NEFTuning * Add inference example and refine neftune * Modify readme file * update the imports --------- Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Camille Zhong <44392324+Camille7777@users.noreply.github.com>
-
- 05 Dec, 2023 1 commit
-
-
flybird11111 authored
fix
-
- 01 Dec, 2023 1 commit
-
-
Michelle authored
* fix doc * modify doc
-
- 30 Nov, 2023 2 commits
-
-
Michelle authored
* refactor server and webui & add new feature * add requirements * modify readme and ui
-
flybird11111 authored
* fix 3d checkpoint load when booster boost without optimizer fix 3d checkpoint load when booster boost without optimizer * test ci * revert ci * fix fix
-
- 29 Nov, 2023 5 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
digger yu authored
-
Frank Lee authored
-
- 28 Nov, 2023 4 commits
-
-
binmakeswell authored
* [doc] add moe news * [doc] add moe news * [doc] add moe news
-
Wenhao Chen authored
* [shardformer] implement policy for all GPT-J models and test * [shardformer] support interleaved pipeline parallel for bert finetune * [shardformer] shardformer support falcon (#4883) * [shardformer]: fix interleaved pipeline for bert model (#5048) * [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093) * Add Mistral support for Shardformer (#5103) * [shardformer] add tests to mistral (#5105) --------- Co-authored-by:
Pengtai Xu <henryxu880@gmail.com> Co-authored-by:
ppt0011 <143150326+ppt0011@users.noreply.github.com> Co-authored-by:
flybird11111 <1829166702@qq.com> Co-authored-by:
eric8607242 <e0928021388@gmail.com>
-
アマデウス authored
-
Zian(Andy) Zheng authored
* add safetybench and cvalues(responsibility) eval dataset * Modify code according to review suggestions --------- Co-authored-by:Orion-Zheng <zhengzian@u.nus.edu>
-
- 27 Nov, 2023 1 commit
-
-
digger yu authored
-
- 24 Nov, 2023 1 commit
-
-
digger yu authored
-
- 23 Nov, 2023 2 commits
-
-
Xuanlei Zhao authored
-
YeAnbang authored
* add langchain * add langchain * Add files via upload * add langchain * fix style * fix style: remove extra space * add pytest; modified retriever * add pytest; modified retriever * add tests to build_on_pr.yml * fix build_on_pr.yml * fix build on pr; fix environ vars * seperate unit tests for colossalqa from build from pr * fix container setting; fix environ vars * commented dev code * add incremental update * remove stale code * fix style * change to sha3 224 * fix retriever; fix style; add unit test for document loader * fix ci workflow config * fix ci workflow config * add set cuda visible device script in ci * fix doc string * fix style; update readme; refactored * add force log info * change build on pr, ignore colossalqa * fix docstring, captitalize all initial letters * fix indexing; fix text-splitter * remove debug code, update reference * reset previous commit * update LICENSE update README add key-value mode, fix bugs * add files back * revert force push * remove junk file * add test files * fix retriever bug, add intent classification * change conversation chain design * rewrite prompt and conversation chain * add ui v1 * ui v1 * fix atavar * add header * Refactor the RAG Code and support Pangu * Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo. * resolved conversation. tested scripts under examples. web demo still buggy * fix ci tests * Some modifications to add ChatGPT api * modify llm.py and remove unnecessary files * Delete applications/ColossalQA/examples/ui/test_frontend_input.json * Remove OpenAI api key * add colossalqa * move files * move files * move files * move files * fix style * Add Readme and fix some bugs. * Add something to readme and modify some code * modify a directory name for clarity * remove redundant directory * Correct a type in llm.py * fix AI prefix * fix test_memory.py * fix conversation * fix some erros and typos * Fix a missing import in RAG_ChatBot.py * add colossalcloud LLM wrapper, correct issues in code review --------- Co-authored-by:
YeAnbang <anbangy2@outlook.com> Co-authored-by:
Orion-Zheng <zheng_zian@u.nus.edu> Co-authored-by:
Zian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com> Co-authored-by:
Orion-Zheng <zhengzian@u.nus.edu>
-
- 22 Nov, 2023 5 commits
-
-
Xuanlei Zhao authored
* llama 3d * update * fix autocast
-
flybird11111 authored
* fix flash attn * fix fix
-
Zhongkai Zhao authored
* hotfix/Fix get model policy strategy in ShardFormer * fix bug in auto policy
-
flybird11111 authored
[gemini]fix gemini optimzer, saving Shardformer in Gemini got list assignment index out of range (#5085)
-
digger yu authored
-
- 21 Nov, 2023 3 commits
-
-
digger yu authored
-
Jun Gao authored
-
Hongxin Liu authored
* [setup] refactor infer setup * [hotfix] fix infenrece behavior on 1 1 gpu * [exmaple] refactor inference examples
-
- 20 Nov, 2023 7 commits
-
-
Bin Jia authored
* fix init model with random parameters * fix example
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
Xu Kai authored
* update examples and engine * fix choices * update example
-
Bin Jia authored
-
Hongxin Liu authored
* [npu] setup device utils (#5047) * [npu] add npu device support * [npu] support low level zero * [test] update npu zero plugin test * [hotfix] fix import * [test] recover tests * [npu] gemini support npu (#5052) * [npu] refactor device utils * [gemini] support npu * [example] llama2+gemini support npu * [kernel] add arm cpu adam kernel (#5065) * [kernel] add arm cpu adam * [optim] update adam optimizer * [kernel] arm cpu adam remove bf16 support
-
Hongxin Liu authored
-
Cuiqing Li (李崔卿) authored
* added flash-decoidng of triton based on lightllm kernel * add req * clean * clean * delete build.sh --------- Co-authored-by:cuiqing.li <lixx336@gmail.com>
-