1. 12 Dec, 2023 1 commit
  2. 11 Dec, 2023 1 commit
  3. 08 Dec, 2023 1 commit
  4. 07 Dec, 2023 1 commit
  5. 05 Dec, 2023 1 commit
  6. 01 Dec, 2023 1 commit
  7. 30 Nov, 2023 2 commits
  8. 29 Nov, 2023 5 commits
  9. 28 Nov, 2023 4 commits
  10. 27 Nov, 2023 1 commit
  11. 24 Nov, 2023 1 commit
  12. 23 Nov, 2023 2 commits
    • Xuanlei Zhao's avatar
      remove duplicate import (#5100) · 68fcaa22
      Xuanlei Zhao authored
      68fcaa22
    • YeAnbang's avatar
      [Feature] Add document retrieval QA (#5020) · e53e729d
      YeAnbang authored
      
      
      * add langchain
      
      * add langchain
      
      * Add files via upload
      
      * add langchain
      
      * fix style
      
      * fix style: remove extra space
      
      * add pytest; modified retriever
      
      * add pytest; modified retriever
      
      * add tests to build_on_pr.yml
      
      * fix build_on_pr.yml
      
      * fix build on pr; fix environ vars
      
      * seperate unit tests for colossalqa from build from pr
      
      * fix container setting; fix environ vars
      
      * commented dev code
      
      * add incremental update
      
      * remove stale code
      
      * fix style
      
      * change to sha3 224
      
      * fix retriever; fix style; add unit test for document loader
      
      * fix ci workflow config
      
      * fix ci workflow config
      
      * add set cuda visible device script in ci
      
      * fix doc string
      
      * fix style; update readme; refactored
      
      * add force log info
      
      * change build on pr, ignore colossalqa
      
      * fix docstring, captitalize all initial letters
      
      * fix indexing; fix text-splitter
      
      * remove debug code, update reference
      
      * reset previous commit
      
      * update LICENSE update README add key-value mode, fix bugs
      
      * add files back
      
      * revert force push
      
      * remove junk file
      
      * add test files
      
      * fix retriever bug, add intent classification
      
      * change conversation chain design
      
      * rewrite prompt and conversation chain
      
      * add ui v1
      
      * ui v1
      
      * fix atavar
      
      * add header
      
      * Refactor the RAG Code and support Pangu
      
      * Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo.
      
      * resolved conversation. tested scripts under examples. web demo still buggy
      
      * fix ci tests
      
      * Some modifications to add ChatGPT api
      
      * modify llm.py and remove unnecessary files
      
      * Delete applications/ColossalQA/examples/ui/test_frontend_input.json
      
      * Remove OpenAI api key
      
      * add colossalqa
      
      * move files
      
      * move files
      
      * move files
      
      * move files
      
      * fix style
      
      * Add Readme and fix some bugs.
      
      * Add something to readme and modify some code
      
      * modify a directory name for clarity
      
      * remove redundant directory
      
      * Correct a type in  llm.py
      
      * fix AI prefix
      
      * fix test_memory.py
      
      * fix conversation
      
      * fix some erros and typos
      
      * Fix a missing import in RAG_ChatBot.py
      
      * add colossalcloud LLM wrapper, correct issues in code review
      
      ---------
      Co-authored-by: default avatarYeAnbang <anbangy2@outlook.com>
      Co-authored-by: default avatarOrion-Zheng <zheng_zian@u.nus.edu>
      Co-authored-by: default avatarZian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com>
      Co-authored-by: default avatarOrion-Zheng <zhengzian@u.nus.edu>
      e53e729d
  13. 22 Nov, 2023 5 commits
  14. 21 Nov, 2023 3 commits
  15. 20 Nov, 2023 7 commits
  16. 19 Nov, 2023 1 commit
    • Xu Kai's avatar
      [inference] Refactor inference architecture (#5057) · fd6482ad
      Xu Kai authored
      
      
      * [inference] support only TP (#4998)
      
      * support only tp
      
      * enable tp
      
      * add support for bloom (#5008)
      
      * [refactor] refactor gptq and smoothquant llama (#5012)
      
      * refactor gptq and smoothquant llama
      
      * fix import error
      
      * fix linear import torch-int
      
      * fix smoothquant llama import error
      
      * fix import accelerate error
      
      * fix bug
      
      * fix import smooth cuda
      
      * fix smoothcuda
      
      * [Inference Refactor] Merge chatglm2 with pp and tp (#5023)
      
      merge chatglm with pp and tp
      
      * [Refactor] remove useless inference code (#5022)
      
      * remove useless code
      
      * fix quant model
      
      * fix test import bug
      
      * mv original inference legacy
      
      * fix chatglm2
      
      * [Refactor] refactor policy search and quant type controlling in inference (#5035)
      
      * [Refactor] refactor policy search and quant type controling in inference
      
      * [inference] update readme (#5051)
      
      * update readme
      
      * update readme
      
      * fix architecture
      
      * fix table
      
      * fix table
      
      * [inference] udpate example (#5053)
      
      * udpate example
      
      * fix run.sh
      
      * fix rebase bug
      
      * fix some errors
      
      * update readme
      
      * add some features
      
      * update interface
      
      * update readme
      
      * update benchmark
      
      * add requirements-infer
      
      ---------
      Co-authored-by: default avatarBin Jia <45593998+FoolPlayer@users.noreply.github.com>
      Co-authored-by: default avatarZhongkai Zhao <kanezz620@gmail.com>
      fd6482ad
  17. 18 Nov, 2023 1 commit
  18. 17 Nov, 2023 1 commit
  19. 16 Nov, 2023 1 commit