"vscode:/vscode.git/clone" did not exist on "40e53d65cbb8b609a6ff8e977d2318044d0f0ee0"
  1. 17 Jan, 2024 1 commit
    • Mo Li's avatar
      Added support for multi-needle testing in needle-in-a-haystack test (#802) · acae5609
      Mo Li authored
      
      
      * Add NeedleInAHaystack Test
      
      * Apply pre-commit formatting
      
      * Update configs/eval_hf_internlm_chat_20b_cdme.py
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      
      * add needle in haystack test
      
      * update needle in haystack test
      
      * update plot function in tools_needleinahaystack.py
      
      * optimizing needleinahaystack dataset generation strategy
      
      * modify minor formatting issues
      
      * add English version support
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * fix needleinahaystack test eval bug
      
      * fix needleinahaystack config bug
      
      * Added support for multi-needle testing in needle-in-a-haystack test
      
      * Optimize the code for plotting in the needle-in-a-haystack test.
      
      * Correct the typo in the dataset parameters.
      
      * update needleinahaystack test docs
      
      ---------
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      acae5609
  2. 25 Dec, 2023 1 commit
  3. 08 Dec, 2023 1 commit
  4. 23 Nov, 2023 1 commit
  5. 22 Nov, 2023 1 commit
  6. 21 Nov, 2023 1 commit
  7. 06 Nov, 2023 1 commit
  8. 27 Oct, 2023 1 commit
  9. 07 Oct, 2023 1 commit
  10. 22 Sep, 2023 1 commit
  11. 18 Sep, 2023 1 commit
    • philipwangOvO's avatar
      [Docs] Readme in longeval (#389) · f57c0702
      philipwangOvO authored
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      f57c0702
  12. 15 Sep, 2023 1 commit
    • Hubert's avatar
      [Feat] implementation for support promptbench (#239) · a11cb45c
      Hubert authored
      * [Feat] support adv_glue dataset for adversarial robustness
      
      * reorg files
      
      * minor fix
      
      * minor fix
      
      * support prompt bench demo
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      a11cb45c
  13. 12 Sep, 2023 1 commit
  14. 07 Sep, 2023 1 commit
  15. 23 Aug, 2023 1 commit
  16. 17 Aug, 2023 1 commit
    • Ezra-Yu's avatar
      [Feat] Add codegeex2 and Humanevalx (#210) · 17ccaa59
      Ezra-Yu authored
      * add codegeex2
      
      * add humanevalx dataset
      
      * add evaluator
      
      * update evaluator
      
      * update configs
      
      * update clean code
      
      * update configs
      
      * fix lint
      
      * remove sleep
      
      * fix lint
      
      * update docs
      
      * fix lint
      17ccaa59
  17. 11 Aug, 2023 1 commit
  18. 10 Aug, 2023 2 commits
  19. 01 Aug, 2023 1 commit
  20. 13 Jul, 2023 1 commit
  21. 06 Jul, 2023 3 commits
  22. 04 Jul, 2023 1 commit