1. 28 Mar, 2024 1 commit
  2. 13 Mar, 2024 1 commit
  3. 11 Mar, 2024 1 commit
  4. 06 Mar, 2024 1 commit
  5. 01 Feb, 2024 1 commit
  6. 18 Jan, 2024 2 commits
  7. 17 Jan, 2024 2 commits
    • Mo Li's avatar
      Added support for multi-needle testing in needle-in-a-haystack test (#802) · acae5609
      Mo Li authored
      
      
      * Add NeedleInAHaystack Test
      
      * Apply pre-commit formatting
      
      * Update configs/eval_hf_internlm_chat_20b_cdme.py
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      
      * add needle in haystack test
      
      * update needle in haystack test
      
      * update plot function in tools_needleinahaystack.py
      
      * optimizing needleinahaystack dataset generation strategy
      
      * modify minor formatting issues
      
      * add English version support
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * fix needleinahaystack test eval bug
      
      * fix needleinahaystack config bug
      
      * Added support for multi-needle testing in needle-in-a-haystack test
      
      * Optimize the code for plotting in the needle-in-a-haystack test.
      
      * Correct the typo in the dataset parameters.
      
      * update needleinahaystack test docs
      
      ---------
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      acae5609
    • RunningLeon's avatar
      [Feature] Update evaluate turbomind (#804) · 0836aec6
      RunningLeon authored
      * update
      
      * fix
      
      * fix
      
      * fix
      0836aec6
  8. 08 Jan, 2024 1 commit
  9. 25 Dec, 2023 1 commit
  10. 23 Dec, 2023 1 commit
  11. 21 Dec, 2023 1 commit
  12. 19 Dec, 2023 2 commits
  13. 15 Dec, 2023 1 commit
  14. 13 Dec, 2023 1 commit
  15. 12 Dec, 2023 1 commit
  16. 11 Dec, 2023 1 commit
  17. 08 Dec, 2023 1 commit
  18. 23 Nov, 2023 1 commit
  19. 22 Nov, 2023 1 commit
  20. 21 Nov, 2023 2 commits
  21. 16 Nov, 2023 1 commit
  22. 10 Nov, 2023 1 commit
  23. 27 Oct, 2023 1 commit
  24. 25 Oct, 2023 1 commit
  25. 07 Oct, 2023 1 commit
  26. 22 Sep, 2023 1 commit
  27. 18 Sep, 2023 1 commit
    • philipwangOvO's avatar
      [Docs] Readme in longeval (#389) · f57c0702
      philipwangOvO authored
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      f57c0702
  28. 15 Sep, 2023 1 commit
    • Hubert's avatar
      [Feat] implementation for support promptbench (#239) · a11cb45c
      Hubert authored
      * [Feat] support adv_glue dataset for adversarial robustness
      
      * reorg files
      
      * minor fix
      
      * minor fix
      
      * support prompt bench demo
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      a11cb45c
  29. 07 Sep, 2023 1 commit
  30. 06 Sep, 2023 1 commit
  31. 17 Aug, 2023 1 commit
    • Ezra-Yu's avatar
      [Feat] Add codegeex2 and Humanevalx (#210) · 17ccaa59
      Ezra-Yu authored
      * add codegeex2
      
      * add humanevalx dataset
      
      * add evaluator
      
      * update evaluator
      
      * update configs
      
      * update clean code
      
      * update configs
      
      * fix lint
      
      * remove sleep
      
      * fix lint
      
      * update docs
      
      * fix lint
      17ccaa59
  32. 10 Aug, 2023 2 commits
  33. 06 Jul, 2023 2 commits
  34. 05 Jul, 2023 1 commit