1. 17 Jan, 2024 2 commits
    • Mo Li's avatar
      Added support for multi-needle testing in needle-in-a-haystack test (#802) · acae5609
      Mo Li authored
      
      
      * Add NeedleInAHaystack Test
      
      * Apply pre-commit formatting
      
      * Update configs/eval_hf_internlm_chat_20b_cdme.py
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      
      * add needle in haystack test
      
      * update needle in haystack test
      
      * update plot function in tools_needleinahaystack.py
      
      * optimizing needleinahaystack dataset generation strategy
      
      * modify minor formatting issues
      
      * add English version support
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * fix needleinahaystack test eval bug
      
      * fix needleinahaystack config bug
      
      * Added support for multi-needle testing in needle-in-a-haystack test
      
      * Optimize the code for plotting in the needle-in-a-haystack test.
      
      * Correct the typo in the dataset parameters.
      
      * update needleinahaystack test docs
      
      ---------
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      acae5609
    • RunningLeon's avatar
      [Feature] Update evaluate turbomind (#804) · 0836aec6
      RunningLeon authored
      * update
      
      * fix
      
      * fix
      
      * fix
      0836aec6
  2. 08 Jan, 2024 2 commits
  3. 05 Jan, 2024 1 commit
  4. 25 Dec, 2023 1 commit
  5. 23 Dec, 2023 1 commit
  6. 21 Dec, 2023 1 commit
  7. 19 Dec, 2023 2 commits
  8. 15 Dec, 2023 1 commit
  9. 13 Dec, 2023 1 commit
  10. 12 Dec, 2023 1 commit
  11. 11 Dec, 2023 1 commit
  12. 08 Dec, 2023 1 commit
  13. 23 Nov, 2023 2 commits
  14. 22 Nov, 2023 2 commits
  15. 21 Nov, 2023 2 commits
  16. 16 Nov, 2023 1 commit
  17. 14 Nov, 2023 1 commit
  18. 13 Nov, 2023 2 commits
    • Songyang Zhang's avatar
      [Doc] Update README (#582) · 01a0f2f3
      Songyang Zhang authored
      01a0f2f3
    • Fengzhe Zhou's avatar
      [Feature] Use dataset in local path (#570) · 689ffe5b
      Fengzhe Zhou authored
      * update commonsenseqa
      
      * update drop
      
      * update flores_first100
      
      * update gsm8k
      
      * update humaneval
      
      * update lambda
      
      * update obqa
      
      * update piqa
      
      * update race
      
      * update siqa
      
      * update story_cloze
      
      * update strategyqa
      
      * update tydiqa
      
      * update winogrande
      
      * update doc
      
      * update hellaswag
      
      * fix obqa
      
      * update collections
      
      * update .zip name
      689ffe5b
  19. 10 Nov, 2023 1 commit
  20. 07 Nov, 2023 1 commit
  21. 06 Nov, 2023 1 commit
  22. 02 Nov, 2023 1 commit
  23. 27 Oct, 2023 2 commits
  24. 25 Oct, 2023 1 commit
  25. 09 Oct, 2023 1 commit
  26. 07 Oct, 2023 3 commits
  27. 22 Sep, 2023 1 commit
  28. 19 Sep, 2023 1 commit
  29. 18 Sep, 2023 1 commit
    • philipwangOvO's avatar
      [Docs] Readme in longeval (#389) · f57c0702
      philipwangOvO authored
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      f57c0702
  30. 15 Sep, 2023 1 commit
    • Hubert's avatar
      [Feat] implementation for support promptbench (#239) · a11cb45c
      Hubert authored
      * [Feat] support adv_glue dataset for adversarial robustness
      
      * reorg files
      
      * minor fix
      
      * minor fix
      
      * support prompt bench demo
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      a11cb45c