1. 28 Apr, 2024 2 commits
  2. 26 Apr, 2024 2 commits
  3. 19 Apr, 2024 1 commit
  4. 02 Apr, 2024 1 commit
  5. 28 Mar, 2024 1 commit
  6. 13 Mar, 2024 1 commit
  7. 11 Mar, 2024 1 commit
  8. 06 Mar, 2024 1 commit
  9. 01 Feb, 2024 1 commit
  10. 18 Jan, 2024 2 commits
  11. 17 Jan, 2024 2 commits
    • Mo Li's avatar
      Added support for multi-needle testing in needle-in-a-haystack test (#802) · acae5609
      Mo Li authored
      
      
      * Add NeedleInAHaystack Test
      
      * Apply pre-commit formatting
      
      * Update configs/eval_hf_internlm_chat_20b_cdme.py
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      
      * add needle in haystack test
      
      * update needle in haystack test
      
      * update plot function in tools_needleinahaystack.py
      
      * optimizing needleinahaystack dataset generation strategy
      
      * modify minor formatting issues
      
      * add English version support
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * fix needleinahaystack test eval bug
      
      * fix needleinahaystack config bug
      
      * Added support for multi-needle testing in needle-in-a-haystack test
      
      * Optimize the code for plotting in the needle-in-a-haystack test.
      
      * Correct the typo in the dataset parameters.
      
      * update needleinahaystack test docs
      
      ---------
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      acae5609
    • RunningLeon's avatar
      [Feature] Update evaluate turbomind (#804) · 0836aec6
      RunningLeon authored
      * update
      
      * fix
      
      * fix
      
      * fix
      0836aec6
  12. 08 Jan, 2024 1 commit
  13. 05 Jan, 2024 1 commit
  14. 25 Dec, 2023 1 commit
  15. 23 Dec, 2023 1 commit
  16. 21 Dec, 2023 1 commit
  17. 19 Dec, 2023 2 commits
  18. 15 Dec, 2023 1 commit
  19. 13 Dec, 2023 1 commit
  20. 12 Dec, 2023 1 commit
  21. 11 Dec, 2023 1 commit
  22. 08 Dec, 2023 1 commit
  23. 23 Nov, 2023 1 commit
  24. 22 Nov, 2023 1 commit
  25. 21 Nov, 2023 2 commits
  26. 16 Nov, 2023 1 commit
  27. 14 Nov, 2023 1 commit
  28. 10 Nov, 2023 1 commit
  29. 27 Oct, 2023 1 commit
  30. 25 Oct, 2023 1 commit
  31. 07 Oct, 2023 1 commit
  32. 22 Sep, 2023 1 commit
  33. 18 Sep, 2023 1 commit
    • philipwangOvO's avatar
      [Docs] Readme in longeval (#389) · f57c0702
      philipwangOvO authored
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      
      * [Docs] Readme in longeval
      f57c0702
  34. 15 Sep, 2023 1 commit
    • Hubert's avatar
      [Feat] implementation for support promptbench (#239) · a11cb45c
      Hubert authored
      * [Feat] support adv_glue dataset for adversarial robustness
      
      * reorg files
      
      * minor fix
      
      * minor fix
      
      * support prompt bench demo
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      
      * minor fix
      a11cb45c