1. 17 Jan, 2024 1 commit
    • Mo Li's avatar
      Added support for multi-needle testing in needle-in-a-haystack test (#802) · acae5609
      Mo Li authored
      
      
      * Add NeedleInAHaystack Test
      
      * Apply pre-commit formatting
      
      * Update configs/eval_hf_internlm_chat_20b_cdme.py
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      
      * add needle in haystack test
      
      * update needle in haystack test
      
      * update plot function in tools_needleinahaystack.py
      
      * optimizing needleinahaystack dataset generation strategy
      
      * modify minor formatting issues
      
      * add English version support
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * change NeedleInAHaystackDataset to dynamic loading
      
      * fix needleinahaystack test eval bug
      
      * fix needleinahaystack config bug
      
      * Added support for multi-needle testing in needle-in-a-haystack test
      
      * Optimize the code for plotting in the needle-in-a-haystack test.
      
      * Correct the typo in the dataset parameters.
      
      * update needleinahaystack test docs
      
      ---------
      Co-authored-by: default avatarSongyang Zhang <tonysy@users.noreply.github.com>
      acae5609
  2. 16 Jan, 2024 1 commit
  3. 12 Jan, 2024 1 commit
  4. 11 Jan, 2024 1 commit
  5. 09 Jan, 2024 1 commit
  6. 08 Jan, 2024 3 commits
  7. 05 Jan, 2024 2 commits
  8. 04 Jan, 2024 1 commit
  9. 02 Jan, 2024 1 commit
  10. 01 Jan, 2024 2 commits
  11. 29 Dec, 2023 3 commits
  12. 28 Dec, 2023 2 commits
  13. 27 Dec, 2023 3 commits
  14. 26 Dec, 2023 1 commit
  15. 25 Dec, 2023 1 commit
  16. 23 Dec, 2023 1 commit
  17. 20 Dec, 2023 2 commits
  18. 19 Dec, 2023 2 commits
  19. 14 Dec, 2023 1 commit
  20. 13 Dec, 2023 1 commit
  21. 12 Dec, 2023 1 commit
  22. 11 Dec, 2023 2 commits
  23. 09 Dec, 2023 1 commit
  24. 08 Dec, 2023 1 commit
  25. 06 Dec, 2023 1 commit
    • bittersweet1999's avatar
      New subjective judgement (#660) · 1c95790f
      bittersweet1999 authored
      
      
      * TabMWP
      
      * TabMWP
      
      * fixed
      
      * fixed
      
      * fixed
      
      * done
      
      * done
      
      * done
      
      * add new subjective judgement
      
      * add new subjective judgement
      
      * add new subjective judgement
      
      * add new subjective judgement
      
      * add new subjective judgement
      
      * modified to a more general way
      
      * modified to a more general way
      
      * final
      
      * final
      
      * add summarizer
      
      * add new summarize
      
      * fixed
      
      * fixed
      
      * fixed
      
      ---------
      Co-authored-by: default avatarcaomaosong <caomaosong@pjlab.org.cn>
      1c95790f
  26. 01 Dec, 2023 3 commits