1. 21 May, 2024 2 commits
    • liushz's avatar
      Update MathBench (#1176) · 1448be00
      liushz authored
      
      
      * Add Math Evaluation with Judge Model Evaluator
      
      * Add Math Evaluation with Judge Model Evaluator
      
      * Add Math Evaluation with Judge Model Evaluator
      
      * Add Math Evaluation with Judge Model Evaluator
      
      * Fix Llama-3 meta template
      
      * Fix MATH with JudgeLM Evaluation
      
      * Fix MATH with JudgeLM Evaluation
      
      * Fix MATH with JudgeLM Evaluation
      
      * Fix MATH with JudgeLM Evaluation
      
      * Update acclerator
      
      * Update MathBench
      
      ---------
      Co-authored-by: default avatarliuhongwei <liuhongwei@pjlab.org.cn>
      1448be00
    • Fengzhe Zhou's avatar
      [Sync] update evaluator (#1175) · 2b3d4150
      Fengzhe Zhou authored
      2b3d4150
  2. 20 May, 2024 1 commit
  3. 17 May, 2024 1 commit
  4. 16 May, 2024 1 commit
    • zhulinJulia24's avatar
      update test workflow (#1167) · 94eb9056
      zhulinJulia24 authored
      
      
      * Update pr-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update pr-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update daily-run-test.yml
      
      * Update oc_score_baseline.yaml
      
      * Update daily-run-test.yml
      
      * Update oc_score_assert.py
      
      ---------
      Co-authored-by: default avatarzhulin1 <zhulin1@pjlab.org.cn>
      94eb9056
  5. 15 May, 2024 5 commits
  6. 14 May, 2024 4 commits
  7. 13 May, 2024 2 commits
  8. 11 May, 2024 1 commit
  9. 09 May, 2024 2 commits
  10. 08 May, 2024 3 commits
  11. 06 May, 2024 5 commits
  12. 30 Apr, 2024 3 commits
  13. 29 Apr, 2024 3 commits
  14. 28 Apr, 2024 5 commits
  15. 26 Apr, 2024 2 commits