• Leo Gao's avatar
    Massive refactor · 778e0f91
    Leo Gao authored
    - Extract evaluator (still needs work to clean up)
    - Add tests for evaluator
    - Fix all the things that break on the new tests
    - Misc cleanup
    778e0f91
test_tasks.py 1.66 KB