Update README.md

dea8d7bd · Hailey Schoelkopf · GitHub · c9bbec6e · dea8d7bd
Unverified Commit dea8d7bd authored Dec 04, 2023 by Hailey Schoelkopf Committed by GitHub Dec 04, 2023
Show whitespace changes
Inline Side-by-side

Showing with 19 additions and 0 deletions

README.md README.md +19 -0

No files found.
--- a/README.md
+++ b/README.md
 # Language Model Evaluation Harness
+## Announcement
+**A new v0.4.0 release of lm-evaluation-harness is available** ! 
+New updates and features include:
+- Internal refactoring
+- Config-based task creation and configuration
+- Easier import of externally-defined task config files (--include_path, passing path to YAML directly, etc)
+- Support for Jinja2 prompt design, easy modification of prompts + prompt imports from Promptsource
+- More advanced configuration options, including output post-processing, answer extraction, and multiple LM generations per document, configurable fewshot settings, and more
+- Speedups and new modeling libraries supported, including: faster data-parallel HF model usage, vLLM support, MPS support with HuggingFace, and more
+- Logging and usability changes
+- New tasks including CoT BIG-Bench-Hard, Belebele, user-defined task groupings, and more
+Please see our updated documentation pages in `docs/` for more details.
+Development will be continuing on the `main` branch, and we encourage you to give us feedback on what features are desired and how to improve the library further, or ask questions, either in issues or PRs on GitHub, or in the [EleutherAI discord](discord.gg/eleutherai)!
 ## Overview
 This project provides a unified framework to test generative language models on a large number of different evaluation tasks.