-
Cameron7195 authored
* Created a new task for gsm8k which corresponds to the cot settings and prompt formatting described by Meta to evaluate Llama. Useful for replicating Llama performance on GSM8K benchmark. * fixing formatting * fixing formatting
cd35aecb