Created a new task for gsm8k which corresponds to the Llama cot settings… (#2215)
* Created a new task for gsm8k which corresponds to the cot settings and prompt formatting described by Meta to evaluate Llama. Useful for replicating Llama performance on GSM8K benchmark. * fixing formatting * fixing formatting
Showing
Please register or sign in to comment