lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Stars

97

Forks

31

Language

Python

Last Updated

May 28, 2024

Similar Repos