Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Stars

4

Forks

1

Language

Python

Last Updated

Aug 07, 2023

Similar Repos