attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Stars

479

Forks

37

Language

Python

Last Updated

May 17, 2024

Similar Repos