InfMoE

Inference framework for MoE layers based on TensorRT with Python binding

Stars

41

Forks

5

Language

C++

Last Updated

Apr 25, 2024

Similar Repos