Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
-
Updated
Aug 6, 2024 - Python
Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
This is a course project created for Advance topics in NLP
Add a description, image, and links to the bert-inference-performance topic page so that developers can more easily learn about it.
To associate your repository with the bert-inference-performance topic, visit your repo's landing page and select "manage topics."