Skip to content

Fast BPE algorithm to generate byte pair encodings from text corpus, it's written in rust and approximately 20x faster than it's python implementation

Notifications You must be signed in to change notification settings

IAmPara0x/fast-bpe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fast Byte Pair Encoding

This contains the bpe_train which trains on the text corpus to generate byte pairs. It's approximately 20x faster than the version written in python!!.

About

Fast BPE algorithm to generate byte pair encodings from text corpus, it's written in rust and approximately 20x faster than it's python implementation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published