Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 which was made avilable to me via the TPU Research Cloud Program.
-
An assortment of various Hebrew corpuses - I have made it available here
-
oscar / unshuffled_deduplicated_he - Homepage | Dataset Permalink
The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.
- Model configs
- Available on Huggingface
- A Google Colab Notebook is available here
- Model configs
- Available on Huggingface
- A Google Colab Notebook is available here
- Model configs
- Available on Huggingface
- A Google Colab Notebook is available here