-
Notifications
You must be signed in to change notification settings - Fork 467
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fine Tuning with Arabic #33
Comments
@gwkrsrch please any help |
Hi @Mahmuod1 , there are several options you can take. You may modify the layout/textbox generation module to make the desired RTL layout. There would be several code lines to modify, e.g., textbox, layouts.
And then, using a perspective transformation (or other transformations), you can embed the synthetic paper into a background. Although the idea is simple, you will see some agreeable results. You may further enhance the quality of the generated samples via various techniques, but it is optional. Hope this helps :) Feel free to reopen this or open another issue if you have anything new for sharing. |
thanks, @gwkrsrch for your detailed instructions |
As a general tip, to train a model for a new language, you need to care about the token vocabulary/tokenizer. #11 would be useful to you :) |
First I would to thank you for this repo
i want to work in Arabic lang and Arabic lang and the Arabic Lang is RTL
could you tell me a pref to the changes i would make when adding the Arabic Lang in the SynthDoG to create the Arabic dataset
and in the model creation
The text was updated successfully, but these errors were encountered: