Handwritten Visual Document Understanding

Create virtual environment

conda env create -f environment.yml conda activate ocr37 python -m ipykernel install --user --name ocr37

Model fine-tuning

train a new model

cd src
python train.py --config config/train_nist.yaml

Update the train_nist

testing:

cd src
python test.py \ --dataset_name_or_path ./dataset/training/nist_form \ --pretrained_model_name_or_path ./result/train_nist/20230905_212956 \ --save_path ./dataset/result/nist_form \ --split test

Update the following accordingly:

where the test dataset is dataset_name_or_path
where the model is stored pretrained_model_name_or_path
folder path to save the results save_path

Reference

We have used Donut pre-train script and model in this work

@inproceedings{kim2022donut,
  title     = {OCR-Free Document Understanding Transformer},
  author    = {Kim, Geewook and Hong, Teakgyu and Yim, Moonbin and Nam, JeongYeon and Park, Jinyoung and Yim, Jinyeong and Hwang, Wonseok and Yun, Sangdoo and Han, Dongyoon and Park, Seunghyun},
  booktitle = {European Conference on Computer Vision (ECCV)},
  year      = {2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
dataset/result/nist_form		dataset/result/nist_form
imgs		imgs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handwritten Visual Document Understanding

Create virtual environment

Model fine-tuning

train a new model

testing:

Reference

About

Releases

Packages

Contributors 3

Languages

License

srsani/hvdu

Folders and files

Latest commit

History

Repository files navigation

Handwritten Visual Document Understanding

Create virtual environment

Model fine-tuning

train a new model

testing:

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages