Stars
OCR
5 repositories
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
A Comprehensive Toolkit for High-Quality PDF Content Extraction