AI-Optical-Character-Recognition (AI-OCR): Extracting data from images

This repository is the backend code for a tool with which you can extract data from images using visual LLMs. The frontend code (using streamlit) can be found here: AI-OCR-Frotend.

Installation

To use the AI-OCR tool, it is best if you install this repository for the backend, as well as the frontend repository, i.e. follow these steps:

Clone this repository for the backend

git clone https://github.com/jWinman91/AI-OCR.git
cd ai-ocr

Install the required dependencies for the backend:

pip install -r requirements.txt

Pull and run the coachdb docker file with the following command:

docker run -e COUCHDB_USER=admin -e COUCHDB_PASSWORD=JensIsCool -p 5984:5984 -d --name config_db couchdb:latest

Clone the frontend repository

git clone https://github.com/jWinman91/AI-OCR-Frontend.git
cd ai-ocr-frondend

Install the required dependencies for the frontend:

pip install -r requirements.txt

Usage

You can then start the backend by running:

python app.py $IP_ADDRESS

Make sure that the docker container for the coachdb is running.

Since, the backend uses fastapi, you could now try it out via the fastapi docs by going to $IP_ADDRESS:5000/docs.

But you can also start the frontend now by running:

chmod +x start_up.sh
./start_up.sh

from within the cloned frontend repository.

A streamlit window will automaticall open in your browser. Within the web application you'll then find two pages on the sidebar:

AI-OCR: Webpage for running the actual optical character recognition
Model Configurations: Subpage for configuring the models (e.g. ChatGPT, Llava, ...)

Example

Acknowledgments

Hugging Face - Framework for working with state-of-the-art natural language processing models.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
configs		configs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
app_llama_cpp.py		app_llama_cpp.py
app_openai.py		app_openai.py
install.sh		install.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Optical-Character-Recognition (AI-OCR): Extracting data from images

Table of Contents

Installation

Usage

Example

Acknowledgments

About

Releases

Packages

Languages

License

jWinman91/AI-OCR

Folders and files

Latest commit

History

Repository files navigation

AI-Optical-Character-Recognition (AI-OCR): Extracting data from images

Table of Contents

Installation

Usage

Example

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages