AutoEval

TableTransformer - Dataset Extractor

Getting Started

First install pytorch with cuda from the pytorch website.
Ensure all required libraries are installed. You can install them using pip:

pip install -r requirements.txt

Usage

Performs table and cell extraction from images using pre-trained models for table detection and structure recognition. It includes functionalities to visualize detected tables, crop and classify cells, and create a graphical user interface (GUI) for cell classification.
The images from the dataset are passed in the model to extraxt the last column of the table that contains the handwritten answers : "True", "False"
Follow the GUI prompts to classify extracted cell images into categories: "true", "false", or "none"

TinyVGG Model

Getting Started

pip install torch torchvision matplotlib tqdm

Dataset

The dataset is organized into training and testing directories, each containing images classified into 'false', 'none', and 'true'.

Overview

The TinyVGG model is a simple CNN architecture with two convolutional blocks and a fully connected classifier
The torchvision.transforms module is used to apply transformations to the dataset.
The transformed datasets are then loaded using ImageFolder
The training time is measured using the timeit library

Acknowledgement

The helper functions are downloaded from Daniel Bourke's PyTorch repository:

https://raw.githubusercontent.com/mrdbourke/pytorch-deep-learning/main/helper_functions.py

Evaluation

Imports and Setup

Required libraries are imported, and the device is configured (CUDA or CPU).

Function Definitions

MaxResize: Resizes images while maintaining aspect ratio.
box_cxcywh_to_xyxy and rescale_bboxes: For bounding box processing.
outputs_to_objects: Converts model outputs to objects.
fig2img: Converts a Matplotlib figure to a PIL Image.
visualize_detected_tables: Visualizes detected tables with bounding boxes.
objects_to_crops: Processes bounding boxes into cropped images and tokens.
get_cell_coordinates_by_row: Extracts cell coordinates by row.

Model Configuration

Loads pre-trained models for table detection and structure recognition.

Model Definition

TinyVGG: A neural network model used for classification of cell contents. The model is instantiated and loaded with pre-trained weights.

Dataset Preparation

Reads correct answers from ModelAnswer.csv and sets up the evaluation dataset folder.

Cell Extraction and Classification

perform_extraction_and_classify_image : Main function to extract table cells and classify their contents.

Runner Code

Iterates through images, performs extraction and classification, and saves results to marks.csv.

AutoEval API

API Server

The AutoEval API provides an endpoint for evaluating question papers using image processing and machine learning techniques.

Setup

Ensure all required dependencies are installed:

pip install flask pillow torch torchvision pandas

Make sure all necessary models (tfc_model, detection_transform, structure_transform, etc.) are properly loaded and available.
The Flask application code is located in the Evaluation.ipynb file. Run the cells in this file (except the part under runner code) to start the API server.

API Endpoint

POST /evaluate

Evaluates a question paper image using the provided correct answers.

Request:

Method: POST
Content-Type: multipart/form-data
Body:
- image: JPG image file of the question paper
- csv: CSV file containing correct answers

Response:

Content-Type: application/json

Body:

{
  "total_marks": int,
  "predictions": [string],
  "correct_answers": [string]
}

Error Responses:

400 Bad Request: Missing files or invalid file types
500 Internal Server Error: Processing error

API Client

The API client provides a convenient way to interact with the AutoEval API from Python scripts.

Setup

Install the required library:
```
pip install requests
```
The client script is located in the post-request.py file.

Usage

Refer to the post-request.py file for details on how to use the evaluate_paper function to send requests to the API.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
models		models
public/img		public/img
.gitignore		.gitignore
Evaluation.ipynb		Evaluation.ipynb
README.md		README.md
TableTransformers_Extractor.ipynb		TableTransformers_Extractor.ipynb
TinyVGG_Train.ipynb		TinyVGG_Train.ipynb
helper_functions.py		helper_functions.py
post-request.py		post-request.py
requirements.txt		requirements.txt
viz.py		viz.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoEval

TableTransformer - Dataset Extractor

Getting Started

Usage

TinyVGG Model

Getting Started

Dataset

Overview

Acknowledgement

Evaluation

Imports and Setup

Function Definitions

Model Configuration

Model Definition

Dataset Preparation

Cell Extraction and Classification

Runner Code

AutoEval API

API Server

Setup

API Endpoint

POST /evaluate

API Client

Setup

Usage

About

Releases

Packages

Contributors 2

Languages

nandhu-44/AutoEval

Folders and files

Latest commit

History

Repository files navigation

AutoEval

TableTransformer - Dataset Extractor

Getting Started

Usage

TinyVGG Model

Getting Started

Dataset

Overview

Acknowledgement

Evaluation

Imports and Setup

Function Definitions

Model Configuration

Model Definition

Dataset Preparation

Cell Extraction and Classification

Runner Code

AutoEval API

API Server

Setup

API Endpoint

POST /evaluate

API Client

Setup

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages