Hand2Num

Project Overview

This project is a real-time hand gesture recognition system that uses computer vision and deep learning technologies to classify hand gestures from webcam input. The system leverages MediaPipe for hand landmark detection and a custom Convolutional Neural Network (CNN) for gesture classification.

Key Features

Real-time hand gesture recognition
Uses MediaPipe for hand landmark detection
Custom CNN model for gesture classification
Developed and trained on Google Colab
Supports multiple gesture categories

Key Training Environment Features

Direct Google Drive file access
Compressed image dataset handling
Automated model training and checkpointing
GPU/TPU acceleration for faster computations

Technologies Used

Development Platform: Google Colab
Hardware Acceleration: TPU
Computer Vision: OpenCV (cv2)
Hand Tracking: MediaPipe
Deep Learning: TensorFlow/Keras
Programming Language: Python

Project Structure

1. Data Generation (`generate.py`)

Captures hand landmark images using webcam
Processes and saves landmark images for training
Supports different hand configurations (left/right, normal/flipped)

Preprocessed Images

One-Left-Normal | One-Right Normal | One-Left Flipped | One-Right Flipped

Two-Left-Normal | Two-Right Normal | Two-Left Flipped | Two-Right Flipped

Three-Left-Normal | Three-Right Normal | Three-Left Flipped | Three-Right Flipped

Four-Left-Normal | Four-Right Normal | Four-Left Flipped | Four-Right Flipped

Five-Left-Normal | Five-Right Normal | Five-Left Flipped | Five-Right Flipped

2. Model Training (`Project_HGR.ipynb`)

Prepares and preprocesses image dataset
Builds a Convolutional Neural Network (CNN)
Trains and validates the gesture recognition model
Saves the best performing model

Model Architecture

Few Testing Results

3. Live Classification (`live_cam_test.py`)

Loads pre-trained model
Processes real-time webcam input
Performs hand gesture recognition
Displays prediction results

Some Real-time Testing Results

Setup and Reproduction

Prerequisites

Google Account
Google Colab access
Prepared image dataset

Steps to Reproduce

Open Google Colab
Create new notebook
Upload or link to required Python scripts
Mount Google Drive
Upload compressed image dataset
Run training notebook (Project_HGR.ipynb)

Model Deployment

After training in Colab:

Download the best performing model
Use live_cam_test.py for real-time gesture recognition
Ensure all dependencies are installed locally

Potential Improvements

Increase training dataset diversity
Implement data augmentation
Experiment with model architectures
Add more gesture categories

Limitations

Requires good lighting conditions
Performance depends on training data quality
Currently supports a limited number of gesture categories

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
images		images
models		models
Project_HGR.ipynb		Project_HGR.ipynb
README.md		README.md
generate.py		generate.py
live_cam_test.py		live_cam_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hand2Num

Project Overview

Key Features

Key Training Environment Features

Technologies Used

Project Structure

1. Data Generation (`generate.py`)

Preprocessed Images

One-Left-Normal | One-Right Normal | One-Left Flipped | One-Right Flipped

Two-Left-Normal | Two-Right Normal | Two-Left Flipped | Two-Right Flipped

Three-Left-Normal | Three-Right Normal | Three-Left Flipped | Three-Right Flipped

Four-Left-Normal | Four-Right Normal | Four-Left Flipped | Four-Right Flipped

Five-Left-Normal | Five-Right Normal | Five-Left Flipped | Five-Right Flipped

2. Model Training (`Project_HGR.ipynb`)

Model Architecture

Few Testing Results

3. Live Classification (`live_cam_test.py`)

Some Real-time Testing Results

Setup and Reproduction

Prerequisites

Steps to Reproduce

Model Deployment

Potential Improvements

Limitations

About

Languages

AndysTMC/Hand2Num

Folders and files

Latest commit

History

Repository files navigation

Hand2Num

Project Overview

Key Features

Key Training Environment Features

Technologies Used

Project Structure

1. Data Generation (generate.py)

Preprocessed Images

One-Left-Normal | One-Right Normal | One-Left Flipped | One-Right Flipped

Two-Left-Normal | Two-Right Normal | Two-Left Flipped | Two-Right Flipped

Three-Left-Normal | Three-Right Normal | Three-Left Flipped | Three-Right Flipped

Four-Left-Normal | Four-Right Normal | Four-Left Flipped | Four-Right Flipped

Five-Left-Normal | Five-Right Normal | Five-Left Flipped | Five-Right Flipped

2. Model Training (Project_HGR.ipynb)

Model Architecture

Few Testing Results

3. Live Classification (live_cam_test.py)

Some Real-time Testing Results

Setup and Reproduction

Prerequisites

Steps to Reproduce

Model Deployment

Potential Improvements

Limitations

About

Topics

Resources

Stars

Watchers

Forks

Languages

1. Data Generation (`generate.py`)

2. Model Training (`Project_HGR.ipynb`)

3. Live Classification (`live_cam_test.py`)