Image Colorization

Authors: Aneesh Thippa, Arya Miryala, Matthew Lo, Shaan Mistry

Date: 06.11.2024

Abstract

This project employs deep learning techniques to develop an image colorizer, transforming grayscale images into colorized versions. Utilizing Hugging Face for dataset deployment, the Stability AI Stable Diffusion model, and fine-tuning with ControlNet, we enhance the model's performance, ensuring that the colorized images align closely with desired color schemes. This project demonstrates the effectiveness of combining state-of-the-art diffusion models with fine-tuning techniques in image colorization tasks.

Introduction

Image colorization enhances the visual appeal of images and has practical applications such as restoring old photographs and improving medical imaging. The challenge lies in generating realistic and contextually accurate colors. Our approach leverages the Stability AI Stable Diffusion model and ControlNet to address this challenge. Using Hugging Face for dataset management, we aim to produce visually appealing and contextually accurate colorized images.

Methodology

1. Data Preparation & Preprocessing

Datasets: COCO and Kaggle
Deployment: Hugging Face Hub
Preprocessing: Conversion to grayscale, resizing to 512x512 pixels, and creation of text prompts

2. Model Selection

Initial Approach: Generative Adversarial Networks (GANs)
Final Approach: Stable Diffusion model for its stability and consistency

3. Training

Platform: Google Cloud Platform with Nvidia L4 GPU
Models Trained: Three models with varying parameters and datasets
Final Model: Trained on 50,000 images for 3 epochs, resulting in satisfactory colorization

4. Cielab Color Space

Combined the L* channel of the input image with the (a* b*) channels of the output image to preserve light levels and improve clarity.

Results

Evaluation Metrics

PSNR (Peak Signal-to-Noise Ratio)
- Higher values indicate better quality.
- Results improved with detailed prompts.
SSIM (Structural Similarity Index)
- Measures structural information, luminance, and contrast.
- Results improved with detailed prompts.
CIEDE2000
- Measures perceived color differences.
- Results improved with detailed prompts.

Limitations

1. Storage Limitations

Limited dataset size and need for checkpoint training.

2. Computational Power

Limited access to powerful GPUs affected training efficiency.

3. Time Constraints

Long training times delayed progress and iterations.

Conclusion

Our project successfully developed an image colorizer using Stability AI and ControlNet. This approach highlights the potential of deep learning techniques for practical applications. Future work includes further training and incorporating object recognition models for more accurate colorizations.

Team Member Contributions

Aneesh Thippa & Shaan Mistry: Model training, literature review, parameter tuning, inference.
Arya Miryala: Dataset creation, Hugging Face deployment, preprocessing, model training.
Matthew Lo: Evaluation algorithms, metric research, initial preprocessing.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
ImageColorizer_Final_Report.pdf		ImageColorizer_Final_Report.pdf
Image_Colorizer.ipynb		Image_Colorizer.ipynb
README.md		README.md
imagecolorization.pdf		imagecolorization.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Colorization

Authors: Aneesh Thippa, Arya Miryala, Matthew Lo, Shaan Mistry

Date: 06.11.2024

Abstract

Introduction

Methodology

1. Data Preparation & Preprocessing

2. Model Selection

3. Training

4. Cielab Color Space

Results

Evaluation Metrics

Limitations

1. Storage Limitations

2. Computational Power

3. Time Constraints

Conclusion

Team Member Contributions

References

About

Releases

Packages

Languages

aryamiryala/ImageColorizer

Folders and files

Latest commit

History

Repository files navigation

Image Colorization

Authors: Aneesh Thippa, Arya Miryala, Matthew Lo, Shaan Mistry

Date: 06.11.2024

Abstract

Introduction

Methodology

1. Data Preparation & Preprocessing

2. Model Selection

3. Training

4. Cielab Color Space

Results

Evaluation Metrics

Limitations

1. Storage Limitations

2. Computational Power

3. Time Constraints

Conclusion

Team Member Contributions

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages