Panoptic Segmentator

Introduction

Welcome to PanopticSegmentator, a cutting-edge web application that empowers users to perform panoptic segmentation on images, videos, and even live webcam feeds. Panoptic segmentation goes beyond traditional semantic segmentation by not only classifying objects in an image but also distinguishing between stuff (e.g., background) and things (e.g., objects).

Installation and Usage

To run the web application, you have two options:

Option 1: Local Installation with Conda

Install necessary dependencies:

apt-get install ffmpeg libsm6 libxext6 ninja-build libglib2.0-0 libsm6 libxrender-dev libxext6 libgl1-mesa-glx

Clone the repository:

git clone https://github.com/kafkaGen/panoptic-segmentator

Create and activate a Conda environment:

conda env create -n <env_name> --file requirements.yaml
conda activate <env_name>

Install Python requirements:

pip install -r requirements.txt
pip install -U openmim

Install additional packages:

mim install mmengine "mmcv>=2.0.0" mmdet git+https://github.com/cocodataset/panopticapi.git

Download required models:
```
python setup.py --download-models
```

Run the application using Streamlit and FastAPI:

uvicorn core.api:app --host 0.0.0.0 --port 8000 & streamlit run streamlit_app.py --server.port 8501

Option 2: Docker Installation

Build the Docker image locally:

docker build -t panoptic-segmentator:latest .

Or pull the Docker image from Docker Hub:

docker pull olko123123123/panoptic-segmentator:latest && docker tag olko123123123/panoptic-segmentator:latest panoptic-segmentator:latest

Run the Docker container using the provided script:
```
bash container-run.sh
```
NOTE: container-run.sh automatically determines whether your machine supports NVIDIA GPU and runs the Docker container accordingly on CPU or GPU.

Access the Application Online

Alternatively, you can try the application online here. Please note that this is an AWS EC2 free-tier instance, so be patient with its performance.

REST API for batch inference

While the web application provides an intuitive interface for individual use, it may not be the most efficient solution for large-scale content processing. For such scenarios, the REST API implementation supports batch inference for both images and videos. Below are examples demonstrating how to make API calls for image and video batch inference.

Image Batch Inference

To perform image batch inference, use the following Python code:

import base64
import os
from io import BytesIO

import numpy as np
import requests
from PIL import Image

url = "<host>/images/?model_name=<model-name>"

file_list, open_files = [], []
for path in os.listdir(path_to_imgs):
    file_path = os.path.join(path_to_imgs, path)
    open_file = open(file_path, "rb")
    if ".jpg" in file_path:
        file_list.append(("images", (path, open_file, "image/jpeg")))
    else:
        file_list.append(("images", (path, open_file, "image/png")))
    open_files.append(open_file)

response = requests.post(url, files=file_list)
for fl in open_files:
    fl.close()
imgs = response.json()["segmented_images_bytes"]
imgs = [np.array(Image.open(BytesIO(base64.b64decode(img)))) for img in imgs]

Video Batch Inference

For video batch inference, utilize the following Python code:

import base64
import os
import tempfile

import cv2
import requests


url = "<host>/videos/?model_name=<model-name>"

file_list, open_files = [], []
for path in os.listdir(path_to_videos):
    file_path = os.path.join(path_to_videos, path)
    open_file = open(file_path, "rb")
    if ".mp4" in file_path:
        file_list.append(("videos", (path, open_file, "video/mp4")))
    open_files.append(open_file)


response = requests.post(url, files=file_list)
for fl in open_files:
    fl.close()
videos_encoded = response.json()["segmented_videos_bytes"]
for video in videos_encoded:
    video_decoded = base64.b64decode(video)

    temp_file = tempfile.NamedTemporaryFile(suffix=".mp4", delete=False)
    temp_file.write(video_decoded)
    temp_file_path = temp_file.name

    cap = cv2.VideoCapture(temp_file_path)

    while True:
        ret, frame = cap.read()
        if not ret:
            break

        rgb_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
        cv2.imshow("Video", frame)
        if cv2.waitKey(30) & 0xFF == ord("q"):
            break

    cap.release()
    cv2.destroyAllWindows()
    temp_file.close()
    os.remove(temp_file_path)

Feel free to adapt these examples to suit your specific use case and integrate them seamlessly into your workflow for efficient batch processing.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
core		core
models		models
pages		pages
settings		settings
test		test
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
README.md		README.md
container-run.sh		container-run.sh
demo.gif		demo.gif
docker-compose-cpu.yml		docker-compose-cpu.yml
docker-compose-gpu.yml		docker-compose-gpu.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements.yaml		requirements.yaml
setup.py		setup.py
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Panoptic Segmentator

Introduction

Installation and Usage

Option 1: Local Installation with Conda

Option 2: Docker Installation

Access the Application Online

REST API for batch inference

Image Batch Inference

Video Batch Inference

About

Releases

Packages

Languages

kafkaGen/panoptic-segmentator

Folders and files

Latest commit

History

Repository files navigation

Panoptic Segmentator

Introduction

Installation and Usage

Option 1: Local Installation with Conda

Option 2: Docker Installation

Access the Application Online

REST API for batch inference

Image Batch Inference

Video Batch Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages