Skip to content

Latest commit

 

History

History
253 lines (200 loc) · 12.4 KB

README.md

File metadata and controls

253 lines (200 loc) · 12.4 KB

✂️ CarveKit ✂️


To obtain the foreground masks:

  1. python -m carvekit -i data/
  2. python fill_hole.py

The higher resolution images from the picture above can be seen in the docs/imgs/compare/ and docs/imgs/input folders.

📙 README Language

Russian English

📄 Description:

Automated high-quality background removal framework for an image using neural networks.

🎆 Features:

  • High Quality
  • Batch Processing
  • NVIDIA CUDA and CPU processing
  • FP16 inference: Fast inference with low memory usage
  • Easy inference
  • 100% remove.bg compatible FastAPI HTTP API
  • Removes background from hairs
  • Easy integration with your code

⛱ Try yourself on Google Colab

⛓️ How does it work?

It can be briefly described as

  1. The user selects a picture or a folder with pictures for processing
  2. The photo is preprocessed to ensure the best quality of the output image
  3. Using machine learning technology, the background of the image is removed
  4. Image post-processing to improve the quality of the processed image

🎓 Implemented Neural Networks:

Networks Target Accuracy
Tracer-B7 (default) General (objects, animals, etc) 90% (mean F1-Score, DUTS-TE)
U^2-net Hairs (hairs, people, animals, objects) 80.4% (mean F1-Score, DUTS-TE)
BASNet General (people, objects) 80.3% (mean F1-Score, DUTS-TE)
DeepLabV3 People, Animals, Cars, etc 67.4% (mean IoU, COCO val2017)

Recommended parameters for different models

Networks Segmentation mask size Trimap parameters (dilation, erosion)
tracer_b7 640 (30, 5)
u2net 320 (30, 5)
basnet 320 (30, 5)
deeplabv3 1024 (40, 20)

Notes:

  1. The final quality may depend on the resolution of your image, the type of scene or object.
  2. Use U2-Net for hairs and Tracer-B7 for general images and correct parameters.
    It is very important for final quality! Example images was taken by using U2-Net and FBA post-processing.

🖼️ Image pre-processing and post-processing methods:

🔍 Preprocessing methods:

  • none - No preprocessing methods used.

They will be added in the future.

✂ Post-processing methods:

  • none - No post-processing methods used.
  • fba (default) - This algorithm improves the borders of the image when removing the background from images with hair, etc. using FBA Matting neural network. This method gives the best result in combination with u2net without any preprocessing methods.

🏷 Setup for CPU processing:

  1. pip install carvekit --extra-index-url https://download.pytorch.org/whl/cpu

The project supports python versions from 3.8 to 3.10.4

🏷 Setup for GPU processing:

  1. Make sure you have an NVIDIA GPU with 8 GB VRAM.
  2. Install CUDA Toolkit and Video Driver for your GPU
  3. pip install carvekit --extra-index-url https://download.pytorch.org/whl/cu113

The project supports python versions from 3.8 to 3.10.4

🧰 Interact via code:

If you don't need deep configuration or don't want to deal with it

import torch
from carvekit.api.high import HiInterface

# Check doc strings for more information
interface = HiInterface(object_type="hairs-like",  # Can be "object" or "hairs-like".
                        batch_size_seg=5,
                        batch_size_matting=1,
                        device='cuda' if torch.cuda.is_available() else 'cpu',
                        seg_mask_size=640,  # Use 640 for Tracer B7 and 320 for U2Net
                        matting_mask_size=2048,
                        trimap_prob_threshold=231,
                        trimap_dilation=30,
                        trimap_erosion_iters=5,
                        fp16=False)
images_without_background = interface(['./tests/data/cat.jpg'])
cat_wo_bg = images_without_background[0]
cat_wo_bg.save('2.png')

                   

If you want control everything

import PIL.Image

from carvekit.api.interface import Interface
from carvekit.ml.wrap.fba_matting import FBAMatting
from carvekit.ml.wrap.tracer_b7 import TracerUniversalB7
from carvekit.pipelines.postprocessing import MattingMethod
from carvekit.pipelines.preprocessing import PreprocessingStub
from carvekit.trimap.generator import TrimapGenerator

# Check doc strings for more information
seg_net = TracerUniversalB7(device='cpu',
              batch_size=1)

fba = FBAMatting(device='cpu',
                 input_tensor_size=2048,
                 batch_size=1)

trimap = TrimapGenerator()

preprocessing = PreprocessingStub()

postprocessing = MattingMethod(matting_module=fba,
                               trimap_generator=trimap,
                               device='cpu')

interface = Interface(pre_pipe=preprocessing,
                      post_pipe=postprocessing,
                      seg_pipe=seg_net)

image = PIL.Image.open('tests/data/cat.jpg')
cat_wo_bg = interface([image])[0]
cat_wo_bg.save('2.png')
                   

🧰 Running the CLI interface:

  • python3 -m carvekit -i <input_path> -o <output_path> --device <device>

Explanation of args:

Usage: carvekit [OPTIONS]

  Performs background removal on specified photos using console interface.

Options:
  -i ./2.jpg                   Path to input file or dir  [required]
  -o ./2.png                   Path to output file or dir
  --pre none                   Preprocessing method
  --post fba                   Postprocessing method.
  --net tracer_b7              Segmentation Network. Check README for more info.
  --recursive                  Enables recursive search for images in a folder
  --batch_size 10              Batch Size for list of images to be loaded to
                               RAM

  --batch_size_seg 5           Batch size for list of images to be processed
                               by segmentation network

  --batch_size_mat 1           Batch size for list of images to be processed
                               by matting network

  --seg_mask_size 640          The size of the input image for the
                               segmentation neural network. Use 640 for Tracer B7 and 320 for U2Net

  --matting_mask_size 2048     The size of the input image for the matting
                               neural network.
  --trimap_dilation 30       The size of the offset radius from the
                                  object mask in pixels when forming an
                                  unknown area
  --trimap_erosion 5        The number of iterations of erosion that the
                                  object's mask will be subjected to before
                                  forming an unknown area
  --trimap_prob_threshold 231
                                  Probability threshold at which the
                                  prob_filter and prob_as_unknown_area
                                  operations will be applied

  --device cpu                 Processing Device.
  --fp16                       Enables mixed precision processing. Use only with CUDA. CPU support is experimental!
  --help                       Show this message and exit.


📦 Running the Framework / FastAPI HTTP API server via Docker:

Using the API via docker is a fast and non-complex way to have a working API.

Our docker images are available on Docker Hub.
Version tags are the same as the releases of the project with suffixes -cpu and -cuda for CPU and CUDA versions respectively.

Important Notes:

  1. Docker image has default front-end at / url and FastAPI backend with docs at /docs url.

  2. Authentication is enabled by default.
    Token keys are reset on every container restart if ENV variables are not set.
    See docker-compose.<device>.yml for more information.
    You can see your access keys in the docker container logs.

  3. There are examples of interaction with the API.
    See docs/code_examples/python for more details

🔨 Creating and running a container:

  1. Install docker-compose
  2. Run docker-compose -f docker-compose.cpu.yml up -d # For CPU Processing
  3. Run docker-compose -f docker-compose.cuda.yml up -d # For GPU Processing

Also you can mount folders from your host machine to docker container and use the CLI interface inside the docker container to process files in this folder.

Building a docker image on Windows is not officially supported. You can try using WSL2 or "Linux Containers Mode" but I haven't tested this.

☑️ Testing

☑️ Testing with local environment

  1. pip install -r requirements_test.txt
  2. pytest

☑️ Testing with Docker

  1. Run docker-compose -f docker-compose.cpu.yml run carvekit_api pytest # For testing on CPU
  2. Run docker-compose -f docker-compose.cuda.yml run carvekit_api pytest # For testing on GPU

👪 Credits: More info

💵 Support

You can thank me for developing this project and buy me a small cup of coffee ☕

Blockchain Cryptocurrency Network Wallet
Ethereum ETH / USDT / USDC / BNB / Dogecoin Mainnet 0x7Ab1B8015020242D2a9bC48F09b2F34b994bc2F8
Ethereum ETH / USDT / USDC / BNB / Dogecoin BSC (Binance Smart Chain) 0x7Ab1B8015020242D2a9bC48F09b2F34b994bc2F8
Bitcoin BTC - bc1qmf4qedujhhvcsg8kxpg5zzc2s3jvqssmu7mmhq
ZCash ZEC - t1d7b9WxdboGFrcVVHG2ZuwWBgWEKhNUbtm
Tron TRX - TH12CADSqSTcNZPvG77GVmYKAe4nrrJB5X
Monero XMR Mainnet 48w2pDYgPtPenwqgnNneEUC9Qt1EE6eD5MucLvU3FGpY3SABudDa4ce5bT1t32oBwchysRCUimCkZVsD1HQRBbxVLF9GTh3
TON TON - EQCznqTdfOKI3L06QX-3Q802tBL0ecSWIKfkSjU-qsoy0CWE

📧 Feedback

I will be glad to receive feedback on the project and suggestions for integration.

For all questions write: farvard34@gmail.com