Torchclust: Clustering Algorithms written with Pytorch for running on GPU

Torchclust was developed to solve the issue of having to convert Pytorch Tensors to Numpy arrays and moving them to the CPU from the GPU in order to utilise frameworks such as scikit-learn.

Torchclust features implementations of common clustering algorithms with a scikit-learn feel.

Implemented algorithms

Centroid-based Clustering
- KMeans
- MeanShift
Density-based Clustering
- DBSCAN
- Gaussian Mixture Model
Deep / Learning-based Clustering
- Self-Organising Maps
Metrics
- Internal
  - Silhouette Score
  - Interia
  - Davies-Bouldin Index
  - Calinski-Harabasz Score / Variance Ratio Criterion
- External
  - Purity Score
  - Rand Index
  - Adjusted Rand Index
  - Mutual Information
  - Normalised Mutual Information

Contributing

This is still an ongoing project and contributions from the opensource community are warmly welcomed.

Contributions can be made in various forms:

Writing docs / Updating README
Fixings bugs
More efficient implementations of algorithnms
Or even implementing more algorithms

Installation

Be sure the GPU version of pytorch is installed if you intend to run the algorithms on GPU.

pip install torchclust

Usage

Kmeans on gaussian blobs

import torch
import matplotlib.pyplot as plt

from torchclust.utils.datasets import make_blobs
from torchclust.centroid import KMeans

x, _ = make_blobs(1000, num_features=2, centers=3)

kmeans = KMeans(num_clusters=3)
labels = kmeans.fit_predict(x)

plt.scatter(x[:, 0], x[:, 1], c=labels)
plt.show()

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
assets		assets
tests		tests
torchclust		torchclust
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Torchclust: Clustering Algorithms written with Pytorch for running on GPU

Implemented algorithms

Contributing

Installation

Usage

Kmeans on gaussian blobs

About

Releases

Packages

Languages

License

danny-1k/torchclust

Folders and files

Latest commit

History

Repository files navigation

Torchclust: Clustering Algorithms written with Pytorch for running on GPU

Implemented algorithms

Contributing

Installation

Usage

Kmeans on gaussian blobs

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages