Classic Datasets

Loaders for classic datasets commonly used in Machine Learning:

Dataset	# Samples	# Features	# Classes	Balance
Ionosphere	351	34	2	0.56
Letter Recognition	20000	16	10	0.91
Telescope	19020	10	2	0.54
Pen Digits	10992	16	10	0.92
Robot Navigation	5456	24	4	0.15
Segmentation	2310	16	7	1.00
USPS	9298	256	10	0.46

Installation

pip install classicdata

Run python -m classicdata.info to list all implemented datasets.

Example Usage

from classicdata import Ionosphere

ionosphere = Ionosphere()

# Use ionosphere.points and ionosphere.labels...

Related Projects

There are other projects. They are more mature, more robust, more better. That is why this project is called classicdata. Sometimes you need small, simple datasets. Other times, consider the following projects.

OpenML: better, faster, stronger; more complex, though
sklearn.datasets: limited selection; no metadata
torchvision.datasets: limited selection; datasets too modern (big)
TensorFlow Datasets: datasets too modern (big)

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
docs		docs
src/classicdata		src/classicdata
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classic Datasets

Installation

Example Usage

Related Projects

About

Releases

Packages

Languages

License

jangop/classicdata

Folders and files

Latest commit

History

Repository files navigation

Classic Datasets

Installation

Example Usage

Related Projects

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages