Skip to content

The problem of choosing a metric for determining the degree similarities between objects of different nature (for strings, outliers, mixed and sparse data)

Notifications You must be signed in to change notification settings

AndrewSalygin/metrics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Metrics

The problem of choosing a metric for determining the degree similarities between objects of different nature (for strings, outliers, mixed and sparse data)

Sparse data

Initial class distribution and dictionaries used:

image

Research results

image

Identifying outliers in data

Source dataset:

image

Mahalanobis distance

The result of the application:

image

Cook's Distance

The result of the application:

image

DBSCAN method

The result of the application:

image

Mixed data

KNN

image

KMeans & K-Prototype

image

KMedoids

image

About

The problem of choosing a metric for determining the degree similarities between objects of different nature (for strings, outliers, mixed and sparse data)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages