Why does the algorithm more faster in the high dimensional? #2

Tsepu · 2023-04-20T07:30:16Z

Why does the algorithm faster in the high dimensional?

I tried the algorithm using several cases (1000 points with 2-8-dimensional).
It returns results faster in low dimensional than high dimensional. Is there any reason?

Thanks

DataOmbudsman · 2023-10-12T20:44:56Z

Several things can explain this depending on your data and the eps parameter you used. As the number of dimensions increases, distance between data points change. Thus, "being within eps distance" gets another meaning, which can e.g., heavily influence the calculation cost of connected component search during insertion.

KwaiYii-Center · 2024-05-22T08:27:05Z

IncDBSCAN is very slow when using about 70w points with 1024 dimension. The distance metric is cosine and eps is set to 0.12. Is there any solution? thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does the algorithm more faster in the high dimensional? #2

Why does the algorithm more faster in the high dimensional? #2

Tsepu commented Apr 20, 2023

DataOmbudsman commented Oct 12, 2023

KwaiYii-Center commented May 22, 2024

Why does the algorithm more faster in the high dimensional? #2

Why does the algorithm more faster in the high dimensional? #2

Comments

Tsepu commented Apr 20, 2023

DataOmbudsman commented Oct 12, 2023

KwaiYii-Center commented May 22, 2024