Halve runtime when simplifying large lines #4

s3cur3 · 2022-05-26T17:28:17Z

First, let me say thank you for putting together this library—it saved us a ton of time in our project. 🙏

This PR dramatically improves performance on lines with a large number of points, primarily by reducing the number of times we have to iterate over the list.

Removes calls to length/1, since it's linear in the size of the list
Cuts down on the number of times we access the last element of a list
Cuts down on the number of new lists we construct
Cuts down on the number of times we iterate the full list to find the maximum distance

This also adds a very simple benchmark to the test suite, which can be run with:

$ mix test --only bench

Benchmarks on my machines (both running Elixir 13.4 + OTP 24, so the Intel machine has the benefit of the JIT):

Machine	Per-iteration Runtime Before	After
2019 Intel iMac	~0.15 sec	~0.07 sec
2020 M1 Macbook Air	~0.19 sec	~0.08 sec

This dramatically improves performance on lines with a large number of points, primarily by reducing the number of times we have to iterate over the list. - Removes calls to `length/1`, since it's linear in the size of the list - Cuts down on the number of times we access the last element of a list - Cuts down on the number of new lists we construct - Cuts down on the number of times we iterate the full list to find the maximum distance This also adds a very simple benchmark to the test suite, which can be run with: $ mix test --only bench On my machine (an M1 Mac running Elixir 13.4 + OTP 24, so no JIT support), the included benchmark runs in roughly 0.19 seconds per iteration in the shipping version of the code. With my changes applied, that drops down to roughly 0.08 seconds.

s3cur3 · 2022-05-26T17:28:59Z

lib/simplify.ex

-  @spec simplify(%Geo.LineString{}, number) :: %Geo.LineString{}
+  @spec simplify(Geo.LineString.t(), number) :: Geo.LineString.t()


Not perf related, but I couldn't help myself. (Cuts down on unnecessary recompiles by removing the compile-time dependency on the Geo library.)

pkinney · 2022-05-28T18:18:46Z

Thanks, @s3cur3 !

I can definitely confirm a big performance boost (2022 M1 Max Macbook Pro):

Before:

## SimplifyBench
benchmark name                       iterations   average time
short linestring - high tolerance        200000   8.35 µs/op
short linestring - normal tolerance      100000   11.79 µs/op
short linestring - low tolerance         100000   12.00 µs/op

After:

## SimplifyBench
benchmark name                       iterations   average time
short linestring - high tolerance       1000000   1.93 µs/op
short linestring - normal tolerance     1000000   2.68 µs/op
short linestring - low tolerance        1000000   2.85 µs/op

I'll merge this in then release as a 2.0.0 after I do a bit of long-overdue dependency updates and such.

pkinney · 2022-05-28T19:52:13Z

Released as part of v2.0.0

s3cur3 · 2022-05-28T21:24:34Z

Thank you so much! 🎉

s3cur3 commented May 26, 2022

View reviewed changes

pkinney merged commit 3535e1d into pkinney:master May 28, 2022

pkinney mentioned this pull request May 28, 2022

V2.0.0 #5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Halve runtime when simplifying large lines #4

Halve runtime when simplifying large lines #4

s3cur3 commented May 26, 2022 •

edited

Loading

s3cur3 May 26, 2022

pkinney May 28, 2022

pkinney commented May 28, 2022

pkinney commented May 28, 2022

s3cur3 commented May 28, 2022

		@spec simplify(%Geo.LineString{}, number) :: %Geo.LineString{}
		@spec simplify(Geo.LineString.t(), number) :: Geo.LineString.t()

Halve runtime when simplifying large lines #4

Halve runtime when simplifying large lines #4

Conversation

s3cur3 commented May 26, 2022 • edited Loading

s3cur3 May 26, 2022

Choose a reason for hiding this comment

pkinney May 28, 2022

Choose a reason for hiding this comment

pkinney commented May 28, 2022

pkinney commented May 28, 2022

s3cur3 commented May 28, 2022

s3cur3 commented May 26, 2022 •

edited

Loading