Further speed improvements #8

folkertdev · 2020-06-17T18:19:15Z

and also a bug fix, see comments

by specializing to the amount of bits rotated, the called function is a one-argument function, so elm does not have to apply the usual F and A wrappers that enable currying

folkertdev

On my machine this now does ~16MB/s in a benchmark, and the 90mb file takes just over 5 seconds of scripting time.

folkertdev · 2020-06-17T18:19:59Z

src/SHA1.elm

-    State (Tuple5 (initial.a + a) (initial.b + b) (initial.c + c) (initial.d + d) (initial.e + e))
+    State
+        { a = initial.a + a
+        , b = initial.b + b
+        , c = initial.c + c
+        , d = initial.d + d
+        , e = initial.e + e
+        }


A plain record is much faster than using the alias (which is a function internally, the record literal is really an object literal in JS)

folkertdev · 2020-06-17T18:21:07Z

src/SHA1.elm

@@ -373,24 +374,84 @@ So in the recursion, `b16` is dropped, all the others shift one position to the
 Then the `deltaState` is also updated with the `value`.

 -}
-reduceWords i deltaState b16 b15 b14 b13 b12 b11 b10 b9 b8 b7 b6 b5 b4 b3 b2 b1 =
-    if (i - blockSize) < 0 then
+reduceWords i ((DeltaState { a, b, c, d, e }) as deltaState) b16 b15 b14 b13 b12 b11 b10 b9 b8 b7 b6 b5 b4 b3 b2 b1 =


This function now performs 8 calculations per iteration.That means fewer function calls, but diminishing returns mean that it's not worth it to go up to 16.

folkertdev · 2020-06-17T18:21:55Z

src/SHA1.elm

+                (shiftedA + f + e + int)
+                    |> Bitwise.shiftRightZfBy 0


This shiftRightZfBy 0 fixes the bug on the large file reported in #5

folkertdev · 2020-06-17T18:22:41Z

src/SHA1.elm

+rotateLeftBy1 : Int -> Int
+rotateLeftBy1 i =
+    -- because of how `rotateLeftBy1` is used, the `Bitwise.shiftRightZfBy 0` is not required
+    Bitwise.or (Bitwise.shiftRightZfBy 31 i) (Bitwise.shiftLeftBy 1 i)


specializing to shift by 1 is important. This function takes only 1 argument now, an so elm does not need to apply wrappers that enable currying of this function. Much faster.

TSFoster · 2020-06-18T12:27:33Z

Brilliant, thanks. Love learning about elm perf from you, thanks for all the explanations

TSFoster · 2020-06-18T23:52:01Z

Now published as 2.1.1

(cc @BKSpurgeon)

folkertdev added 3 commits June 17, 2020 17:29

remove function call and inline loop

7a26505

optimize bit rotation

36bf8be

by specializing to the amount of bits rotated, the called function is a one-argument function, so elm does not have to apply the usual F and A wrappers that enable currying

optimize calculateDigestDeltas

30baa0b

folkertdev commented Jun 17, 2020

View reviewed changes

folkertdev mentioned this pull request Jun 17, 2020

Hashing elm/bytes #5

Closed

TSFoster merged commit 6a2b3ae into TSFoster:master Jun 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further speed improvements #8

Further speed improvements #8

folkertdev commented Jun 17, 2020

folkertdev left a comment

folkertdev Jun 17, 2020

folkertdev Jun 17, 2020

folkertdev Jun 17, 2020

folkertdev Jun 17, 2020

TSFoster commented Jun 18, 2020

TSFoster commented Jun 18, 2020

Further speed improvements #8

Further speed improvements #8

Conversation

folkertdev commented Jun 17, 2020

folkertdev left a comment

Choose a reason for hiding this comment

folkertdev Jun 17, 2020

Choose a reason for hiding this comment

folkertdev Jun 17, 2020

Choose a reason for hiding this comment

folkertdev Jun 17, 2020

Choose a reason for hiding this comment

folkertdev Jun 17, 2020

Choose a reason for hiding this comment

TSFoster commented Jun 18, 2020

TSFoster commented Jun 18, 2020