optimize `IO.whenA` #4135

Jasper-M · 2024-09-12T15:22:43Z

I noticed that this simple recursive loop

def loop(i: Long): IO[Unit] =
  IO.whenA(i % 1000_000 == 0)(IO.println(i / 1000_000)).flatMap{ _ =>
    IO.whenA(i < Long.MaxValue)(loop(i + 1))
  }

loop(0).unsafeRunSync()

ends with

1071
1072
1073
java.lang.NegativeArraySizeException: -2147483648
	at cats.effect.ArrayStack.checkAndGrow(ArrayStack.scala:73)
	at cats.effect.ArrayStack.push(ArrayStack.scala:34)
	at cats.effect.IOFiber.runLoop(IOFiber.scala:367)
	at cats.effect.IOFiber.autoCedeR(IOFiber.scala:1423)
	at cats.effect.IOFiber.run(IOFiber.scala:119)
	at cats.effect.unsafe.WorkerThread.run(WorkerThread.scala:743)

While the equivalent if else code seems to keep running without noticeable GC pauses.

armanbilge · 2024-09-12T20:30:50Z

Interesting, that must be because of the void(...) in the Applicative#whenA implementation? We avoid that b/c we require that the argument is already voided.

https://github.com/typelevel/cats/blob/927a9bb957530b144506e96fc78bff67553858be/core/src/main/scala/cats/Applicative.scala#L263-L264

See also:

ArrayIndexOutOfBoundsException in ByteStack #3907

armanbilge

Shall we add a test case based on your example?

Jasper-M · 2024-09-13T15:34:40Z

We could do that, but this slimmed down version still takes 20 seconds to complete on my machine. And that's in the happy path where it doesn't fail or use a ridiculous amount of memory.

def loop(i: Long): IO[Unit] =
  IO.unit >> IO.whenA(i < 1_100_000_000)(loop(i + 1))

armanbilge

Ah yes, d'oh, good point. Probably not worth the strain on CI.

lenguyenthanh · 2024-09-13T19:50:00Z

Interesting, that must be because of the void(...) in the Applicative#whenA implementation?

Should We do something about IO.void, ex:

   def void: IO[Unit] =
    5 -    map(_ => ())
    6 +    isInstanceOf[IO[Unit]] match {
    7 +      case true => this.asInstanceOf[IO[Unit]]
    8 +      case _ => map(_ => ())
    9 +    }

armanbilge · 2024-09-13T19:57:27Z

isInstanceOf[IO[Unit]]

@lenguyenthanh Unfortunately this won't work because type parameters are erased at runtime, so it will always be true.

djspiewak

This makes quite a bit of sense honestly. Also it raises the question of when and why whenA would actually be desirable…

Can you add an override in the Async typeclass implementation for both methods as well? That will allow the optimization to work in polymorphic contexts.

Jasper-M · 2024-09-23T08:26:17Z

Can you add an override in the Async typeclass implementation for both methods as well? That will allow the optimization to work in polymorphic contexts.

The problem is like @armanbilge pointed out that the typeclass method takes a => F[A] whereas the IO method takes => IO[Unit]. So I don't think you can do any better than if (cond) void(f) else unit in the typeclass. Maybe the method signature in cats-core should be fixed, but that seems hard to do without breaking things.

djspiewak

Ooooooh yeah this is exceptionally annoying. Both because the signature in Cats is definitely wrong, but also because we're inconsistent with it and I didn't realize. At any rate, bigger fish than we can fry right now.

armanbilge · 2024-09-23T20:40:03Z

Both because the signature in Cats is definitely wrong

This might be a matter of opinion, see a discussion on this topic in typelevel/cats#4352 (comment).

Jasper-M added 3 commits September 12, 2024 15:11

optimize IO.whenA

9d6d7f2

clean up imports

72d547c

scalafmt

062120d

armanbilge changed the title ~~optimize IO.whenA~~ optimize IO.whenA Sep 12, 2024

armanbilge reviewed Sep 12, 2024

View reviewed changes

armanbilge approved these changes Sep 13, 2024

View reviewed changes

djspiewak requested changes Sep 23, 2024

View reviewed changes

djspiewak approved these changes Sep 23, 2024

View reviewed changes

djspiewak merged commit cb76f95 into typelevel:series/3.5.x Sep 23, 2024
27 of 31 checks passed

iRevive mentioned this pull request Sep 27, 2024

benchmarks: add BatchSpanProcessor benchmark typelevel/otel4s#791

Merged

1 task

armanbilge added the 🍄 enhancement label Oct 27, 2024

counter2015 mentioned this pull request Oct 28, 2024

Check target array length before allocating #4159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize `IO.whenA` #4135

optimize `IO.whenA` #4135

Jasper-M commented Sep 12, 2024

armanbilge commented Sep 12, 2024

armanbilge left a comment

Jasper-M commented Sep 13, 2024

armanbilge left a comment

lenguyenthanh commented Sep 13, 2024 •

edited

Loading

armanbilge commented Sep 13, 2024

djspiewak left a comment

Jasper-M commented Sep 23, 2024

djspiewak left a comment

armanbilge commented Sep 23, 2024

optimize IO.whenA #4135

optimize IO.whenA #4135

Conversation

Jasper-M commented Sep 12, 2024

armanbilge commented Sep 12, 2024

armanbilge left a comment

Choose a reason for hiding this comment

Jasper-M commented Sep 13, 2024

armanbilge left a comment

Choose a reason for hiding this comment

lenguyenthanh commented Sep 13, 2024 • edited Loading

armanbilge commented Sep 13, 2024

djspiewak left a comment

Choose a reason for hiding this comment

Jasper-M commented Sep 23, 2024

djspiewak left a comment

Choose a reason for hiding this comment

armanbilge commented Sep 23, 2024

optimize `IO.whenA` #4135

optimize `IO.whenA` #4135

lenguyenthanh commented Sep 13, 2024 •

edited

Loading