-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance and Benchmarks #104
Conversation
It's interesting, in the last commit I found
|
It is possible that julia> @btime t = Libtask.CTask(f, args...);
258.923 ms (766345 allocations: 43.38 MiB)
julia> @btime Libtask.step_in(t.tf.tape, args)
95.054 ns (1 allocation: 48 bytes)
julia> @btime f(args...)
2.549 μs (38 allocations: 1.95 KiB)
(2.0, VarInfo (2 variables (μ, σ), dimension 2; logp: -1.2750123006e7))
So it appears that a lot of time is spent on repetitively constructing |
Without Cache:
With IR and Tape Cache:
|
In spite of numeric test failures and a few errors, unit tests finished in about 2 hours on my machine:
|
Co-authored-by: David Widmann <devmotion@users.noreply.github.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can confirm that the tests now run correctly - we can rerun the Turing CI once this PR is merged. Fingers crossed!
This PR is ready to merge. @yebai |
No description provided.