-
-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release 0.7 #132
Release 0.7 #132
Conversation
Nope, I think this is good, but maybe there's a way to exclude the ViTs from the release for now? I'm working on a refactor for those and there might be breaking API changes when that finalises (which would probably make releases more complicated). If it's not a biggie then that's fine (a lot of the models will probably require minor API tweaks for adding more general implementations in the future, though, so I'm a little unsure how this will play out) |
That's fine, we can always make a breaking release for this package when needed. I just want to avoid releasing a new API and immediately changing it within days. I commented out the ViT code from this release. When you submit the refactor, just comment it back in. |
Maybe it's worth trying to expose the activation function to the user at a uniform level in the API before the release? It's not much of a pain but some models do it (ResNet, MobileNet now) and others don't (DenseNet, frex.) One idea I had was that the function for returning the layers could expose this, with a documentation note that the original model can be used from the constructor while additional customisation can be added using the layers and wrapping them (this is implicit right now but isn't documented - also, docs keep failing, not sure what that's about) Also, another quite minor change (but could potentially help clean up the code a little bit): |
You mean like
Taking a look over all the places it is used, it is almost always part of a larger |
Yeah true, this probably needs quite some thinking for making it somewhat similar for all the networks - it probably doesn't need to hold up v0.7.
Hmmm, true. I also see that Chain on Flux master works even with arrays inside it, so pretty soon we can do away with the splats anyways I reckon |
|
1055055
to
d162d9e
Compare
I tested the |
The ViT tests are still commented out, is that intentional until we find a fix for the OOMs? |
It almost always OOMs for all systems right? |
Yep, only macOS is spared because the runners have more memory. One solution could maybe be to remove tests for |
Let's see what happens |
Okay, the Flux v0.13 release seems imminent, so let's just wait for that so we can bump our compat. |
This should wait on #125 before merging and releasing (which I will review tomorrow). @theabhirath any other PRs with API changes that you think should go in this release?
Last minute changes: