You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you guys for open sourcing this amazing work, I was curious though if there are any plans for a Triton implementation for a higher level implementation. I would like to experiment with this project in tandem with a library I have been working on to accelerate diffusion models but I am not entirely familiar with CUDA yet.
Looking forward to your response 🙂
The text was updated successfully, but these errors were encountered:
Mainly looking for an implementation I can easily play around with, hopefully stuff like bias and activation fusion, extension to 2D, etc. Is there a reference pytorch implementation anywhere I can look at?
Thank you guys for open sourcing this amazing work, I was curious though if there are any plans for a Triton implementation for a higher level implementation. I would like to experiment with this project in tandem with a library I have been working on to accelerate diffusion models but I am not entirely familiar with CUDA yet.
Looking forward to your response 🙂
The text was updated successfully, but these errors were encountered: