Implementation of Multi-head Self-attention transformers from scrach This is a hobby project implemented with an intention of gaining hands-on experience with transformers and self-attention. Reference This blog post was followed as a reference.