A custom implementation of the Transformer architecture.
This project involves building the core components of the Transformer architecture from the ground up. It serves as an educational tool to deeply understand self-attention mechanisms and sequence-to-sequence modeling.
For feedback or suggestions, contact me at: dev.jhawar.cs@gmail.com