This project implements an Encoder-Decoder Transformer model from scratch (using PyTorch's nn.MultiheadAttention but not nn.TransformerEncoder/Decoder) to perform basic arithmetic (addition and ...
This Transformer Arithmetic Project was completed as part of Assignment 5-2 from the EECS 498-007 Deep Learning for Computer Vision course offered by the University of Michigan. Although I am a ...