This is the pytorch version of implmenting transformer from scratch
Transformer is a powerful sequence transduction model , and we here implement every detail of it. And we trained my model to be capable of translating from English to Chinese.
TODO:
Preprocess the EN-ZH Dataset Construct the training part of the transformer Optimize the hyperparameters to obtain the best model Add interface that we can use to do the translation
Overall architecture of the model:
Transformer is an encoder-decoder structure like this