Transformer from scratch

This is the pytorch version of implmenting transformer from scratch

Transformer is a powerful sequence transduction model , and we here implement every detail of it. And we trained my model to be capable of translating from English to Chinese.

TODO:

Preprocess the EN-ZH Dataset
Construct the training part of the transformer
Optimize the hyperparameters to obtain the best model
Add interface that we can use to do the translation

Architecture of Transformer

Overall architecture of the model:

Transformer is an encoder-decoder structure like this

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
model		model
util		util
Architecture.png		Architecture.png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer from scratch

Architecture of Transformer

About

Releases

Packages

Languages

ZheyuHarry/Transformer-from-scratch

Folders and files

Latest commit

History

Repository files navigation

Transformer from scratch

Architecture of Transformer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages