Neural Machine Translation: a Practical Session

This repo contains a Google Colab notebook which presents a practical session to introduce the training of neural machine translation systems to students. It was created for the lab sessions of the machine translation module of the IARFID master from Universitat Politècnica de València.

You can open the notebook by cliking on .

Overview

The goal of this lab session is to build machine translation systems based on neural networks (neural machine translation; NMT) from a dataset of bilingual parallel sentences using a custom version of the OpenNMT-py toolkit (Klein et al., 2017).

Dataset

The dataset we are going to use in this practical session is the Spanish–English language pair of the EuTrans corpus (Casacuberta et al., 2004), whose content involves the interaction of a customer with a receptionist at the frontdesk of a hotel. It comes with the custom version of OpenNMT-py that we are using. It is located at OpenNMT-py/dataset/EuTrans.

Here we can see an example of its content:

por favor, ¿nos puede dar la llave de la habitación?

can you give us the key to the room, please?

Network description

The neural network that we are going to use for training the NMT system has the following configuration:

Encoder and decoder are both Transformer with 64 neurons.
2 layers.
Hidden Transformer feed-forward of size 64.
2 self-attention heads.
Source word vector of size 64.
Target word vector of size 64.

Colab resources

References

Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Llorens, D., Hinarejos, C.D., & Molau, S. (2004). Some approaches to statistical and finite-state speech-to-speech translation. Comput. Speech Lang., 18, 25–47.
Klein, G., Kim, Y., Deng, Y., Senellart, J., and Rush, A. M. (2017). OpenNMT: Open-Source Toolkit for Neural Machine Translation. In Proceedings of the Association for Computational Linguistics: System Demonstration, 67–72.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
nmt-practical-session.ipynb		nmt-practical-session.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Machine Translation: a Practical Session

Overview

Dataset

Network description

Colab resources

References

About

Releases

Packages

Languages

License

midobal/nmt-practical-session

Folders and files

Latest commit

History

Repository files navigation

Neural Machine Translation: a Practical Session

Overview

Dataset

Network description

Colab resources

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages