TMX2Moses

TMX2Moses transforms translation memory files into a parallell corpus of two aligned bitext files suitable for training a statistical machine translation system like Moses. Thus TMX2Moses makes it possible to train your machine translation system on a corpus consisting of the documents you (or some one else) have translated. This is a great advantage as the vocabulary and style is adapted to the domain you are working on.

The software has been tested mainly on TMX-files created with OmegaT, but it works with TMX-files created with other CAT-tools (Computer Aided Translation) as well.

Please read the QuickStart if you would like to test the program. You can test on the TMX-file example_en-sv-omegat.tmx (a translation to Swedish of a Wikipedia article on translation) or your own TMX-file. If you enter the name of a folder all TMX-files in the folder will be processed.

Contributions are welcome. Just make a fork, make a pull request and I will merge as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Compiled for java 1.7		Compiled for java 1.7
Compiled		Compiled
Source		Source
.gitattributes		.gitattributes
.gitignore		.gitignore
QuickStart_TMX2Moses.pdf		QuickStart_TMX2Moses.pdf
QuickStart_TMX2Moses.txt		QuickStart_TMX2Moses.txt
README.md		README.md
TMX2Moses.jar		TMX2Moses.jar
TMX2Moses.jar.sig		TMX2Moses.jar.sig
Translate TMX2Moses to your language.txt		Translate TMX2Moses to your language.txt
credit.txt		credit.txt
example_en-sv-omegat.tmx		example_en-sv-omegat.tmx
license.txt		license.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TMX2Moses

About

Releases

Packages

Languages

License

havet/TMX2Moses

Folders and files

Latest commit

History

Repository files navigation

TMX2Moses

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages