Skip to content

This is my 5th place solution to the AIJ Junior "Digital Peter".

Notifications You must be signed in to change notification settings

Shemka/DigitalPeterSolution

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Digital Peter Contest Solution

Task description

Digital Peter is an educational task with a historical slant created on the basis of several AI technologies (Computer Vision, NLP, and knowledge graphs). The task was prepared jointly with the Saint Petersburg Institute of History (N.P.Lihachov mansion) of Russian Academy of Sciences, Federal Archival Agency of Russia and Russian State Archive of Ancient Acts.

Contestants are invited to create an algorithm for line-by-line recognition of manuscripts written by Peter the Great.

Approach

All images were resized to 128x1024 and normalized. Also rotated images were in dataset, so I used simple heuristic for their rotation. Only Random rotations and crops were used as augmentaions. Only lines with the most frequent chars were chosen. Then I used NN with following architecture: custom resnet-like backbone for image feature extraction, 3 GRU-256 with 0.1 dropout and CTC loss. Then sequence of probabilites were encoded with greedy search.

Public

изображение

Private

изображение

About

This is my 5th place solution to the AIJ Junior "Digital Peter".

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published