To install you must have:
GNU/Linuxpython3.9g++
curl -sSL https://install.python-poetry.org | python3 - --preview
export PATH="/home/$USER/.local/bin:$PATH"git clone --recursive https://github.com/naereni/NoteScribe.git
cd NoteScribe && sh install.shIf you will you encounter a problem with ctcdecode - you should run several times python setup.py install in directory NoteScribe/third_party/ctcdecode
This task is from National Tech Olympiad (NTO). Problem is get all text from photo of school notebook. The text can be in russian or in english.
A sequence of two models: segmentation and recognition. First, the segmentation model predicts the mask polygons of each word in the photo. Then these words are cut out of the image along the contour of the mask (drops are obtained for each word) and fed into the recognition model. The result is a list of recognized words with their coordinates.
- X101-FPN from detectron2.model_zoo + augmentation + high resolution
- CRNN architecture with Resnet-34 backbone and BiLSTM, pre-trained for the top 1 models of the competition Digital Peter
- KenLM, trained on competition data Feedback, "Решу ОГЭ/ЕГЭ", and also CTCDecoder
Christofari with NVIDIA Tesla V100 and docker image jupyter-cuda10.1-tf2.3.0-pt1.6.0-gpu:0.0.82
In this telegram project, the bot was written using the aiogram library, that is, it supported synchronous I/O, as well as the model itself, in other words, each subsequent user who launched the bot does not increase the task execution time of the previous user.
In this project, implementation is telegram bot. Realization is containing in ./src. To start bot you have to run ./src/server.py, and for security reasons, API key must contain in environment variables. Like a export TG_API_TOKEN="123".
![]() |
![]() |

