Telegram bot for handwritten text recognition

Install

To install you must have:

GNU/Linux
python3.9
g++

Poerty

curl -sSL https://install.python-poetry.org | python3 - --preview
export PATH="/home/$USER/.local/bin:$PATH"

NoteScribe

git clone --recursive https://github.com/naereni/NoteScribe.git
cd NoteScribe && sh install.sh

If you will you encounter a problem with ctcdecode - you should run several times python setup.py install in directory NoteScribe/third_party/ctcdecode

Solution

What the problem?

This task is from National Tech Olympiad (NTO). Problem is get all text from photo of school notebook. The text can be in russian or in english.

How this work?

A sequence of two models: segmentation and recognition. First, the segmentation model predicts the mask polygons of each word in the photo. Then these words are cut out of the image along the contour of the mask (drops are obtained for each word) and fed into the recognition model. The result is a list of recognized words with their coordinates.

Models

Instance Segmentation

X101-FPN from detectron2.model_zoo + augmentation + high resolution

Character Recognition

CRNN architecture with Resnet-34 backbone and BiLSTM, pre-trained for the top 1 models of the competition Digital Peter

Beam Search

KenLM, trained on competition data Feedback, "Решу ОГЭ/ЕГЭ", and also CTCDecoder

Resources

Christofari with NVIDIA Tesla V100 and docker image jupyter-cuda10.1-tf2.3.0-pt1.6.0-gpu:0.0.82

Implementation

About

In this telegram project, the bot was written using the aiogram library, that is, it supported synchronous I/O, as well as the model itself, in other words, each subsequent user who launched the bot does not increase the task execution time of the previous user.

How to start

In this project, implementation is telegram bot. Realization is containing in ./src. To start bot you have to run ./src/server.py, and for security reasons, API key must contain in environment variables. Like a export TG_API_TOKEN="123".

Examples

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
examples		examples
src		src
third_party		third_party
train		train
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telegram bot for handwritten text recognition

Install

Poerty

NoteScribe

Solution

What the problem?

How this work?

Models

Resources

Implementation

About

How to start

Examples

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Telegram bot for handwritten text recognition

Install

Poerty

NoteScribe

Solution

What the problem?

How this work?

Models

Resources

Implementation

About

How to start

Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages