Interactive OCR with Tesseract
Label Studio can be used to interactively work with OCR (Optical Character Recognition) models like Tesseract.
Create Interactive Model
You can use Label Studio ML Backend to start an interactive OCR model:
- Download
git clone https://github.com/HumanSignal/label-studio-ml-backend.git
- Go to
label_studio_ml/examples/tesseract
- Run
docker-compose up
It will start the server listening on http://localhost:9090
.
Connect to Label Studio
Let’s connect to the running Label Studio instance. You need API_KEY
that can be found in Account & Settings
-> API Key
section.
Create a project
To create a project, you need to specify the label_config
that defines the labeling interface and the labels ontology.
Connect OCR Model to Project
To connect your running OCR model to the project, you need to specify the model URL and the project ID: