FASTAPI Microservice to Extract Text from Images

Python & FastAPI Tutorial: Create an ai microservice to extract text from images

Objective

Learn how to deploy an AI microservice REST API endpoint using FastAPI, pytesseract, streamlit cloud Platform

Streamlit deployment

Checkout the application live here 🚀

Screenshot 📸

Project Implementation Outline

Setup Requirements.txt
Setup Environment
Setup FastAPI App
FastAPI & Jinja Templates
FastAPI & PyTest
FastAPI Git & pre-commit
Deploy to DigitalOcean
Deploy Docker App to DigitalOcean App Platform
FastAPI Settings & Environment Variables & dotenv
Handling File Uploads
Automated Testing File Uploads
Image Upload Validation & Tests
Implementing Tesseract & pytesseract
Authorization Headers
Production Endpoint & Authorization Tests
One-Click Deploy on DigitalOcean App Platform

Project Requirements

fastapi
gunicorn
uvicorn
jinja2
pytest
requests
pre-commit
python-dotenv
python-multipart
aiofiles
pillow

Working with Pre-Commit

First, pip install pre-commit
Make a .pre-commit-config.yaml file
In the Terminal use : pre-commit-install
Next run the following command : pre-commit run --all-files

Working with PyTest

Install pytest using pip
add pytest.ini file to exclude the directories pytest looks for testing code
Using pytest -s to stdout the test responses from endpoints as well.

Working with Tesseract OCR

Reference - Tesseract OCR GitHub

Check the PyPi package here

Use following command to install wrapper class for Google Tesseract OCR Engine:

pip install pytesseract

Install tesseract using windows installer available at: https://github.com/UB-Mannheim/tesseract/wiki
Note the tesseract path from the installation. Default installation path at the time of this edit was: C:\Users\USER\AppData\Local\Tesseract-OCR. It may change so please check the installation path.
pip install pytesseract
Set the tesseract path in the script before calling image_to_string:

pytesseract.pytesseract.tesseract_cmd = r'C:\Users\USER\AppData\Local\Tesseract-OCR\tesseract.exe'

Securing the Endpoint using Auth Tokens

Generate a random token using Python secrets library

import secrets

secrets.token_urlsafe(32)

Save the generated tokens in a .env file

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
app		app
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FASTAPI Microservice to Extract Text from Images

Objective

Streamlit deployment

Project Implementation Outline

Project Requirements

Working with Pre-Commit

Working with PyTest

Working with Tesseract OCR

Securing the Endpoint using Auth Tokens

About

Releases

Packages

Languages

License

kshitijzutshi/FAST-API-Text-OCR

Folders and files

Latest commit

History

Repository files navigation

FASTAPI Microservice to Extract Text from Images

Objective

Streamlit deployment

Project Implementation Outline

Project Requirements

Working with Pre-Commit

Working with PyTest

Working with Tesseract OCR

Securing the Endpoint using Auth Tokens

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages