OCR-on-Image-ROI-with-Tesseract

Applying OCR on manually selected Region of Interests (using mouse drag) for Text extraction from Images

Code Flow Steps

Install pytesseract() and setting it to the path variable
- Tesseract Download Link: https://github.com/UB-Mannheim/tesseract/wiki
Import the required libraries
Read the image file into python using OpenCV’s imread() method
Resize (if necessary) the images and converting them into grey scale using OpenCV’s resize () and cvtColor() methods respectively
Extract the Region of Interest from the image manually using mouse drag.
- Starting coordinates are stored when the left mouse button is pressed and the ending coordinates when the left mouse button is released.
- Extract the region between these starting and ending coordinates when ‘enter’ is pressed. If ‘c’ is pressed the coordinates are cleared.
Optical Character Recognition (OCR) is then applied on the ROI using pytesseract. (Instead of Tesseract engine, Google Vision or Azure Vision could also be used).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
LICENSE		LICENSE
OCR_ROI_result.png		OCR_ROI_result.png
OCR_with_mouse_drag.py		OCR_with_mouse_drag.py
README.md		README.md
Test image 2.jpg		Test image 2.jpg
Test image1.jpg		Test image1.jpg