Recognizes text from any images and prints them.
- This Python Script performs text detection using OpenCV’s EAST text detector, a highly accurate deep learning text detector used to detect text in natural scene images.Once we have detected the text regions with OpenCV, we’ll then extract each of the text ROIs and pass them into Tesseract, enabling us to build an entire OpenCV OCR pipeline!
- This project can help you recognize text on an image. OCR technology solves the problem by converting text images into text data that can be analyzed by other business software. You can then use the data to conduct analytics, streamline operations, automate processes, and improve productivity.It is based on OpenCV, enabling us to apply deep learning.
The project currently does the following things.
- Text Recognition
- Text Detection
- Prints Recognized Text
See below for our future steps.
- Converting Text to Audio
- Language Conversion
- In order to perform OpenCV OCR text recognition, we will need to install various python packages such as Tesseract v4 which includes a highly accurate deep learning-based model for text recognition, OpenCV and other required packages.
- Tesseract v4: It includes a highly accurate deep learning-based model for text recognition.
- OpenCV: To run this script you’ll need OpenCV installed. Version 3.4.2 or better is required.
- frozen_east_text_recognition.pb: The EAST text detector. This CNN is pre-trained for text detection and ready to go.
- imutils: This package will be used for non-maxima suppression.
- argparse
- numpy
A step by step series of examples that tell you how to get a development env running. Before getting on with the installations, make sure to download the frozen_east_text_recognition.pb from link.
Installing Tesseract v4:
- In Windows: Under python Terminal type: pip install tesseract. Installing OpenCV:
- In Windows: Under python Terminal type: pip install opencv-python
- frozen_east_text_recognition.pb: It can be downloaded from the project as it is already included(must be kept in the same folder as the script).
- imutils: Under python Terminal type: pip install imutils
- argparse: Under python Terminal type: pip install argparse
- numpy: Under python Terminal type: pip install numpy
- using OpenCV’s EAST text detector, a highly accurate deep learning text detector used to detect text in natural scene images.
- Once we have detected the text regions with OpenCV, we’ll then extract each of the text ROIs and pass them into Tesseract, enabling us to build an entire OpenCV OCR pipeline!
Please read contributing.md for details on our code of conduct, and the process for submitting pull requests to us.
This project is licensed under the MIT License - see the LICENSE file for details.