Printed-Text-recognition-and-conversion

Introduction

These days there is a huge demand in storing the information available in paper documents into a computer, storage disk and then later reusing this information by searching process. One simple way to store information from these paper documents in to computer system is to first scan the documents and then store them as images. But to reuse this information it is very difficult to read the individual contents and searching the contents form these documents line-by-line and word-by-word.This poses an inconvenience because the image is not searchable or editable. Even when we want to convert scanned images directly into pdf, they are not in editable or searchable format.

The aim of this project was to make a software which would be capable of identifying and recognizing English typed text from an image(.jpg, .jpeg, .png) and convert it to an editable format(.txt ,etc) so that it can be directly modified without the need for typing the text document again manually. The project involves the implementation of Image Processing techniques and Machine Learning Algorithms.

Approach:

Image Processing:
1. Binarization
2. Skew-Correction
Segmentation
1. Line segmentation
2. Character segmentation
The training is done using CNN model.

To install

The language used is Python3

Required libraries

    Numpy
    OpenCV
    Sklearn
    Scikit
    Tensorflow
    PyQt4

To run through GUI

    python gui.py

To run on CLI

    python main.py

Authors

Roshni Ram

Ishita Das

Rohit Shamdasani

Ayush Mudgal

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
__pycache__		__pycache__
dataset		dataset
image_samples		image_samples
nn_two_stage		nn_two_stage
training_nn		training_nn
.gitattributes		.gitattributes
DSC_0511.jpg		DSC_0511.jpg
IMG_3949.jpg		IMG_3949.jpg
LICENSE		LICENSE
README.md		README.md
before_segmentation.py		before_segmentation.py
biases_weights.ckpt.index		biases_weights.ckpt.index
biases_weights.ckpt.meta		biases_weights.ckpt.meta
big_merged.txt		big_merged.txt
dict.py		dict.py
functions_characters.py		functions_characters.py
functions_lines.py		functions_lines.py
functions_words.py		functions_words.py
get_equivalent_letter.py		get_equivalent_letter.py
gui.py		gui.py
imagecrop.py		imagecrop.py
img_for_detection.png		img_for_detection.png
img_for_extraction.png		img_for_extraction.png
img_with_lines.png		img_with_lines.png
main.py		main.py
network2.py		network2.py
ocr.py		ocr.py
original_img.jpg		original_img.jpg
output.txt		output.txt
requirements-linux.txt		requirements-linux.txt
resize.py		resize.py
rotated_rect.png		rotated_rect.png
segmentation_characters.py		segmentation_characters.py
segmentation_words.py		segmentation_words.py
segmented.png		segmented.png
setup.py		setup.py
text2speech.py		text2speech.py
tkinter_gui.py		tkinter_gui.py
train_model.py		train_model.py
user_input.py		user_input.py
welcome.wav		welcome.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Printed-Text-recognition-and-conversion

Introduction

Approach:

To install

Required libraries

To run through GUI

To run on CLI

Authors

About

Releases

Packages

License

NJACKWinterOfCode/Printed-Text-recognition-and-conversion

Folders and files

Latest commit

History

Repository files navigation

Printed-Text-recognition-and-conversion

Introduction

Approach:

To install

Required libraries

To run through GUI

To run on CLI

Authors

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages