readmrz
detects the machine readable zone on ID cards and extracts the text in that zone.
This zone contains the name, surname, date of birth, etc. of the person to whom the identity card was issued. It has universal standards in new generation identity cards and passports.
In conclusion, readmrz
is a tool to read mrz code on identity cards and passports.
Please install tesseract
before installing the package,
On macOS,
$ brew install tesseract
On Ubuntu,
$ sudo apt-get install -y tesseract-ocr
On Windows,
$ choco install tesseract
Then you can install the latest release,
$ pip install readmrz
>>> import json
>>> from readmrz import MrzDetector, MrzReader
>>> detector = MrzDetector()
>>> reader = MrzReader()
>>> image = detector.read('/path/to/file')
>>> cropped = detector.crop_area(image)
>>> result = reader.process(cropped)
>>> print(json.dumps(result))
{
"surname": "STEARNE",
"name": "JOHN TIMOTHY KELLY",
"country": "CAN",
"nationality": "CAN",
"birth_date": "580702",
"expiry_date": "240904",
"sex": "M",
"document_type": "P",
"document_number": "GA302922",
"optional_data": "",
"birth_date_hash": "0",
"expiry_date_hash": "3",
"document_number_hash": "0",
"final_hash": "2"
}
or using url,
>>> import json
>>> from readmrz import MrzDetector, MrzReader
>>> detector = MrzDetector()
>>> reader = MrzReader()
>>> image = detector.read_from_url('/url/to/image')
>>> cropped = detector.crop_area(image)
>>> result = reader.process(cropped)
>>> print(json.dumps(result))
{
"surname": "STEARNE",
"name": "JOHN TIMOTHY KELLY",
"country": "CAN",
"nationality": "CAN",
"birth_date": "580702",
"expiry_date": "240904",
"sex": "M",
"document_type": "P",
"document_number": "GA302922",
"optional_data": "",
"birth_date_hash": "0",
"expiry_date_hash": "3",
"document_number_hash": "0",
"final_hash": "2"
}
The result is returned as a dict so it's easy to access the fields. You can also use command-line,
$ readmrz -f /path/to/file
or using url,
$ readmrz -u /url/to/image
Please check to the notebook to see the results step by step.
Please check to the pylint and flake8 steps in workflow before contribution.