Sequence® is a collection of libraries for automation of cross-application e-discovery and forensic workflows.
This connector contains steps to perform optical character recognition (OCR) on image files. It uses the Tesseract open source library as the OCR engine.
The following needs to be installed:
- <path> = 'MyImage.bmp'
- <imageData> = FileRead <path>
- <imageFormat> = GetImageFormat <path>
- <imageText> = TesseractOCR <imageData> <imageFormat>
- Print <imageText>
https://sequence.sh/playground
Can be downloaded from the Releases page.
Release nuget packages are available from nuget.org.