Text Recognition and PDF/A Generation with the 'ocrservice’ Microservice

yuuvis® RAD 7.x »

The 'ocrservice' service is a microservice that generates text documents used for full-text indexing from image documents in conjunction with the text recognition software ABBYY FineReader. It can also be used to generate PDF files with hidden text, various PDF/A formats, and highly compressed PDFs.

Configuration

The 'ocrservice' service works with the following default settings:

PDF profile: format PDF/A1b
PDF profile: procedure Balanced
Text profile Default: TextExport.ini
File transfer to yuuvis® RAD rendition-plus Stream
Number of cores for ABBYY FineReader 1

These settings can be modified in the ocr-prod.yml configuration file located in the \servicemanager\config directory.

Example of a configuration in the ocr-prod.yml file:

finereader:
	profile:
		pdfa: PDFA1bBalanced.ini
		text: TextExport.ini
rest:
	transferPolicy: stream
finereader.engine.numberOfCores: 1

The example corresponds to the default settings.

Only the settings that differ from the default settings need to be entered.