Conversor de texto OCR¶
The OCR text converter is a tool to parse the contents of an image and detect areas with text to convert into editable and translatable characters files.
The tool can batch optical character recognition (OCR) over images, and their translations in many languages using an online translator engine. It also allows you to review the text and make corrections and offers spell checking.
The tool use in background the Tesseract, a powerful open-source optical character recognition engine available for Linux, macOS, and Windows.
To perform text conversions, select the scanned images including text to recognize and start the tool from the menu
, or use the icon OCR Text Converter from the Tools tab on the right sidebar. The following dialog must appear:On the right side, the Text recognition tab shows on the top of view the Tesseract binary program version detected on your system. If none is present, you will need to install it on your system. Below, the Tesseract settings can be customized to process images.
The Languages setting specifies the language used for OCR. In the Default mode, when processing digital text with multiple languages, Tesseract can automatically recognize languages using Latin alphabets such as English or French, but it’s not compatible with languages using hieroglyphs such as Chinese or Japanese. You can use the Orientation and Script Detection mode instead or a specific language module if available.
The Segmentation mode settings specify the Tesseract page segmentation mode to use while processing images. Possible choices are listed below:
OSD solo: solo detección de orientación y de escritura (OSD).
Con OSD: segmentación de página automática con OSD.
Sin OSD: segmentación de página automática, pero sin OSD ni OCR.
Predeterminado: segmentación de página completamente automática, pero sin OSD.
Col. de texto: suponer que es una sola columna de texto de tamaño variable.
Alineado verticalmente: suponer que es un solo bloque uniforme de texto alineado verticalmente.
Bloque: suponer que es un solo bloque uniforme de texto.
Línea: tratar la imagen como una sola línea de texto.
Palabra: tratar la imagen como una sola palabra.
Palabra en círculo: tratar la imagen como una sola palabra en un círculo.
Carácter: tratar la imagen como un solo carácter.
Texto escaso: texto escaso; encontrar la mayor cantidad de texto posible sin ningún orden en particular.
Texto escaso + OSD: texto escaso con OSD.
Línea en bruto: tratar la imagen como una única línea de texto, evitando los trucos específicos de Tesseract.
If you want more details about the Tesseract Segmentation Mode you can read this online tutorial.
The Engine mode setting specifies the Tesseract OCR internal engine to use while processing images. Possible choices are listed below:
Legacy: Legacy engine only (older engine not based on the neural network).
LSTM: Neural network LSTM (Long Short-Term Memory deep-learning) engine only.
Legacy + LSTM: Both legacy and LSTM engines will be used.
Default: Default value. Let Tesseract choose the best engine based on what is available.
The Resolution Dpi settings specify the resolution as Dot Per Inch (DPI) for the input images.
If the Use Multi-cores setting is enabled, files from the list will be processed in parallel with Tesseract.
The Store result in will specify where to place the text contents recognized by Tesseract while processing images. Possible choices are listed below:
Text file: Store OCR result in a separate text file in the same directory as the processed image.
Metadata: Store OCR result in alternative-language XMP tag from image metadata.
On the bottom of this view, the OCR result can be translated into different languages using one online translation engine. You can set more than one translation language to process images. Corresponding translations will be hosted in separate text files or in extra metadata entries depending on the Store result in setting. See this page from the manual for more details about the Localize Settings.
The Text Review tab on the right side allows editing the OCR result for each image processed with Tesseract. Select one item from the list on the left side and OCR result will be displayed in a text editor. You can fix text if necessary or apply spell-checking. See this page from the manual for more details about the Spell-Checking Settings.
On the bottom of the dialog, the Default button allows resetting all settings to the default values. The Start OCR drop-down button allows the processing of the currently selected images from the list or all items. Finally, the Close button will stop all OCR processes if any and close the dialog.