Recognition languages

Contents
[ ]

Aspose.OCR for JavaScript via C++ can recognize a text in a large number of languages and all popular writing scripts, including texts with mixed languages.

To specify a language for recognition, provide one of the following values in language_alphabet property in recognition settings:

Value Numeric value Language
Module.Language.NONE 0 Extended Latin characters, including diacritics
Module.Language.BEL 22 Belorussian
Module.Language.BUL 23 Bulgarian
Module.Language.CHI 21 Chinese
Module.Language.CZE 7 Czech
Module.Language.DAN 8 Danish
Module.Language.DEU 2 German
Module.Language.DUM 9 Dutch
Module.Language.ENG 1 English
Module.Language.EST 10 Estonian
Module.Language.FIN 11 Finnish
Module.Language.FRA 5 French
Module.Language.HIN 28 Hindi
Module.Language.ITA 6 Italian
Module.Language.KAZ 27 Kazakh
Module.Language.LAV 12 Latvian
Module.Language.LIT 13 Lithuanian
Module.Language.NOR 14 Norwegian
Module.Language.POL 15 Polish
Module.Language.POR 3 Portuguese
Module.Language.RUM 16 Romanian
Module.Language.RUS 24 Russian
Module.Language.SLK 18 Slovak
Module.Language.SLV 19 Slovene
Module.Language.SPA 4 Spanish
Module.Language.SRP 25 Serbian
Module.Language.SRP_HRV 17 Serbo-Croatian
Module.Language.SWE 20 Swedish
Module.Language.UKR 26 Ukrainian

If this parameter is omitted, the OCR engine will assume that the text is written in extended Latin.

Example

The following code sample demonstrates how to specify the recognition language:

// Prepare images
var source = Module.WasmAsposeOCRInput();
source.url = filename;
var content = new Module.WasmAsposeOCRInputs();
content.push_back(source);
// Set recognition language
var settings = Module.WasmAsposeOCRRecognitionSettings();
settings.language_alphabet = Module.Language.DEU;
// Recognize image
var result = Module.AsposeOCRRecognize(content, settings);