Recognition languages

Aspose.OCR for Java can recognize a large number of languages and all popular writing scripts, including texts with mixed languages.

Universal (mixed language) OCR models

  • Extended Latin
  • Cyrillic
  • Cyrillic/English
  • Chinese
  • Devanagari

Worldwide

  • English
  • Latin

Europe

  • Albanian
  • Belarusan (Belorussian)
  • Bosnian
  • Bulgarian
  • Catalan
  • Chechen
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hungarian (Magyar)
  • Italian
  • Kabardian
  • Latvian
  • Lithuanian
  • Low German
  • Norwegian
  • Occitan
  • Polish
  • Portuguese
  • Romanian
  • Russian
  • Serbian (Cyrillic)
  • Serbo-Croatian (Latin)
  • Slovak
  • Slovene (Slovenian)
  • Spanish
  • Swedish
  • Turkish
  • Ukrainian

Central Asia

  • Azerbaijani (Azeri)
  • Kazakh
  • Turkmen
  • Uyghur

East Asia

  • Cantonese
  • Dong
  • Gan
  • Hakka
  • Hmong
  • Japanese
  • Korean
  • Mandarin (Chinese)
  • Min Bei
  • Min Dong
  • Min Nan
  • Mongolian
  • Pu-Xian
  • Wu (Changzhou)
  • Xiang
  • Zhuang

South Asia

  • Awadhi
  • Bhojpuri
  • Chattisgarhi (Laria, Khaltahi)
  • Dhundari
  • Garhwali
  • Haryanvi
  • Hindi
  • Kanauji
  • Konkani
  • Kumauni
  • Lamani (Lambadi)
  • Magahi
  • Maithili
  • Malvi
  • Marathi
  • Marwari
  • Mewari
  • Mewati
  • Nepali
  • Rajbanshi
  • Urdu
  • Wagdi

Southeast Asia

  • Betawi
  • Bikol
  • Bouyei (Buyi, Giáy)
  • Cebuano
  • Hiligaynon
  • Ilocano
  • Indonesian
  • Kapampangan
  • Kannada
  • Makassar (Makasar)
  • Malay (Melayu)
  • Minangkabau
  • Muong
  • Musi
  • Palembang
  • Pangasinan
  • Sasak
  • Sundanese (Sunda)
  • Tagalog (Pilipino)
  • Tamil
  • Telugu
  • Vietnamese
  • Waray-Waray

Middle East and North Africa

  • Arabic (mixed texts in Arabic and English are also supported)
  • Dimli
  • Gilaki
  • Kurdish (Kurmanji)
  • Persian (Farsi)

Africa

  • Afrikaans
  • Bemba (Chibemba)
  • Chichewa (Chewa, Nyanja)
  • Gusii
  • Hausa
  • Kanuri
  • Kikongo
  • Luguru
  • Luo
  • Makua (Makhuwa)
  • Malagasy
  • Meru
  • Nandi
  • Ndebele
  • Oromo
  • Rwanda
  • Serer-Sine
  • Shona (Karanga)
  • Somali
  • Sotho (Northern)
  • Sotho (Southern)
  • Sukuma
  • Swahili
  • Swati (Swazi)
  • Tonga
  • Tsonga
  • Tswana
  • Tumbuka
  • Umbundu
  • Wolaytta
  • Xhosa
  • Yao
  • Yoruba
  • Zulu

Americas

  • K’iche'
  • Quechua

Supported handwritten languages

Aspose.OCR for Java can extract text from handwritten notes in a number of languages based on Extended Latin alphabet:

  • English
  • Indonesian
  • German
  • Italian
  • Malay (Melayu)
  • Hausa
  • Swahili
  • Yoruba
  • Oromo
  • Dutch
  • Malagasy
  • Zhuang
  • Somali
  • Chichewa (Chewa, Nyanja)
  • Rwanda
  • Min Bei
  • Zulu
  • Min Dong
  • Swedish
  • Hiligaynon
  • Hmong
  • Shona (Karanga)
  • Xhosa
  • Betawi
  • Afrikaans
  • Minangkabau
  • Sotho (Southern)
  • Bikol
  • Danish
  • Kanuri
  • Tswana
  • Luo
  • Sukuma
  • Tsonga
  • Bemba (Chibemba)
  • Nandi
  • Palembang
  • Umbundu
  • Sotho (Northern)
  • Waray-Waray
  • Lamani (Lambadi)
  • Low German
  • Musi
  • Pu-Xian
  • Bouyei (Buyi, Giáy)
  • Ndebele
  • Sasak
  • Swati (Swazi)
  • Gusii
  • Meru
  • Wolaytta
  • Dong
  • Pangasinan
  • Makassar (Makasar)
  • Tumbuka
  • Serer-Sine
  • Tonga
  • K’iche'
  • Luguru
  • Latin