Extracting text from street photos

Aspose.OCR offers a special recognition algorithm for extracting content from images with sparse text and noisy/colored backgrounds. This method significantly improves OCR accuracy in the following business cases:

  • Read text from street photos.
  • Segment and identify road signs and signboards within street images.
  • Locate price tags and interpret the extracted text as prices.
  • Find and aggregate regions of interest on food labels, such as nutritional information or ingredient lists.
  • Identify and analyze car license plates.
  • Extract text from menus and catalogs.

To extract text from such images, use RecognizeStreetPhoto() method of AsposeOCR class.

The method takes OcrInput object and returns a RecognitionResult object containing the text from images.

AsposeOCR api = new AsposeOCR();
// Load photos
OcrInput input  = new OcrInput(InputType.SingleImage);
input.add(os.path.join(self.dataDir, "photo1.png"));
input.add(os.path.join(self.dataDir, "photo2.png"));
// Recognize text
ArrayList<RecognitionResult> results = api.RecognizeStreetPhoto(input);
results.forEach((result) -> {
	System.out.println(result.recognition_text);
});

Live demo

Street photo

Limitations

  • The method only supports Latin letters and numbers.
  • This method does not support recognition settings.