Finding word bounding boxes

Aspose.OCR for Java can automatically find the coordinates of image regions containing words. This can be useful for highlighting detected areas when previewing an image or extracting individual blocks of text.

To get bounding boxes of all words in images, provided in OcrInput object, use DetectRectangles method. Specify AreasType.WORDS as the areasType parameter of the method. isDetectAreas parameter of the method is ignored.

The method returns a list of RectangleOutput objects containing coordinates of each word in each image.

PDF documents can contain more than one image per page. Therefore, the resulting list can contain more RectangleOutput objects than the number of pages in the document.

Property	Type	Description
`Rectangles`	`ArrayList<Rectangle>`	Coordinates of each word of an image (top-left corner, width and height), returned as a list of `Rectangle` objects.
`ImageIndex`	`int`	Sequence number of the image on the page. When working with single-page images, this value is always 0.
`Page`	`int`	Page number. When working with single-page images, this value is always 0.
`Source`	`String`	The full path or URL of the source file. If the file is provided as a `BufferedImage` object, an array of pixels, or a Base64 string, this value will be empty.

Example

The following code example shows how to detect words in multiple images:

AsposeOcr api = new AsposeOcr();
// Add images to detection batch
OcrInput images = new OcrInput(InputType.SingleImage);
images.Add("source1.png");
images.Add("source2.jpg");
// Get word coordinates
ArrayList<RectangleOutput> areas = api.DetectRectangles(images, AreasType.WORDS);
areas.forEach((area) -> {
	area.Rectangles.forEach((rectangle) -> {
		System.out.println("File: " + area.Source + " | " + rectangle.x + ", " + rectangle.y + ", " + rectangle.width + ", " + rectangle.height);
	});
});

Finding line bounding boxes Finding character bounding boxes