Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.
Aspose.OCR for .NET can automatically analyze image content and identify the different types of layout blocks within it.
Pass one or more images as aspose.ocr.OcrInput
object to aspose.ocr.detect_document_layout
method and get the layout blocks as aspose.ocr.LayoutOutput
object.
The method accepts the collection of images in any of the supported formats and returns them as an array of LayoutOutput
objects with the following properties:
aspose.ocr.LayoutOutput
This class stores a layout block detected in an image.
Property | Type | Description |
---|---|---|
source |
string |
The full path to the file or URL, if applicable. Empty for images provided as a stream, byte array, or Base64. |
page |
int |
Page number for multi-page images. |
paragraphs |
Array of ContentArea |
Detected paragraphs. |
images |
Array of ContentArea |
Detected illustrations. |
headers |
Array of ContentArea |
Detected headers. |
tables |
Array of ContentArea |
Detected tables. |
lists |
Array of ContentArea |
Detected lists. |
captions |
Array of ContentArea |
Detected captions. |
equations |
Array of ContentArea |
Detected equations. |
Aspose.OCR.ContentArea
This class stores a layout block detected in an image.
Property | Type | Description |
---|---|---|
index |
int |
The sequential index of the content area, unique within the entire image. |
rectangle |
aspose.ocr.Rectangle |
The bounding rectangle of the content area. |
The following code sample demonstrates how to detect the image layout:
### Detect layout on the image
# Instantiate Aspose.OCR API
api = AsposeOcr()
# Add image to the recognition batch
input = OcrInput(InputType.SINGLE_IMAGE)
input.add("source.png")
# Detect layout
results = api.detect_document_layout(input)
# Print result
for result in results:
print("paragraphs:" + str(len(result.paragraphs)))
print("images:" + str(len(result.images)))
print("tables:" + str(len(result.tables)))
print("headers:" + str(len(result.headers)))
print("lists:" + str(len(result.lists)))
print("captions:" + str(len(result.captions)))
print("equations:" + str(len(result.equations)))
# Recognize only headers
# Set rectangles for recognition
set = RecognitionSettings()
set.recognition_areas = [area.rectangle for area in result.headers] # use paragraphs or any other areas
# Recognize areas (rectangles)
results = api.recognize(input, set)
for result in results:
print(result.recognition_text)
Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.