Extract Images from PDF File using Java
Contents
[
Hide
]
Aspose.PDF for Java supports direct image-resource extraction and placement-based filtering.
Extract an embedded image by index
Use this example when you need to save a specific image resource from a PDF page.
- Open the source PDF Document.
- Access the target XImage from the page resources.
- Save the image stream to an output file.
public static void extractImage(Path inputFile, Path outputFile) throws Exception {
try (Document document = new Document(inputFile.toString());
OutputStream outputImage = Files.newOutputStream(outputFile)) {
XImage image = document.getPages().get_Item(1).getResources().getImages().get_Item(1);
image.save(outputImage);
}
}
Extract images from a specific page region
Use this example when only images placed inside a selected rectangle should be exported.
- Define the target Rectangle and open the source PDF.
- Use ImagePlacementAbsorber to inspect image placements on the page.
- Save only the images whose placement fits inside the selected region.
public static void extractImageFromSpecificRegion(Path inputFile, Path outputFile) throws Exception {
Rectangle rectangle = new Rectangle(0, 0, 590, 590, true);
try (Document document = new Document(inputFile.toString())) {
ImagePlacementAbsorber absorber = new ImagePlacementAbsorber();
document.getPages().get_Item(1).accept(absorber);
int index = 1;
for (ImagePlacement imagePlacement : absorber.getImagePlacements()) {
Point point1 = new Point(imagePlacement.getRectangle().getLLX(), imagePlacement.getRectangle().getLLY());
Point point2 = new Point(imagePlacement.getRectangle().getURX(), imagePlacement.getRectangle().getURX());
if (rectangle.contains(point1, true) && rectangle.contains(point2, true)) {
Path indexedOutputFile = Path.of(outputFile.toString().replace("index", String.valueOf(index)));
try (OutputStream outputImage = Files.newOutputStream(indexedOutputFile)) {
imagePlacement.getImage().save(outputImage);
}
index++;
}
}
}
}