استخراج الصور من PDF (الواجهات)

تتيح لك فئة PdfExtractor استخراج الصور من ملف PDF. First off, you need to create an object of PdfExtractor class and bind input PDF file using bindPdf method. After that, call extractImage method to extract all the images into memory. Once the images are extracted, you can get those images with the help of hasNextImage and getNextImage methods. You need to loop through all the extracted images using a while loop. In order to save the images to disk, you can call the overload of the getNextImage method which takes file path as argument. The following code snippet shows you how to extract images from the whole PDF to files.

أولاً، تحتاج إلى إنشاء كائن من فئة PdfExtractor وربط ملف PDF المدخل باستخدام طريقة bindPdf. بعد ذلك، استدعِ طريقة extractImage لاستخراج جميع الصور في الذاكرة. بمجرد استخراج الصور، يمكنك الحصول على تلك الصور بمساعدة طريقتي hasNextImage وgetNextImage. تحتاج إلى التجول عبر جميع الصور المستخرجة باستخدام حلقة while. من أجل حفظ الصور على القرص، يمكنك استدعاء التحميل الزائد لطريقة getNextImage التي تأخذ مسار الملف كحجة. يوضح لك مقتطف الشيفرة التالي كيفية استخراج الصور من ملف PDF بالكامل إلى ملفات.

public static void ExtractImages()
 {
    //Create an extractor and bind it to the document
    Document document = new Document(_dataDir + "sample.pdf");
    PdfExtractor extractor = new PdfExtractor(document);
    extractor.setStartPage(1);
    extractor.setEndPage(3);            

    //Run the extractor
    extractor.extractImage();
    int imageNumber = 1;
    //Iterate througth extracted images collection
    while (extractor.hasNextImage())
    {
        //Retrieve image from collection and save it in a file 
        extractor.getNextImage(_dataDir + String.format("image%03d.png", imageNumber++),ImageType.getPng());
    }
}

استخراج كائنات الرسم البياني من مستند PDF (واجهات)