Features List

Aspose.HTML for Java Features

Aspose.HTML is a programming API that allows developers to create, open existing, manipulate, navigate through and convert (X)HTML documents into the various supported output formats. Aspose.HTML implements W3C HTML specification, so it’s classes and properties have similar names that come from the specifications.

This API mimics the behavior of a headless browser and offers following features.

  • Creating or opening an existing HTML document from different sources (Aspose.HTML.Examples.QuickStart.DocumentOpenTests in the examples project);
  • HTML Manipulation: creating, editing, removing and replacing HTML nodes via API 
  • Saving HTML document
  • Extracting CSS styles for particular HTML node
  • Configuring a document sandbox that affects the processing of HTML documents i.e. CSS styles in some cases are dependent on screen size and we allow to configure environment independently of an execution machine
  • Navigation through HTML document in different ways:
  • Scripting that allows to manipulate HTML DOM via JavaScript
  • Converting HTML document into various supported formats: JPEG, PNG, BMP, TIFF, PDF, XPS
  • Converting (X)HTML and similar formats, such as ePub and MHTML to above specified formats.

Note

  • String representation of CSS Color value is dependent of the context that can be either in a ‘specified’ or ‘computed’ state.
    • In the ‘computed’ state CSS Color is represented as rgb/rgba function.
    • In the ‘specified’ state CSS Color is represented as original string specified by the author.

The URL object is based on Java Framework implementation according to w3c URL specification. The difference between Framework and w3c is an order of constructor parameters

General Features

  • Written completely in Java and works with JRE.
  • JDK environment required.
  • Supports Desktop, JSP or JSP applications.
  • API reference in HTML and Microsoft Help format.
  • Supported JDK 1.6 or higher versions.
  • 32-bit OS support.
  • 64-bit OS support.

Supported HTML versions

Aspose.HTML for Java supports HTML and (X)HTML.

Namespaces structure in API

todo:image_alt_text

Text

  • Extract text from pages.
  • Search text from pages.
  • Add text in HTML file.

Document

  • Create, edit, remove and replace HTML nodes
  • Extracting CSS styles for particular HTML node
  • Convert HTML documents into various supported image formats: JPEG, PNG, BMP, TIFF
  • Convert HTML documents to PDF format
  • Convert HTML documents to XPS format