Logical Levels of Nodes in a Document
This documentation sometimes refers to a group of node classes as belonging to a “level” in a document, such as “block-level”, “inline-level” (also known as “inline”), or “row-level” nodes. These levels in a document are differentiated purely logically and are not explicitly expressed by inheritance or other Aspose.Words DOM means. The node level is used to describe the place in the document tree where the node would typically occur.
In the previous article, we already talked about the relationship between nodes and the fact that not all nodes are allowed to be a child of any nodes. For example, Cell can only be a Row child, and a Row can only be a Table child, and so on. These relationships are also applicable for the logical division of nodes into levels in the document.
The following sections describe the logical levels of nodes in Aspose.Words and the classes that belong to each level.
Document and Section Logical Level
A Word document consists of one or more sections, represented by the Section class and separated by section breaks. A section can define its own page size, margins, orientation, number of text columns, and headers and footers.
Document and Section level nodes have the structure as shown in the following diagram.
A section contains the main text, as well as headers and footers for the first, even, and odd pages. These different “flows” of text are called stories.
In Aspose.Words, the Section node contains the Body and HeaderFooter story nodes. The Body object stores the main text. The HeaderFooter objects store the text for each header and footer. The text of any story consists of paragraphs and tables, respectively represented by the Paragraph and Table objects of the Block-level.
Additionally, each Word document can contain a glossary, which is represented by the GlossaryDocument node in Aspose.Words. A glossary document contains BuildingBlocks, AutoText, and AutoCorrect entries.
GlossaryDocument includes BuildingBlock nodes representing different types of glossary document entries. Each BuildingBlock contains sections that can be inserted, removed, and copied into documents.
Block Logical Level
Block-level nodes represent containers for content and content controls, and can occur in the document tree child nodes in the following nodes:
- Body
- Header
- Footer
- Footnote
- Comment
- Shape
- GroupShape
- Cell
- StructuredDocumentTag
Block-level nodes are represented by the following classes:
- Tables and Paragraphs, which are the most important block-level nodes
- Bookmarks, which occurs both at the block-level and at the inline-level
- StructuredDocumentTag, which represent custom markup and can contain both content and content controls
The following diagram shows the block-level elements.
Inline Logical Level
Inline-level nodes represent the actual content of the document and can be contained in the following containers:
- Paragraph – the most common container
- SmartTag
- StructuredDocumentTag
Inline-level elements are represented by the following classes:
- Run – runs of text formatted differently
- BookmarkStart and BookmarkEnd represent bookmarks
- [CommentRangeStart]https://reference.aspose.com/words/java/com.aspose.words/comment/RangeStart), CommentRangeEnd, Comment and Footnote represent annotations
- FieldStart, FieldChar, FieldSeparator and FieldEnd that represent field characters, and FormField represent Word fields
- SpecialChar represents special characters in the document
- Shape and GroupShape represent shapes, drawings, images, etc.
- SmartTag and StructuredDocumentTag represent custom markup
The following diagram shows the inline-level nodes structure.
Shapes in Microsoft Word include Office Art AutoShapes, text boxes, images, OLE objects, and ActiveX controls, all of which are represented using the Shape
class. Some shapes can also contain text, so Shape nodes in Aspose.Words can contain block-level nodes.
Shapes can be grouped inside each other using GroupShape nodes.
Table, Row, and Cell Node Level
The table consists of nodes of rows and cells. Table elements are represented by the following classes:
- Row represents a table row
- Cell represents a table cell
- StructuredDocumentTag represent custom markup
The following diagram shows the node structures of the Table, Row, and Cell levels.