Delete redundant spaces after line break while importing HTML

Effect of setting the HtmlLoadOptions.DeleteRedundantSpaces property to false and true

The following screenshot shows the effect of setting this property to false and true.

todo:image_alt_text

Delete redundant spaces after line break while importing HTML

The following sample code shows the usage of the HtmlLoadOptions.DeleteRedundantSpaces  property. Please set it true or false to get the output as shown in the above screenshot.

// For complete examples and data files, please go to https://github.com/aspose-cells/Aspose.Cells-for-Java
// The path to the documents directory
String dataDir = Utils.getSharedDataDir(DeleteRedundantSpacesFromHtml.class) + "TechnicalArticles/";
// Sample Html containing redundant spaces after <br> tag
String html = "<html>" + "<body>" + "<table>" + "<tr>" + "<td>" + "<br> This is sample data"
+ "<br> This is sample data" + "<br> This is sample data" + "</td>" + "</tr>" + "</table>"
+ "</body>" + "</html>";
// Convert Html to byte array
byte[] byteArray = html.getBytes();
// Set Html load options and keep precision true
HtmlLoadOptions loadOptions = new HtmlLoadOptions(LoadFormat.HTML);
loadOptions.setDeleteRedundantSpaces(true);
// Convert byte array into stream
java.io.ByteArrayInputStream stream = new java.io.ByteArrayInputStream(byteArray);
// Create workbook from stream with Html load options
Workbook workbook = new Workbook(stream, loadOptions);
// Access first worksheet
Worksheet worksheet = workbook.getWorksheets().get(0);
// Auto fit the sheet columns
worksheet.autoFitColumns();
// Save the workbook
workbook.save(dataDir + "DRSFromHtml_out-" + loadOptions.getDeleteRedundantSpaces() + ".xlsx", SaveFormat.XLSX);
System.out.println("File saved");