导入HTML时删除换行后多余空格
Contents
[
Hide
]
请使用HtmlLoadOptions.DeleteRedundantSpaces属性,并将其设置为true以删除换行标记后的所有多余空格。默认情况下,此属性为false,并且输出的Excel文件中保留了多余的空格。
将HtmlLoadOptions.DeleteRedundantSpaces属性设置为false和true的效果
以下截图显示了将此属性设置为false和true的效果。
在导入HTML时删除换行后的多余空格
以下示例代码显示了HtmlLoadOptions.DeleteRedundantSpaces属性的用法。请将其设置为true或false,以获得上述截图中显示的输出。
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
// For complete examples and data files, please go to https://github.com/aspose-cells/Aspose.Cells-for-Java | |
// The path to the documents directory | |
String dataDir = Utils.getSharedDataDir(DeleteRedundantSpacesFromHtml.class) + "TechnicalArticles/"; | |
// Sample Html containing redundant spaces after <br> tag | |
String html = "<html>" + "<body>" + "<table>" + "<tr>" + "<td>" + "<br> This is sample data" | |
+ "<br> This is sample data" + "<br> This is sample data" + "</td>" + "</tr>" + "</table>" | |
+ "</body>" + "</html>"; | |
// Convert Html to byte array | |
byte[] byteArray = html.getBytes(); | |
// Set Html load options and keep precision true | |
HtmlLoadOptions loadOptions = new HtmlLoadOptions(LoadFormat.HTML); | |
loadOptions.setDeleteRedundantSpaces(true); | |
// Convert byte array into stream | |
java.io.ByteArrayInputStream stream = new java.io.ByteArrayInputStream(byteArray); | |
// Create workbook from stream with Html load options | |
Workbook workbook = new Workbook(stream, loadOptions); | |
// Access first worksheet | |
Worksheet worksheet = workbook.getWorksheets().get(0); | |
// Auto fit the sheet columns | |
worksheet.autoFitColumns(); | |
// Save the workbook | |
workbook.save(dataDir + "DRSFromHtml_out-" + loadOptions.getDeleteRedundantSpaces() + ".xlsx", SaveFormat.XLSX); | |
System.out.println("File saved"); |