Find and Replace
You can easily navigate within your document using a keyboard and mouse, but if you have many pages to scroll through, it will take quite a while to find specific text in a long document. It will be more time consuming when you want to replace certain characters or words that you have used in your document. The “Find and replace” functionality enables you to find a sequence of characters in a document and replace it with another sequence of characters.
Aspose.Words allows you to find a specific string or regular expression pattern in your document and replace it with an alternative without installing and using additional applications such as Microsoft Word. This will speed up many typing and formatting tasks, potentially saving you hours of work.
This article explains how to apply string replacement and regular expressions with the support of metacharacters.
Ways to Find and Replace
Aspose.Words provides two ways to apply the find and replace operation by using the following:
- Simple string replacement – to find and replace a specific string with another, you need to specify a search string (alphanumeric characters) that is going to be replaced according to all occurrences with another specified replacement string. Both strings must not contain symbols. Take into account that string comparison can be case-sensitive, or you may be unsure of spelling or have several similar spellings.
- Regular expressions – to specify a regular expression to find the exact string matches and replace them according to your regular expression. Note that a word is defined as being made up of only alphanumeric characters. If replacement is executed with only whole words being matched and the input string happens to contain symbols, then no phrases will be found.
In addition, you can use special metacharacters with simple string replacement and regular expressions to specify breaks within the find and replace operation.
Aspose.Words presents the find and replace functionality with the Aspose.Words.Replacing namespace. You can work with many options during the find and replace process using FindReplaceOptions class.
Find and Replace Text Using Simple String Replacement
You can use one of the Replace methods to find or replace a particular string and return the number of replacements that were made. In this case, you can specify a string to be replaced, a string that will replace all its occurrences, whether the replacement is case-sensitive, and whether only stand-alone words will be affected.
The following code example shows how to find the string “CustomerName” and replace it with the string “James Bond”:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
builder.Writeln("Hello _CustomerName_,"); | |
doc.Range.Replace("_CustomerName_", "James Bond", new FindReplaceOptions(FindReplaceDirection.Forward)); | |
doc.Save(ArtifactsDir + "FindAndReplace.ReplaceWithString.docx"); |
You can notice the difference between the document before applying simple string replacement:

And after applying simple string replacement:

Find and Replace Text Using Regular Expressions
A regular expression (regex) is a pattern that describes a certain sequence of text. Suppose you want to replace all double occurrences of a word with a single word occurrence. Then you can apply the following regular expression to specify the double-word pattern: ([a-zA-Z]+) \1
.
Use the other Replace method to search and replace particular character combinations by setting the Regex
parameter as the regular expression pattern to find matches.
The following code example shows how to replace strings that match a regular expression pattern with a specified replacement string:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
builder.Writeln("sad mad bad"); | |
FindReplaceOptions options = new FindReplaceOptions(); | |
doc.Range.Replace(new Regex("[s|m]ad"), "bad", options); | |
doc.Save(ArtifactsDir + "FindAndReplace.ReplaceWithRegex.docx"); |
You can notice the difference between the document before applying string replacement with regular expressions:

And after applying string replacement with regular expressions:

Find and Replace String Using Metacharacters
You can use metacharacters in the search string or the replacement string if a particular text or phrase is composed of multiple paragraphs, sections, or pages. Some of the metacharacters include &p for a paragraph break, &b for a section break, &m for a page break, and &l for a line break.
The following code example shows how to replace text with paragraph and page break:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
builder.Font.Name = "Arial"; | |
builder.Writeln("First section"); | |
builder.Writeln(" 1st paragraph"); | |
builder.Writeln(" 2nd paragraph"); | |
builder.Writeln("{insert-section}"); | |
builder.Writeln("Second section"); | |
builder.Writeln(" 1st paragraph"); | |
FindReplaceOptions findReplaceOptions = new FindReplaceOptions(); | |
findReplaceOptions.ApplyParagraphFormat.Alignment = ParagraphAlignment.Center; | |
// Double each paragraph break after word "section", add kind of underline and make it centered. | |
int count = doc.Range.Replace("section&p", "section&p----------------------&p", findReplaceOptions); | |
// Insert section break instead of custom text tag. | |
count = doc.Range.Replace("{insert-section}", "&b", findReplaceOptions); | |
doc.Save(ArtifactsDir + "FindAndReplace.ReplaceTextContainingMetaCharacters.docx"); |
Find and Replace String in Header/Footer of a Document
You can find and replace text in the header/footer section of a Word document using the HeaderFooter class.
The following code example shows how to replace the text of the header section in your document:
You can notice the difference between the document before applying footer string replacement:

And after applying footer string replacement:

Ignore Text During Find and Replace
While applying the find and replace operation, you can ignore certain segments of the text. So, certain parts of the text can be excluded from the search, and the find and replace can be applied only to the remaining parts.
Aspose.Words provides many find and replace properties for ignoring text such as IgnoreDeleted, IgnoreFieldCodes, IgnoreFields, IgnoreFootnotes, and IgnoreInserted.
The following code example shows how to ignore text inside delete revisions:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
// Insert non-revised text. | |
builder.Writeln("Deleted"); | |
builder.Write("Text"); | |
// Remove first paragraph with tracking revisions. | |
doc.StartTrackRevisions("author", DateTime.Now); | |
doc.FirstSection.Body.FirstParagraph.Remove(); | |
doc.StopTrackRevisions(); | |
FindReplaceOptions options = new FindReplaceOptions { IgnoreDeleted = true }; | |
Regex regex = new Regex("e"); | |
doc.Range.Replace(regex, "*", options); | |
Console.WriteLine(doc.GetText()); | |
options.IgnoreDeleted = false; | |
doc.Range.Replace(regex, "*", options); | |
Console.WriteLine(doc.GetText()); |
Customize Find and Replace Operation
Aspose.Words provides many different properties to find and replace text such as applying specific format with ApplyFont and ApplyParagraphFormats properties, using substitutions in replacement patterns with UseSubstitutions property, and others.
The following code example shows how to highlight a specific word in your document:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
Document doc = new Document(MyDir + "Footer.docx"); | |
FindReplaceOptions options = new FindReplaceOptions(); | |
options.ApplyFont.HighlightColor = Color.DarkOrange; | |
doc.Range.Replace(new Regex("(header|footer)"), "", options); |
Aspose.Words allows you to use IReplacingCallback interface to create and call a custom method during a replace operation. You may have some use cases where you need to customize the find and replace operation such as replacing text specified with a regular expression with HTML tags, so basically you will apply replace with inserting HTML.
If you need to replace a string with an HTML tag, apply the IReplacingCallback interface to customize the find and replace operation so the match starts at the beginning of a run with the match node of your document. Let us provide several examples of using IReplacingCallback.
The following code example shows how to replace text specified with HTML:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
public void ReplaceWithHtml() | |
{ | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
builder.Writeln("Hello <CustomerName>,"); | |
FindReplaceOptions options = new FindReplaceOptions(); | |
options.ReplacingCallback = new ReplaceWithHtmlEvaluator(options); | |
doc.Range.Replace(new Regex(@" <CustomerName>,"), string.Empty, options); | |
doc.Save(ArtifactsDir + "FindAndReplace.ReplaceWithHtml.docx"); | |
} | |
private class ReplaceWithHtmlEvaluator : IReplacingCallback | |
{ | |
internal ReplaceWithHtmlEvaluator(FindReplaceOptions options) | |
{ | |
mOptions = options; | |
} | |
/// <summary> | |
/// NOTE: This is a simplistic method that will only work well when the match | |
/// starts at the beginning of a run. | |
/// </summary> | |
ReplaceAction IReplacingCallback.Replacing(ReplacingArgs args) | |
{ | |
DocumentBuilder builder = new DocumentBuilder((Document) args.MatchNode.Document); | |
builder.MoveTo(args.MatchNode); | |
// Replace '<CustomerName>' text with a red bold name. | |
builder.InsertHtml("<b><font color='red'>James Bond, </font></b>"); | |
args.Replacement = ""; | |
return ReplaceAction.Replace; | |
} | |
private readonly FindReplaceOptions mOptions; | |
} |
The following code example shows how to prepend a line number to each line:
// For complete examples and data files, please go to https://github.com/aspose-words/Aspose.Words-for-.NET.git. | |
public void LineCounter() | |
{ | |
Document doc = new Document(); | |
DocumentBuilder builder = new DocumentBuilder(doc); | |
builder.Writeln("This is first line"); | |
builder.Writeln("Second line"); | |
builder.Writeln("And last line"); | |
// Prepend each line with line number. | |
FindReplaceOptions opt = new FindReplaceOptions() { ReplacingCallback = new LineCounterCallback() }; | |
doc.Range.Replace(new Regex("[^&p]*&p"), "", opt); | |
doc.Save(ArtifactsDir + "FindAndReplace.LineCounter.docx"); | |
} | |
internal class LineCounterCallback : IReplacingCallback | |
{ | |
public ReplaceAction Replacing(ReplacingArgs args) | |
{ | |
Debug.WriteLine(args.Match.Value); | |
args.Replacement = string.Format("{0} {1}", mCounter++, args.Match.Value); | |
return ReplaceAction.Replace; | |
} | |
private int mCounter = 1; | |
} |