Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.
Suppose you are a web developer, graphic designer, researcher, journalist, or student, in a word, a person working on his own project. In that case, you will obviously need images for your project in the development process. To download images from website manually, you spend a very long time. You will have to copy each image URL and then save the image, but you can use Aspose.HTML for .NET library to extract images from website programmatically.
In this article we demonstrate how to extract both standard images (<img> tags) and website icons (<link rel="icon">) using the Aspose.HTML for .NET API. Automating the download process is far more efficient than hunting for files manually. Let’s get started!
Most pictures in an HTML document are represented using the <img> element. Here is an example of how to use Aspose.HTML for .NET to find images specified by this element. So, to download images from website, you should take a few following steps:
HTMLDocument class and pass it the URL of the website from which you want to find images.<img> elements. The method returns a list of the HTML document’s <img> elements.Select() method to create a distinct collection of relative image URLs and the
GetAttribute(“src”) method to extract the src attribute of each <img> element.HTMLDocument class.File.WriteAllBytes() method to save each image to a local file. 1// Extract images from website using C#
2
3// Open a document you want to download images from
4using HTMLDocument document = new HTMLDocument("https://docs.aspose.com/svg/net/drawing-basics/svg-shapes/");
5
6// Collect all <img> elements
7HTMLCollection images = document.GetElementsByTagName("img");
8
9// Create a distinct collection of relative image URLs
10IEnumerable<string> urls = images.Select(element => element.GetAttribute("src")).Distinct();
11
12// Create absolute image URLs
13IEnumerable<Url> absUrls = urls.Select(src => new Url(src, document.BaseURI));
14
15foreach (Url url in absUrls)
16{
17 // Create an image request message
18 using RequestMessage request = new RequestMessage(url);
19
20 // Extract image
21 using ResponseMessage response = document.Context.Network.Send(request);
22
23 // Check whether a response is successful
24 if (response.IsSuccess)
25 {
26 // Save image to a local file system
27 File.WriteAllBytes(Path.Combine(OutputDir, url.Pathname.Split('/').Last()), response.Content.ReadAsByteArray());
28 }
29}Note: It is important to adhere to copyright laws and obtain proper permission or licensing before using saved images for commercial purposes. We do not support data extraction and use of other people’s files for commercial purposes without their permission.
Icons are a kind of image in HTML documents that are specified using <link> elements with the rel attribute set to icon. Let’s look at how to extract icons from website using the Aspose.HTML for .NET library:
HTMLDocument class and pass it the URL of the website from which you want to extract icons.<link> elements.Where() method that filters the collection based on the link => link.GetAttribute("rel") == "icon" expression. Thus, the icons collection will contain only links with a rel attribute with the value icon.Select() method to create a distinct collection of relative icon URLs and the
GetAttribute(“href”) method to extract the href attribute of each <link> element.HTMLDocument class.File.WriteAllBytes() method to save icons to a local file. As a result, you will have a collection of icons from website in your local folder. 1// Download icons from website using C#
2
3// Open a document you want to download icons from
4using HTMLDocument document = new HTMLDocument("https://docs.aspose.com/html/net/message-handlers/");
5
6// Collect all <link> elements
7HTMLCollection links = document.GetElementsByTagName("link");
8
9// Leave only "icon" elements
10IEnumerable<Element> icons = links.Where(link => link.GetAttribute("rel") == "icon");
11
12// Create a distinct collection of relative icon URLs
13IEnumerable<string> urls = icons.Select(icon => icon.GetAttribute("href")).Distinct();
14
15// Create absolute icon URLs
16IEnumerable<Url> absUrls = urls.Select(src => new Url(src, document.BaseURI));
17
18foreach (Url url in absUrls)
19{
20 // Create a downloading request
21 using RequestMessage request = new RequestMessage(url);
22
23 // Extract icon
24 using ResponseMessage response = document.Context.Network.Send(request);
25
26 // Check whether a response is successful
27 if (response.IsSuccess)
28 {
29 // Save icon to a local file system
30 File.WriteAllBytes(Path.Combine(OutputDir, url.Pathname.Split('/').Last()), response.Content.ReadAsByteArray());
31 }
32}You can use these C# examples to automate extracting all images from website, which can be helpful for tasks such as archiving, researching, analyzing web content, or any other application for personal use. Also, this is great for web designers and developers wanting to pull images from sites without diving into the source code.
| Problem | Cause | Solution |
|---|---|---|
| Images are not downloaded | src attribute contains a data URI or empty string | Skip data URIs (src.StartsWith("data:")) and filter out empty values before downloading |
| Relative URLs resolve incorrectly | BaseURI is missing or the page uses <base> tag | Use document.BaseURI or read the <base> element and combine it with the relative path |
| HTTP errors (404/403) stop the script | No error handling around Network.Send | Check response.IsSuccess and continue the loop on failure |
| Files are overwritten | Duplicate file names from different URLs | Generate unique names (e.g., include a counter or hash of the URL) |
| Icons are missed | rel can contain multiple values like "shortcut icon", so == "icon" fails. | Split rel by spaces and check if it contains "icon". Also match "shortcut" or "apple-touch-icon" if needed. |
Aspose.HTML for .NET makes extracting images and icons from any web page straightforward and reliable. By leveraging the DOM API and built‑in networking features, you can automate bulk downloads, archive web content, or gather assets for research without manual effort. The provided examples cover both standard images and favicon extraction, complete with error handling and best‑practice tips.
You can download the complete C# examples and data files from GitHub.
Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.