Skip to main content
Lection Logolection
LectionFree Tool

XML Sitemap Viewer

Enter any domain or sitemap URL to view all pages, last modified dates, and export the full list as CSV or JSON.

Try: wikipedia.org, github.com, nytimes.com, or any site you want to explore

What is an XML sitemap?

An XML sitemap is a file that lists every important page on a website. It tells search engines like Google and Bing where to find content and how often pages are updated. Most sites store their sitemap at /sitemap.xml, though larger sites use a sitemap index file that links to multiple sub-sitemaps organized by section or content type.

For web scrapers and researchers, sitemaps are invaluable. Rather than crawling a site page-by-page, you can read its sitemap to get a complete list of URLs upfront, including last-modified dates that tell you which pages have changed recently.

How to use this sitemap viewer

  1. Enter a domain (e.g., example.com) or paste a direct sitemap URL.
  2. The tool auto-discovers the sitemap by trying /sitemap.xml and parsing robots.txt.
  3. If a sitemap index is returned, click Load on any sub-sitemap to drill in.
  4. Filter URLs with the search box to find specific pages or sections.
  5. Download the full URL list as CSV or JSON for use in spreadsheets or scraping tools.

Common use cases

  • Build a seed URL list before scraping a site with a tool like Lection.
  • Check a competitor's content structure and publishing frequency.
  • Verify that your own sitemap contains all the pages you expect.
  • Find recently updated pages using last-modified dates for monitoring campaigns.
  • Audit a site before migration to ensure no pages are missing from the new sitemap.
Want to schedule and automate these scrapes? Create a free account to get started.Create Account

Frequently asked questions

What is a sitemap index file?

A sitemap index is a master sitemap that links to multiple child sitemaps. Large sites use them to stay within the 50,000 URL limit per sitemap file. This viewer detects index files automatically and lets you load each sub-sitemap individually.

Why does this tool show fewer URLs than the sitemap count?

To keep results fast and readable, this viewer shows the first 500 URLs from a single sitemap file. Download the CSV or JSON to get the complete list. For sitemap indexes, load each sub-sitemap separately.

What does the lastmod date mean?

The lastmod field tells search engines (and you) when a page was last updated. It is set by the website owner, so accuracy varies — some sites update it with every CMS publish, others leave it stale.

Why can't the tool find a sitemap for a site I know has one?

Some sites use non-standard sitemap paths, block server-to-server requests, or generate sitemaps dynamically. Try pasting the full sitemap URL directly (e.g., example.com/sitemap_index.xml) instead of just the domain.

How is this useful for web scraping?

Sitemaps give you a complete, structured list of all URLs on a site before you start scraping. Instead of following links page-by-page, you can download the sitemap, filter for the pages you want, and feed that list directly into your scraper for faster, more targeted data collection.

Related resources