/ Site Crawler Fixes / Sitemap Mismatches

How to Fix Sitemap Mismatches

Your sitemap tells Google "these are the URLs I want indexed". Every URL in the sitemap is a promise. When promises break — 404s, redirects, noindex pages, missing URLs — Google's trust in your sitemap as a crawl-priority signal degrades. This guide walks through every category of sitemap mismatch and the fix for each.

1. Run the sitemap audit

Step 1
Run the Site Crawler
Run the Site Crawler with Sitemap audit enabled. The crawler fetches every URL in your sitemap and checks status code, indexability, canonical alignment and lastmod accuracy.
Step 2
Review the categorised report
Mismatches are grouped:
  • URLs returning 404, 410, 5xx
  • URLs that 301 to a different URL
  • URLs with noindex meta or X-Robots-Tag
  • URLs canonical-tagged to a different URL
  • URLs with implausibly recent lastmod
  • URLs missing from sitemap but reachable on the site

2. Fix 404s and 410s in sitemap

Step 1
Decide per URL: restore or remove
For each 404 in the sitemap: is the URL meant to exist?
  • Yes (accidental deletion, broken route) → restore the page
  • No (intentional removal) → remove from the sitemap; 410 or 301 as appropriate
Step 2
Remove from sitemap
How depends on your sitemap source:
  • WordPress (Yoast/Rank Math): sitemap auto-regenerates as you delete pages. If the dead URL still appears after deletion, clear sitemap cache via plugin settings.
  • Shopify: sitemap auto-managed; deleted products fall out within 24-48h.
  • Static/headless: rebuild your sitemap generator. Make sure the URL discovery step queries actual live pages, not a cached source.

3. Remove redirects from sitemap

Step 1
Update sitemap to use canonical URLs
If /old-url/ 301s to /new-url/, your sitemap should contain /new-url/, not /old-url/. Many sitemap generators export the old URL because their source-of-truth was last updated before the redirect.
⚠️ Having redirects in your sitemap doesn't break SEO — Google follows the redirect — but it wastes crawl budget and degrades sitemap trust. Clean up at every regeneration.

4. Remove noindex pages from sitemap

Step 1
Decide per URL: drop the noindex or drop from sitemap
A page that says noindex shouldn't be in your sitemap. Either:
  • Remove the noindex if the page is actually worth indexing
  • Remove the URL from the sitemap if the noindex is correct

5. Add missing URLs

Step 1
Review crawler-found, sitemap-missing list
URLs the Site Crawler reached but that aren't in the sitemap are signalling-gaps. Most should be added.
Step 2
Find the sitemap generator's gap
Common causes: custom post types in WordPress that the SEO plugin doesn't include by default, dynamic routes in a headless app that the sitemap script doesn't traverse, products marked "hidden" in Shopify that still have public URLs.

6. Fix stale lastmod

Step 1
Verify lastmod source
Open your sitemap.xml and look at the <lastmod> values. If they all match the same date (likely your last deploy), your sitemap generator is using build time, not content modification time. Bad signal to Google.
Step 2
Fix the generator
Sitemap generators should read each page's actual last-modified date from the CMS. Examples:
  • WordPress: post_modified is correct; do NOT use post_date alone
  • Headless CMS: use the updatedAt field, not query time
  • Static sites: use git log dates per content file
If you can't get accurate lastmod, omit it entirely. Empty is better than wrong.

7. Regenerate and submit

Step 1
Regenerate
Force a regeneration of the sitemap from your generator. Verify the new sitemap reflects your fixes by spot-checking URLs:
curl -s https://yourdomain.com/sitemap.xml | head -50
Step 2
Submit in Search Console
Search Console → Sitemaps. Confirm your sitemap is listed with no errors. If you have a sitemap index, submit the index URL only — Google reads it and discovers the child sitemaps.

🕷 Re-run the Site Crawler

Verify sitemap mismatches are resolved.

Run Site Crawler →
Related Guides: Site Crawler Fixes  ·  Fix Robots Blocks  ·  Robots & Sitemap Fixes  ·  Robots & Sitemap Guide
💬 Got a problem?