/ Site Crawler Fixes / Orphan Pages

How to Fix Orphan Pages

An orphan page is a URL that no other page on your site links to. It exists, it's in your sitemap, Google can find it, but nothing inside your site endorses it. Orphan pages typically rank poorly because they have zero internal link equity, and they accumulate over time as sites grow. This guide walks through identifying orphans, deciding which to keep, building proper inbound links, and preventing future orphans from being created.

1. Build the orphan list

The Site Crawler identifies orphans by comparing what it can reach by crawling internal links against your declared sitemap.xml. Pages in the sitemap that the crawl never reaches are orphans.

Step 1
Run the Site Crawler with sitemap cross-reference
Run a fresh Site Crawler with the option Include sitemap URLs enabled. The crawler crawls from your homepage and follows internal links; then it checks every URL in sitemap.xml against what it found. URLs in the sitemap but not reached are orphans.
Step 2
Export the orphan list
Export to CSV. Columns: orphan URL, page title, last-modified date, sitemap inclusion. You'll often see 5-15% of indexed pages on mid-sized sites flagged as orphans.

2. Decide per page: keep or delete

Don't make a global decision. Each orphan is a separate keep-or-delete call based on its traffic potential and strategic role.

SignalDecision
Has Search Console impressions or clicks last 90 daysKEEP — build inbound links
Has external backlinks (Search Console Links report)KEEP — build inbound links and amplify
Zero traffic, zero backlinks, no strategic roleDELETE — 410 Gone
Zero traffic but related content exists301 REDIRECT to related content
Deliberate landing page (campaign, PPC, partner)KEEP — exclude from sitemap; no-index if appropriate

3. Build inbound links for keepers

Step 1
Find related parent content
For each orphan worth keeping, identify 3-5 pages that should contextually link to it. Use the site search (site:yourdomain.com [topic] in Google) or your CMS search to find pages about related topics.
Step 2
Add contextual links
Edit each related page to add a contextual link to the orphan, ideally in the body content (not just a "related posts" widget). Use descriptive anchor text — not "click here" or "read more". The first link in body content carries more weight than a sidebar link.
Step 3
Add to hub pages
If you have category/hub pages, add the previously-orphan URL to the relevant hub. Hub pages aggregate topical authority and pass it to linked spokes.
💡 Adding ONE good contextual link from a relevant high-authority page does more than five sidebar links from low-authority pages. Quality of source matters more than quantity.

4. Handle the delete-or-redirect orphans

301 Redirect

For orphans with backlinks but no strategic role, 301 to the closest related live page. Set up via:

# WordPress: Redirection plugin, add rule
# nginx: rewrite ^/old-orphan/$ /new-target/ permanent;
# Apache: Redirect 301 /old-orphan/ /new-target/
# Shopify: Online Store > Navigation > URL Redirects

410 Gone (clean deletion)

For orphans with zero traffic and zero links, 410 is cleaner than 404. It tells Google the removal is intentional, and Google processes 410s faster than 404s.

# nginx
location = /old-orphan/ {
    return 410;
}

# Apache .htaccess
Redirect 410 /old-orphan/

Then remove the URL from sitemap.xml to stop Google re-discovering it.

5. Prevent future orphans

Step 1
Editorial policy: no publish without inbound link
Require any new page to have at least one inbound internal link from a relevant parent page before publishing. Either via editorial workflow or CMS validation (custom plugin that checks for inbound links before allowing publish).
Step 2
Quarterly orphan audit
Orphans accumulate as sites grow. Add an orphan audit to a quarterly SEO routine. Catch new orphans early when content memory is fresh and decisions are easier.

🕷 Re-run the Site Crawler

Verify orphan count has dropped after your fixes.

Run Site Crawler →
Related Guides: Site Crawler Fixes  ·  Fix Broken Links  ·  Fix Thin Content  ·  Site Crawler Guide
💬 Got a problem?