HTTP errors silently damage your SEO every day. Google crawls your site, hits a 404 or a 500, and leaves empty-handed — wasting crawl budget, losing rankings, and sending users to dead ends. Here is everything you need to know about finding and fixing them.
🔴 Run HTTP Error Audit All Performance Tools →Every time a browser or search engine requests a page from your server, the server responds with a three-digit status code. Codes starting with 4 are client errors — the request was wrong or the page does not exist. Codes starting with 5 are server errors — something broke on your end.
| Code | Name | What it means | SEO impact |
|---|---|---|---|
| 404 | Not Found | Page does not exist | Wastes crawl budget, removes page from index |
| 410 | Gone | Page permanently deleted | Faster deindex than 404 — use intentionally |
| 403 | Forbidden | Access denied | Google cannot crawl the page — treated like a soft block |
| 401 | Unauthorised | Login required | Page will not be indexed |
| 408 | Request Timeout | Server took too long to respond | Googlebot may abandon and not return |
| 429 | Too Many Requests | Rate limited | Googlebot backs off — can slow down full crawl |
| Code | Name | What it means | SEO impact |
|---|---|---|---|
| 500 | Internal Server Error | Generic server crash | Page may be deindexed if persistent |
| 502 | Bad Gateway | Upstream server failed | Signals unreliable infrastructure to Google |
| 503 | Service Unavailable | Server overloaded or in maintenance | Temporary — Google retries. Persistent = deindex risk |
| 504 | Gateway Timeout | Upstream took too long | Same as 503 — retry behaviour from Googlebot |
Yes — and in multiple ways.
Google allocates a crawl budget to every site — a limit on how many pages it will crawl in a given period. Every 404 Googlebot visits is a wasted crawl. On large sites with hundreds of broken URLs, this means important pages get crawled less often or not at all.
If a page that previously ranked starts returning 404, Google will eventually remove it from the index. Any backlinks pointing to that URL stop passing authority. If you do not redirect the URL, those rankings and that link equity are gone.
Users who land on a 404 page leave immediately. High bounce rates on error pages are a negative user experience signal. On e-commerce sites, a 404 on a product page means a lost sale.
There are three main methods — ranked by thoroughness:
The HTTP Error Audit at AIWebPageSEO crawls your entire site, follows every internal link, and records the HTTP status code returned by each page. Results are shown in a filterable table — filter by 4xx errors, 5xx errors, redirects, or 200 OK. You can export to CSV for a full report.
It works by starting at your homepage, extracting all internal links, visiting each one, and repeating — exactly as Googlebot does. The scan size is tiered by number of pages:
| Scan size | Best for | Cost |
|---|---|---|
| 10 pages | Small sites, landing pages, quick checks | 10p |
| 50 pages | Blogs, small business sites | 50p |
| 100 pages | Medium sites, e-commerce stores | £5.00 |
| 1,000 pages | Large sites, agencies, full domain audits | £9.99 |
Google Search Console's Coverage report shows pages returning 404 or other errors that Googlebot has encountered. The limitation: it only shows pages Google has already tried to crawl. New broken pages may take days or weeks to appear. It also does not show errors on pages that are not yet in Google's index.
Your web server logs every request and its status code. Parsing your NGINX or Apache access logs will reveal every 4xx and 5xx response across all traffic — not just Googlebot. The downside is that logs require technical access and manual processing, and they do not show you which pages linked to the broken URLs.
For each 404 you find, you have four options:
500 errors are server-side — check your error logs immediately. Common causes and fixes:
The right frequency depends on how often your site changes:
A soft 404 is a page that returns HTTP 200 OK but displays a "page not found" message. The server says everything is fine — but the content is an error page. Search engines can detect these and may deindex them.
Common causes of soft 404s:
Fix soft 404s by ensuring your error pages return the correct HTTP status code — 404 or 410 — not 200.
Run the HTTP Error Audit and get a complete list of every 4xx and 5xx error across your site — filtered, sortable, and exportable as CSV. From 10p for small sites up to £9.99 for 1,000 pages.
Run HTTP Error Audit →