How to Fix Every Ecommerce Crawl Finding
The Ecommerce Crawl is a specialised crawler for online stores โ it understands faceted navigation, product variants, out-of-stock states, and the URL patterns specific to Shopify, WooCommerce, Magento. Large stores have 100x more URLs than content sites; crawl budget matters more here than anywhere. This index covers every finding.
By finding type
Pick the finding matching yours:
๐๏ธ Fix faceted URL explosion PLANNED
Filter combinations create thousands of URL variants. Most are low-value. Robots.txt patterns to block, canonical strategy to consolidate, the parameter-handling in Search Console.
๐ Fix duplicate product URLs PLANNED
Same product at /products/widget and /collections/sale/products/widget. Canonical pointing at the primary URL. Internal links updated to the canonical. The Shopify-specific challenge.
๐ฆ Fix out-of-stock product handling PLANNED
Delete (lose rankings), 410 (lose link equity), 301 to category (right answer usually), noindex (waste crawl), or keep live with stock-status schema (best for temporary OOS). The decision tree.
๐๏ธ Fix orphan product pages PLANNED
Products in sitemap but no category links to them. Common cause: deprecated collection deletion. The relink-or-delete decision, the bulk-fix workflow.
๐ธ Fix crawl budget waste PLANNED
Googlebot crawling 10,000 URLs of which 9,000 are filter variants. The crawl-budget rebalancing: block low-value, prioritise high-value, monitor via log analysis.
๐ Fix variant URL strategy PLANNED
Sizes, colours, options. Separate URLs (good for SEO, bad for canonicalisation) vs single URL with JS variants (good for canonicalisation, bad for individual variant indexing). The trade-off.
๐ท๏ธ Fix collection vs category architecture PLANNED
Shopify-specific: collections can overlap (a t-shirt in "Mens", "Sale", "Summer 2026"). The canonical-collection decision, the cross-collection SEO strategy.
๐
Fix seasonal product handling PLANNED
Christmas decorations live 365 days. The hide-vs-keep-live decision, the URL-preservation strategy for next year, the year-over-year content refresh pattern.
By platform
Crawl patterns by ecommerce platform:
๐ Fix Shopify crawl issues PLANNED
Collections-as-categories quirk, the variant URL strategy, the apps that explode crawl space, the robots.txt limitations.
๐ฐ Fix WooCommerce crawl issues PLANNED
Product-cat / product-tag overlap, the attribute-page explosion, the .htaccess-rule patterns for crawl shaping.
๐ข Fix Magento / large-catalogue crawl PLANNED
Layered navigation, the multi-store catalogue, the catalogue-search URL handling, the bulk-rule patterns at scale.
What our Ecommerce Crawl checks
The crawler maps every URL on your store, identifies faceted-nav explosion, finds duplicate product URLs, flags out-of-stock products, surfaces orphans and measures crawl-budget efficiency. For the full reference, see the Ecommerce SEO Guide.
๐ท Crawl your store first
Most large stores discover 30-50% of their crawl budget is going to URLs that shouldn't be crawled at all.
Run Ecommerce Crawl โ