Google may ignore Sitemaps if they contain invalid URLs.

2 minutes read

A Google employee John Muller confirmed on Twitter that the search engine could ignore Sitemap.xml files if they contain invalid URLs. At the same time, if the URLs were redirected and the content is loaded, then this will not happen. Otherwise, Google will suspend URL retrieval.

“We will stop extracting Sitemaps if the URL is invalid, but if you return content or redirect (which is recommended), we will continue our attempts. However, this should not be a problem, as a whole Sitemap is just a tiny part of all the URLs that we extract from the site, ”explained Muller.

Valentin Pletzer

@VorticonCmdr

Answer to user@JohnMu

The sitemap files themselves. The HTTP-URLs are accessed nearly as often as the HTTPS.

🍌 John 🍌

@JohnMu

We’d stop fetching sitemap files if the URLs are invalid, but if you’re returning content or redirecting (which is kinda recommended), we’ll keep trying them. It shouldn’t cause problems, since overall sitemap files are only a tiny-tiny part of all URLs fetched from a site.

6

Recall that Google uses XML Sitemap data to determine canonical URLs, but this is a less significant signal than the rel = canonical attribute.
Facebook Twitter LinkedIn Telegram Pocket

Related Posts:

Google announced that it almost solved the problem with displaying URLs in the AMP search results. They currently display Google URLs, not publishers that come from AMP Cache. However, later this year the situation will change, and AMP Cache will begin to prov...
Yesterday, Google Ads users noticed an alert in the service interface that reports for the period from April 30 to May 1 of PST contain incorrect data. Later, the Google Ads team posted the following message on the Google Ads Developer blog : “On May 1, at ab...
Google solved a problem with displaying the URL of publishers using the AMP format. Now users can see the real URL (ebay.com) in the search results in the Chrome browser, not the Google AMP URL (google.com/amp/ebay.com). To do this, Google has implemented Chro...