XML Sitemaps & Indexing

XML Sitemap Submitted but Pages Are Not Indexed

A successful sitemap submission means Google could fetch and read the file—not that every listed URL will be indexed. Check the affected URLs in Page Indexing and URL Inspection, then correct crawlability, indexability, canonical, content, or internal-linking problems.

XML sitemap submitted successfully while pages remain unindexed in Google

Quick overview

Summary

Confirm the actual indexing status by filtering the Page Indexing report by sitemap and inspecting representative URLs individually.
Check each important URL’s signals, including its HTTP status, robots access, noindex directives, canonical URL, redirects, content, and internal links.
Fix or remove unsuitable URLs, request indexing for selected corrected pages, and resubmit the sitemap only after substantial sitemap changes or a fetch failure.

By Murat · Technical SEO Specialist

Seeing Success beside an XML sitemap in Google Search Console feels reassuring. Unfortunately, the reassurance can disappear quickly when the Page Indexing report shows that many—or even all—of the submitted pages remain unindexed.

It is the technical SEO equivalent of receiving confirmation that your application was delivered, then realizing nobody promised it would be approved.

The sitemap may be working exactly as intended. A sitemap helps Google discover your preferred URLs, but discovery, crawling, canonical selection, and indexing are separate stages. A page can therefore appear in a successfully processed sitemap and still remain unindexed because Google has not crawled it, cannot access it, has detected an indexing directive, considers it a duplicate, or has not selected it for inclusion in the index.

The correct response is not to keep submitting the same sitemap. First determine which submitted URLs are affected, whether they should be indexed, and which Search Console status applies. Then fix the page-level or site-wide cause, update the sitemap where necessary, and allow Google time to process the changes.

What Does “Success” Mean in the Sitemaps Report?

In the Google Search Console Sitemaps report, Success means that Google fetched and read the sitemap without a sitemap-level error.

It does not mean that:

Every URL was crawled
Every URL was accepted as canonical
Every URL was indexed
Every indexed URL will appear for searches
Google considers every submitted page useful or distinct

The Discovered pages number is also easy to misinterpret. It represents the page URLs Google parsed from the sitemap. It is not an indexed-page count.

A sitemap is therefore better understood as a discovery and canonical-preference signal—not a guaranteed admission ticket to Google’s index.

If the sitemap shows Success and the expected number of discovered URLs, the sitemap file itself is probably not the main problem. The next step is to investigate the submitted pages.

Why Are Fewer Pages Indexed Than the Sitemap Contains?

The number of indexed pages will often be lower than the number of URLs found in a sitemap because not every submitted URL is necessarily suitable for indexing.

For example, a sitemap may contain:

URLs that redirect elsewhere
Pages carrying a noindex directive
Duplicate or near-duplicate pages
URLs canonicalized to another page
Deleted pages returning 404 responses
Pages Google interprets as soft 404s
URLs blocked from crawling
New URLs Google has discovered but not crawled
Crawled pages Google has not selected for indexing

Some of these conditions are genuine mistakes. Others are expected exclusions.

A duplicate URL with a correct canonical pointing to an indexed primary version does not also need to be indexed. Similarly, a redirected URL should normally give way to its destination. The objective is not to force every URL into the index; it is to get the canonical version of every valuable, indexable page indexed.

Before attempting a fix, ask a more useful question:

Is this an important canonical page that should appear in Google Search?

If the answer is no, exclusion may be appropriate. The sitemap should then be corrected so it no longer presents that URL as a preferred indexing candidate.

What Should You Check First?

Start with the affected sitemap in Google Search Console rather than checking random URLs one by one.

1. Confirm the sitemap details

Open Indexing → Sitemaps and verify:

The status is Success
The last read date is reasonably recent
The discovered-page count is plausible
The correct sitemap or sitemap index was submitted
The sitemap belongs to the correct Search Console property

If the sitemap cannot be fetched or contains parsing errors, address that problem first. A sitemap that shows Success, however, usually requires page-level investigation.

2. Filter the Page Indexing report by sitemap

Open Indexing → Pages, select the relevant sitemap where the filtering option is available, and review the reasons under Why pages aren’t indexed.

Do not look only at the total unindexed count. Group the URLs by reason, affected template, content type, and business importance.

For example, 500 filtered faceted-navigation URLs may require a different response from five core service pages marked Crawled – currently not indexed.

For a more detailed workflow, see how to diagnose indexing issues in Google Search Console.

3. Inspect representative URLs

Choose representative examples from each important status group and open them in URL Inspection.

Compare:

The indexed URL result
The last crawl date
Whether crawling was allowed
Whether the page was fetched successfully
The user-declared canonical
The Google-selected canonical
The referring sitemap
The referring pages, when reported

Then run Test Live URL to check the current version.

The indexed result and live test answer different questions. The indexed result describes what Google knows from its stored version. The live test checks whether the current page appears accessible and potentially indexable now.

A successful live test does not prove that Google will index the page. It may simply show that a previous technical problem has been corrected since Google last crawled it.

Why Are Pages in the Sitemap Not Indexed?

The URL cannot be crawled or accessed

An intended indexable URL should normally:

Return a successful 200 HTTP response
Be accessible without authentication
Permit Googlebot crawling
Load its main content reliably
Avoid server errors and redirect loops

A sitemap cannot override a blocked, unavailable, or failing page.

Check for:

Robots.txt disallow rules
401, 403, 404, or 5xx responses
Redirect loops or excessive redirect chains
Intermittent hosting or firewall failures
Essential content or resources unavailable to Googlebot

Do not use robots.txt as an indexing-control substitute. Robots.txt primarily controls crawling. If Google cannot crawl a page, it may be unable to see other instructions placed on that page.

The page contains a noindex directive

A noindex meta robots tag or X-Robots-Tag response header tells Google not to index the resource.

If the page is intentionally excluded, remove it from the sitemap.

If the page should be indexed:

Remove the noindex directive.
Confirm the live page no longer contains it.
Check that Google can crawl the page.
Request indexing for a small number of important corrected URLs.
Monitor the Page Indexing report.

Submitting a noindex URL in a sitemap sends conflicting instructions: the sitemap presents the URL as a preferred page, while the page itself asks not to be indexed.

The URL redirects elsewhere

A redirected URL is not normally the version Google should index. The redirect destination is the indexing candidate.

Replace redirected sitemap entries with their final canonical destinations. Also update internal links so that they point directly to the final URLs rather than requiring search engines to follow old addresses.

The sitemap should describe the current website, not provide a guided tour of its former URLs.

Google considers another URL canonical

A sitemap provides a canonical signal, but it does not force Google to accept the submitted URL as canonical.

Google may select another URL when it finds:

Duplicate or substantially similar content
Conflicting canonical tags
Internal links pointing mainly to another version
Redirects favouring another URL
HTTP and HTTPS versions
Hostname or trailing-slash inconsistencies
Parameterized or filtered versions
Inconsistent hreflang and canonical signals

Inspect the submitted URL and compare the user-declared canonical with the Google-selected canonical.

If Google selected the correct alternative, remove the duplicate URL from the sitemap. If Google selected the wrong URL, investigate the combined signals rather than changing only the sitemap.

The canonical tag, internal links, redirects, sitemap inclusion, hreflang annotations, and page content should support the same preferred URL.

The page is a soft 404

A soft 404 usually returns a successful response while presenting content that resembles an error, empty result, unavailable item, or page with insufficient meaningful information.

Common examples include:

Empty category pages
Deleted products still returning 200
“No results found” pages
Thin location pages with almost no unique information
Templates whose main content failed to render

If the resource genuinely no longer exists, return an appropriate 404 or 410 response and remove it from the sitemap.

If the page should exist, restore or strengthen its primary content and verify what Google sees through the rendered live test.

The URL is Discovered – currently not indexed

This status means Google knows about the URL but has not crawled it yet.

For a few recently published pages, waiting may be reasonable. For a large or persistent group, check:

Whether the server is reliable
Whether the site generates excessive low-value URLs
Whether the pages are deeply buried
Whether important pages have contextual internal links
Whether the sitemap contains outdated or unsuitable URLs
Whether the affected pattern creates substantial duplication

The status does not automatically prove a content-quality problem. It does, however, justify reviewing how efficiently the site helps Google find and prioritize its important pages.

The URL is Crawled – currently not indexed

This status means Google crawled the page but did not index it at that time. Google may index it later, but simply resubmitting the same unchanged URL is not a meaningful fix.

Review the page alongside comparable indexed pages. Check:

Whether the content is substantially distinct
Whether another URL covers the same intent
Whether the page has a clear primary purpose
Whether the rendered main content is complete
Whether Google selected another canonical
Whether the page is internally linked from relevant indexed pages
Whether the issue affects an entire template

For a fuller explanation of overlapping causes, review the broader reasons why Google may not index a page.

Do not respond by adding arbitrary text until a tool score changes. Determine what the page contributes, whether that contribution is already available elsewhere, and whether the page deserves to exist as a separate search result.

The page has weak internal discovery

A sitemap can help Google discover a URL, but it does not replace internal linking or coherent site architecture.

An important page should normally be reachable through relevant navigation, category pages, hub pages, or contextual links. A URL that exists only in the sitemap may appear technically available while remaining disconnected from the rest of the website.

A page with no internal links exists, but the website behaves as though the two have never been introduced.

Add useful links from relevant indexed pages using descriptive anchor text. Avoid creating artificial sitewide links merely to increase link counts; the links should make sense to users and clarify the page’s place within the website.

Reasons pages listed in a successful XML sitemap can remain unindexed — A successful sitemap can still contain pages affected by crawl, indexing, canonical, content, or internal-linking problems.

Which URLs Should Be Removed From the Sitemap?

A well-maintained sitemap should primarily contain URLs that are:

Canonical
Intended for indexing
Accessible to Google
Returning a successful response
Current and useful
Part of the live website

Use the following guidance:

URL condition	Keep in sitemap?	Recommended action
Canonical 200 page intended for indexing	Yes	Keep and maintain
Page with `noindex`	No	Remove unless `noindex` was accidental
Redirected URL	No	Replace it with the final destination
Duplicate or alternate non-canonical URL	No	Include the preferred canonical version
404 or 410 URL	No	Remove from the sitemap
Soft 404	Usually no	Restore meaningful content or return the correct status
Server-error URL	Not while unresolved	Fix the server problem and verify stability
Staging, account, search, or utility page	Usually no	Exclude unless it has a genuine search purpose
Discovered or crawled but not indexed canonical page	Potentially	Keep if it is valuable and genuinely intended for indexing

Do not remove a valuable page merely because it has not been indexed yet. First determine whether the page is technically sound, distinct, useful, canonical, and properly linked.

Should You Fix, Update, Resubmit, or Wait?

The right action depends on the condition you find.

Fix the page when:

An important page is blocked accidentally
An unintended noindex directive exists
The page returns an incorrect response
The canonical points to the wrong URL
The main content does not render correctly
An important page is orphaned or buried
The content substantially duplicates another page unintentionally

Update the sitemap when:

It contains redirects, errors, noindex pages, or non-canonical URLs
New canonical pages have been added
Old URLs have been permanently removed
A sitemap index points to obsolete child sitemaps
Significant page updates require accurate <lastmod> values

Only update <lastmod> when the page has changed meaningfully. Automatically changing every modification date without a corresponding page update makes the field less useful.

Resubmit the sitemap when:

Google could not fetch it and the problem has been corrected
The sitemap has been substantially restructured
Large groups of URLs have been added, removed, or replaced
A major migration or URL change has occurred

Google periodically rereads successfully processed sitemaps. Routine page-level fixes do not necessarily require removing and resubmitting the same sitemap.

Use Request Indexing when:

A small number of important pages were added or substantially corrected
The live test confirms the current version is accessible
You need Google to revisit a specific URL

Request Indexing is a recrawl request, not an indexing command. Repeating it for the same unchanged URL does not create a faster queue, however satisfying that might feel.

Wait when:

The pages are new
The sitemap is clean
The URLs are accessible, indexable, canonical, and internally linked
No template-level problem is apparent
Google has not yet had reasonable time to recrawl the site

Waiting is appropriate only after the technical and editorial signals have been checked. “Wait and see” is less useful when hundreds of valuable URLs share the same preventable error.

Technical SEO workflow to check, fix, validate, and monitor unindexed sitemap URLs — Check the status, correct the underlying signal, validate the live URL, and then allow Google time to recrawl.

How Long Should You Wait After Submitting a Sitemap?

There is no fixed indexing deadline. Crawling and recrawling can take anywhere from several days to several weeks.

Evaluate progress using:

The sitemap’s last read date
The page’s last crawl date
Changes in the Page Indexing report
URL Inspection results
Server logs, where available
Whether the issue affects isolated pages or a large template group

A recently published page that is clean and internally linked may simply need time. A core commercial page that has remained unindexed for weeks deserves closer inspection, especially when comparable pages are indexed normally.

Avoid using daily site: searches as your primary monitoring method. Search Console provides more useful diagnostic information about the indexed URL, crawl history, canonical selection, and exclusion reason.

How Should You Prioritize a Site-Wide Sitemap Indexing Problem?

Do not begin with the largest URL count automatically. Begin with the pages whose absence creates the greatest business or search-visibility impact.

A practical priority order is:

Revenue-generating product, service, category, or lead-generation pages
Important editorial pages targeting established demand
Template-level problems affecting many valuable URLs
Newly published pages central to current campaigns
Low-value archives, parameters, utility pages, and obsolete URLs

Then consider:

Number of affected important URLs
Severity of the issue
Whether the cause is template-wide
Implementation effort
Risk of changing canonicals, redirects, or crawl controls
Ease of validating the correction

When the symptoms extend beyond the sitemap, a broader process may be needed to find and fix technical SEO issues across templates, internal links, directives, and canonical signals.

Common questions

Frequently Asked Questions

Does submitting an XML sitemap guarantee that Google will index the pages?

No. An XML sitemap helps Google discover important URLs, but it does not guarantee that Google will crawl or index every submitted page. Each URL must still be accessible, indexable, canonical, and suitable for inclusion in Google’s index.

Why does my sitemap say Success when the pages are not indexed?

Success means Google was able to fetch and process the sitemap file. It does not mean that every URL in the sitemap was crawled or indexed. Use the Page Indexing report and URL Inspection to investigate the status of individual pages.

Should I resubmit my sitemap after fixing unindexed pages?

Usually, you do not need to resubmit the sitemap after every page-level fix because Google periodically rereads submitted sitemaps. Resubmit it when the sitemap previously failed, its location changed, or its URL inventory was substantially updated.

How long does Google take to index pages after sitemap submission?

There is no fixed indexing timeframe. Google may recrawl a page within several days, but some pages can take several weeks or may not be indexed at all. The timing depends on factors such as crawl demand, site quality, internal linking, server reliability, and the page’s indexing signals.

Should noindex, redirected, or non-canonical URLs be included in a sitemap?

Generally, no. A sitemap should primarily contain accessible, canonical URLs that return a successful 200 response and are intended for indexing. Redirected, noindex, duplicate, error, and non-canonical URLs should normally be removed and replaced with their preferred canonical destinations where applicable.

Final Thoughts

A successfully submitted XML sitemap confirms that Google could process the file. It does not confirm that every URL inside the sitemap was crawled, selected as canonical, or indexed.

Start by identifying the exact status of the affected URLs. Separate intentional exclusions from genuine problems, inspect important examples, and correct conflicting crawlability, indexability, canonical, response, content, and internal-linking signals.

Keep the sitemap limited to current canonical URLs that are genuinely intended for indexing. Request indexing selectively after meaningful fixes, resubmit the sitemap when the file itself has changed substantially or previously failed, and otherwise allow time for recrawling.

When large groups of important pages remain unindexed or several signals conflict, Technical SEO audit and implementation support can help distinguish sitemap symptoms from the underlying site-wide cause.

Need a clear next step?

Turn the diagnosis into a prioritized fix plan.

I can investigate the underlying issue, separate high-impact problems from low-value warnings, and provide practical implementation and validation guidance.

Discuss Your Website