By
Murat · Technical SEO Specialist
Publishing a page does not guarantee that it will appear in Google Search.
Before a page can rank, Google must first discover its URL, crawl it,
process its content, and decide to add it to the index. A problem at any
stage can prevent the page from appearing in search results.
Your page may be blocked from crawling, marked with a
noindex directive, redirected to another URL, treated as a
duplicate, or simply not considered useful enough to index. Google may
also know that the URL exists but have not crawled it yet.
The correct fix depends on the actual reason the page was excluded.
Repeatedly requesting indexing without diagnosing the cause is unlikely
to solve the problem.
This guide explains how to find out why Google is not indexing your pages
and what you can do about it.
How Do You Know Whether a Page Is Indexed?
The best place to check an individual URL is the URL Inspection tool in
Google Search Console.
Enter the exact URL you want to investigate. Google may report that:
- The URL is indexed
- The URL is unknown to Google
- The page has been discovered but not indexed
- The page has been crawled but not indexed
- Indexing is blocked by a
noindex directive
- Crawling is blocked by
robots.txt
- The URL redirects to another page
- Google selected a different canonical URL
- The page returned an error
For a broader view, use the Page Indexing report in Search Console. This
report groups URLs according to whether they are indexed and explains why
other URLs were excluded.
Not every excluded URL is a problem. Redirected pages, duplicate URL
variations, deleted pages, and deliberately excluded content may correctly
remain outside the index.
Focus on important pages that you genuinely want people to find through
Google, such as your service, product, category, location, and valuable
informational pages.
You can also perform a quick Google search using:
site:example.com/page-url/
However, a site: search is not a complete or definitive
indexing report. Use Search Console when diagnosing a specific page.
1. Google Has Not Discovered the Page
Google cannot crawl a page if it does not know that the URL exists.
Google mainly discovers new pages through links from pages it already
knows. It may also find URLs through XML sitemaps and other signals.
A new page may remain undiscovered when:
- No other page links to it
- It is missing from the XML sitemap
- It is buried deep within the site
- Important links are not technically crawlable
- The site is new and has few external or internal discovery signals
- The page is accessible only through a form or internal search function
If URL Inspection says the URL is unknown to Google, make sure it has at
least one relevant internal link from an indexed page.
For example, a new service page could be linked from:
- Your main services page
- Your homepage, when appropriate
- A relevant navigation menu
- Related blog articles
- A category or topic hub
- Breadcrumb navigation
The page should also be included in your XML sitemap if it is canonical,
indexable, and important enough to appear in search.
Do not rely only on the sitemap. A page that appears in the sitemap but has
no internal links may still look isolated from the rest of the website.
2. Google Cannot Crawl the Page
Google must be able to access a page before it can fully process and index
its content.
First, check whether the URL is blocked in your robots.txt
file. This file is usually available at:
https://example.com/robots.txt
A rule such as the following would prevent Googlebot from requesting pages
in the specified directory:
User-agent: *
Disallow: /services/
That would be a serious issue if your main commercial pages were stored
under /services/.
However, robots.txt should not be used as the primary method for
keeping a page out of Google. It controls crawling rather than guaranteeing
removal from the index. A blocked URL may still be known through links and
could potentially appear without a useful snippet.
Other crawl-access problems can include:
- Server downtime
- DNS or hosting errors
- Firewall restrictions
- Security tools blocking Googlebot
- Login or authentication requirements
- Excessive rate limiting
- Incorrect content delivery network settings
- Repeated server errors
- Broken URL structures
Use URL Inspection’s live test to check whether Google can access the
current version of the page. For larger websites, server logs and a
technical crawler can provide additional evidence.
3. The Page Does Not Return a Successful Status Code
Google’s basic technical requirements state that an indexable page should
work correctly and return a successful HTTP response.
A normal live page will usually return a 200 status code.
Other responses can explain why a URL is not indexed:
- 301 or 308: The URL permanently redirects elsewhere
- 302 or 307: The URL temporarily redirects elsewhere
- 404: The page was not found
- 410: The page was intentionally removed
- 500: The server encountered an internal error
- 503: The service is temporarily unavailable
A redirected URL is generally not expected to be indexed as a separate
page. Google will normally evaluate the destination URL instead.
A page can also be treated as a soft 404. This happens when the server
returns 200, but the page appears to contain no meaningful content or
communicates that the requested content does not exist.
Check the actual response code and make sure the page loads normally for
both users and Googlebot.
4. The Page Contains a Noindex Directive
A noindex directive explicitly tells supported search engines
not to include a page in search results.
It may appear in the page’s HTML:
<meta name="robots" content="noindex">
It can also be delivered through an X-Robots-Tag HTTP header.
Unintended noindex directives commonly appear after:
- A staging site is moved to the live domain
- A website is redesigned
- A CMS visibility setting is changed
- An SEO plugin is configured incorrectly
- A page template is copied
- Developers temporarily block indexing and forget to reverse the change
Inspect the page source, HTTP headers, CMS settings, and SEO plugin
configuration.
If the page should be indexed, remove the noindex directive and
confirm that Google can crawl the page.
This last point matters because Google must be allowed to crawl the page to
see that the noindex directive has been removed. Blocking the
same page in robots.txt may prevent Google from detecting the
updated indexing instruction.
5. The URL Redirects to Another Page
Google Search Console may classify a URL as Page with redirect.
This is usually expected when an old URL has been permanently redirected to
a newer or preferred page. Google generally indexes the destination rather
than the redirecting URL.
A redirect becomes a problem when:
- An important page redirects unintentionally
- The redirect points to an irrelevant destination
- The URL redirects back to itself
- Multiple redirects create a long chain
- HTTP and HTTPS rules conflict
- WWW and non-WWW rules conflict
- The mobile or desktop version redirects incorrectly
Test the URL and follow the complete redirect path.
For example:
Old URL → Temporary URL → Current URL
should usually be simplified to:
Old URL → Current URL
Also update internal links so they point directly to the final destination
rather than relying on unnecessary redirects.
6. Google Considers the Page a Duplicate
Google tries to select one representative URL when several pages contain
the same or very similar content. This selected URL is known as the
canonical version.
Your page may not be indexed separately when:
- It duplicates another page
- Its canonical tag points elsewhere
- Google chooses a different canonical
- URL parameters create multiple versions
- HTTP and HTTPS versions are both accessible
- WWW and non-WWW versions are not consolidated
- Multiple category paths generate the same page
- Location, product, or service pages have nearly identical content
Check the canonical tag in the page’s HTML:
<link rel="canonical" href="https://example.com/preferred-page/">
If this points to another URL, you are signalling that the other page is the
preferred version.
Google can also choose a different canonical from the one you declared when
it receives conflicting signals.
To strengthen your preferred URL:
- Use a self-referencing canonical where appropriate
- Link internally to the canonical URL
- Include only canonical URLs in the sitemap
- Redirect unnecessary duplicate versions
- Keep protocol and hostname versions consistent
- Avoid creating multiple pages for the same search intent
- Make pages meaningfully distinct when they require separate URLs
Duplicate pages do not all need to be indexed. The real question is whether
Google has selected the correct representative URL.
7. The Page Was Discovered but Is Currently Not Indexed
Discovered – currently not indexed generally means that Google
knows about the URL but has not yet crawled it.
This does not automatically mean that the page has a specific technical
error.
Possible contributing factors include:
- The site has a very large number of low-value URLs
- Google is crawling the site inefficiently
- The server has experienced availability problems
- The page has weak internal linking
- The site generates many duplicate or parameter-based URLs
- Google has not yet prioritized crawling the page
- The page is new
Review whether the page is linked from relevant indexed content and included
in a clean XML sitemap.
Also look at the wider website. If thousands of unnecessary filtered,
parameter, tag, archive, or internal-search URLs are being generated,
Google may spend time discovering URLs that have little search value.
For a normal small website, do not assume that “crawl budget” is always the
main problem. Start with page discoverability, site quality, internal links,
server reliability, and unnecessary URL generation.
8. The Page Was Crawled but Is Currently Not Indexed
Crawled – currently not indexed means Google visited the page but
did not add it to the index at that time.
The page may be indexed later, so this status is not necessarily permanent.
It also does not always indicate a purely technical problem.
Review the page critically:
- Does it provide original and useful information?
- Is it substantially similar to another page?
- Does it satisfy a clear search intent?
- Is it only a thin variation of a product, service, or location page?
- Is the main content complete?
- Does the page contain mostly boilerplate?
- Is it linked from other relevant pages?
- Does it belong in the index as an independent result?
For example, creating 50 location pages where only the city name changes may
not give Google a strong reason to index every version.
Similarly, empty category pages, placeholder pages, expired listings,
automatically generated archives, and very similar product variations may
offer limited independent value.
The solution is not to add words merely to make the page longer. Improve
the page’s purpose, originality, completeness, and usefulness. When several
pages target the same intent, consolidation may be more appropriate than
expanding each one.
9. Your XML Sitemap Contains the Wrong URLs
An XML sitemap helps Google discover URLs, but it does not guarantee that
those URLs will be crawled or indexed.
Your sitemap should generally contain URLs that are:
- Canonical
- Indexable
- Live
- Returning a successful response
- Important enough to appear in search
Avoid including:
- Redirected URLs
- Deleted pages
noindex pages
- Non-canonical duplicates
- Blocked URLs
- Staging URLs
- Internal search results
- Low-value parameter variations
When the sitemap contains mixed signals, it becomes less useful as a list of
your preferred pages.
Clean the sitemap, resubmit it in Search Console when necessary, and monitor
its processing status.
10. Important Pages Have Weak Internal Linking
Internal links help Google discover pages and understand how they relate to
the rest of your website.
A page with no internal links is commonly called an orphan page. Even when
it appears in the sitemap, its isolation may signal that it is not an
important part of the website.
Link to priority pages from relevant content using descriptive anchor text.
For example, instead of:
Learn more
a clearer link might be:
Learn more about our technical SEO audit service
Do not add excessive or unrelated links simply to influence indexing.
Internal links should create a useful path for both visitors and search
engines.
How to Fix and Resubmit an Unindexed Page
Use the following process:
- Inspect the exact URL in Google Search Console.
- Identify the reported indexing reason.
- Run a live URL test.
- Confirm that the page returns 200.
- Check
robots.txt.
- Check for meta robots and
X-Robots-Tag directives.
- Review the canonical tag.
- Confirm the page is internally linked.
- Verify that the sitemap contains the canonical URL.
- Assess whether the page is original, complete, and useful.
- Fix the underlying issue.
- Request indexing when appropriate.
- Monitor the URL and Page Indexing report.
Google says recrawling can take anywhere from several days to several weeks.
Requesting indexing does not guarantee immediate crawling or inclusion in
search results.
Do not repeatedly submit the same unchanged page. Fix the problem first,
validate the live URL, and then allow Google time to process the update.
Does Every Page Need to Be Indexed?
No.
Many websites contain URLs that should not appear as independent search
results, including:
- Shopping cart pages
- Account pages
- Internal search results
- Filter and sorting variations
- Duplicate print pages
- Tracking URLs
- Thank-you pages
- Staging pages
- Expired or deleted content
- Low-value archives
The goal is not to maximize the number of indexed pages. It is to ensure
that the right pages are indexed.
A healthy Page Indexing report can therefore include many legitimately
excluded URLs. Investigate exclusions based on page purpose and business
value, not simply because Search Console labels them as “Not indexed.”
Final Thoughts
When Google does not index a page, requesting indexing should not be your
first and only response.
Begin by identifying where the process failed:
- Has Google discovered the URL?
- Can Google crawl it?
- Does it return a successful status?
- Is indexing blocked?
- Does it redirect?
- Is another URL canonical?
- Is the page sufficiently useful and distinct?
- Can Google reach it through internal links?
Then fix the underlying cause and validate the result.
Google does not guarantee that every eligible page will be indexed.
However, clear discovery paths, reliable server access, correct indexing
directives, consistent canonical signals, useful content, and strong
internal linking give your important pages the best opportunity to appear
in search.
Need help diagnosing an indexing problem?
I can investigate the cause, prioritize the required fixes, and provide
clear implementation guidance.
Contact Murat