10 February 2026

Why JavaScript Pages are Crawled but Not Indexed

Hero image for 'Why JavaScript Pages are Crawled but Not Indexed.' Image by Vincent van Zalinge.

In Brief

Crawled does not mean indexed. Once Google can fetch a JavaScript page, the next question is whether the rendered result is clear, canonical, stable, useful, and worth storing. Rendering support helps, but it does not overcome weak content, confused canonicals, blocked resources, unstable metadata, poor internal links, or duplicate pages.

"Crawled but not indexed" is one of those Search Console statuses that sounds more helpful than it is.

It tells you Google found the URL. It does not tell you whether Google found enough useful content, trusted the canonical, rendered the page successfully, discovered a near‑duplicate, hit a quality issue, or decided the page was not worth storing.

On JavaScript‑heavy sites, the diagnosis is often made too late. People assume that because Google can render JavaScript, the JavaScript is not the problem. That is the wrong conclusion. Rendering support is not the same as an indexing guarantee.

A page can be crawlable and still be weak. It can return 200 and still have no useful body content before rendering. It can render correctly for you and fail for crawlers because of blocked resources, timing, auth, errors, or unstable metadata. It can have content and still lose the canonical battle.

The job is to separate discovery, rendering, indexing, and ranking. They are related, but they are not the same failure.

First, Check What Google Can Fetch Without Your Browser

Start with the boring checks.

For the affected URL, confirm:

status code
final URL after redirects
canonical URL
robots meta tag
X‑Robots‑Tag header
robots.txt rules
hreflang where relevant
sitemap inclusion
internal links pointing to the page

Do this before opening React DevTools. A JavaScript site can still have a plain HTTP problem.

If a page is blocked by robots.txt, has noindex, points its canonical elsewhere, redirects unexpectedly, or only appears in a stale sitemap, indexing may fail before rendering becomes relevant.

Google's documentation on JavaScript SEO basics, robots.txt, and sitemaps is worth keeping close during this kind of work. It is easy to chase framework details whilst missing a directive that says "do not index this".

Compare Raw HTML with Rendered HTML

Raw HTML is what the crawler receives first. Rendered HTML is what exists after JavaScript has run.

Both matter.

If the raw HTML is a thin app shell, the page depends heavily on rendering. That might be acceptable for some pages, but it raises the cost of indexing. It also makes the page more fragile: a failed API request, blocked script, uncaught error, or slow third‑party dependency can leave the rendered output incomplete.

For each affected template, compare:

raw document title
rendered document title
meta description
canonical
h1
body copy
internal links
product, article, or service details
structured data
image alt text
pagination links

The page does not need every word in the first response, but it should not depend on a fragile browser‑only chain to expose its main meaning.

This is why optimising HTML markup for SEO is still relevant on modern React and Next.js sites. Better rendering does not remove the need for coherent document structure.

Look for Metadata That Changes After Load

Client‑side metadata is a common cause of confusion.

A page might show the right title in your browser tab after the app has loaded, but the crawler may initially see a generic title. The same can happen with meta descriptions, canonicals, Open Graph tags, and structured data.

Check whether metadata is:

present in server‑rendered output
route‑specific
stable before and after hydration
consistent with the canonical URL
consistent with visible content
free from staging or preview values

React SPAs often rely on libraries to patch metadata during route changes. That can work, but it is easier to get wrong than a server‑rendered metadata model. If a page matters for search, metadata should not be treated as a late client‑side side effect.

Check the Canonical Decision

Many "not indexed" problems are really canonical problems.

Google may crawl the URL, inspect it, and decide that another URL is the better representative. That may be right. It may also be a signal that your site is creating duplicate or near‑duplicate pages.

Check for:

canonical pointing to the wrong environment
canonical pointing to the homepage or category page
multiple URLs with the same content
filtered pages without a crawl strategy
uppercase and lowercase variants
trailing slash differences
query parameters that create duplicates
pages with identical titles and descriptions

If a JavaScript app builds page state from URL parameters, it can accidentally create many crawlable variants. Search engines then have to decide which one, if any, deserves to be indexed.

Canonicalisation is not a magic fix for weak content. It is a signal. If the signal conflicts with internal links, redirects, sitemap entries, or visible content, do not expect it to solve the problem alone.

Internal Links Decide Whether the Page Looks Important

Sitemaps help discovery. Internal links explain importance and context.

If an important page is only reachable through a search form, filter interaction, infinite scroll, client‑side state, or a button with no real href, crawlers may not treat it as part of the main site structure.

Review:

top navigation
breadcrumbs
category links
related articles
product or service cross‑links
footer links
pagination
HTML anchors generated before hydration

Anchor text matters as well. "Read more" repeated 200 times is less useful than links that describe the destination. This is not just an accessibility concern. It helps the page estate describe itself.

If the article supports a commercial service, connect it naturally to the relevant service page. For example, pages affected by rendering or indexation issues should point readers towards technical SEO for JavaScript applications or JavaScript SEO rendering and indexing fixes where the service genuinely fits.

Rendering Success Does Not Fix Thin Content

Sometimes JavaScript is blamed because it is visible and technical. The real issue is content quality.

A page can render correctly and still be ignored because it is too similar to other pages, too thin, not internally supported, or not clearly useful. That is especially common with location pages, tag pages, filtered commerce pages, and programmatically generated templates.

Ask:

Does this page answer a distinct query?
Is the content meaningfully different from adjacent pages?
Would a user be satisfied if this URL appeared in search?
Does the page have enough internal support?
Does it link to useful next steps?
Is the title aligned with the content, not just the template?

The technical work gives the page a fair chance. It does not create substance for it.

A Practical Debugging Order

When a JavaScript page is crawled but not indexed, I would usually work through this order:

Confirm the URL is meant to be indexed.
Check status code, redirects, robots, noindex, and canonical.
Check sitemap and internal links.
Compare raw HTML and rendered HTML.
Inspect server logs or crawl data if available.
Check browser console errors and failed network requests.
Compare affected and indexed templates.
Review duplicate content and canonical clusters.
Strengthen the page content, links, and metadata if the technical surface is sound.
Request validation only after the root cause is fixed.

That order avoids one of the most common mistakes: asking Google to reprocess a page that has not actually changed.

Wrapping Up

"Crawled but not indexed" is not a single diagnosis.

On JavaScript sites, it usually means you need to check the whole path from discovery to rendered meaning: HTTP response, directives, canonical, raw HTML, rendered HTML, internal links, and page quality.

Google can render JavaScript, but your job is not to make Google work harder than necessary. Important pages should expose their purpose clearly, link into the site properly, and avoid fragile client‑side dependencies for core content and metadata.

Key Takeaways

Crawling, rendering, indexing, and ranking are different stages.
Check directives and canonicals before blaming JavaScript.
Compare raw and rendered HTML for affected templates.
Make important metadata stable before hydration.
Use crawlable internal links with descriptive anchor text.
Do not expect a technically valid page to index if the content is thin or duplicated.

All articles

Next article
Graph Traversal: Solving the 'Course Schedule' Problem.
06 February 2026
Graph Traversal: Solving the 'Course Schedule' Problem
The 'Course Schedule' problem tests our ability to detect cycles using graph traversal. I'll explain a clear solution using depth‑first search with TypeScript.
Read article
What AEO is, and How It Fits with SEO and GEO.
07 May 2026
What AEO is, and How It Fits with SEO and GEO
AEO explained alongside SEO and GEO, covering answer engines, featured snippets, AI answers, content structure, measurement, and practical workflow.
Read article
Optimising HTML Markup for SEO.
03 February 2017
Optimising HTML Markup for SEO
Optimise HTML markup for SEO and accessibility with semantic elements, heading structure, alt text, clean code, anchor text, and crawler‑friendly structure.
Read article
10 Essential SEO Tips for Front‑End Developers.
29 October 2021
10 Essential SEO Tips for Front‑End Developers
Ten practical SEO areas front‑end developers can influence, from site speed and semantics to metadata, mobile UX, internal links, and structured data.
Read article
Traffic Dropped After a Replatform: The Technical Checks I Run First.
21 May 2026
Traffic Dropped After a Replatform: The Technical Checks I Run First
Diagnose traffic drops after a redesign, migration, or replatform by checking route parity, rendered HTML, redirects, canonicals, sitemaps, and schema.
Read article
Technical GEO for Websites: Entities, Structured Data, and Crawl Paths.
31 May 2026
Technical GEO for Websites: Entities, Structured Data, and Crawl Paths
Technical GEO for websites, covering indexing, renderability, entity clarity, structured data, and crawl paths without inventing an AI‑only markup layer.
Read article
Advanced Techniques for Responsive Web Design.
20 March 2019
Advanced Techniques for Responsive Web Design
Discover advanced responsive web design techniques with CSS Grid, `clamp()`, container queries, and JS enhancements for performance‑optimised, adaptive sites.
Read article
Staying Current: Automating Copyright Year Updates.
01 January 2024
Staying Current: Automating Copyright Year Updates
Automate copyright year updates across JavaScript, React, Angular, Vue, jQuery, PHP, and WordPress so footers do not go stale each January across stacks.
Read article
Looping in JavaScript ES5 and ES6: forEach and for...of.
06 July 2015
Looping in JavaScript ES5 and ES6: forEach and for...of
Compare classic for loops, ES5 forEach() and ES6 for...of in JavaScript, including NodeList support, readability and when a loop can stop early.
Read article
Previewing CMS Content in Gatsby Workflows.
22 February 2021
Previewing CMS Content in Gatsby Workflows
Plan CMS preview in Gatsby workflows by setting expectations around draft content, preview builds, webhooks, deployment timing, and editor confidence.
Read article
The will‑change Property in CSS.
16 September 2022
The will‑change Property in CSS
The CSS will‑change property signals that an element may change, allowing the browser to prepare rendering work, although it should only be used selectively.
Read article
Higher‑Order Functions in JavaScript.
06 July 2022
Higher‑Order Functions in JavaScript
Higher‑order functions in JavaScript take functions as arguments or return them. Here, I explore their benefits, common use cases, and practical examples.

Relevant Services

Untangling a delivery problem?

Send the symptoms, constraints, and affected routes. I'll help identify whether the issue sits in the application, platform, content model, deployment path, or search surface.

Get in touch

Why JavaScript Pages are Crawled but Not Indexed

In Brief

First, Check What Google Can Fetch Without Your Browser

Compare Raw HTML with Rendered HTML

Look for Metadata That Changes After Load

Check the Canonical Decision

Internal Links Decide Whether the Page Looks Important

Rendering Success Does Not Fix Thin Content

A Practical Debugging Order

Wrapping Up

Key Takeaways

Graph Traversal: Solving the 'Course Schedule' Problem

What AEO is, and How It Fits with SEO and GEO

Optimising HTML Markup for SEO

10 Essential SEO Tips for Front‑End Developers

Traffic Dropped After a Replatform: The Technical Checks I Run First

Technical GEO for Websites: Entities, Structured Data, and Crawl Paths

Advanced Techniques for Responsive Web Design

Staying Current: Automating Copyright Year Updates

Looping in JavaScript ES5 and ES6: `forEach` and `for...of`

Previewing CMS Content in Gatsby Workflows

The `will‑change` Property in CSS

Higher‑Order Functions in JavaScript

Relevant Services

Technical SEO for JavaScript Applications

Technical SEO Recovery and Debugging

JavaScript SEO Rendering and Indexing Fix

Untangling a delivery problem?