Technical SEO for React SaaS: Sitemaps, Structured Data & Core Web Vitals • RiseGravity

Why React apps lose SEO they didn't have to

React makes great products and, by default, mediocre SEO. A standard client-side-rendered SPA ships an almost-empty HTML shell and fills it in with JavaScript. Search engines can render JavaScript, but they do it on a budget and a delay—so anything that depends on JS to show your title, description, or content is a coin flip on whether it gets indexed well.

The good news: technical SEO is one of the highest-ROI things you can fix, and almost all of it is engineering, not guesswork. This is the checklist we apply to React and SaaS sites—the same techniques behind this very site (a React 19 + Vite build with a generated sitemap, native metadata, and JSON-LD).

Key takeaways

Rendering strategy is the foundation—make sure crawlers get real HTML.
Every indexable page needs a unique title, description, and canonical.
Structured data (JSON-LD) earns rich results and clarifies your content.
A correct sitemap + robots.txt tells crawlers exactly what to index.
Core Web Vitals are a ranking factor—and a conversion factor.

Get the rendering strategy right

This is the decision everything else rests on. The question: when a crawler requests a page, does it get your content in the HTML, or an empty shell?

Client-side rendering (CSR) — the SPA default. Worst case for SEO; content depends on JS execution. Fine for app screens behind login, risky for marketing/content pages.
Static site generation (SSG) / prerendering — HTML is generated at build time. Excellent for SEO and speed; ideal for marketing sites, blogs, docs, and project pages whose content is known at build. (This site prerenders content authored as Markdown.)
Server-side rendering (SSR) — HTML is rendered per request. Best when content is dynamic and personalized but must still be indexable.

The rule of thumb: public, indexable pages should ship real HTML via SSG or SSR. Reserve pure CSR for authenticated app surfaces that don't need to rank. If you're on a Vite SPA, prerender your public routes; if you're on Next.js, choose SSG/SSR per route.

Metadata: unique, accurate, per page

Every indexable page needs its own metadata. Duplicated or missing titles are one of the most common—and most fixable—SEO problems.

Title (~50–60 chars): primary keyword first, brand last.
Meta description (~150–160 chars): a compelling summary; it's your search-result ad copy. It doesn't directly rank, but it drives click-through, which does.
Open Graph + Twitter cards: control how links look when shared (title, description, image).

React 19 hoists <title> and <meta> rendered anywhere in your component tree into <head>, so you can colocate SEO with the page—no helper library required:

// React 19 hoists these into <head> automatically
function ArticleHead({ title, description, image, url }) {
  return (
    <>
      <title>{title}</title>
      <meta name="description" content={description} />
      <link rel="canonical" href={url} />
      <meta property="og:title" content={title} />
      <meta property="og:description" content={description} />
      <meta property="og:image" content={image} />
    </>
  );
}

Canonical URLs: end duplicate-content ambiguity

Search engines treat example.com/pricing, example.com/pricing/, example.com/pricing?ref=twitter, and the www variant as potentially different URLs. A <link rel="canonical"> tells them which one is authoritative, consolidating ranking signals instead of splitting them.

Set a self-referencing canonical on every indexable page (absolute URL).
Pick one host (www vs apex) and one trailing-slash convention—then redirect the rest with 301s.
Strip tracking params from the canonical so campaign links don't fragment your authority.

Structured data with JSON-LD

Structured data is machine-readable markup (schema.org) that tells search engines exactly what a page is—an article, a product, an organization, an FAQ. It powers rich results (star ratings, FAQ accordions, article cards) and improves how your content is understood. JSON-LD in a script tag is Google's preferred format.

Match the schema type to the page:

Organization on the homepage (name, logo, social profiles, contact).
BlogPosting / Article on posts (headline, datePublished, author, image).
Product / Offer on product pages (price, availability, rating).
BreadcrumbList for navigation context.
FAQPage for FAQ sections—frequently rewarded with expandable rich results (which is one reason the posts on this blog end with an FAQ).

const jsonLd = {
  "@context": "https://schema.org",
  "@type": "BlogPosting",
  headline: title,
  datePublished: date,
  author: { "@type": "Organization", name: "RiseGravity" },
  image,
};
// React 19 hoists this script into the document head
return <script type="application/ld+json">{JSON.stringify(jsonLd)}</script>;

Validate with Google's Rich Results Test before you ship—malformed schema earns nothing.

Sitemaps and robots.txt: tell crawlers what matters

A sitemap is the index of pages you want crawled, with a lastmod so search engines know what changed. For content-driven sites, generate it from the same source as your content so it never drifts out of sync. On this site, a post-build script reads the Markdown frontmatter (slug + date) and emits sitemap.xml—add a post, rebuild, and it's in the sitemap automatically, no manual list to maintain.

// Generate sitemap entries from content frontmatter at build time
const urls = posts.map((p) => ({
  loc: `${BASE_URL}/blog/${p.slug}`,
  lastmod: p.date,
}));

robots.txt complements it: allow what should be crawled, disallow private/duplicate areas (app dashboards, internal search results), and point to the sitemap:

User-agent: *
Allow: /
Disallow: /app/
Sitemap: https://example.com/sitemap.xml

Then submit the sitemap in Google Search Console and watch the Coverage report for pages that are excluded—it's the fastest way to find indexing problems.

Core Web Vitals: speed is a ranking signal

Google uses Core Web Vitals—real-world measures of speed and stability—as a ranking factor, and they correlate strongly with conversion. The three to track:

Metric	Measures	Good
LCP (Largest Contentful Paint)	Loading — when the main content appears	< 2.5 s
INP (Interaction to Next Paint)	Responsiveness — input lag	< 200 ms
CLS (Cumulative Layout Shift)	Visual stability — unexpected movement	< 0.1

High-leverage fixes that map directly to React/Vite builds:

Code-split and lazy-load routes and heavy components so the initial bundle is small (better LCP and INP).
Reserve space for images and embeds (set width/height or aspect-ratio) to kill layout shift (CLS).
Serve modern image formats (WebP/AVIF), lazy-load below-the-fold images, and size them responsively.
Preload the LCP asset (hero image or font) and avoid render-blocking resources.
Keep main-thread work light—defer non-critical JS so interactions stay snappy (INP).

Measure with Lighthouse and, more importantly, with field data (PageSpeed Insights / Search Console), because lab scores and real-user experience diverge.

A technical SEO checklist for your next React build

Public pages render real HTML (SSG/prerender or SSR), not an empty shell.
Unique title + meta description on every indexable page.
Self-referencing canonical; one host + slash convention enforced with 301s.
Open Graph / Twitter tags with a share image.
JSON-LD matched to each page type (Organization, Article, Product, FAQ…).
Sitemap generated from your content source; lastmod accurate.
robots.txt allows the right paths, disallows private ones, links the sitemap.
Submitted to Search Console; Coverage report monitored.
Core Web Vitals in the green (LCP, INP, CLS), verified with field data.
Semantic HTML and image alt text—good for accessibility and SEO.

Common React SEO mistakes (and how to catch them)

Most React SEO problems aren't exotic—they're the same handful of issues, shipped over and over. Knowing them turns audits from guesswork into a checklist.

Routing without real URLs. Anchor-based or hash routing (/#/pricing) and click handlers that swap content without changing the URL leave crawlers with one page. Use real, server-resolvable paths (the History API) so every view has a crawlable, shareable, canonical URL.
Content gated behind interaction. If content only appears after a click, scroll, or tab change, assume it won't be indexed. Render important content in the initial markup; reserve lazy reveal for genuinely secondary material.
Duplicate or templated metadata. Shipping the same <title> and description on every route is one of the most common—and most damaging—mistakes. Generate metadata per page from the page's own data.
Broken or missing canonicals. No canonical (or a canonical pointing at the wrong host/variant) splits ranking signals across duplicate URLs. Set a correct, absolute, self-referencing canonical everywhere.
Soft 404s. A React app that renders "not found" content with a 200 OK status confuses crawlers into indexing empty pages. Return a real 404 (or 410) status for missing content, and 301 for moved content.
JavaScript-only links. Crawlers follow <a href>, not onClick navigation. Use real anchor tags so your internal link graph is discoverable.
Images without dimensions or alt text. Missing dimensions cause layout shift (hurting CLS); missing alt hurts accessibility and image search.

How to catch them: crawl your own site with a tool like Screaming Frog (it sees what a bot sees), check rendered HTML with Google's URL Inspection tool in Search Console, and watch the Coverage and Core Web Vitals reports over time. The gap between "what my browser shows" and "what the crawler renders" is where React SEO bugs live.

Frequently asked questions

Can Google index a client-side React app? It can render JavaScript, but on a delayed, limited budget—so CSR-only public pages risk slow, incomplete, or missed indexing. For anything that must rank, ship real HTML via static generation/prerendering or SSR, and keep pure CSR for authenticated app screens.

Do I need a library like react-helmet for meta tags? Not on React 19. It hoists <title>, <meta>, <link>, and JSON-LD <script> rendered anywhere in your component tree into the document head, so you can colocate SEO tags with each page without a helper.

Does structured data improve rankings? Indirectly. It rarely boosts ranking on its own, but it makes content eligible for rich results (FAQ accordions, article cards, star ratings) that raise click-through, and it helps search engines understand your pages—both of which help performance.

How do I keep my sitemap from going stale? Generate it at build time from the same source as your content (frontmatter, CMS, or database) rather than maintaining a hand-edited list. Add the lastmod date from the content itself so crawlers see accurate change signals.

Build it fast, build it to be found

Technical SEO and performance are engineering problems with engineering solutions—and on a well-built React site they're mostly a matter of doing the foundations right once. If you want a marketing site or SaaS that's fast, accessible, and built to rank from day one, see our Projects, read Design Systems That Scale, or reach out at contact@risegravity.com.