Free Backlink CheckerFree Backlink Checker
Crawly
All articles
Technical SEOContent

What is Thin Content and Why Does It Matter?

Thin content fails to satisfy search intent regardless of word count. Here is what Google considers thin, what to do about it, and how to find it.

16 May 2026 · 6 min read

Thin content is content that fails to satisfy search intent. Word count is not the definition. A 2,000-word page stuffed with boilerplate and padding can be thinner than a 400-word page that gives the user exactly what they came for. Google's measure of thin content is functional: does this page deliver genuine value to the person who searched for it?

The practical consequence of thin content is poor or absent rankings. Google may choose to ignore thin pages, rank them far below competitors who cover the topic properly, or remove them from the index entirely.

What types of content does Google consider thin?

Auto-generated pages

Pages produced at scale by software without meaningful human input: templated location pages that swap city names into identical copy, auto-generated product descriptions pulled from a database without editorial review, or pages created by crawling and republishing data from other sources. Google's systems are effective at identifying content that follows a mechanical template across hundreds or thousands of URLs.

Affiliate pages with no added value

Affiliate content that reproduces manufacturer descriptions, specification tables from the retailer's feed, and generic marketing copy adds nothing to what already exists across dozens of other affiliate sites. Google's guidance on affiliate content is explicit: pages that function only as thin fronts for other sites' content, with no original analysis, user reviews, or editorial perspective, are candidates for ranking suppression.

Doorway pages

Pages created specifically to rank for a keyword variation rather than to serve a user need. A local services site that builds separate pages for "plumber in Manchester", "plumber in Salford", and "plumber in Stockport", each with near-identical content except the city name, is producing doorway pages. These are a form of thin content and a violation of Google's spam policies when done at scale.

Scraped content

Content copied or lightly reworded from other sites. Google's systems compare content across the web. Pages that reproduce content found elsewhere without significant original contribution are ranked accordingly.

Boilerplate pages

Pages where the unique content on the page is minimal relative to the template: navigation, headers, footers, sidebars, legal disclaimers. Tag archive pages, author index pages, and internal search result pages often fall into this category. These are not necessarily thin in terms of intent, but they do not warrant indexing in most cases.

What does thin content look like in ecommerce?

Ecommerce sites are particularly exposed to thin content problems.

Category pages with one or two products have almost no unique content: a page title, a brief description, and one product listing. These pages rarely satisfy search intent and rarely rank. The practical fix is either to consolidate them into a parent category with a canonical tag or to noindex them until the category grows.

Product pages using manufacturer descriptions are near-duplicates of every other retailer stocking the same product. The manufacturer description appears verbatim on the brand's own site, on every distributor, and on competing retailers. Differentiation requires original content: user reviews, editorial commentary, comparison context, or detailed application guidance written by your team.

Filtered and faceted category pages created by size, colour, material, or price filters often produce hundreds of near-duplicate pages with minimal unique content. Proper handling via canonical tags or noindex is essential for large ecommerce catalogues, and directly affects crawl budget.

What was Google Panda?

Google Panda was an algorithm update first released in 2011, designed to reduce the rankings of low-quality, thin, and duplicate content. It had an immediate and severe impact on content farms and sites that had scaled content production without regard for quality.

Panda was eventually rolled into Google's core algorithm and is no longer a named, periodic update. The content quality signals it introduced are now part of how Google evaluates every page on every crawl. The lesson from Panda is not historical: it is that content quality signals are baked into the ranking system at a fundamental level, and thin content carries a persistent ranking disadvantage regardless of other signals.

Google's Helpful Content system, introduced in 2022 and absorbed into core updates by 2024, extended this further. It evaluates content at site level, not just page level. A high proportion of thin or unhelpful content on a site can suppress the rankings of even the well-written pages elsewhere on the same domain.

How do you fix thin content?

The decision comes down to three options: expand the content, consolidate the URL, or remove it from the index.

Expand the content

If the page has genuine ranking potential and serves a real user need, expand it. Add original analysis, specific examples, data, user perspectives, or structured guidance the user cannot get from a quick skim of a competitor page. A thin product page becomes useful with a detailed specification breakdown, a "who is this for" section, and honest editorial commentary. A thin location page becomes useful with specific local information, real imagery, customer references from that area, and practical access information.

Expansion is only the right answer if the page can credibly satisfy the intent behind the keyword. If it cannot, consolidation or noindex is more honest.

Consolidate with a canonical or 301 redirect

If a thin page is a near-duplicate of a stronger page, point a canonical tag at the stronger page, or redirect it with a 301. This concentrates the ranking signals onto the page most likely to rank. Use this approach for parameter variants, near-identical category pages, and ecommerce facets that do not merit standalone indexing.

Noindex

If the page needs to exist for users (an internal search result, a thank-you page, a filtered view you cannot redirect cleanly) but has no value in search results, use a noindex tag. This keeps the page accessible but removes it from Google's index and focuses crawl budget on the pages that matter.

When should you noindex rather than improve?

This is the practical question most site owners struggle with. A useful framework:

Scenario Recommended action
Page has backlinks or ranking history Improve content first, then reassess
Page serves users but not search Noindex
Page is a near-duplicate of a better page Canonical or 301 redirect
Page exists only as a technical artefact Noindex or remove entirely
Page has real keyword potential but is currently thin Expand content
Page is a filtered or faceted variant Noindex or canonical to clean URL

The key question is whether the page can realistically serve a search query better than the alternatives. If the answer is no, noindex is cleaner than leaving thin content in the index dragging down the rest of the site.

How do you identify thin content across your site?

Manual review does not scale. You need a crawler that reports on every page.

Crawly's desktop app crawls your entire site and reports word count for every page, giving you a ranked view of your lowest-content URLs. Pair this with indexability data to identify which thin pages are currently indexed, and you have a clear action list: expand, consolidate, or noindex. Filter by page indexability status to prioritise the pages that are indexed and ranking poorly over those that are already excluded.


Thin content is not about meeting a word count target. It is about whether each page on your site earns its place in Google's index by genuinely serving the person who lands on it. Download Crawly to find candidate thin pages across your site and prioritise which ones to fix first.

Try it yourself with Crawly

Free to download. No page cap. Claude Code MCP built in.

Download free