How to Do an SEO Content Audit
A content audit reviews every page on your site to decide whether to keep, improve, consolidate, or remove it. Here is the full process, step by step.
16 May 2026 · 8 min read
A content audit is a systematic review of every page on your site to decide whether to keep it, improve it, consolidate it with another page, or remove it entirely. It is not a one-off exercise: content accumulates over time, and pages that were useful two years ago may now be outdated, thin, or competing with each other for the same keywords.
The practical case for doing one is straightforward. Old, thin, or redundant pages drag down the overall quality signals of a site, waste crawl budget, and can suppress rankings across the whole domain. Google's Helpful Content system evaluates content quality at site level, not just page level. A high proportion of low-quality pages affects the standing of even your best content. A content audit identifies those pages and gives you a clear action plan.
How do you get a complete URL inventory?
The first step is knowing what is actually on your site, not just what you think is there. CMS platforms accumulate URLs from old category pages, tag archives, author pages, outdated blog posts, and thin landing pages that were never properly removed. You will not find all of these by browsing the site or exporting from your CMS.
Run a full site crawl. A crawler follows every internal link from your homepage outwards and returns every URL it finds, along with its status code, title, meta description, word count, and indexability status. This is your true inventory.
Once you have the crawl data, filter it down to the URLs that matter for the audit. You can typically exclude:
- URLs returning 4xx or 5xx status codes (handle these separately as technical issues)
- Pages already marked noindex or excluded by robots.txt
- Utility pages: login, checkout, cart, account pages
What remains is your working inventory: every indexable content page that Google can and does crawl.
What data do you need before making decisions?
A URL list alone is not enough. You need performance data for each page before you can make a sound keep, improve, consolidate, or remove decision.
Pull two data sources:
Google Search Console: export impressions, clicks, and average position per page for the last 12 months. This tells you whether a page is visible in search at all and whether it is generating any traffic. Filter to the page view in the Performance report and export everything.
Analytics (Google Analytics 4 or equivalent): sessions, engagement rate, and average engagement time per page. This tells you whether the traffic a page does receive is meaningful. A page with 200 monthly sessions and an average engagement time of 12 seconds is performing very differently from a page with the same session count and 3 minutes of engagement.
Join these datasets against your URL inventory. The result is a table where every indexable page has: status code, word count, indexability, impressions, clicks, average position, sessions, engagement rate. This table is what you make decisions from.
How do you classify each URL?
Every URL gets assigned one of four actions. Be decisive. The goal is to reduce the number of weak pages on the site, not to find reasons to keep everything.
Keep
Pages that are performing well or are strategically important. Performance means meaningful organic traffic, good rankings (positions 1-20 for a relevant query), or strong engagement signals. Strategic importance means a page that serves a clear business purpose even if organic traffic is low: a key service page, a resource linked from many other pages, a page with strong backlinks.
Keep pages are left alone unless a routine review flags an issue.
Improve
Pages that target a relevant topic but are not performing. These might have some impressions but low clicks, or rankings stuck in positions 15-40 that could be pushed into the top 10 with substantially better content. The topic is worth targeting; the current execution is not good enough.
Improving means genuinely rewriting the page, not adding a few paragraphs. Add original analysis, update outdated data, add structured sections that answer the full search intent, and make the page clearly better than the current top-ranking competitors.
Only classify a page as Improve if you intend to actually improve it. A page classified as Improve and left unchanged is still a weak page.
Consolidate
Pages that cover a topic already handled better by another page on the site, or thin pages whose content could be absorbed into a more comprehensive article. Consolidation means merging the weaker page into the stronger one and placing a 301 redirect from the old URL to the new one.
This is the right action when two articles target the same keyword, when a page is too thin to justify standing alone but contributes something worth keeping, or when you have a category of near-duplicate location or service pages.
Remove
Pages with no organic traffic, no rankings, no internal links worth preserving, no external links, and no strategic purpose. These pages add weight without adding value. Remove them: either return a 404 (if there is genuinely no sensible redirect destination) or redirect to the nearest relevant page.
The concern most people have about removing pages is that they might lose something. In practice, a page with zero impressions and zero clicks over 12 months is contributing nothing. Removing it is cleaner than leaving thin content in the index.
What do you look at when evaluating each page?
Five factors determine how a page is classified.
Word count. Not a proxy for quality, but a useful filter. A page with 150 words is almost always thin content. A page with 2,000 words is not necessarily good, but it is less likely to be trivially thin. Sort your inventory by word count ascending and start your review with the lowest-content pages.
Unique value. Does this page say something that is not said better elsewhere on your own site or on the top-ranking competitor pages? If the answer is no, the page is a consolidate or remove candidate regardless of word count.
Keyword targeting. Is the page targeting a real search query with meaningful volume? Check Search Console for the queries that trigger impressions on this page. A page generating impressions for a query with no competition and no volume is still a weak page.
Traffic trend. Look at the 12-month impression and click trend in Search Console. A page in decline despite no apparent reason is a warning signal. A page that was never indexed and has zero impressions across 12 months is a remove candidate.
Backlinks. Before removing or consolidating any page, check whether it has external links pointing to it. A page with five referring domains pointing to it is worth preserving, even if the content needs improving. If you consolidate it, ensure the 301 redirect is in place to pass link equity to the destination.
How do you consolidate two pages into one?
Consolidation is the most impactful action in most content audits. Done well, it replaces two underperforming pages with one stronger page that inherits all the ranking signals from both.
The process:
- Identify the destination page: the one that will absorb the other. Usually this is the page with better rankings, more backlinks, or a stronger URL structure.
- Review the source page for any content worth keeping: unique statistics, examples, or sections not covered by the destination.
- Rewrite the destination page to be comprehensive. Pull in what was worth keeping from the source. Make the combined article genuinely better than both originals.
- Once the new destination page is live, add a 301 redirect from the source URL to the destination URL. Do not remove the source URL without this redirect in place.
- Update any internal links that pointed to the source URL to point directly to the destination. Internal links to redirecting URLs are not broken, but they are cleaner and pass equity more directly when updated.
- If the source page had a canonical tag pointing elsewhere, remove or update it before the 301 redirect goes live.
Monitor the destination page in Search Console over the following four to eight weeks. Rankings should stabilise and, in many cases, improve as the consolidated signals from both pages concentrate on one URL.
How do you prioritise the work?
A content audit will surface more actions than you can tackle in a week. Prioritise in this order:
First: pages with some impressions but low clicks. These have search visibility and are close to performing. A title tag or content improvement can move them quickly. These are your fastest wins.
Second: consolidation candidates. Merging two thin articles into one stronger one often delivers ranking improvements in four to eight weeks. It also cleans up the site architecture.
Third: removals. Zero-traffic, zero-impression pages with no strategic value. Remove them in bulk. This is low-effort and immediately reduces the proportion of weak content on your domain.
Last: full rewrites of poor-performing pages on competitive topics. These take the most time and have the longest feedback loop.
How often should you run a content audit?
For most sites, once a year is the right cadence. Run one after any major content push to assess what is working before scaling further. Run one after a Google core update that meaningfully changed your rankings: core updates often reward or penalise specific types of content, and the audit will show which pages were affected and why.
Sites that publish aggressively (multiple articles per week) should review content on a rolling basis rather than waiting for an annual audit. Topical authority is built through depth and quality across a subject area. That requires knowing which existing content is weak before publishing more on the same topic.
Crawl budget is a secondary concern for most small sites, but for larger sites with thousands of indexed pages, reducing the volume of thin and low-value content directly improves how Googlebot allocates its crawl time. Removing or consolidating 200 weak pages frees crawl budget for the pages that are worth crawling.
For a deeper look at how page indexability affects your audit decisions, and how canonical tags factor into consolidation, see what is a canonical tag.
The first step of any content audit is a full site crawl. You cannot audit what you cannot see, and you cannot see everything without crawling. Crawly crawls your entire site, reports word count for every page, and flags thin and duplicate content automatically. Download it and run your first crawl before you open a spreadsheet.