seo101

Content Pruning

Here's a hard truth: a 1,000-page site with 200 excellent articles and 800 thin ones isn't treated like a site with 200 excellent articles. It's treated like a low-quality site with occasional bright spots.

Google's quality systems evaluate signals at the page level and the site level. Thin content drags down the average. And since the Helpful Content updates of 2022–2024, that average matters more than it ever has - sites with large fractions of unhelpful content saw sitewide ranking suppression, not just individual page penalties.

Pruning isn't deletion for its own sake. It's quality control.

The content audit process#

Step 1: Export all indexed URLs#

Use Screaming Frog or GSC → Sitemaps. You want every URL Google is actually crawling and indexing - not just what your CMS lists as "published." The two lists are often different.

Step 2: Pull 12 months of GSC data per URL#

Export all pages with their performance metrics. Build a spreadsheet with:

ColumnWhat it reveals
ImpressionsIs Google even considering this page for results?
ClicksAre people choosing it?
CTRIs the title/meta compelling?
Avg. positionWhere does it rank?
Word countVolume (from crawl export)
Last modifiedWhen was it last touched?

Step 3: Sort into three buckets#

Working - Clicks > 10/month or impressions > 1,000 with position under 30. These are earning their keep. Leave them alone unless there's a specific reason to touch them.

Underperforming - Some impressions, few clicks, positions 20–50. These need a judgement call: can they be meaningfully improved? Are they duplicating a better page?

Dead weight - Zero clicks, near-zero impressions, or consistent positions > 50 for a full year. These are the first queue for pruning action.

The three decisions#

For every underperforming or dead-weight URL, you have three options:

Improve#

The page has a real keyword opportunity but hasn't delivered because the content is thin, outdated, or poorly structured.

Indicators: Position 15–40 with meaningful impressions, clear defined intent, obvious gap between your page and what's ranking above you.

Action: Rewrite substantially - add depth, update the data, restructure for clarity. Don't add 300 words of filler and call it done. The job is to become the best result for that query.

Consolidate#

Multiple pages cover the same topic without any of them being the authoritative resource. They're splitting relevance signals and competing against each other.

Indicators: Two or more URLs ranking for the same query cluster, high content similarity, none individually ranking on page one.

Action: Pick the strongest URL as the canonical destination. 301 redirect the others. Merge the best content from each into the winner. Update all internal links to point to the survivor.

Delete and redirect#

The page has no viable keyword opportunity and no user value. No amount of improvement will make it rankworthy - or worth reading.

Indicators: Targeting an irrelevant or zero-opportunity query, auto-generated tag/category pages with minimal content, outdated event or campaign pages with no evergreen relevance.

Options:

  • 301 to the most relevant live page - best for user experience and equity transfer
  • noindex - if the URL must stay accessible but shouldn't be indexed
  • Delete + 410 Gone - for content that served no purpose and where there's no natural redirect destination

The usual suspects#

Thin tag and category pages#

CMS auto-generated pages for tags or categories with only 2–3 posts beneath them. Noindex or consolidate.

Date-archive pages#

/2019/07/ archive pages almost never satisfy real search intent. Noindex or redirect to the corresponding category.

Duplicate location/service pages#

"We serve the [city] area" × 50 cities, all with the same copy. This is a known manual-action trigger. Either write genuinely unique content per location (local case studies, local team bios, local statistics) or collapse them into a single well-structured service area page.

Stale dated content#

"Best SEO Tools for 2018" still indexed in 2026 is an active negative quality signal. Update to the current year or redirect to your live equivalent.

"Me too" blog posts#

Posts written to cover a topic because a competitor covered it - not because your site adds anything different. These are the most uncomfortable to acknowledge and the most important to remove.

What to watch after pruning#

In the 30–90 days after a significant pruning operation:

  • GSC indexed pages count - should fall to match your intended footprint
  • Clicks and impressions for surviving pages - should improve as quality signals concentrate
  • Crawl budget (for larger sites) - Googlebot should spend proportionally more time on your money pages

Recovery from Helpful Content suppression is slow but real. The mechanism: removing low-quality pages raises your site's average quality score. Give it time and at least one core update cycle.

Next: Internal Linking - making sure the pages that survive the prune get the internal authority they deserve.