Sisällön karsiminen: Kun vähemmän sisältöä tarkoittaa enemmän liikennettä
Publishing more content doesn't always mean more traffic. After years of blog posts, landing pages, and campaign-specific URLs, most sites accumulate a significant amount of content that actively hurts their SEO performance. Thin pages dilute topical authority. Duplicate topics cannibalize each other's rankings. Outdated content sends negative quality signals. And all of it consumes crawl budget that should be spent on your best content.
The evidence is consistent: strategic content pruning — removing, consolidating, or updating underperforming content — produces measurable traffic increases. HubSpot famously reported a 25% increase in organic traffic after removing 3,000 blog posts. Ahrefs documented a similar pattern across hundreds of sites: reducing page count by 30-50% while maintaining content quality correlated with a 20-40% traffic increase within 3-6 months.
Here's the methodology we use at Empirium for every content audit.
The Content Bloat Problem
Content bloat is the slow accumulation of pages that provide negative or zero SEO value. It happens naturally over time:
Keyword cannibalization: You wrote "Best CRM for Small Business" in 2022 and "Top CRM Tools for Small Teams" in 2024. Both target the same intent. Google sees two pages competing for the same query and can't decide which to rank. Result: neither ranks well.
Thin content: Short posts from the early days of your blog — 300-word articles that barely scratch the surface of a topic. Google's quality standards have risen dramatically since these were published. They drag down your site's overall quality score.
Outdated information: Articles referencing discontinued products, old pricing, deprecated APIs, or outdated statistics. These don't just fail to help users — they actively mislead them, which Google's algorithms detect through engagement signals.
Duplicate or near-duplicate pages: Parameter URLs, tag archives, author pages, and year-based archives that create dozens of thin pages with overlapping content.
How Bloat Hurts Rankings
- Authority dilution: Your site's accumulated authority gets divided among more pages. Remove low-value pages, and the remaining pages receive a proportionally larger share.
- Crawl budget waste: Google allocates a limited crawl budget per site. Pages that don't deserve ranking consume budget that should go to your best content.
- Quality signals: Google evaluates site quality holistically. A site with 200 excellent pages and 800 thin pages sends weaker quality signals than a site with 200 excellent pages and nothing else.
- User experience: Internal search results and "related posts" features surface low-quality content, reducing engagement metrics sitewide.
Identifying Content to Prune
Use the four-quadrant model to categorize every indexable URL on your site:
The Four-Quadrant Analysis
| Quadrant | Traffic | Quality/Relevance | Action |
|---|---|---|---|
| Keep | High | High | Leave as-is, update regularly |
| Update | Low-Medium | High potential | Refresh content, update data, improve SEO |
| Merge | Low | Overlaps with another page | Consolidate into the stronger page, redirect |
| Delete | Zero/near-zero | Low, outdated, or irrelevant | Remove and redirect (or 410) |
Step-by-Step Identification
1. Pull the data. Export from Google Analytics and Search Console:
- Pages with zero organic sessions in the last 12 months
- Pages with declining traffic trends (down >50% year-over-year)
- Pages with high bounce rates AND low time-on-page (engagement failures)
2. Cross-reference with backlink data. Pull from Ahrefs or SEMrush:
- Pages with zero referring domains
- Pages with only low-quality referring domains
3. Identify cannibalization. Use Search Console's Performance report:
- Filter by query → check which pages appear for the same query
- If multiple pages rank for the same query and both fluctuate in position, they're cannibalizing
4. Assess content quality. For each candidate page:
- Is the information still accurate?
- Does it cover the topic comprehensively (by current standards)?
- Does it serve a unique purpose that no other page on the site serves?
The Numbers
In our experience across dozens of content audits:
| Category | Typical Percentage | Description |
|---|---|---|
| Keep | 20-30% | Performing well, no changes needed |
| Update | 25-35% | Good topic, needs refreshed content/data |
| Merge | 20-30% | Overlapping topics to consolidate |
| Delete | 15-25% | Zero value, should be removed |
Most site owners are surprised by how much content falls into the merge or delete categories. A site with 500 blog posts typically has 75-125 that should be removed entirely.
The Pruning Process
Phase 1: Preparation (Week 1)
- Document everything. Create a spreadsheet with every URL, its category (keep/update/merge/delete), traffic data, backlink count, and planned action.
- Identify redirect targets. Every deleted or merged URL needs a redirect destination — either the merged-into page or the most topically relevant existing page.
- Back up content. Before deleting anything, export all content in case you need to reference it later.
Phase 2: Merges (Weeks 2-3)
Merges produce the most immediate SEO benefit. When you consolidate two mediocre pages into one strong page:
- The merged page inherits backlinks from both sources (via 301 redirect)
- Google receives a clearer signal about which page to rank
- The consolidated content is more comprehensive, which improves topical authority
Merge process:
- Choose the winner — the page with more backlinks, better URL, or higher current traffic
- Incorporate unique valuable content from the loser into the winner
- Update the winner's content to be comprehensive
- 301 redirect the loser URL to the winner URL
- Update all internal links that pointed to the loser
Phase 3: Deletions (Week 3)
For pages with zero traffic, zero backlinks, and no unique value:
Option A: 301 redirect to the most relevant existing page. Use this when a topically relevant target exists. The redirect preserves any minimal link equity and prevents 404 errors.
Option B: 410 Gone status code. Use this when no relevant redirect target exists. A 410 tells Google the content has been intentionally removed (unlike a 404, which could be accidental). Google will drop the URL from its index faster.
Never: Simply noindex the page and leave it live. This wastes crawl budget without any benefit. If the content shouldn't be indexed, it shouldn't exist.
Phase 4: Updates (Weeks 4-6)
For pages in the "update" category:
- Refresh statistics and examples with current data
- Add sections that address gaps (compared to current top-ranking competitors)
- Improve internal linking to and from updated content
- Update
lastmodin your sitemap - Request re-indexing via Search Console
Phase 5: Monitor (Weeks 7-12)
Track these metrics weekly after pruning:
- Total organic sessions (expect a temporary dip of 5-10%, then recovery + growth)
- Average position for target keywords
- Indexed page count in Search Console
- Crawl stats in Search Console (expect improved crawl efficiency)
When NOT to Prune
Not every underperforming page should be removed. Protect content that:
Has Backlinks
A page with zero organic traffic but 15 referring domains is passing authority to your site. Removing it without proper redirects loses that authority. Always redirect pages with backlinks to a relevant target — never delete them outright.
Has Seasonal Traffic
Some content has cyclical traffic patterns — tax-related articles spike in April, holiday gift guides peak in November. Check at least 18 months of data before concluding a page has "zero traffic." It might be dormant between cycles.
Serves Non-SEO Purposes
Customer support articles, internal documentation, policy pages, and post-purchase content may receive zero organic search traffic but serve essential business functions. Don't prune content just because it doesn't rank — prune content that shouldn't be indexed and provides no user value.
Is Brand New
Content published in the last 3-6 months hasn't had time to establish rankings. Google's sandbox effect means new content, especially on new topic clusters, can take months to gain traction. Don't prune content before it's had a fair chance to perform.
FAQ
How quickly will I see results after content pruning?
Typically 4-8 weeks for merges and updates to show ranking improvements. Deletions take longer — Google needs to recrawl redirected URLs and recalculate authority distribution. The full effect of a major pruning effort usually appears within 3-6 months. Expect a small traffic dip in weeks 1-3 as Google reprocesses the changes, followed by recovery and growth.
Can content pruning hurt my SEO?
Yes, if done poorly. Common mistakes: deleting pages with valuable backlinks without redirects, removing content that ranks for niche long-tail queries you didn't check, and pruning too aggressively (removing 70%+ of content at once). Always use data to guide decisions, always set up proper redirects, and prune in phases rather than all at once.
How do I handle content pruning for a multi-language site?
Prune each language version independently. A page that performs well in English might have zero traffic in its French translation. For international sites, also consider whether thin translations are dragging down your overall site quality. Removing low-quality machine translations and focusing on fewer, higher-quality language versions often produces better results than maintaining 20 languages with thin content.
How often should I prune content?
Annually for most sites. Quarterly for sites publishing more than 10 articles per month. The key is building pruning into your content operations rather than treating it as a one-time project. Every new article published should be checked against existing content for cannibalization potential before publication.