Custom Web Audits
Run Audit Demo Audit Audit Types Free Tools Pricing What We Analyze 🔒 Login
← All free tools
📑

Duplicate Content Checker

Enter two URLs and we measure how much of their content overlaps, as a similarity percentage. Catch near-duplicate pages that split rankings or trip duplicate-content issues — before they cost you traffic.

⚡ Interactive demo — sample data
78% content similarity between the two pages — near-duplicate. Search engines may rank only one. Consolidate or canonicalize.
Content similarity: 78% — near-duplicate. These two location pages share nearly all of their body copy. Issue
Verdict: search engines will likely pick one version to rank and split signals across both. Add a canonical or give each page unique local content. Issue
URL 1: 410 words • URL 2: 395 words — similar length, consistent with a single template reused with the city name swapped. Warning
Both pages fetched and compared on main content only — shared menu, header and footer were excluded. Looks good

About this tool

Enter two URLs and we measure how much of their content overlaps, as a similarity percentage. Catch near-duplicate pages that split rankings or trip duplicate-content issues — before they cost you traffic.

How it works

Enter two URLs
Paste two public URLs separated by a comma — for example 'https://site.com/page-a, https://site.com/page-b'. They can be two pages on your own site, your page versus a competitor's, or a www vs non-www version of the same address. We fetch both.
We compare the visible text
From each page we extract the main readable text — stripping scripts, styling, navigation, headers and footers — then break it into overlapping three-word phrases. Comparing those phrase sets tells us how much wording the two pages genuinely share, not just whether they use the same vocabulary.
Read the similarity score
You get a single content-similarity percentage with a clear verdict: near-duplicate, substantial overlap, or distinct. Use it to decide whether to consolidate the pages, add a canonical tag, rewrite one, or leave them alone.

What we check

Common issues we catch

Where this matters

Frequently asked questions

What counts as duplicate content?
Content that is identical or very similar across two or more URLs. That includes obvious copies (the same article on two pages) and sneakier cases like the same page served at www and non-www, or product variants with near-identical descriptions. This tool measures how close two specific URLs are so you can judge for yourself.
Is there a Google penalty for duplicate content?
No — for ordinary duplication there's no penalty. Google simply picks one version as canonical, indexes that, and sets the others aside. The real cost is that your ranking signals get split across URLs instead of pooling on one, and Google might pick a version you didn't intend. (Deliberately copying content to manipulate rankings is a separate spam issue.)
What similarity percentage should I worry about?
We flag 70% and above as near-duplicate — that's where search engines are most likely to collapse the pages into one. The 40–70% band is substantial overlap worth reviewing, especially for templated pages. Under 40% the pages are mostly distinct and at low risk.
How do I fix two pages that are too similar?
You have four main options. Consolidate them into one stronger page and 301-redirect the other; add a canonical tag on the duplicate pointing to the version you want to rank; rewrite one so they genuinely differ; or noindex the lesser page if it must stay live for users but shouldn't compete in search.
Should my product variant pages have canonical tags?
Usually, yes. If color or size variants share nearly all their copy, pointing their canonical tags at one main product URL consolidates ranking signals onto that page instead of spreading them thin. Keep separate indexable pages only where each variant has substantial unique content and search demand.
Does shared header, footer or menu text count against me?
We focus on each page's main content and strip out the navigation, header and footer, so shared site chrome doesn't inflate the score much. Search engines are also good at recognizing boilerplate. It's duplication in the body content that matters, which is what this tool measures.
Can I compare my page against a competitor's?
Yes — just put both URLs in. A high score against a competitor can mean one of you syndicated or copied the other, or that you're both using the same manufacturer or supplier description. It's a useful check for spotting where you need to write something genuinely your own.
Why did two different-looking pages score high?
Most often because the visible body text is largely the same even if the design differs — common with templated location pages or reskinned variants. Layout and styling don't count here; we compare the actual words, so two pages that look different but read the same will score high.

This is one of several free SEO tools from Custom Web Audits. For a complete, prioritized analysis of your whole website, run a full audit.