Slide 41
Slide 41 text
pa.ag
@peakaceag
41
Most common causes of duplicate content
E.g. for Google, these examples are each two different URLs:
https://pa.ag/url-A/
https://pa.ag/url-a/
Case sensitivity
https://pa.ag/url-b
https://pa.ag/url-b/
Trailing slashes
https://pa.ag
https://www.pa.ag
non-www vs www
http://pa.ag
https://pa.ag
HTTP vs HTTPS
Dealing with duplication issues
▪ 301 redirect: e.g. non-www vs www, HTTP vs HTTPs,
casing (upper/lower), trailing slashes, Index pages
(index.php)
▪ noindex: e.g. white labelling, internal search result pages,
work-in-progress content, PPC- and other landing pages
▪ (Self-referencing) canonicals: e.g. for parameters used for
tracking, session IDs, printer friendly version, PDF to
HTML, etc.
▪ 403 password protect: e.g. staging-/development servers
▪ 404/410 gone: e.g. feeded content that needs to go fast,
other outdated/irrelevant or low-quality content
i
301
301