SlideShare ist ein Scribd-Unternehmen logo
1 von 31
New perspectives on duplicate content
Alexis Sanders
Senior SEO Manager at Merkle
Omi Sido
Technical SEO at Canon Europe
#OnCrawlinOrbit
Why should SEOs care about duplicate content?
#OnCrawlinOrbit
There is no manual
penalty for duplicate
content.
Source: October 2015 Google Hangout
Source: 10 Things I Hate About You
the
website
you
don’t
want to
be
what a user sees: what a bot sees:
Umm, I think I
like the white
shirt better…
Source: Introduction to Information Retrieval (c19)
“by some estimates,
as many as 40% of
the pages on the
web are duplicates
of other pages”
1. Indexing Challenges
2. Lower Link Impact
3. Internal Competition
4. Poor Crawl Bandwidth
Common sources of duplication:
o Repetitive page
o Doorway pages
o Inventory control
o Syndicated content
o PR releases
o Republishing
o Plagiarism
o Non-unique copy
o Localized content
o Thin content
o Staging sites
o HTTP vs. HTTPS
o Subdomain
o URL cases
o File extensions
o Trailing slash
o Index pages
o Parameters
o Pagination
o Mobile Configuration
o Internal site search
technical content
o Facets
o Sorts
o Image-only
How can SEOs find and identify duplicate content?
#OnCrawlinOrbit
1. Know your user journey
7. If your content is stolen:
Request a canonical tag
File a DCMA request
6. Strategically → consolidate,
create, delete, optimize
5. Leverage
appropriate
signaling
4. If the pages are
100% duplicate,
consolidate w/
a 301 redirect
3. Prioritize
duplicate
content
issues that are
affecting
performance
2. Create a strong
hierarchical URL taxonomy
oncrawl > Duplicate Content
oncrawl > Duplicate Content > By Group
Google
• Direct quotes in Google
• Searching via site:searches
• site:
• site: + inurl:
• intitle:
• filetype: (for file extensions)
GSC > Coverage > Duplicate …
•Quetext
•Noplag
•PaperRater
•Grammarly
•CopyScape
Plagiarism Tools
Keyword Density
Resolving duplicate content:
a memorable case
#OnCrawlinOrbit
+64.2% in sessions Y/Y from Google organic.
Adding unique content.
+28.7% in sessions Y/Y from Bing organic.
From only H1  SSR full UX
HTTPS
HTTP
Once fixed, clicks
returned to normal
range on HTTPS w/in
3-4 days.
Accidental HTTP
canonical.
Estimated 5-10%
loss.
What is new for duplicate content
in the past year and a half?
#OnCrawlinOrbit
What will duplicate content management
look like in the future?
#OnCrawlinOrbit
• Less technical-based duplicate content (as CMS wise up)
• More automation (unit testing and external testing)
• Automatically detect high similarity pages and page types
for writers and content managers
• Google continue to improve their existing systems and
detection
• Perhaps an alert system to escalate issue of Google not
using the right canonical
Alexis’ hopes for the future,
Do you have a favorite technical trick?
#OnCrawlinOrbit
• EC2 remote computer instance
• Check mobile first testing tool
• Switch user agent to Googlebot
• Using TechnicalSEO.com’s robots.txt
tool
• Screaming frog log analyzer
• Made with Love’s htaccess checker
• Using Google Data Studio to report on
changes (syncing Sheets with updates,
filtering each page by relevant updates)
Alexis’ tech SEO tricks
Do you have a least favorite technical SEO question?
#OnCrawlinOrbit
Do you have a favorite googlebot?
#OnCrawlinOrbit
Alexis: I like the idea that
Googlebot is tired and
overworked (from crawling 130
trillion URLs).
Do you have a favorite planet?
#OnCrawlinOrbit
Launching the best SEO tips
into space
Next up on June 27th
from Bordeaux, France
FULL AGENDA AT WWW.ONCRAWL.COM/SEOINORBIT

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools Panel
 
Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...Alexis max-Creating a bot experience as good as your user experience - Alexis...
Alexis max-Creating a bot experience as good as your user experience - Alexis...
 
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
 
Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...Redefining technical SEO & how we should be thinking about it as an industry ...
Redefining technical SEO & how we should be thinking about it as an industry ...
 
SEMrush- 5 Hours of SEO - Scaling Ranking Data Analysis
SEMrush- 5 Hours of SEO - Scaling Ranking Data AnalysisSEMrush- 5 Hours of SEO - Scaling Ranking Data Analysis
SEMrush- 5 Hours of SEO - Scaling Ranking Data Analysis
 
SMX West 2020 - Leveraging Structured Data for Maximum Effect
SMX West  2020 - Leveraging Structured Data for Maximum EffectSMX West  2020 - Leveraging Structured Data for Maximum Effect
SMX West 2020 - Leveraging Structured Data for Maximum Effect
 
Inbound Marketing Tools - SearchFest
Inbound Marketing Tools - SearchFestInbound Marketing Tools - SearchFest
Inbound Marketing Tools - SearchFest
 
Mobile-First Indexing and AMP - SMX Advanced 2018
Mobile-First Indexing and AMP - SMX Advanced 2018Mobile-First Indexing and AMP - SMX Advanced 2018
Mobile-First Indexing and AMP - SMX Advanced 2018
 
How to build simple web apps to automate your SEO tasks - BrightonSEO Spring ...
How to build simple web apps to automate your SEO tasks - BrightonSEO Spring ...How to build simple web apps to automate your SEO tasks - BrightonSEO Spring ...
How to build simple web apps to automate your SEO tasks - BrightonSEO Spring ...
 
SEO and Analytics
SEO and AnalyticsSEO and Analytics
SEO and Analytics
 
The Real Problems Behind Indexing | 5 Hours of Technical SEO
The Real Problems Behind Indexing | 5 Hours of Technical SEOThe Real Problems Behind Indexing | 5 Hours of Technical SEO
The Real Problems Behind Indexing | 5 Hours of Technical SEO
 
Barry Adams - Search in a screenless world - Nottingham Digital Summit
Barry Adams - Search in a screenless world - Nottingham Digital SummitBarry Adams - Search in a screenless world - Nottingham Digital Summit
Barry Adams - Search in a screenless world - Nottingham Digital Summit
 
Proactive Measures for Good Site Health - Brighton SEO 2014
Proactive Measures for Good Site Health - Brighton SEO 2014Proactive Measures for Good Site Health - Brighton SEO 2014
Proactive Measures for Good Site Health - Brighton SEO 2014
 
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 20195 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
 
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
 
Conflicting Website Signals & Confused Search Engines | Raleigh SEO Conferenc...
Conflicting Website Signals & Confused Search Engines | Raleigh SEO Conferenc...Conflicting Website Signals & Confused Search Engines | Raleigh SEO Conferenc...
Conflicting Website Signals & Confused Search Engines | Raleigh SEO Conferenc...
 
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul ShapiroRedefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
 
TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom
 
Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...
Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...
Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...
 
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
 

Ähnlich wie SEO in Orbit - Duplicate Content by OnCrawl

Ähnlich wie SEO in Orbit - Duplicate Content by OnCrawl (20)

Duplicate content presentation March 2012
Duplicate content presentation   March 2012Duplicate content presentation   March 2012
Duplicate content presentation March 2012
 
Common SEO Mistakes During Site Relaunches, Redesigns, Migrations (2018)
Common SEO Mistakes During Site Relaunches, Redesigns, Migrations (2018) Common SEO Mistakes During Site Relaunches, Redesigns, Migrations (2018)
Common SEO Mistakes During Site Relaunches, Redesigns, Migrations (2018)
 
Search Engine Optimization (Seo) for Developers
Search Engine Optimization (Seo) for DevelopersSearch Engine Optimization (Seo) for Developers
Search Engine Optimization (Seo) for Developers
 
BrightonSEO 5 Critical Questions Your Log Files Can Answer September 2016
BrightonSEO 5 Critical Questions Your Log Files Can Answer September 2016BrightonSEO 5 Critical Questions Your Log Files Can Answer September 2016
BrightonSEO 5 Critical Questions Your Log Files Can Answer September 2016
 
Webmaster tools (ICMK485)
Webmaster tools (ICMK485)Webmaster tools (ICMK485)
Webmaster tools (ICMK485)
 
C3 2012 Inside the Searchlight API - Dan Cristo
C3 2012  Inside the Searchlight API - Dan CristoC3 2012  Inside the Searchlight API - Dan Cristo
C3 2012 Inside the Searchlight API - Dan Cristo
 
Website Audit [On Page and Off Page] by Carl Benedic Pantaleon
Website Audit [On Page and Off Page] by Carl Benedic PantaleonWebsite Audit [On Page and Off Page] by Carl Benedic Pantaleon
Website Audit [On Page and Off Page] by Carl Benedic Pantaleon
 
State of Search Marketing - 2019
State of Search Marketing - 2019State of Search Marketing - 2019
State of Search Marketing - 2019
 
Algorithmic Web Spam detection - Matt Peters MozCon
Algorithmic Web Spam detection - Matt Peters MozConAlgorithmic Web Spam detection - Matt Peters MozCon
Algorithmic Web Spam detection - Matt Peters MozCon
 
Search engine optimization
Search engine optimization Search engine optimization
Search engine optimization
 
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning CatalystIntroduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
 
Web Components at Scale, HTML5DevConf 2014-10-21
Web Components at Scale, HTML5DevConf 2014-10-21Web Components at Scale, HTML5DevConf 2014-10-21
Web Components at Scale, HTML5DevConf 2014-10-21
 
Search Engine Optimization Primer
Search Engine Optimization PrimerSearch Engine Optimization Primer
Search Engine Optimization Primer
 
Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]
 
Content Audit Webinar with Everett & URL Profiler
Content Audit Webinar with Everett & URL ProfilerContent Audit Webinar with Everett & URL Profiler
Content Audit Webinar with Everett & URL Profiler
 
Beginners seo gs v3
Beginners seo gs v3Beginners seo gs v3
Beginners seo gs v3
 
Seo biggner guide 2014
Seo biggner guide 2014Seo biggner guide 2014
Seo biggner guide 2014
 
Seo Made Easy
Seo Made EasySeo Made Easy
Seo Made Easy
 
SEO Essentials for 2021
SEO Essentials for 2021SEO Essentials for 2021
SEO Essentials for 2021
 
The Right Way to Do an SEO Audit
The Right Way to Do an SEO AuditThe Right Way to Do an SEO Audit
The Right Way to Do an SEO Audit
 

Mehr von Alexis Sanders

D13TM Newsletter - July
D13TM Newsletter - JulyD13TM Newsletter - July
D13TM Newsletter - July
Alexis Sanders
 
2016-7 Toastmasters PRM Plan-Website
2016-7 Toastmasters PRM Plan-Website2016-7 Toastmasters PRM Plan-Website
2016-7 Toastmasters PRM Plan-Website
Alexis Sanders
 
Summer TLI 2016 program v7
Summer TLI 2016 program v7Summer TLI 2016 program v7
Summer TLI 2016 program v7
Alexis Sanders
 
Fall Conference 2015 - Hall of Fame Program - v4
Fall Conference 2015 - Hall of Fame Program - v4Fall Conference 2015 - Hall of Fame Program - v4
Fall Conference 2015 - Hall of Fame Program - v4
Alexis Sanders
 
Summer TLI 2015 program-Final-LOW RES
Summer TLI 2015 program-Final-LOW RESSummer TLI 2015 program-Final-LOW RES
Summer TLI 2015 program-Final-LOW RES
Alexis Sanders
 
Winter TLI 2016 program-8_PROOF
Winter TLI 2016 program-8_PROOFWinter TLI 2016 program-8_PROOF
Winter TLI 2016 program-8_PROOF
Alexis Sanders
 
Spring Conference 2016 - program v9
Spring Conference 2016 - program v9Spring Conference 2016 - program v9
Spring Conference 2016 - program v9
Alexis Sanders
 
Fall Conference 2015 v7
Fall Conference 2015 v7Fall Conference 2015 v7
Fall Conference 2015 v7
Alexis Sanders
 

Mehr von Alexis Sanders (13)

The Science of Seeking Your Customer: Research, SEO, CRM
The Science of Seeking Your Customer: Research, SEO, CRMThe Science of Seeking Your Customer: Research, SEO, CRM
The Science of Seeking Your Customer: Research, SEO, CRM
 
quest is in the name: question answering systems
quest is in the name: question answering systemsquest is in the name: question answering systems
quest is in the name: question answering systems
 
skillshare organic search strategies - template
skillshare organic search strategies - templateskillshare organic search strategies - template
skillshare organic search strategies - template
 
BrightonSEO Structured Data by Alexis Sanders
BrightonSEO Structured Data by Alexis SandersBrightonSEO Structured Data by Alexis Sanders
BrightonSEO Structured Data by Alexis Sanders
 
Ocular Dialect
Ocular DialectOcular Dialect
Ocular Dialect
 
D13TM Newsletter - July
D13TM Newsletter - JulyD13TM Newsletter - July
D13TM Newsletter - July
 
2016-7 Toastmasters PRM Plan-Website
2016-7 Toastmasters PRM Plan-Website2016-7 Toastmasters PRM Plan-Website
2016-7 Toastmasters PRM Plan-Website
 
Summer TLI 2016 program v7
Summer TLI 2016 program v7Summer TLI 2016 program v7
Summer TLI 2016 program v7
 
Fall Conference 2015 - Hall of Fame Program - v4
Fall Conference 2015 - Hall of Fame Program - v4Fall Conference 2015 - Hall of Fame Program - v4
Fall Conference 2015 - Hall of Fame Program - v4
 
Summer TLI 2015 program-Final-LOW RES
Summer TLI 2015 program-Final-LOW RESSummer TLI 2015 program-Final-LOW RES
Summer TLI 2015 program-Final-LOW RES
 
Winter TLI 2016 program-8_PROOF
Winter TLI 2016 program-8_PROOFWinter TLI 2016 program-8_PROOF
Winter TLI 2016 program-8_PROOF
 
Spring Conference 2016 - program v9
Spring Conference 2016 - program v9Spring Conference 2016 - program v9
Spring Conference 2016 - program v9
 
Fall Conference 2015 v7
Fall Conference 2015 v7Fall Conference 2015 v7
Fall Conference 2015 v7
 

Kürzlich hochgeladen

Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdf
tbatkhuu1
 
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAILBUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
DIGISHIFT INDIA +918368319550
 

Kürzlich hochgeladen (20)

Major SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalMajor SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain Digital
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptx
 
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptx
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
 
Digital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew RupertDigital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew Rupert
 
Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdf
 
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 Reports
 
The Science of Landing Page Messaging.pdf
The Science of Landing Page Messaging.pdfThe Science of Landing Page Messaging.pdf
The Science of Landing Page Messaging.pdf
 
Unraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptxUnraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptx
 
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAILBUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 
Social media, ppt. Features, characteristics
Social media, ppt. Features, characteristicsSocial media, ppt. Features, characteristics
Social media, ppt. Features, characteristics
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
 
Social Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaSocial Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid media
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
 
The Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison KaltmanThe Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison Kaltman
 
Factors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptxFactors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptx
 
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
 

SEO in Orbit - Duplicate Content by OnCrawl

  • 1. New perspectives on duplicate content Alexis Sanders Senior SEO Manager at Merkle Omi Sido Technical SEO at Canon Europe #OnCrawlinOrbit
  • 2. Why should SEOs care about duplicate content? #OnCrawlinOrbit
  • 3. There is no manual penalty for duplicate content. Source: October 2015 Google Hangout
  • 4. Source: 10 Things I Hate About You the website you don’t want to be
  • 5. what a user sees: what a bot sees: Umm, I think I like the white shirt better…
  • 6. Source: Introduction to Information Retrieval (c19) “by some estimates, as many as 40% of the pages on the web are duplicates of other pages”
  • 7. 1. Indexing Challenges 2. Lower Link Impact 3. Internal Competition 4. Poor Crawl Bandwidth
  • 8. Common sources of duplication:
  • 9. o Repetitive page o Doorway pages o Inventory control o Syndicated content o PR releases o Republishing o Plagiarism o Non-unique copy o Localized content o Thin content o Staging sites o HTTP vs. HTTPS o Subdomain o URL cases o File extensions o Trailing slash o Index pages o Parameters o Pagination o Mobile Configuration o Internal site search technical content o Facets o Sorts o Image-only
  • 10. How can SEOs find and identify duplicate content? #OnCrawlinOrbit
  • 11. 1. Know your user journey 7. If your content is stolen: Request a canonical tag File a DCMA request 6. Strategically → consolidate, create, delete, optimize 5. Leverage appropriate signaling 4. If the pages are 100% duplicate, consolidate w/ a 301 redirect 3. Prioritize duplicate content issues that are affecting performance 2. Create a strong hierarchical URL taxonomy
  • 13. oncrawl > Duplicate Content > By Group
  • 14. Google • Direct quotes in Google • Searching via site:searches • site: • site: + inurl: • intitle: • filetype: (for file extensions)
  • 15. GSC > Coverage > Duplicate …
  • 18. Resolving duplicate content: a memorable case #OnCrawlinOrbit
  • 19. +64.2% in sessions Y/Y from Google organic. Adding unique content.
  • 20. +28.7% in sessions Y/Y from Bing organic. From only H1  SSR full UX
  • 21. HTTPS HTTP Once fixed, clicks returned to normal range on HTTPS w/in 3-4 days. Accidental HTTP canonical. Estimated 5-10% loss.
  • 22. What is new for duplicate content in the past year and a half? #OnCrawlinOrbit
  • 23. What will duplicate content management look like in the future? #OnCrawlinOrbit
  • 24. • Less technical-based duplicate content (as CMS wise up) • More automation (unit testing and external testing) • Automatically detect high similarity pages and page types for writers and content managers • Google continue to improve their existing systems and detection • Perhaps an alert system to escalate issue of Google not using the right canonical Alexis’ hopes for the future,
  • 25. Do you have a favorite technical trick? #OnCrawlinOrbit
  • 26. • EC2 remote computer instance • Check mobile first testing tool • Switch user agent to Googlebot • Using TechnicalSEO.com’s robots.txt tool • Screaming frog log analyzer • Made with Love’s htaccess checker • Using Google Data Studio to report on changes (syncing Sheets with updates, filtering each page by relevant updates) Alexis’ tech SEO tricks
  • 27. Do you have a least favorite technical SEO question? #OnCrawlinOrbit
  • 28. Do you have a favorite googlebot? #OnCrawlinOrbit
  • 29. Alexis: I like the idea that Googlebot is tired and overworked (from crawling 130 trillion URLs).
  • 30. Do you have a favorite planet? #OnCrawlinOrbit
  • 31. Launching the best SEO tips into space Next up on June 27th from Bordeaux, France FULL AGENDA AT WWW.ONCRAWL.COM/SEOINORBIT

Hinweis der Redaktion

  1. October 2015 Google Hangout
  2. Joey: [holding up headshots] “Which one do you like better?” Bianca: “Umm, I think I like the white shirt better.” Joey: “Yeah, it’s-it’s more…” Bianca: “Pensive?” Joey: “Damn, I was going for thoughtful.”
  3. Roll in like star wars? https://www.shapechef.com/blog/star-wars-intro-crawl-in-powerpoint-2013
  4. I have two clients that see a lot of this, one in real estate, one in health. Lot of trying to add unique content and information architecture.
  5. From Google
  6. Incorrect canonicals: ~2/1/19 Fixed: 4/18/19
  7. Same as stuff in the past Finding ways to make content unique, especially when site is massive Machine generated content
  8. Alexis: Check mobile first testing tool, switch user agent to Googlebot Using technicalseo.com’s robots.txt tool Screaming frog log analyzer EC2 remote computer I’ll try to think of something better.
  9. Robots.txt
  10. Alexis – the newest, evergreen one.
  11. Alexis: Jupiter