SlideShare ist ein Scribd-Unternehmen logo
1 von 9
Downloaden Sie, um offline zu lesen
Duplicate Content & Multiple Site
            Issues
            Sasi Parthasarathy
        Program Manager, Microsoft
Topics covered

• Duplicate content
   – Internal content -> URL Canonicalization
   – External content -> Spam, Geo-targeting
• Content Syndication
• Good practices
• Examples Examples Examples
URL canonicalization

•   Less is more - expose only one URL per piece of content – pretty
    please
•   The practice of consolidating all versions of a page under one URL is
    referred to as quot;canonicalizationquot;
•   Helps the search engine; at the same time does not split your rank juice
•   Having too many duplicate URLs will waste crawl time – the crawler might
    spend time indexing duplicate URLs and miss good content
•   4 ways to get to microsoft.com but we need only one
     1. microsoft.com
     2. www.microsoft.com
     3. www.microsoft.com/en/us/default.aspx
     4. www.microsoft.com/en/us/
Few recommendations for canonicalization

• Select WWW or Non-WWW, then redirect the other option to your
  preferred version
• Remove the default filename from the end of your URLs
    – All web servers allow you to select one or more default filenames to serve when
      the browser requests a directory. Check and see if the default filename is at the
      end of the URL and then trim it off
• Link internally to the canonical form of your URL
    – Make sure you always link to the proper canonical form of your URLs from within
      your site
• Remove query string variables or rewrite to readable URLs
    – http://www.mysite.com/downloads/details.aspx?FamilyID=ab99&displaylang=en
      to
      http://www.mysite.com/downloads/en/family/ab99
Why duplicate content?

• Your intention is the key
• If your intent is to manipulate the search engine, you will
  be penalized
  Example1: Multiple domains with very little or no
  difference in content and no clear intent why these
  domains exist
  Example2: If you are trying to falsely promote original
  content as your own (please report any issues with
  copied content to Live Search support)
Going International – Help Search Engines

You may have similar pages but for various regions.
Problems for search engines with geo-targeting:
• No standardized way to tell a search engine which region or
   language your content is targeted for
• Top level domains may not indicate the intended audience. For
   example, http://ma.tt/, an English site or Orange.com, a French
   Telecom site hosted in France.
• Using search unfriendly redirection techniques
Few indicators - Help Live Search while Geo-
                      targeting
• Country code top-level domain (ccTLD). For example, .ca
  specifically targets users in Canada
• Set all your domains in Live Search webmaster tools and make it
  explicit for the region

These indicators will help us show the correct page for the correct
  market
Content Syndication

• Syndicate with caution: For sites that syndicate their content on
  other sites
• From our perspective, we always want to show the version we think
  is appropriate to the user. This may not be the version you want or
  prefer.
• Tip:
   Ask your partner to use robots.txt to stop us from indexing the syndicated material
General tips to help the Search Engine


• Dynamic URLs – if the content is not changing, don’t have too many
  parameters
• 301 is your best friend – use them when you can
• No 302 hijack!!
• When you do a site update, don’t have links to expired pages
• Use robots.txt for anything you don’t want crawlers to crawl
• Consistent naming convention – easy for search engines to
  understand
• Follow standard URL formation practices

Weitere ähnliche Inhalte

Andere mochten auch

Ancient Indian Mathematics And Astronomy
Ancient Indian Mathematics And AstronomyAncient Indian Mathematics And Astronomy
Ancient Indian Mathematics And AstronomyKalaimani Retnasamy
 
Ambit Energy Business Presentation
Ambit Energy Business PresentationAmbit Energy Business Presentation
Ambit Energy Business Presentationtoelerich
 
Poly Books (Japan Finals)
Poly Books (Japan Finals)Poly Books (Japan Finals)
Poly Books (Japan Finals)guestde5b2cc
 
08[1] multimedia
08[1]  multimedia08[1]  multimedia
08[1] multimediavincentlin
 
les discapacitats treball recerca boix
les discapacitats treball recerca boixles discapacitats treball recerca boix
les discapacitats treball recerca boixrecercadiscapacitats
 

Andere mochten auch (8)

Ancient Indian Mathematics And Astronomy
Ancient Indian Mathematics And AstronomyAncient Indian Mathematics And Astronomy
Ancient Indian Mathematics And Astronomy
 
Ambit Energy Business Presentation
Ambit Energy Business PresentationAmbit Energy Business Presentation
Ambit Energy Business Presentation
 
Hungaria
HungariaHungaria
Hungaria
 
Poly Books (Japan Finals)
Poly Books (Japan Finals)Poly Books (Japan Finals)
Poly Books (Japan Finals)
 
08[1] multimedia
08[1]  multimedia08[1]  multimedia
08[1] multimedia
 
les discapacitats treball recerca boix
les discapacitats treball recerca boixles discapacitats treball recerca boix
les discapacitats treball recerca boix
 
Bharamri
BharamriBharamri
Bharamri
 
vb.net
vb.netvb.net
vb.net
 

Kürzlich hochgeladen

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Kürzlich hochgeladen (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Duplicate Content SES NY 2009

  • 1. Duplicate Content & Multiple Site Issues Sasi Parthasarathy Program Manager, Microsoft
  • 2. Topics covered • Duplicate content – Internal content -> URL Canonicalization – External content -> Spam, Geo-targeting • Content Syndication • Good practices • Examples Examples Examples
  • 3. URL canonicalization • Less is more - expose only one URL per piece of content – pretty please • The practice of consolidating all versions of a page under one URL is referred to as quot;canonicalizationquot; • Helps the search engine; at the same time does not split your rank juice • Having too many duplicate URLs will waste crawl time – the crawler might spend time indexing duplicate URLs and miss good content • 4 ways to get to microsoft.com but we need only one 1. microsoft.com 2. www.microsoft.com 3. www.microsoft.com/en/us/default.aspx 4. www.microsoft.com/en/us/
  • 4. Few recommendations for canonicalization • Select WWW or Non-WWW, then redirect the other option to your preferred version • Remove the default filename from the end of your URLs – All web servers allow you to select one or more default filenames to serve when the browser requests a directory. Check and see if the default filename is at the end of the URL and then trim it off • Link internally to the canonical form of your URL – Make sure you always link to the proper canonical form of your URLs from within your site • Remove query string variables or rewrite to readable URLs – http://www.mysite.com/downloads/details.aspx?FamilyID=ab99&displaylang=en to http://www.mysite.com/downloads/en/family/ab99
  • 5. Why duplicate content? • Your intention is the key • If your intent is to manipulate the search engine, you will be penalized Example1: Multiple domains with very little or no difference in content and no clear intent why these domains exist Example2: If you are trying to falsely promote original content as your own (please report any issues with copied content to Live Search support)
  • 6. Going International – Help Search Engines You may have similar pages but for various regions. Problems for search engines with geo-targeting: • No standardized way to tell a search engine which region or language your content is targeted for • Top level domains may not indicate the intended audience. For example, http://ma.tt/, an English site or Orange.com, a French Telecom site hosted in France. • Using search unfriendly redirection techniques
  • 7. Few indicators - Help Live Search while Geo- targeting • Country code top-level domain (ccTLD). For example, .ca specifically targets users in Canada • Set all your domains in Live Search webmaster tools and make it explicit for the region These indicators will help us show the correct page for the correct market
  • 8. Content Syndication • Syndicate with caution: For sites that syndicate their content on other sites • From our perspective, we always want to show the version we think is appropriate to the user. This may not be the version you want or prefer. • Tip: Ask your partner to use robots.txt to stop us from indexing the syndicated material
  • 9. General tips to help the Search Engine • Dynamic URLs – if the content is not changing, don’t have too many parameters • 301 is your best friend – use them when you can • No 302 hijack!! • When you do a site update, don’t have links to expired pages • Use robots.txt for anything you don’t want crawlers to crawl • Consistent naming convention – easy for search engines to understand • Follow standard URL formation practices