SlideShare ist ein Scribd-Unternehmen logo
1 von 55
#pubcon
Presented by: Dawn Anderson
@dawnieando
‘Myths, Facts And Theories On Crawl
Budget And The Importance Of ‘URL
Importance Optimization’’
#pubcon
Dawn Anderson
• Move It Marketing
• University Lecturer – Digital Marketing
• From Manchester, UK (rains a lot)
• International SEO Consultant – 10+ yrs in SEO
• Pomeranian pooch lover - Bert
• Fascinated by crawling (practice & academia)
• Doesn’t fare well in YouTube screen grabs ;P
• Party trick: Remembering UK postcode areas
(US Zip code equivalent)
• Search Awards Judge
• Twitter chatterer @dawnieando
#pubcon
Defining Crawl Budget
‘Host Load’ =
What can you
handle?
+
‘URL Scheduling’
= What is
important to
crawl & how
often?
#pubcon
Myths About Crawl Budget
#pubcon
Myth – It’s All About Just My Site, Right?
• NO – HOST LOAD is apportioned at an IP
level and shared amongst the sites
there (Host load)
#pubcon
Host Load - When Will This Matter?
• It’s more about server capacity than SEO TBH
• Your site is massive (similar in size e.g. to ’Amazon’)
• Your site is massive and you’re on a shared hosting
• You’re using a CDN and your site is massive
• You have lots of large subdomains sharing space
• Crawlable test or staging sites
• You have ‘infinite loops’ and ‘spider traps’
• You keep throwing server errors during
crawling
‘Average’ sites don’t
normally hit the payload
(‘host load’)
#pubcon
Myth - Google Search Console
Crawl Stats Is Where It’s At Right?
#pubcon
GSC Crawl Stats Is Not Really
Just ‘Web Pages’
• Includes ALL CSS, JS, Zip,
XML, PDF, AMP, HTML
files crawled
• Pages are NOT just single
webpages
https://support.google.com/webmasters/answer/3
5253
Not just ‘web
pages
#pubcon
Visits By ALL The 10 Types Of Googlebots Are
Recorded Together In GSC
Web Image News
Video Feature Phone Smartphone
Mobile
Adsense
Adsense Adsbot
App
Crawler
ALL The
Googlebot Family
#pubcon
It Also Includes All 200 And 30X
Responses
• That massive crawl you thought
you just got on new pages or
existing pages 200 Oks could also
be many, many 30X redirections
• Especially when using * wildcard
redirections on large sites
• NO 400, 500, robotted or
unreachables are recorded here
https://support.google.com/webmasters/answer/3
5253
#pubcon
GSC Doesn’t Even Show You WHAT URLs
Have Been Crawled & When
It will likely just a few URLs being crawled very often, some very rarely and most others
somewhere in between – YOU NEED TO KNOW
#pubcon
REALITY – Server Logs & Log Analysis Is
Where It’s At
AUTOMATE SERVER LOG
RETRIEVAL VIA CRON JOB
grep Googlebot access_log
>googlebot_access.txt
#pubcon
Use Tools Or Just Export, Convert Data
& Use Mr Mu’s Spreadsheet
Spreadsheet - https://goo.gl/1pToL8
#pubcon
For The Avoidance Of Doubt –
I Asked To Be Sure
#pubcon
Why Does This Matter?
On A Large Site You Need To Be Able To
See Through ‘Spider Eyes’
You need to see what
Googlebot
‘REALLY’ thinks of
your site
#pubcon
Myth – It’s The No Of ‘Pages’ Crawled In
GSC Crawl Stats Divided By Days
For all of the reasons
in the previous 7+
slides
#pubcon
Myth – Googlebot Crawls Through Your
Website From One End To The Other
Then Starts Again
• This is where it gets complicated
• Web crawl efficiency is key
• There is an order to things
• Minimizing visibility of existing stale content is
key too – the rest of the web is changing
• Fresh results are vital to searchers
#pubcon
“What I Think You Are Talking About Is
Scheduling” (Illyes, Google)
Remember that time when Mr Mu
kicked Andrey under the table?
(joking JJ)
#pubcon
Why Web Crawling Efficiency?
“WE ARE ALL
PUBLISHERS”
THE NUMBER OF WEBSITES
DOUBLED IN SIZE BETWEEN 2011
AND 2012
AND AGAIN BY 1/3 IN 2014
The Content
‘Explosion’
#pubcon
“We don't index every one of
those trillion pages -- many of
them are similar to each
other” (J Alpert, Google)
“There’s a needle in here
somewhere”
“It’s an important needle
too” If only we could
identify it
“So how many unique pages
does the
web really contain? We
don't know;
we don't have time to look
at them all!”
(J Alpert, Google)
#pubcon
The Duplicate Content ‘Penalty’ Myth
• ‘Real’ duplicates (matching
content checksum) filtered and
not indexed
“Each content filter sends the
retrieved web pages to Dupserver
to determine if they are duplicates
of other web pages”
http://www.google.ch/patents/US20120317089
#pubcon
Duplication & ’The Battle To Be The Single
URL / Content Fingerprint’
URL / CONTENT
FINGERPRINT
REDIRECT
YOU HAVE THE POWER TO
CHOOSE ‘THE ONE’
CANONICALIZATION,
HREFLANG, CONSISTENT
SIGNALS INTERNALLY
#pubcon
NON-
PREFERRED
VERSION
‘IMPOSTER
INDEXATION’ &
‘TOO SIMILAR’
CONTENT
The wrong version
of your URL is
selected and
indexed
Users may pick the wrong version of
the duplicate content and link to that
one. Then signals are dissipated
#pubcon
De-duping, URL Sorting & Scheduling
Original Image -
https://patentimages.storage.googleapis.com/US8666964B1/US08666964-20140304-
D00004.png
https://www.google.com/patents/US8666964
Lots and lots of
patents on crawling
efficiency
#pubcon
Important Pages Are Crawled More Frequently
These pages are important and need to be up to
date. They cannot be returned as stale data
#pubcon
Depth Of Crawl Is Greater In Higher
Quality Sections Of Sites
• Important grandparents and parents
begets ’important’ children and
grandchild URLs
• Higher quality site sections
(descendants) get crawled more
#pubcon
Low Quality Sites Get Crawled Less
Frequently
https://support.google.com/webmasters/answer/35253
They are low importance
#pubcon
Myth – It’s Based Just On PageRank
”There’s a ‘shit-ton’ of other
stuff going on which plays an
important role” (Illyes,
Google)
#pubcon
PageRank Has Become Just One Of Very
Many Things
“WHATEVERYOU ARE THINKING…
WHETHER IT BE ABOUTCRAWLING OR
RANKING… IT (PAGERANK)HAS
BECOME JUSTONE OFVERYMANY
THINGS” (Andrey Lipattsev, Google, 2016)
#pubcon
It’s Mostly Driven By ‘Importance’
“SCHEDULING  IS  MOSTLY  
DRIVEN  BY
IMPORTANCE”  (Illyes,  Google)
IMPORTANCE  MAY  INCLUDE  
PAGERANK  (Patents)  …  BUT  IT  IS  
ONLY  A  PART  OF  IT
RANKING  IS  ALSO  DRIVEN  BY  
IMPORTANCE  (IN  PART)
#pubcon
Page (URL) Importance Is Mahoossively
Important (May Include PageRank)
PAGE IMPORTANCE - The importance of a
page independent of a query
• Location in Site (e.g. home page more important than
parameter 3 level output)
• PageRank
• Page type / file type
• Internal PageRank
• Internal Backlinks
• In-site Anchor Text Consistency
• Relevance (content, anchors and elements) to a topic
(Similarity Importance)
• Directives from in-page robot and robots.txt management
• Parent quality brushes off on child page quality
• Inclusion in XML sitemaps and the index
IMPORTANT PARENTS LIKELY SEEN TO HAVE
IMPORTANT CHILD PAGES
Several Google Patents
#pubcon
But…Importance Signs From Whom?
3 Types Of ‘Importance Signal Sender’?
SEARCHERS WEBMASTERS LINKERATILooking for
results, creating
queries,
triggering
impressions,
demanding
freshness
Hreflang,
Canonicalization,
Internal links, Sitemap
and index inclusion,
Information
Architecture,Anchors,
Building content at a
URL on a topic
Passing PageRank
AND WHY IS ‘IMPORTANCE’ SO
IMPORTANT?
#pubcon
Concept Of Search Engine
Embarrassment
A concept mostly originally
attributed to Joel Wolf
#pubcon
Search Engine Embarrassment
Credit: Joel Wolf Et Al GOODNESS & BADNESS IN SEARCH
ENGINE EMBARRASSMENT
Concept of using probability
estimates to revisit web
pages ‘just in time’ and
based around limiting
‘likelihood of stale pages
being exposed’ to searchers
#pubcon
Search Engine Embarrassment
Probability(Seen_Stale_Data)=Function
(User_View_Rate,Document_Update_R
ate,Web_Crawl_Interval).
#pubcon
Search Engine Embarrassment
User_View_Rate – Likelihood of the document being seen
+
Document_Update_rate – How often it has material changes
+
Web_Crawl_Interval – How often is it currently crawled
COMBINED TO CALCULATE
Probability(Seen_Stale_Data) = Risk of Search Engine Embarrassment?
‘JUST IN TIME SMART CRAWLING’
#pubcon
THEORY - Search Engine Embarrassment
Joel Wolf’s ‘Optimal Crawl
Strategies’ (Search Engine
Embarrassment) Paper is Cited
in this Google Patent
#pubcon
Triggering More ’Real Searcher Impressions’
A SMALL TEST
THE PAGES
BECAME
ARGUABLY
MORE
IMPORTANT
CRAWLING
IMPROVED
RANKING IMPROVED
TRAFFIC IMPROVED
#pubcon
Myth – Don’t We Just Have To Make Random
Changes To Get Crawled More?
NOT ALL CHANGE IS
CREATED EQUAL
#pubcon
WHAT Changed? Was it important?
https://www.seroundtable.com/google-crawl-
frequency-ranking-21153.html
HINTS &
C = ∑ i = 0 n - 1
weight i *
feature
CRITICAL MATERIAL
CHANGE
#pubcon
Randomization & Lying About ‘Change’
To Googlebot Won’t Help
• NOT ALL CHANGE IS IMPORTANT ENOUGH TO BE RECRAWLED
• DO NOT TRY TO MANIPULATE ‘CHANGE’
• You can’t get more crawl just by changing your pages alone &
you may actually be doing your site harm
• WHY – Because… ‘hints’ & ’thresholds’ designed to pick up on
this
• If every URL changes header response will always be modified
since (current date)
• Randomization and shuffling could be preventing Googlebot from
crawling the important pages
• Last-modified is taken into consideration, IF it is correct
• Priority == ignored so don’t make it up
• Change frequency == ignored so don’t make it up
’IMPORTANCE’
BEATS ‘CHANGE’
#pubcon
‘Crawl Rank’ – Causation or Correlation?
• By getting your URL crawled more frequently do
they automatically rank higher?
• “A lot of people confuse crawling with ranking”
(John Mu)
• Crawl Rank - It seems this is more correlation
than causation
• You got your URLs crawled more by making
them more important (e.g. via internal linking
strategies), canonicalization, hreflang, merging
and improving thin content, etc, updating with
fresh and rich content to a topic… and
subsequently ranked higher
“Often times, it is kind of a
relationship that, when we think
something is important we tend to
crawl it more frequently and that
might be more visible in search”
John Mueller, Google
#pubcon
The Four Main Types Of
Cannibalisation– Slideshare
@jonearnshaw
http://www.slideshare.net/jonat
hanearnshaw/seo-46813620
Consistently Avoiding Importance Cannibalisation
You must be consistently
clear in emphasising the
‘importance’ of the right
version of your ‘special
ones’ (your key most
important URLs).
#pubcon
Consistently avoiding ‘Mixed Signals’ & Skewed
URL Importance
GOOGLE CAN GET
CONFUSED AS TO WHICH
PAGE IT SHOULD RANK
FROM YOUR SITE FOR
KEY TERMS – BE CLEAR
ON TARGETS
#pubcon
Consistency - Avoiding ‘importance
dissipation’ from generational cruft
Consider keeping the
same URL for annual
events and optimise
the content for
current year
“Choose a URL
structure that can
stand the test of
time” (John Mu,
Google)
#pubcon
Cool URIs (And URLs) Don’t Change
• The iterative drip, drip, drip of Importance
• Nurture & mature (grow) importance
• Consistent importance signals ongoing
• Think URL as well as URI
“…many, many things can change and your URIs
can and should stay the same” (Sir Tim Berners-
Lee)
COOL URIs DON’T CHANGE
https://www.w3.org/Provider/Style/URI
“allocate URIs which
you will be able to
stand by in 2 years, in
20 years, in 200
years” (Sir Tim-
Berners Lee)
IMPORTANCE VIA
CONSISTENCY
#pubcon
“all over the Web, webmasters are
making decisions which will make
it really difficult for themselves in
the future” (Sir Tim Berners-Lee)
Don’t Let That Be You
#pubcon
THANK	
  YOU
TWITTER - @dawnieando
GOOGLE+ -+DawnAnderson888
LINKEDIN – msdawnanderson
www.move-it-marketing.co.uk
#pubcon
Importance Via Internal Links
Most Important Page 1
Most	
  Important	
  Page	
  2
Most	
  Important	
  Page	
  3
IS THIS
YOUR BLOG??
HOPE NOT
https://support.google.com/webmasters/answer/
138752?hl=en
#pubcon
Descending Importance Clues Via Internal
Links (Breadcrumbs)
SINGLE
TEXT OUTPUT ONLY
BREADCRUMB
FEWER
FEWER
MOST
Image credit:
https://www.smashingmagazine.com/2009/03/breadcrumbs-in-web-
design-examples-and-best-practices/
Home
Category
Sub
Product
#pubcon
YES? … YOU’RE IN
NO? … YOU’RE OUT
(sitemaps and index)
Importance By Inclusion (& Unimportance via
Exclusion
#pubcon
Importance Via Consistently Indicating ‘Correct Version’
of Duplicates
• Canonicalisation
• Choose one https / http / nonwww / www version and 301 redirect the others
• Eliminate ‘too similar’URLs
• Consistency of internal link targets (right site version, right target for
keywords / topics / topic intent / user intent)
• Right version inclusionin XML sitemaps
• Re-optimization/ unpicking of 30X redirect chains internallyand externally
• Review of internal links in GSC for ‘skew’
• Review of existingcontent to improve on topic for ‘importance’
• Save / nurture the URL (thinkfor the long term in URL planning)
• Breadcrumbs
• Minimize boiler plate content
• Minimize regurgitatedcontent in various parts of your site
#pubcon
SOURCES
• Scheduler For Search Engine Crawler -http://www.google.ch/patents/US20120317089
• We Knew The Web Was Big - https://googleblog.blogspot.co.uk/2008/07/we-knew-
web-was-big.html
• https://www.youtube.com/watch?v=GVKcMU7YNOQ
• http://webpromo.expert/google-qa-duplicate-content/
#pubcon
SOURCES
• http://webpromo.expert/google-qa-crawlingrendering/
• https://twitter.com/dergal/status/777782401497980928
• Cool URIs Don’t Change -https://www.w3.org/Provider/Style/URI
• https://searchenginewatch.com/2016/04/06/webpromos-qa-with-googles-andrey-
lipattsev-transcript/
• https://www.youtube.com/watch?v=Wcnz1kCoiks
• https://www.youtube.com/watch?v=MryA3F0ySew
• ‘Optimal Crawling Strategies For Web Search Engines’ -
http://dl.acm.org/citation.cfm?id=511465

Weitere ähnliche Inhalte

Was ist angesagt?

[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing PagesAreej AbuAli
 
Beth Barnham Schema Auditing BrightonSEO Slides.pptx
Beth Barnham Schema Auditing BrightonSEO Slides.pptxBeth Barnham Schema Auditing BrightonSEO Slides.pptx
Beth Barnham Schema Auditing BrightonSEO Slides.pptxBethBarnham1
 
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEO
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEOSearch Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEO
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEOKoray Tugberk GUBUR
 
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkGoodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkAleyda SolĂ­s
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Pythonsearchsolved
 
Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Andrew Charlton
 
Data-driven SEO & content strategy to reduce your customer acquisition costs
Data-driven SEO & content strategy to reduce your customer acquisition costsData-driven SEO & content strategy to reduce your customer acquisition costs
Data-driven SEO & content strategy to reduce your customer acquisition costsadlift
 
The Quickest Win in SEO – How to do Internal Linking the Right Way
The Quickest Win in SEO – How to do Internal Linking the Right WayThe Quickest Win in SEO – How to do Internal Linking the Right Way
The Quickest Win in SEO – How to do Internal Linking the Right WayMartin Hayman
 
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOConAleyda SolĂ­s
 
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Ahrefs
 
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfCore Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfSophie Gibson
 
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtual
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtualHow to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtual
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtualAleyda SolĂ­s
 
[BrightonSEO 2019] Restructuring Websites to Improve Indexability
[BrightonSEO 2019] Restructuring Websites to Improve Indexability[BrightonSEO 2019] Restructuring Websites to Improve Indexability
[BrightonSEO 2019] Restructuring Websites to Improve IndexabilityAreej AbuAli
 
How to leverage indexation tracking to monitor issues and improve performance
How to leverage indexation tracking to monitor issues and improve performanceHow to leverage indexation tracking to monitor issues and improve performance
How to leverage indexation tracking to monitor issues and improve performanceSimon Lesser
 
Agrupa y vencerĂĄs - SEO tĂŠcnico
Agrupa y vencerĂĄs - SEO tĂŠcnicoAgrupa y vencerĂĄs - SEO tĂŠcnico
Agrupa y vencerĂĄs - SEO tĂŠcnicoLino UruĂąuela
 
How to get your SEO work prioritised in house - Maddie McCartney.pptx
How to get your SEO work prioritised in house - Maddie McCartney.pptxHow to get your SEO work prioritised in house - Maddie McCartney.pptx
How to get your SEO work prioritised in house - Maddie McCartney.pptxMaddie McCartney
 
Quality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGCQuality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGCHamlet Batista
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Koray Tugberk GUBUR
 
[TurnDigi 2020] Getting Tech SEO Implemented
[TurnDigi 2020] Getting Tech SEO Implemented[TurnDigi 2020] Getting Tech SEO Implemented
[TurnDigi 2020] Getting Tech SEO ImplementedAreej AbuAli
 
How Search Works
How Search WorksHow Search Works
How Search WorksAhrefs
 

Was ist angesagt? (20)

[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages
[BrightonSEO 2022] Unlocking the Hidden Potential of Product Listing Pages
 
Beth Barnham Schema Auditing BrightonSEO Slides.pptx
Beth Barnham Schema Auditing BrightonSEO Slides.pptxBeth Barnham Schema Auditing BrightonSEO Slides.pptx
Beth Barnham Schema Auditing BrightonSEO Slides.pptx
 
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEO
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEOSearch Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEO
Search Query Processing: The Secret Life of Queries, Parsing, Rewriting & SEO
 
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkGoodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
 
How to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With PythonHow to Automatically Subcategorise Your Website Automatically With Python
How to Automatically Subcategorise Your Website Automatically With Python
 
Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022
 
Data-driven SEO & content strategy to reduce your customer acquisition costs
Data-driven SEO & content strategy to reduce your customer acquisition costsData-driven SEO & content strategy to reduce your customer acquisition costs
Data-driven SEO & content strategy to reduce your customer acquisition costs
 
The Quickest Win in SEO – How to do Internal Linking the Right Way
The Quickest Win in SEO – How to do Internal Linking the Right WayThe Quickest Win in SEO – How to do Internal Linking the Right Way
The Quickest Win in SEO – How to do Internal Linking the Right Way
 
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon
7 E-Commerce SEO Mistakes & How to Fix Them #DeepSEOCon
 
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
 
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfCore Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
 
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtual
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtualHow to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtual
How to Build your Own SEO Workbook in Google Sheets for Free #WTSVirtual
 
[BrightonSEO 2019] Restructuring Websites to Improve Indexability
[BrightonSEO 2019] Restructuring Websites to Improve Indexability[BrightonSEO 2019] Restructuring Websites to Improve Indexability
[BrightonSEO 2019] Restructuring Websites to Improve Indexability
 
How to leverage indexation tracking to monitor issues and improve performance
How to leverage indexation tracking to monitor issues and improve performanceHow to leverage indexation tracking to monitor issues and improve performance
How to leverage indexation tracking to monitor issues and improve performance
 
Agrupa y vencerĂĄs - SEO tĂŠcnico
Agrupa y vencerĂĄs - SEO tĂŠcnicoAgrupa y vencerĂĄs - SEO tĂŠcnico
Agrupa y vencerĂĄs - SEO tĂŠcnico
 
How to get your SEO work prioritised in house - Maddie McCartney.pptx
How to get your SEO work prioritised in house - Maddie McCartney.pptxHow to get your SEO work prioritised in house - Maddie McCartney.pptx
How to get your SEO work prioritised in house - Maddie McCartney.pptx
 
Quality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGCQuality Content at Scale Through Automated Text Summarization of UGC
Quality Content at Scale Through Automated Text Summarization of UGC
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
 
[TurnDigi 2020] Getting Tech SEO Implemented
[TurnDigi 2020] Getting Tech SEO Implemented[TurnDigi 2020] Getting Tech SEO Implemented
[TurnDigi 2020] Getting Tech SEO Implemented
 
How Search Works
How Search WorksHow Search Works
How Search Works
 

Andere mochten auch

How to use fumbaro wall paper site powered by Plone
How to use fumbaro wall paper site powered by PloneHow to use fumbaro wall paper site powered by Plone
How to use fumbaro wall paper site powered by PloneTakanori Suzuki
 
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...Dawn Anderson MSc DigM
 
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondAMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondDawn Anderson MSc DigM
 
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your Sites
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your SitesSEO - Stop Eating Your Words - Avoid Cannibalisation Of Your Sites
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your SitesDawn Anderson MSc DigM
 
Crawl Budget Optimization - SMX MĂźnchen 2016
Crawl Budget Optimization - SMX MĂźnchen 2016Crawl Budget Optimization - SMX MĂźnchen 2016
Crawl Budget Optimization - SMX MĂźnchen 2016Bastian Grimm
 
Negotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsNegotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsDawn Anderson MSc DigM
 
Digitized Student Development, Social Media, and Identity
Digitized Student Development, Social Media, and IdentityDigitized Student Development, Social Media, and Identity
Digitized Student Development, Social Media, and IdentityPaul Brown
 
SEOs as Whole Brain T Shaped Marketers
SEOs as Whole Brain T Shaped MarketersSEOs as Whole Brain T Shaped Marketers
SEOs as Whole Brain T Shaped MarketersDawn Anderson MSc DigM
 
Xmersion 5 - TEDx Kalamazoo Talk (2015)
Xmersion 5 - TEDx Kalamazoo Talk (2015)Xmersion 5 - TEDx Kalamazoo Talk (2015)
Xmersion 5 - TEDx Kalamazoo Talk (2015)John Mueller
 
URL Design with Lasso
URL Design with LassoURL Design with Lasso
URL Design with Lassomacsolve
 
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?MarĂ­a Clara Ruiz MartĂ­nez
 
11 Of The Oddest Pets You Might Want To Look After
11 Of The Oddest Pets You Might Want To Look After11 Of The Oddest Pets You Might Want To Look After
11 Of The Oddest Pets You Might Want To Look AfterJane Clarke
 
Erik Proposal Final
Erik Proposal FinalErik Proposal Final
Erik Proposal FinalErik Messier
 
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG Hendricks
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG HendricksVision 2030: Gauteng Provincial Fire & Rescue Services - RG Hendricks
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG HendricksRoland2015
 
Tablas de contenidos
Tablas de contenidosTablas de contenidos
Tablas de contenidosximenahdz
 
2015 Land Markets Survey | REALTORS Land Institute & NAR
2015 Land Markets Survey | REALTORS Land Institute & NAR2015 Land Markets Survey | REALTORS Land Institute & NAR
2015 Land Markets Survey | REALTORS Land Institute & NARREALTORSÂŽ Land Institute
 
Lodgement Order dated 28.01.2017 of Registrar Supreme Court of India
Lodgement Order dated 28.01.2017 of  Registrar Supreme Court of IndiaLodgement Order dated 28.01.2017 of  Registrar Supreme Court of India
Lodgement Order dated 28.01.2017 of Registrar Supreme Court of IndiaOm Prakash Poddar
 
Current challenges in web crawling
Current challenges in web crawlingCurrent challenges in web crawling
Current challenges in web crawlingDenis Shestakov
 

Andere mochten auch (20)

How to use fumbaro wall paper site powered by Plone
How to use fumbaro wall paper site powered by PloneHow to use fumbaro wall paper site powered by Plone
How to use fumbaro wall paper site powered by Plone
 
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...
SEO Make Micro-Moments and Wordpress Work For User Journey Mapping With Conte...
 
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondAMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
 
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your Sites
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your SitesSEO - Stop Eating Your Words - Avoid Cannibalisation Of Your Sites
SEO - Stop Eating Your Words - Avoid Cannibalisation Of Your Sites
 
Crawl Budget Optimization - SMX MĂźnchen 2016
Crawl Budget Optimization - SMX MĂźnchen 2016Crawl Budget Optimization - SMX MĂźnchen 2016
Crawl Budget Optimization - SMX MĂźnchen 2016
 
Negotiating crawl budget with googlebots
Negotiating crawl budget with googlebotsNegotiating crawl budget with googlebots
Negotiating crawl budget with googlebots
 
Digitized Student Development, Social Media, and Identity
Digitized Student Development, Social Media, and IdentityDigitized Student Development, Social Media, and Identity
Digitized Student Development, Social Media, and Identity
 
SEOs as Whole Brain T Shaped Marketers
SEOs as Whole Brain T Shaped MarketersSEOs as Whole Brain T Shaped Marketers
SEOs as Whole Brain T Shaped Marketers
 
Xmersion 5 - TEDx Kalamazoo Talk (2015)
Xmersion 5 - TEDx Kalamazoo Talk (2015)Xmersion 5 - TEDx Kalamazoo Talk (2015)
Xmersion 5 - TEDx Kalamazoo Talk (2015)
 
URL Design with Lasso
URL Design with LassoURL Design with Lasso
URL Design with Lasso
 
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?
El sindrome del Impostor - ÂżIncapacidad o Pobre Imagen de sĂ­ mismo/a?
 
11 Of The Oddest Pets You Might Want To Look After
11 Of The Oddest Pets You Might Want To Look After11 Of The Oddest Pets You Might Want To Look After
11 Of The Oddest Pets You Might Want To Look After
 
Erik Proposal Final
Erik Proposal FinalErik Proposal Final
Erik Proposal Final
 
2014 Land Markets Survey
2014 Land Markets Survey2014 Land Markets Survey
2014 Land Markets Survey
 
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG Hendricks
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG HendricksVision 2030: Gauteng Provincial Fire & Rescue Services - RG Hendricks
Vision 2030: Gauteng Provincial Fire & Rescue Services - RG Hendricks
 
Tablas de contenidos
Tablas de contenidosTablas de contenidos
Tablas de contenidos
 
2015 Land Markets Survey | REALTORS Land Institute & NAR
2015 Land Markets Survey | REALTORS Land Institute & NAR2015 Land Markets Survey | REALTORS Land Institute & NAR
2015 Land Markets Survey | REALTORS Land Institute & NAR
 
reshma resume
reshma resumereshma resume
reshma resume
 
Lodgement Order dated 28.01.2017 of Registrar Supreme Court of India
Lodgement Order dated 28.01.2017 of  Registrar Supreme Court of IndiaLodgement Order dated 28.01.2017 of  Registrar Supreme Court of India
Lodgement Order dated 28.01.2017 of Registrar Supreme Court of India
 
Current challenges in web crawling
Current challenges in web crawlingCurrent challenges in web crawling
Current challenges in web crawling
 

Ähnlich wie Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of URL Importance Optimization Pubcon Vegas 2016

Pubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonPubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonDawn Anderson MSc DigM
 
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick StoxA Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stoxpatrickstox
 
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick StoxPubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stoxpatrickstox
 
Sunday Business Post SEO Masterclass - John RIng
Sunday Business Post SEO Masterclass �- John RIngSunday Business Post SEO Masterclass �- John RIng
Sunday Business Post SEO Masterclass - John RIngTinderPoint
 
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Marty Hayes
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Nate Plaunt
 
Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Jan Hendrik Merlin Jacob
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2Nate Plaunt
 
Sales Funnel & Content Marketing Audits
Sales Funnel & Content Marketing Audits Sales Funnel & Content Marketing Audits
Sales Funnel & Content Marketing Audits Rhea Drysdale
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUKeeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUJason Mun
 
SEO Checklists
SEO ChecklistsSEO Checklists
SEO ChecklistsJon Payne
 
SEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideSEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideAdam Audette
 
Analyzing search engine results pages(SERPs) All over the worlds
Analyzing search engine results pages(SERPs) All over the worldsAnalyzing search engine results pages(SERPs) All over the worlds
Analyzing search engine results pages(SERPs) All over the worldsAnil Sah
 
How to do a SEO Site Audit
How to do a SEO Site AuditHow to do a SEO Site Audit
How to do a SEO Site AuditKathy Alice Brown
 
From Pandalized to Panda Loved
From Pandalized to Panda LovedFrom Pandalized to Panda Loved
From Pandalized to Panda Lovedpointit
 
SEO Predictions for 2013 & Beyond
SEO Predictions for 2013 & Beyond SEO Predictions for 2013 & Beyond
SEO Predictions for 2013 & Beyond sbedrick
 

Ähnlich wie Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of URL Importance Optimization Pubcon Vegas 2016 (20)

Pubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonPubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn anderson
 
Intro to Google, SEO, and You in 2017
Intro to Google, SEO, and You in 2017Intro to Google, SEO, and You in 2017
Intro to Google, SEO, and You in 2017
 
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick StoxA Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
 
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick StoxPubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
 
Sunday Business Post SEO Masterclass - John RIng
Sunday Business Post SEO Masterclass �- John RIngSunday Business Post SEO Masterclass �- John RIng
Sunday Business Post SEO Masterclass - John RIng
 
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
 
Seo Made Easy
Seo Made EasySeo Made Easy
Seo Made Easy
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2
 
Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2
 
Sales Funnel & Content Marketing Audits
Sales Funnel & Content Marketing Audits Sales Funnel & Content Marketing Audits
Sales Funnel & Content Marketing Audits
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUKeeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
 
SEO Checklists
SEO ChecklistsSEO Checklists
SEO Checklists
 
SEO 2015
SEO 2015SEO 2015
SEO 2015
 
SEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive GuideSEO for Ecommerce: A Comprehensive Guide
SEO for Ecommerce: A Comprehensive Guide
 
Analyzing search engine results pages(SERPs) All over the worlds
Analyzing search engine results pages(SERPs) All over the worldsAnalyzing search engine results pages(SERPs) All over the worlds
Analyzing search engine results pages(SERPs) All over the worlds
 
How to do a SEO Site Audit
How to do a SEO Site AuditHow to do a SEO Site Audit
How to do a SEO Site Audit
 
From Pandalized to Panda Loved
From Pandalized to Panda LovedFrom Pandalized to Panda Loved
From Pandalized to Panda Loved
 
SEO Predictions for 2013 & Beyond
SEO Predictions for 2013 & Beyond SEO Predictions for 2013 & Beyond
SEO Predictions for 2013 & Beyond
 

Mehr von Dawn Anderson MSc DigM

Human vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfHuman vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfDawn Anderson MSc DigM
 
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesLife of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesDawn Anderson MSc DigM
 
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Dawn Anderson MSc DigM
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you thinkDawn Anderson MSc DigM
 
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Dawn Anderson MSc DigM
 
Google BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceGoogle BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceDawn Anderson MSc DigM
 
Google BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowGoogle BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowDawn Anderson MSc DigM
 
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender SearchDawn Anderson MSc DigM
 
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Dawn Anderson MSc DigM
 
Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Dawn Anderson MSc DigM
 
Google BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceGoogle BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceDawn Anderson MSc DigM
 
The User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchThe User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchDawn Anderson MSc DigM
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Dawn Anderson MSc DigM
 
Using topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchUsing topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchDawn Anderson MSc DigM
 
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Dawn Anderson MSc DigM
 
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...Dawn Anderson MSc DigM
 
SEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftSEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftDawn Anderson MSc DigM
 
Voice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEOVoice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEODawn Anderson MSc DigM
 

Mehr von Dawn Anderson MSc DigM (20)

Human vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfHuman vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdf
 
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesLife of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
 
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you think
 
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
 
Google BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceGoogle BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual Conference
 
Google BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowGoogle BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to Know
 
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
 
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
 
Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019
 
Google BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceGoogle BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard Race
 
The User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchThe User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive Search
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
 
Using topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchUsing topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic search
 
SEO in a Mobile First World
SEO in a Mobile First WorldSEO in a Mobile First World
SEO in a Mobile First World
 
Modern Ecommerce SEO
Modern Ecommerce SEOModern Ecommerce SEO
Modern Ecommerce SEO
 
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
 
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
 
SEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftSEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm Shift
 
Voice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEOVoice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEO
 

KĂźrzlich hochgeladen

Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...CIO Business World
 
Michael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisMichael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisjunaid794917
 
Jai Institute for Parenting Program Guide
Jai Institute for Parenting Program GuideJai Institute for Parenting Program Guide
Jai Institute for Parenting Program Guidekiva6
 
2024's Top PPC Tactics: Triple Your Google Ads Local Leads
2024's Top PPC Tactics: Triple Your Google Ads Local Leads2024's Top PPC Tactics: Triple Your Google Ads Local Leads
2024's Top PPC Tactics: Triple Your Google Ads Local LeadsSearch Engine Journal
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garsiderobwhite630290
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software SolutionsDevherds Software Solutions
 
Storyboards for my Final Major Project Video
Storyboards for my Final Major Project VideoStoryboards for my Final Major Project Video
Storyboards for my Final Major Project VideoSineadBidwell
 
Fiverr's Product Marketing Interview Assignment
Fiverr's Product Marketing Interview AssignmentFiverr's Product Marketing Interview Assignment
Fiverr's Product Marketing Interview AssignmentFarrel Brest
 
The Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingThe Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingJuan Pineda
 
The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligenceHinde Lamrani
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDemandbase
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityMonishka Adhikari
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Exploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfExploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfadult marketing
 
marketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfmarketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfarsathsahil
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentationdgtivemarketingagenc
 
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRCall Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfSnapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfEastern Online-iSURVEY
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOSzymon Słowik
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceSapana Sha
 

KĂźrzlich hochgeladen (20)

Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
 
Michael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisMichael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysis
 
Jai Institute for Parenting Program Guide
Jai Institute for Parenting Program GuideJai Institute for Parenting Program Guide
Jai Institute for Parenting Program Guide
 
2024's Top PPC Tactics: Triple Your Google Ads Local Leads
2024's Top PPC Tactics: Triple Your Google Ads Local Leads2024's Top PPC Tactics: Triple Your Google Ads Local Leads
2024's Top PPC Tactics: Triple Your Google Ads Local Leads
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions
 
Storyboards for my Final Major Project Video
Storyboards for my Final Major Project VideoStoryboards for my Final Major Project Video
Storyboards for my Final Major Project Video
 
Fiverr's Product Marketing Interview Assignment
Fiverr's Product Marketing Interview AssignmentFiverr's Product Marketing Interview Assignment
Fiverr's Product Marketing Interview Assignment
 
The Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingThe Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO Copywriting
 
The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligence
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic Creativity
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Exploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfExploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdf
 
marketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdfmarketing strategy of tanishq word PPROJECT.pdf
marketing strategy of tanishq word PPROJECT.pdf
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentation
 
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRCall Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
 
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfSnapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEO
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts Service
 

Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of URL Importance Optimization Pubcon Vegas 2016

  • 1. #pubcon Presented by: Dawn Anderson @dawnieando ‘Myths, Facts And Theories On Crawl Budget And The Importance Of ‘URL Importance Optimization’’
  • 2. #pubcon Dawn Anderson • Move It Marketing • University Lecturer – Digital Marketing • From Manchester, UK (rains a lot) • International SEO Consultant – 10+ yrs in SEO • Pomeranian pooch lover - Bert • Fascinated by crawling (practice & academia) • Doesn’t fare well in YouTube screen grabs ;P • Party trick: Remembering UK postcode areas (US Zip code equivalent) • Search Awards Judge • Twitter chatterer @dawnieando
  • 3. #pubcon Defining Crawl Budget ‘Host Load’ = What can you handle? + ‘URL Scheduling’ = What is important to crawl & how often?
  • 5. #pubcon Myth – It’s All About Just My Site, Right? • NO – HOST LOAD is apportioned at an IP level and shared amongst the sites there (Host load)
  • 6. #pubcon Host Load - When Will This Matter? • It’s more about server capacity than SEO TBH • Your site is massive (similar in size e.g. to ’Amazon’) • Your site is massive and you’re on a shared hosting • You’re using a CDN and your site is massive • You have lots of large subdomains sharing space • Crawlable test or staging sites • You have ‘infinite loops’ and ‘spider traps’ • You keep throwing server errors during crawling ‘Average’ sites don’t normally hit the payload (‘host load’)
  • 7. #pubcon Myth - Google Search Console Crawl Stats Is Where It’s At Right?
  • 8. #pubcon GSC Crawl Stats Is Not Really Just ‘Web Pages’ • Includes ALL CSS, JS, Zip, XML, PDF, AMP, HTML files crawled • Pages are NOT just single webpages https://support.google.com/webmasters/answer/3 5253 Not just ‘web pages
  • 9. #pubcon Visits By ALL The 10 Types Of Googlebots Are Recorded Together In GSC Web Image News Video Feature Phone Smartphone Mobile Adsense Adsense Adsbot App Crawler ALL The Googlebot Family
  • 10. #pubcon It Also Includes All 200 And 30X Responses • That massive crawl you thought you just got on new pages or existing pages 200 Oks could also be many, many 30X redirections • Especially when using * wildcard redirections on large sites • NO 400, 500, robotted or unreachables are recorded here https://support.google.com/webmasters/answer/3 5253
  • 11. #pubcon GSC Doesn’t Even Show You WHAT URLs Have Been Crawled & When It will likely just a few URLs being crawled very often, some very rarely and most others somewhere in between – YOU NEED TO KNOW
  • 12. #pubcon REALITY – Server Logs & Log Analysis Is Where It’s At AUTOMATE SERVER LOG RETRIEVAL VIA CRON JOB grep Googlebot access_log >googlebot_access.txt
  • 13. #pubcon Use Tools Or Just Export, Convert Data & Use Mr Mu’s Spreadsheet Spreadsheet - https://goo.gl/1pToL8
  • 14. #pubcon For The Avoidance Of Doubt – I Asked To Be Sure
  • 15. #pubcon Why Does This Matter? On A Large Site You Need To Be Able To See Through ‘Spider Eyes’ You need to see what Googlebot ‘REALLY’ thinks of your site
  • 16. #pubcon Myth – It’s The No Of ‘Pages’ Crawled In GSC Crawl Stats Divided By Days For all of the reasons in the previous 7+ slides
  • 17. #pubcon Myth – Googlebot Crawls Through Your Website From One End To The Other Then Starts Again • This is where it gets complicated • Web crawl efficiency is key • There is an order to things • Minimizing visibility of existing stale content is key too – the rest of the web is changing • Fresh results are vital to searchers
  • 18. #pubcon “What I Think You Are Talking About Is Scheduling” (Illyes, Google) Remember that time when Mr Mu kicked Andrey under the table? (joking JJ)
  • 19. #pubcon Why Web Crawling Efficiency? “WE ARE ALL PUBLISHERS” THE NUMBER OF WEBSITES DOUBLED IN SIZE BETWEEN 2011 AND 2012 AND AGAIN BY 1/3 IN 2014 The Content ‘Explosion’
  • 20. #pubcon “We don't index every one of those trillion pages -- many of them are similar to each other” (J Alpert, Google) “There’s a needle in here somewhere” “It’s an important needle too” If only we could identify it “So how many unique pages does the web really contain? We don't know; we don't have time to look at them all!” (J Alpert, Google)
  • 21. #pubcon The Duplicate Content ‘Penalty’ Myth • ‘Real’ duplicates (matching content checksum) filtered and not indexed “Each content filter sends the retrieved web pages to Dupserver to determine if they are duplicates of other web pages” http://www.google.ch/patents/US20120317089
  • 22. #pubcon Duplication & ’The Battle To Be The Single URL / Content Fingerprint’ URL / CONTENT FINGERPRINT REDIRECT YOU HAVE THE POWER TO CHOOSE ‘THE ONE’ CANONICALIZATION, HREFLANG, CONSISTENT SIGNALS INTERNALLY
  • 23. #pubcon NON- PREFERRED VERSION ‘IMPOSTER INDEXATION’ & ‘TOO SIMILAR’ CONTENT The wrong version of your URL is selected and indexed Users may pick the wrong version of the duplicate content and link to that one. Then signals are dissipated
  • 24. #pubcon De-duping, URL Sorting & Scheduling Original Image - https://patentimages.storage.googleapis.com/US8666964B1/US08666964-20140304- D00004.png https://www.google.com/patents/US8666964 Lots and lots of patents on crawling efficiency
  • 25. #pubcon Important Pages Are Crawled More Frequently These pages are important and need to be up to date. They cannot be returned as stale data
  • 26. #pubcon Depth Of Crawl Is Greater In Higher Quality Sections Of Sites • Important grandparents and parents begets ’important’ children and grandchild URLs • Higher quality site sections (descendants) get crawled more
  • 27. #pubcon Low Quality Sites Get Crawled Less Frequently https://support.google.com/webmasters/answer/35253 They are low importance
  • 28. #pubcon Myth – It’s Based Just On PageRank ”There’s a ‘shit-ton’ of other stuff going on which plays an important role” (Illyes, Google)
  • 29. #pubcon PageRank Has Become Just One Of Very Many Things “WHATEVERYOU ARE THINKING… WHETHER IT BE ABOUTCRAWLING OR RANKING… IT (PAGERANK)HAS BECOME JUSTONE OFVERYMANY THINGS” (Andrey Lipattsev, Google, 2016)
  • 30. #pubcon It’s Mostly Driven By ‘Importance’ “SCHEDULING  IS  MOSTLY   DRIVEN  BY IMPORTANCE”  (Illyes,  Google) IMPORTANCE  MAY  INCLUDE   PAGERANK  (Patents)  …  BUT  IT  IS   ONLY  A  PART  OF  IT RANKING  IS  ALSO  DRIVEN  BY   IMPORTANCE  (IN  PART)
  • 31. #pubcon Page (URL) Importance Is Mahoossively Important (May Include PageRank)
  • 32. PAGE IMPORTANCE - The importance of a page independent of a query • Location in Site (e.g. home page more important than parameter 3 level output) • PageRank • Page type / file type • Internal PageRank • Internal Backlinks • In-site Anchor Text Consistency • Relevance (content, anchors and elements) to a topic (Similarity Importance) • Directives from in-page robot and robots.txt management • Parent quality brushes off on child page quality • Inclusion in XML sitemaps and the index IMPORTANT PARENTS LIKELY SEEN TO HAVE IMPORTANT CHILD PAGES Several Google Patents
  • 33. #pubcon But…Importance Signs From Whom? 3 Types Of ‘Importance Signal Sender’? SEARCHERS WEBMASTERS LINKERATILooking for results, creating queries, triggering impressions, demanding freshness Hreflang, Canonicalization, Internal links, Sitemap and index inclusion, Information Architecture,Anchors, Building content at a URL on a topic Passing PageRank AND WHY IS ‘IMPORTANCE’ SO IMPORTANT?
  • 34. #pubcon Concept Of Search Engine Embarrassment A concept mostly originally attributed to Joel Wolf
  • 35. #pubcon Search Engine Embarrassment Credit: Joel Wolf Et Al GOODNESS & BADNESS IN SEARCH ENGINE EMBARRASSMENT Concept of using probability estimates to revisit web pages ‘just in time’ and based around limiting ‘likelihood of stale pages being exposed’ to searchers
  • 37. #pubcon Search Engine Embarrassment User_View_Rate – Likelihood of the document being seen + Document_Update_rate – How often it has material changes + Web_Crawl_Interval – How often is it currently crawled COMBINED TO CALCULATE Probability(Seen_Stale_Data) = Risk of Search Engine Embarrassment? ‘JUST IN TIME SMART CRAWLING’
  • 38. #pubcon THEORY - Search Engine Embarrassment Joel Wolf’s ‘Optimal Crawl Strategies’ (Search Engine Embarrassment) Paper is Cited in this Google Patent
  • 39. #pubcon Triggering More ’Real Searcher Impressions’ A SMALL TEST THE PAGES BECAME ARGUABLY MORE IMPORTANT CRAWLING IMPROVED RANKING IMPROVED TRAFFIC IMPROVED
  • 40. #pubcon Myth – Don’t We Just Have To Make Random Changes To Get Crawled More? NOT ALL CHANGE IS CREATED EQUAL
  • 41. #pubcon WHAT Changed? Was it important? https://www.seroundtable.com/google-crawl- frequency-ranking-21153.html HINTS & C = ∑ i = 0 n - 1 weight i * feature CRITICAL MATERIAL CHANGE
  • 42. #pubcon Randomization & Lying About ‘Change’ To Googlebot Won’t Help • NOT ALL CHANGE IS IMPORTANT ENOUGH TO BE RECRAWLED • DO NOT TRY TO MANIPULATE ‘CHANGE’ • You can’t get more crawl just by changing your pages alone & you may actually be doing your site harm • WHY – Because… ‘hints’ & ’thresholds’ designed to pick up on this • If every URL changes header response will always be modified since (current date) • Randomization and shuffling could be preventing Googlebot from crawling the important pages • Last-modified is taken into consideration, IF it is correct • Priority == ignored so don’t make it up • Change frequency == ignored so don’t make it up ’IMPORTANCE’ BEATS ‘CHANGE’
  • 43. #pubcon ‘Crawl Rank’ – Causation or Correlation? • By getting your URL crawled more frequently do they automatically rank higher? • “A lot of people confuse crawling with ranking” (John Mu) • Crawl Rank - It seems this is more correlation than causation • You got your URLs crawled more by making them more important (e.g. via internal linking strategies), canonicalization, hreflang, merging and improving thin content, etc, updating with fresh and rich content to a topic… and subsequently ranked higher “Often times, it is kind of a relationship that, when we think something is important we tend to crawl it more frequently and that might be more visible in search” John Mueller, Google
  • 44. #pubcon The Four Main Types Of Cannibalisation– Slideshare @jonearnshaw http://www.slideshare.net/jonat hanearnshaw/seo-46813620 Consistently Avoiding Importance Cannibalisation You must be consistently clear in emphasising the ‘importance’ of the right version of your ‘special ones’ (your key most important URLs).
  • 45. #pubcon Consistently avoiding ‘Mixed Signals’ & Skewed URL Importance GOOGLE CAN GET CONFUSED AS TO WHICH PAGE IT SHOULD RANK FROM YOUR SITE FOR KEY TERMS – BE CLEAR ON TARGETS
  • 46. #pubcon Consistency - Avoiding ‘importance dissipation’ from generational cruft Consider keeping the same URL for annual events and optimise the content for current year “Choose a URL structure that can stand the test of time” (John Mu, Google)
  • 47. #pubcon Cool URIs (And URLs) Don’t Change • The iterative drip, drip, drip of Importance • Nurture & mature (grow) importance • Consistent importance signals ongoing • Think URL as well as URI “…many, many things can change and your URIs can and should stay the same” (Sir Tim Berners- Lee) COOL URIs DON’T CHANGE https://www.w3.org/Provider/Style/URI “allocate URIs which you will be able to stand by in 2 years, in 20 years, in 200 years” (Sir Tim- Berners Lee) IMPORTANCE VIA CONSISTENCY
  • 48. #pubcon “all over the Web, webmasters are making decisions which will make it really difficult for themselves in the future” (Sir Tim Berners-Lee) Don’t Let That Be You
  • 49. #pubcon THANK  YOU TWITTER - @dawnieando GOOGLE+ -+DawnAnderson888 LINKEDIN – msdawnanderson www.move-it-marketing.co.uk
  • 50. #pubcon Importance Via Internal Links Most Important Page 1 Most  Important  Page  2 Most  Important  Page  3 IS THIS YOUR BLOG?? HOPE NOT https://support.google.com/webmasters/answer/ 138752?hl=en
  • 51. #pubcon Descending Importance Clues Via Internal Links (Breadcrumbs) SINGLE TEXT OUTPUT ONLY BREADCRUMB FEWER FEWER MOST Image credit: https://www.smashingmagazine.com/2009/03/breadcrumbs-in-web- design-examples-and-best-practices/ Home Category Sub Product
  • 52. #pubcon YES? … YOU’RE IN NO? … YOU’RE OUT (sitemaps and index) Importance By Inclusion (& Unimportance via Exclusion
  • 53. #pubcon Importance Via Consistently Indicating ‘Correct Version’ of Duplicates • Canonicalisation • Choose one https / http / nonwww / www version and 301 redirect the others • Eliminate ‘too similar’URLs • Consistency of internal link targets (right site version, right target for keywords / topics / topic intent / user intent) • Right version inclusionin XML sitemaps • Re-optimization/ unpicking of 30X redirect chains internallyand externally • Review of internal links in GSC for ‘skew’ • Review of existingcontent to improve on topic for ‘importance’ • Save / nurture the URL (thinkfor the long term in URL planning) • Breadcrumbs • Minimize boiler plate content • Minimize regurgitatedcontent in various parts of your site
  • 54. #pubcon SOURCES • Scheduler For Search Engine Crawler -http://www.google.ch/patents/US20120317089 • We Knew The Web Was Big - https://googleblog.blogspot.co.uk/2008/07/we-knew- web-was-big.html • https://www.youtube.com/watch?v=GVKcMU7YNOQ • http://webpromo.expert/google-qa-duplicate-content/
  • 55. #pubcon SOURCES • http://webpromo.expert/google-qa-crawlingrendering/ • https://twitter.com/dergal/status/777782401497980928 • Cool URIs Don’t Change -https://www.w3.org/Provider/Style/URI • https://searchenginewatch.com/2016/04/06/webpromos-qa-with-googles-andrey- lipattsev-transcript/ • https://www.youtube.com/watch?v=Wcnz1kCoiks • https://www.youtube.com/watch?v=MryA3F0ySew • ‘Optimal Crawling Strategies For Web Search Engines’ - http://dl.acm.org/citation.cfm?id=511465