17. /product/
When folders are not
constant /city/,
segment by number
/product/region/
of slashes (/) using
regex to see the full
/product/region/city/
picture.
18. It turns out that it was the
/region/ city pages.
23. Data Gathered Tool(s) used Double check with:
Before / After traffic metrics (visits, Google Analytics Server logs imported into Splunk.
bounce rate etc..)
Response codes, link architecture, Screaming Frog, IIS crawler Chrome inspect element, manual
On page elements (titles, robots inspection.
directives etc..)
KW rankings, Backlink analysis Authority Labs, Searchmetrics, Custom scripts, manual Google
Open site explorer, Majestic checks.
Indexation, PageRank Scrapebox Custom scripts, SEOstats
Googlebot activity Server logs – with Splunk Google webcache (not perfect!)
Source code similarity (scrape) ImportXML + Excel Manual checks, Text comparing
online (Diffnow)
Environmental activity SEOmoz Algorithm updates, Webmaster forums, other SEOs,
Development queue / logs, SEO articles.
Holidays, world events
24. The process of elimination, think
of it as Logger.log();
25. Navigation Same Navigation
Heading 10% Different Heading 10%
Same
Lead generation Form Lead generation Form
Sales text Same Sales text
Same
Footer Footer
26. Now I know what the
problem is, and I have data
to explain.
http://www.diffnow.com/
27. These pages were 90%
identical to each other AND
Panda 3.8 & 3.9 updates
rolled out around the time
of loss.
28. Step 3
Form a hypothesis that might
explain the problem
29. If we make these pages at
least 50% unique, they
won’t be considered
duplicate and will regain
traffic after a Panda refresh.