SlideShare a Scribd company logo
1 of 38
Hadoop and Big Data to Drive  Web Analytics Raghu Kashyap & Michael Wetta @ Orbitz Worldwide
About Us Raghu Kashyap -  Director Web Analytics  Twitter:  @ragskashyap Blog:  http://kashyaps.com Email:  [email_address] Michael Wetta  -  Marketing Strategy & Analytics Email:  [email_address]
Overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is Web Analytics? ,[object Object],[object Object],[object Object],[object Object]
 
[object Object],[object Object],[object Object],[object Object],[object Object],Behavioral attributes
Web Analytics History Early1990s – Hit counters Reference - http://www.theedifier.com
Web Analytics History 1993 – Web server logs (Webtrends) 213.60.233.243 - - [25/May/2004:00:17:09 +1200] "GET /internet/index.html HTTP/1.1"  200 6792 "http://www.mediacollege.com/video/streaming/http.html" "Mozilla/5.0  (X11; U; Linux i686; es-ES; rv:1.6) Gecko/20040413 Debian/1.6-5” 151.44.15.252 - - [25/May/2004:00:17:20 +1200] "GET /cgi-bin/forum/commentary.pl /noframes/read/209 HTTP/1.1" 200 6863 "http://search.virgilio.it/search/cgi/search.cgi ?qs=download+video+illegal+Berg&lr=&dom=s&offset=0&hits=10&switch=0&f=us” "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; Hotbar 4.4.7.0)”
Web Analytics History ,[object Object],[object Object]
Web Analytics History 2005 – Google Analytics Reference - http://www.theedifier.com
Web Analytics History ,[object Object],[object Object]
Web Analytics today ,[object Object],[object Object],[object Object],[object Object]
Site Analytics  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi Variate Testing (MVT) ,[object Object],[object Object],[object Object]
Voice of Customer (VOC) ,[object Object],[object Object],[object Object]
Competitive Intelligence ,[object Object],[object Object]
About Orbitz Worldwide
Challenges ,[object Object],[object Object],[object Object],[object Object],[object Object]
continued…. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Web Analytics & Big Data ,[object Object],[object Object],[object Object]
Big Data Infrastructure ,[object Object],[object Object],[object Object],[object Object]
Processing of Web Analytics Data
Aggregating data into Data Warehouse
Data Analysis Jobs ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Categories ,[object Object],[object Object],[object Object],[object Object],[object Object]
Shifting from Innovation to Mainstream Consumption
Crossing the Chasm: Shifting from Innovation to Mainstream Consumption Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle ,[object Object],[object Object],[object Object]
Crossing the Chasm: Shifting from Innovation to Mainstream Consumption Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle
Innovators Visionaries Mainstream Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle Crossing the Chasm: Shifting from Innovation to Mainstream Consumption
Crossing the Chasm: Shifting from Innovation to Mainstream Consumption Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle ,[object Object],[object Object],[object Object],[object Object],Key Components Adoption:
Centralized Decentralization Web Analytics team + SEO team + Hotel optimization team
Model for success ,[object Object],[object Object],[object Object]
Should everyone do this? ,[object Object],[object Object],[object Object],[object Object]
Other Key Projects ,[object Object],[object Object],[object Object],[object Object]
Where else? Amazon  - Was Amazon's recommendation engine crucial to the company's success? Facebook  – A Petabyte Scale Data Warehouse using Hadoop EBay  – The power of the Elephant Apple  – iAds, UX and Data analytics
Conclusion ,[object Object],[object Object],[object Object],[object Object]
Reference ,[object Object],[object Object],[object Object],[object Object]
Questions? ,[object Object]

More Related Content

What's hot

Hooduku - Big data analytics - case study
Hooduku - Big data analytics - case studyHooduku - Big data analytics - case study
Hooduku - Big data analytics - case studySudhi Seshachala
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acmehooduku
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfullyAdir Sharabi
 
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike GualtieriSpark Summit
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubCloudera, Inc.
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfulyAdir Sharabi
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BIDeZyre
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data PlatformAndrei Savu
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
 
Recipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataRecipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataFadi Yousuf
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18Harvinder Atwal
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 
Reinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRReinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRLilia Gutnik
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey ResultsAtScale
 
Make data simple in the cognitive era
Make data simple in the cognitive eraMake data simple in the cognitive era
Make data simple in the cognitive eraIBM Analytics
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
 
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
BreizhJUG - Janvier 2014 - Big Data -  Dataiku - Pages JaunesBreizhJUG - Janvier 2014 - Big Data -  Dataiku - Pages Jaunes
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages JaunesDataiku
 

What's hot (20)

Hooduku - Big data analytics - case study
Hooduku - Big data analytics - case studyHooduku - Big data analytics - case study
Hooduku - Big data analytics - case study
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acme
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfully
 
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfuly
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 
Recipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataRecipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big Data
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
Reinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRReinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapR
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results
 
Make data simple in the cognitive era
Make data simple in the cognitive eraMake data simple in the cognitive era
Make data simple in the cognitive era
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of Things
 
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
BreizhJUG - Janvier 2014 - Big Data -  Dataiku - Pages JaunesBreizhJUG - Janvier 2014 - Big Data -  Dataiku - Pages Jaunes
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
 

Viewers also liked

AutoCAD 2011 Apertura Creazione Salvataggio
AutoCAD 2011 Apertura Creazione SalvataggioAutoCAD 2011 Apertura Creazione Salvataggio
AutoCAD 2011 Apertura Creazione SalvataggioLaura Camellini
 
Big Data Analytics from a Practitioners View
Big Data Analytics from a Practitioners ViewBig Data Analytics from a Practitioners View
Big Data Analytics from a Practitioners ViewRaghu Kashyap
 
Moodle 2.7 - amministrazione base - corsi - backup
Moodle 2.7 - amministrazione base - corsi - backupMoodle 2.7 - amministrazione base - corsi - backup
Moodle 2.7 - amministrazione base - corsi - backupLaura Camellini
 
July17 Dale Calvert MPB Today Team Training Slides
July17 Dale Calvert MPB Today Team Training SlidesJuly17 Dale Calvert MPB Today Team Training Slides
July17 Dale Calvert MPB Today Team Training SlidesLiving Good
 
Big Data redefines Enterprise Data Warehouse @Bangalore
Big Data redefines Enterprise Data Warehouse @BangaloreBig Data redefines Enterprise Data Warehouse @Bangalore
Big Data redefines Enterprise Data Warehouse @BangaloreRaghu Kashyap
 
Moodle 2.7 - amministrazione base - importare utenti da database
Moodle 2.7 - amministrazione base - importare utenti da databaseMoodle 2.7 - amministrazione base - importare utenti da database
Moodle 2.7 - amministrazione base - importare utenti da databaseLaura Camellini
 
AutoCAD 2011 Area Di Lavoro E Guida
AutoCAD 2011 Area Di Lavoro E GuidaAutoCAD 2011 Area Di Lavoro E Guida
AutoCAD 2011 Area Di Lavoro E GuidaLaura Camellini
 
Accelerate 2012 chicago - orbitz
Accelerate   2012 chicago - orbitzAccelerate   2012 chicago - orbitz
Accelerate 2012 chicago - orbitzRaghu Kashyap
 
Cinque dita (Omaggio fino al 22 Aprile 015)
Cinque dita (Omaggio fino al 22 Aprile 015)Cinque dita (Omaggio fino al 22 Aprile 015)
Cinque dita (Omaggio fino al 22 Aprile 015)Amerigo Mancini
 
Moodle 2.7 - corsi - Tracciamento
Moodle 2.7 - corsi - TracciamentoMoodle 2.7 - corsi - Tracciamento
Moodle 2.7 - corsi - TracciamentoLaura Camellini
 
Jatin sharma profile host,stand up comic
Jatin sharma profile host,stand up comicJatin sharma profile host,stand up comic
Jatin sharma profile host,stand up comicJatin Sharma
 
Auto CAD 2011 Strumenti Di Disegno
Auto CAD 2011 Strumenti Di DisegnoAuto CAD 2011 Strumenti Di Disegno
Auto CAD 2011 Strumenti Di DisegnoLaura Camellini
 
Is BI/Analytics and Agile an Oxymoron?
Is BI/Analytics and Agile an Oxymoron?Is BI/Analytics and Agile an Oxymoron?
Is BI/Analytics and Agile an Oxymoron?Raghu Kashyap
 
Moodle 2.7 - amministrazione - corsi - messaggistica
Moodle 2.7 - amministrazione - corsi - messaggisticaMoodle 2.7 - amministrazione - corsi - messaggistica
Moodle 2.7 - amministrazione - corsi - messaggisticaLaura Camellini
 
Moodle 2.7 - corsi - valutazioni
Moodle 2.7 - corsi - valutazioniMoodle 2.7 - corsi - valutazioni
Moodle 2.7 - corsi - valutazioniLaura Camellini
 
Visual learning Jenny Knox
Visual learning   Jenny KnoxVisual learning   Jenny Knox
Visual learning Jenny KnoxJennyKnox
 

Viewers also liked (19)

AutoCAD 2011 Apertura Creazione Salvataggio
AutoCAD 2011 Apertura Creazione SalvataggioAutoCAD 2011 Apertura Creazione Salvataggio
AutoCAD 2011 Apertura Creazione Salvataggio
 
Big Data Analytics from a Practitioners View
Big Data Analytics from a Practitioners ViewBig Data Analytics from a Practitioners View
Big Data Analytics from a Practitioners View
 
Moodle 2.7 - amministrazione base - corsi - backup
Moodle 2.7 - amministrazione base - corsi - backupMoodle 2.7 - amministrazione base - corsi - backup
Moodle 2.7 - amministrazione base - corsi - backup
 
Organic Flux Capacitor
Organic Flux CapacitorOrganic Flux Capacitor
Organic Flux Capacitor
 
July17 Dale Calvert MPB Today Team Training Slides
July17 Dale Calvert MPB Today Team Training SlidesJuly17 Dale Calvert MPB Today Team Training Slides
July17 Dale Calvert MPB Today Team Training Slides
 
Big Data redefines Enterprise Data Warehouse @Bangalore
Big Data redefines Enterprise Data Warehouse @BangaloreBig Data redefines Enterprise Data Warehouse @Bangalore
Big Data redefines Enterprise Data Warehouse @Bangalore
 
Moodle 2.7 - amministrazione base - importare utenti da database
Moodle 2.7 - amministrazione base - importare utenti da databaseMoodle 2.7 - amministrazione base - importare utenti da database
Moodle 2.7 - amministrazione base - importare utenti da database
 
AutoCAD 2011 Area Di Lavoro E Guida
AutoCAD 2011 Area Di Lavoro E GuidaAutoCAD 2011 Area Di Lavoro E Guida
AutoCAD 2011 Area Di Lavoro E Guida
 
Accelerate 2012 chicago - orbitz
Accelerate   2012 chicago - orbitzAccelerate   2012 chicago - orbitz
Accelerate 2012 chicago - orbitz
 
Cinque dita (Omaggio fino al 22 Aprile 015)
Cinque dita (Omaggio fino al 22 Aprile 015)Cinque dita (Omaggio fino al 22 Aprile 015)
Cinque dita (Omaggio fino al 22 Aprile 015)
 
Moodle 2.7 - corsi - Tracciamento
Moodle 2.7 - corsi - TracciamentoMoodle 2.7 - corsi - Tracciamento
Moodle 2.7 - corsi - Tracciamento
 
Literalma fórum mundial
Literalma fórum mundialLiteralma fórum mundial
Literalma fórum mundial
 
Jatin sharma profile host,stand up comic
Jatin sharma profile host,stand up comicJatin sharma profile host,stand up comic
Jatin sharma profile host,stand up comic
 
Auto CAD 2011 Strumenti Di Disegno
Auto CAD 2011 Strumenti Di DisegnoAuto CAD 2011 Strumenti Di Disegno
Auto CAD 2011 Strumenti Di Disegno
 
Is BI/Analytics and Agile an Oxymoron?
Is BI/Analytics and Agile an Oxymoron?Is BI/Analytics and Agile an Oxymoron?
Is BI/Analytics and Agile an Oxymoron?
 
Moodle 2.7 - amministrazione - corsi - messaggistica
Moodle 2.7 - amministrazione - corsi - messaggisticaMoodle 2.7 - amministrazione - corsi - messaggistica
Moodle 2.7 - amministrazione - corsi - messaggistica
 
Moodle 2.7 - corsi - valutazioni
Moodle 2.7 - corsi - valutazioniMoodle 2.7 - corsi - valutazioni
Moodle 2.7 - corsi - valutazioni
 
Visual learning Jenny Knox
Visual learning   Jenny KnoxVisual learning   Jenny Knox
Visual learning Jenny Knox
 
Norme uni en iso 9000
Norme uni en iso 9000Norme uni en iso 9000
Norme uni en iso 9000
 

Similar to Web analyticsandbigdata techweek2011

Architecture of Big Data Solutions
Architecture of Big Data SolutionsArchitecture of Big Data Solutions
Architecture of Big Data SolutionsGuido Schmutz
 
Data Driven Design: Using Web Analytics to Improve Information Architectures
Data Driven Design: Using Web Analytics to Improve Information ArchitecturesData Driven Design: Using Web Analytics to Improve Information Architectures
Data Driven Design: Using Web Analytics to Improve Information ArchitecturesAndrea Wiggins
 
Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveRon Krzoska
 
Big Data Meetup: Analytical Systems Evolution
Big Data Meetup: Analytical Systems EvolutionBig Data Meetup: Analytical Systems Evolution
Big Data Meetup: Analytical Systems EvolutionProvectus
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
Webinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data PlatformWebinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data PlatformDataStax
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Keynote: Future of IT - future of enterprise it Canada
Keynote: Future of IT - future of enterprise it CanadaKeynote: Future of IT - future of enterprise it Canada
Keynote: Future of IT - future of enterprise it CanadaAmazon Web Services
 
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...Trivadis
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital TransformationMukund Babbar
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Jonathan Seidman
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
February 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSFebruary 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSAmazon Web Services
 
Big Data for the CMO
Big Data for the CMOBig Data for the CMO
Big Data for the CMOBruno Aziza
 

Similar to Web analyticsandbigdata techweek2011 (20)

Architecture of Big Data Solutions
Architecture of Big Data SolutionsArchitecture of Big Data Solutions
Architecture of Big Data Solutions
 
Data Driven Design: Using Web Analytics to Improve Information Architectures
Data Driven Design: Using Web Analytics to Improve Information ArchitecturesData Driven Design: Using Web Analytics to Improve Information Architectures
Data Driven Design: Using Web Analytics to Improve Information Architectures
 
Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserve
 
Big Data Meetup: Analytical Systems Evolution
Big Data Meetup: Analytical Systems EvolutionBig Data Meetup: Analytical Systems Evolution
Big Data Meetup: Analytical Systems Evolution
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Webinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data PlatformWebinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data Platform
 
KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Keynote: Future of IT - future of enterprise it Canada
Keynote: Future of IT - future of enterprise it CanadaKeynote: Future of IT - future of enterprise it Canada
Keynote: Future of IT - future of enterprise it Canada
 
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
February 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWSFebruary 2016 Webinar Series - 451 Research and AWS
February 2016 Webinar Series - 451 Research and AWS
 
Big Data for the CMO
Big Data for the CMOBig Data for the CMO
Big Data for the CMO
 

Recently uploaded

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Recently uploaded (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

Web analyticsandbigdata techweek2011

  • 1. Hadoop and Big Data to Drive Web Analytics Raghu Kashyap & Michael Wetta @ Orbitz Worldwide
  • 2. About Us Raghu Kashyap - Director Web Analytics Twitter: @ragskashyap Blog: http://kashyaps.com Email: [email_address] Michael Wetta - Marketing Strategy & Analytics Email: [email_address]
  • 3.
  • 4.
  • 5.  
  • 6.
  • 7. Web Analytics History Early1990s – Hit counters Reference - http://www.theedifier.com
  • 8. Web Analytics History 1993 – Web server logs (Webtrends) 213.60.233.243 - - [25/May/2004:00:17:09 +1200] "GET /internet/index.html HTTP/1.1" 200 6792 "http://www.mediacollege.com/video/streaming/http.html" "Mozilla/5.0 (X11; U; Linux i686; es-ES; rv:1.6) Gecko/20040413 Debian/1.6-5” 151.44.15.252 - - [25/May/2004:00:17:20 +1200] "GET /cgi-bin/forum/commentary.pl /noframes/read/209 HTTP/1.1" 200 6863 "http://search.virgilio.it/search/cgi/search.cgi ?qs=download+video+illegal+Berg&lr=&dom=s&offset=0&hits=10&switch=0&f=us” "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; Hotbar 4.4.7.0)”
  • 9.
  • 10. Web Analytics History 2005 – Google Analytics Reference - http://www.theedifier.com
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Processing of Web Analytics Data
  • 23. Aggregating data into Data Warehouse
  • 24.
  • 25.
  • 26. Shifting from Innovation to Mainstream Consumption
  • 27.
  • 28. Crossing the Chasm: Shifting from Innovation to Mainstream Consumption Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle
  • 29. Innovators Visionaries Mainstream Adapted from Geoffrey A. Moore – Technology Adoption Lifecycle Crossing the Chasm: Shifting from Innovation to Mainstream Consumption
  • 30.
  • 31. Centralized Decentralization Web Analytics team + SEO team + Hotel optimization team
  • 32.
  • 33.
  • 34.
  • 35. Where else? Amazon - Was Amazon's recommendation engine crucial to the company's success? Facebook – A Petabyte Scale Data Warehouse using Hadoop EBay – The power of the Elephant Apple – iAds, UX and Data analytics
  • 36.
  • 37.
  • 38.

Editor's Notes

  1. Welcome everyone. This presentation is about Hadoop and Big data to drive web analytics 2. The goal of this presentation is to explain how we are shaping up web analytics and big data to optimize the data driven decisions at Orbitz World wide 3. I will also be talking about the process model on how we are effectively utilizing the brains and man power across the organization towards a common goal 4. Between me and Wetta we promise to give you some thought provoking details about analytics and Big data :-)
  2. I met someone at the train station who asked me what I do? I said I work in the web analytics field and I help shape up the strategy and vision at Orbitz worldwide and enable our business teams to get insights on the performance of our site and act upon it. So he said, Ah you do reporting :-) 2. I started thinking why web analytics is hard for people to get and started evangelizing both within and outside Orbitz 3. I manage the webanalytics team at Orbitz worldwide I also try to help out non-profit organizations while I am not busy with my wife and 2 sons.
  3. Here is the Agenda on what we would be talking. We will be breaking the sections in web analytics, challenges with web analytics, how Big data is helping us overcome these challenges and the process. 3. Michael will providing the business side of the story. 4. Finally we will open up for questions.
  4. .1 So what is web analytics? 2, Read the definition . It tells you exactly why someone came to your site and what kind of impact they had on the bottom line of your revenue 3. Read the definition . You need to immerse yourself in data to understand the story it's telling 4. Read the definition . Focus on Customer. Customer is the king. You need to listen and act upon their feedback 5. Read the definition. Test Test and Test. If you want to prove or disprove a HIPPO's opinion you need to perform tests on your site 6. btw HIPPO is a common terminology in the industry. It stands for Highest Income Paid person's opinion :-)
  5. 1. A website is just like a store 2. You have millions of people visiting you every day and shopping on your site 3. So where does Web Analytics fit here? 4. Web Analytics is the invisible shopper who goes around the store and watches everyone's behavior. 5. Web Analytics takes the behavioral attributes and helps business with insights!
  6. So how do we fit the puzzle? By learning the behavior of the customer and focusing on key attributes Know the travel details such as how many travelers, what kind of travelers, any preferred carrier or hotels? 4. Understand the shopping patterns. Does he want to shop only on weekends or else only on Thursdays. 5. Focus on Visit Patterns. How many times does he come to the site before he buys anything 6. Learn the page navigation. I.e does he see 100 pages every time he comes or does he know exactly what to look at 7. Master the Demand source. Anyone who's worked in the marketing side knows that attribution is a holy war. Deciding which demand source gets the credit for conversion is something people will argue to death Just like the IDE war between VIM, EMACS, Intellij and Eclipse :-) 8. So now that I think you understand what webanalytics is and what you can do with it lets focus a bit on its history
  7. So who remembers the glory days of hit counters? Early 1990's if you had more hits to the site than that was a wonderful thing. People would measure the traffic using hit counters. As Michael says nothing's worse than bringing cheap and crappy traffic to the site.
  8. So for people who are familiar with webtrends know that they were one of the first companies to parse server side logs to provide web analytics tools 2. This was kind of a gateway to the wonderful world of site analytics. 3. Technology folks know that nothing is easy with log file parsing :-)
  9. The reason webanalytics became popular along the business marketing teams was due to the fact that it was easy to implement javascript tags and use one of the SAAS tools to analyze data 2. There has been numerous articles, discussions and debates on which approach is better server side or client side tagging 3. I think this will continue in the near future.
  10. Google Analytics made Web Analytics sexy, easy and cheap Prior to the arrival of Google analytics the big vendors in this area were Omniture, Webtrends, Coremetrics The cost for these tools ranges anywhere from quarter million dollars to over a million dollars. Google changed the map of web analytics with the introduction of GA
  11. So this finally brings us to the new era of web analytics with Big Data In the early days of 2009 there were a lot of acquisitions in this area and now we see most of the vendors consolidating their businesses to support the big data market. 3. The early adaptors of Big Data were in the likes of IBM, Facebook and Orbitz 4. We will be seeing more the Data warehouse vendors moving in this direction
  12. What exactly is Web Analytics today and what type of data we collect and funnel into our Big Data infrastructure. 2. The four pillars of web analytics are Site Analytics, Voice of Customer, Multi Variate testing(MVT in short) and finally Competitive intelligence 3. Let's now focus on what these four key areas are about and understand the importance and usage within Orbitz
  13. Site Analytics provides the "What" of web analytics This really helps us understand the visitor behavior pattern It also helps us measure the conversion and track the demand source
  14. This helps us answer why users drop off from a search result page Do people like round button or a square button. Things that you never imagined would have an impact on a customer will be surfaced by doing MVT testing 4. If you want to have some fun and see your customer understanding skills check out www.whichtestwon.com There are every day tests on which you can vote.
  15. You don't know why a customer behaved the way he did on your site. The only way to understand certain things with customer behavior is by asking them why. VOC helps you understand the customer needs and behavior by listening to them through surveys and feedback mechanisms
  16. so lets say your business is seeing an upward trend and revenue has been soaring Do you know if this is because of the changes that you did on your site? Or do you know if there is a upward trend in the market and everyone including your competitors are growing 4. If you competitor is growing at 25% rate and you are at 5% then you really are not growing. 5. CI will help you understand all the these aspects of business so that you can make educated data driven decisions
  17. This brings us to Orbitz and how we are using all the aspects if WA along with Big Data. OWW Operates multiple brands across the globe In US we have Orbitz, CheapTickets, The Away Network and Orbitz for Business Internationally we have ebookers, HotelClub (includes RatestoGo and Asia Hotels) We went public in 2007 and registered as OWW on NYSE
  18. So with so many brands and so much data we had quite a few challenges? For starters we couldn't easily do multi dimensional analysis with the tools. With data spread across in multiple tools it was hard to picture the whole 9 yards obviously tools cost money Harder for people to understand where to look at for data With Analytics you need direction rather than precision to take action and get insights
  19. In the Big Data front we didn't have a good infrastructure where we could house all this data in a cost effective way. 2. Data extraction was NOT an easy task 3. Focusing on the key differences on when you need testing v/s when you need reporting. 4. Earlier I mentioned that you need to do rigorous outcome analysis. However, with all the challenges we faced it was not an easy task.
  20. We realized that with all the challenges we had, we had to innovate and experiment new ways to enable successful web analytics at OWW 2. We generate hundreds of GB of log data per day. How can we effectively store this massive data and how can we mine this data and make sense out of it? 3. Our existing DW was not intended to support such large sets of data and more importantly process this data We also needed to make sure that we don't spend huge money to store this data set. 4. Big data infrastructure with Hadoop has been a huge success at Orbitz and at other organizations
  21. So what does this buy us? We can now store data for a long period of time without worrying too much about the space Analysts and developers have access to this data set Developers can run adhoc queries to support our business needs. While the core web analytics team focuses on the company standards and metrics
  22. Here is an example of how we process our site analytics data today. We FTP the log files into our Hadoop infrastructure daily. The files are LZO compressed for better storage utilization. Developers then write Map reduce jobs against these raw log files to output data into HIVE tables. HIVE is a DW equivalent of Hadoop Most of the MR jobs are written using Java and scripting languages such as Python, Ruby, BASH. Business teams however, have skillset to run queries against HIVE tables.
  23. Since the market on Big Data is not that mature there are no good ways to build visualization on top of HIVE 2. Due to this and for other reasons we need to bring a subset of this data into our warehouse. 3. So in essence the data that are in HIVE will make it into the warehouse. 4. There are companies such as Karmaspehe, Datamere who are in the initial stages of bridging the gap between business needs and Hadoop access. 5. However, its too early to say if this will be the norm
  24. We focused on some key areas of our business such as demand source and campaigns as our pilot and worked with our business partners to enable the analytics on Big Data 2. We have developers writing Map Reduce jobs which run every day and populate HIVE tables We generate more than 25 million records for a month for the pilot use case that we worked on This only show cases the sheer magnitude and power of analytics within the Big Data framework
  25. Here are some of the areas where we are utilizing the infrastructure we have built in to extract data and provide additional analysis 2. Traffic acquisition helps us understand the demand and flow into our websites 3. provide platform for better marketing optimizations. 4. Better understand the user engagement 5. Provide better ad optimization framework 6. finally understand the user behavior 7. So all this is pretty cool stuff from technology and analytics stand point of view. Lets now turn our focus to Michael wetta and learn more from the expert on how business is leveraging the Big Data and web analytics to drive business decisions.
  26. So how do you organizationally structure yourself and Big Data so that you can be effective both in terms of resource utilization and setting the platform for success 2. This is what we call the Centralized Decentralization. 3. With this approach the core web analytics team controls and supports the individual teams when it comes to data extraction and modeling. 4. This prevents one team from being the bottle neck with data extraction and analytics 5. If you have ever worked in the Data Warehouse side of the world you will know the challenges and delays in getting the data
  27. With the core process of centralized decentralization and being agile how do you succeed? You can't manage if you can't measure. But once you measure make sure you fail fast Every team needs to be thinking of analytics with every feature they work on Dimensional modeling is great but like someone wise said 'All models are wrong but some are useful" :-) My point here is data without analysis is like a Ferrari without gas. If you Make it a point to extract smaller chunks of data and tie this effort to your business objectives. You are sure to succeed
  28. Here are some key learning's from our experience and some thoughts for you to consider If you have the strength of technology go for it. This needs heavy investment from time and resource perspective Like I mentioned many times data without analysis is worthless
  29. We at Orbitz use Big Data and Hadoop for numerous other projects some of them being Machine Learning, page load performance analysis and data cache analysis
  30. I couldn't end this session without telling who else is doing something similar. read slides
  31. In conclusion I would like to say: Invest in people and tools empower individual teams in your organization to manage their own analytics on Big Data Focus on analysis and not just data extraction.
  32. Here are some good references
  33. Thanks again for listening to our story and we would be available for any further questions you may have. Also if you are interested in applying for a job at orbitz please check out the career site