SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
The evolution of Business Intelligence at Viadeo


                                           Techdays 22/11/2012
Agenda


 What is Business Intelligence?
 Key Roles
 Viadeo Data
 Technical Solutions : a short history
What is Business Intelligence ?


  Application Stack
  Awareness Stack
  Insights
   Application
   Awareness
   Insights
  Actions
   Actions
                         Act          Marketing Actions, Business
                         ion          Strategies, Operations
                          s
                                           Forecasting, Predicting,
                                           Statistics, Competitor
                       Insights            Information, Analysis




Feedback              Awareness
                                                    Reports, Dashboards



                                                              Meta Data, KPI’s,
                                                              Visual Templates,
                  Application Stack                           Security, Information
                                                              Dissemination,
                                                              Scheduling
                                                                        Plumbing of
                                                                        structured and
               Data Warehouse & ETL                                     unstructured
                                                                        data, logic to
                                                                        persists data
Key Roles : the Business Analyst




                                             Functional
       Simple                                  Web
                        BI      Followup     (Challenge
      (Metrics)
  BI Dashboards    Informatio                 Product
                    Dashboard   Analysis        PO)
   Specification    n Access
                        s                   Specificatio
                                                 n
                    (Scalars)



                                             Technical
      Complex
                                Proactive     (Enforce
     (Data viz)
                     Direct                 data quality)
                     (SQL,
                   Datameer)
Key Roles : the Big Data Engineer




                                             Implement
        Real Time    REST/Scala/Java APIs       Data
                                            Awarenes
                                            Visualization
     Data plumbing   Expose to Apps
                                                 s




                                              Enforce
         Batch           JDBC/ODBC
                                            data quality
Viadeo data : The Dynamics

                  • 45 million members
                  • Worldwide presence
                     • China, India, Russia, Mexico,..
                  • Mobile App, Web, API
                  • B2B / B2C

 Mining

                                     User
          Usage                   Engagement
Viadeo data : Graph
Technical solutions : The Beginnings
   Phase 1: 2006-2008          Phase 2 : 2008-2010




           Mysql                       Mysql
       Server name : Peach         Server name : Lakitu




      Internal tool to allow
      C-Level, Sales,…
      Access data
Technical solutions : A better
architecture
                     Phase 3: 2010 - 2012




                           MySQL

                           Server name :
                           « Unfied ODS »




                           Mysql
           Server name :                    Server name :
             ODS Live                         ODS Live
             Cluster 1                        Cluster 2




           Server name :                     Server name :
             ODS Live                          ODS Live
             Cluster 3                         Cluster 5
Technical solutions : 2 new internal
products
Scala-centric, Play! framework
 Cross-channel messaging system
    Email, Mobile, Social
    Flexible content management
    Flexible targeting of recipients
    Content testing strategies : A/B, multivariate
    Event-driven : web app events, mobile events, ad hoc events
    Automation, scheduling, frequency capping

 Analytics
    Data visualization : based on Javascript D3.js, processing.js etc.
    Tabular Reports, OLAP navigation
    Pluggable alerts : business activity monitoring

 A common requirement : scalability!!!
    Viadeo data is Big
    Processing performance is not an option, it is mandatory
Technical solutions : a new
architecture
based on CQRS pattern
Technical solutions : a new
architecture
• Master dataset :
  • Historical data stored in HBase
  • Provided as a service by architects team

• Datamarts :
   • Built on HDFS using MapReduce jobs
   • MapReduce eased by use of Cascading library
      and Scala DSL (Scalding)
   • Pushed to in-memory distributed storage
   • Elastic Search, Riak
Technical solutions : A better
architecture




             P
            O
           O
         SQ
 MySQL
Conclusion

• Many scalable data storage solutions
• Rapid application development frameworks and low-risk
  programming languages on the JVM
• Custom analytics = what we implement is what we use
   • Analytical needs are very well identified
• Blend data stream and batch processing to answer
  different needs
• Pluggable Data mining R&D
• Analytics for Viadeo members/recruiters/companies :
  Social Media Monitoring as a Complex Event
  Processing topic
?
Thanks !




           flelay@viadeoteam.com
           Tél : 01 75 70 12 93

Weitere ähnliche Inhalte

Mehr von Olivier DASINI

MySQL 8.0.21 - New Features Summary
MySQL 8.0.21 - New Features SummaryMySQL 8.0.21 - New Features Summary
MySQL 8.0.21 - New Features SummaryOlivier DASINI
 
MySQL 8.0.19 - New Features Summary
MySQL 8.0.19 - New Features SummaryMySQL 8.0.19 - New Features Summary
MySQL 8.0.19 - New Features SummaryOlivier DASINI
 
MySQL 8.0.18 - New Features Summary
MySQL 8.0.18 - New Features SummaryMySQL 8.0.18 - New Features Summary
MySQL 8.0.18 - New Features SummaryOlivier DASINI
 
MySQL 8.0.17 - New Features Summary
MySQL 8.0.17 - New Features SummaryMySQL 8.0.17 - New Features Summary
MySQL 8.0.17 - New Features SummaryOlivier DASINI
 
MySQL 8.0.16 New Features Summary
MySQL 8.0.16 New Features SummaryMySQL 8.0.16 New Features Summary
MySQL 8.0.16 New Features SummaryOlivier DASINI
 
MySQL Day Paris 2018 - Introduction & The State of the Dolphin
MySQL Day Paris 2018 - Introduction & The State of the DolphinMySQL Day Paris 2018 - Introduction & The State of the Dolphin
MySQL Day Paris 2018 - Introduction & The State of the DolphinOlivier DASINI
 
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsMySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsOlivier DASINI
 
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0Olivier DASINI
 
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...Olivier DASINI
 
MySQL Day Paris 2018 - MySQL JSON Document Store
MySQL Day Paris 2018 - MySQL JSON Document StoreMySQL Day Paris 2018 - MySQL JSON Document Store
MySQL Day Paris 2018 - MySQL JSON Document StoreOlivier DASINI
 
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?Olivier DASINI
 
MySQL 8.0, what's new ? - Forum PHP 2018
MySQL 8.0, what's new ? - Forum PHP 2018MySQL 8.0, what's new ? - Forum PHP 2018
MySQL 8.0, what's new ? - Forum PHP 2018Olivier DASINI
 
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...MySQL JSON Document Store - A Document Store with all the benefits of a Trans...
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...Olivier DASINI
 
MySQL 8.0 - What's New ?
MySQL 8.0 - What's New ?MySQL 8.0 - What's New ?
MySQL 8.0 - What's New ?Olivier DASINI
 
MySQL Document Store - A Document Store with all the benefts of a Transactona...
MySQL Document Store - A Document Store with all the benefts of a Transactona...MySQL Document Store - A Document Store with all the benefts of a Transactona...
MySQL Document Store - A Document Store with all the benefts of a Transactona...Olivier DASINI
 
MySQL 5.7 InnoDB Cluster (Jan 2018)
MySQL 5.7 InnoDB Cluster (Jan 2018)MySQL 5.7 InnoDB Cluster (Jan 2018)
MySQL 5.7 InnoDB Cluster (Jan 2018)Olivier DASINI
 
MySQL InnoDB Cluster - A complete High Availability solution for MySQL
MySQL InnoDB Cluster - A complete High Availability solution for MySQLMySQL InnoDB Cluster - A complete High Availability solution for MySQL
MySQL InnoDB Cluster - A complete High Availability solution for MySQLOlivier DASINI
 
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP Paris
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP ParisMySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP Paris
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP ParisOlivier DASINI
 
MySQL Day Paris 2016 - MySQL Enterprise Edition
MySQL Day Paris 2016 - MySQL Enterprise EditionMySQL Day Paris 2016 - MySQL Enterprise Edition
MySQL Day Paris 2016 - MySQL Enterprise EditionOlivier DASINI
 
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud Service
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud ServiceMySQL Day Paris 2016 - Introducing Oracle MySQL Cloud Service
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud ServiceOlivier DASINI
 

Mehr von Olivier DASINI (20)

MySQL 8.0.21 - New Features Summary
MySQL 8.0.21 - New Features SummaryMySQL 8.0.21 - New Features Summary
MySQL 8.0.21 - New Features Summary
 
MySQL 8.0.19 - New Features Summary
MySQL 8.0.19 - New Features SummaryMySQL 8.0.19 - New Features Summary
MySQL 8.0.19 - New Features Summary
 
MySQL 8.0.18 - New Features Summary
MySQL 8.0.18 - New Features SummaryMySQL 8.0.18 - New Features Summary
MySQL 8.0.18 - New Features Summary
 
MySQL 8.0.17 - New Features Summary
MySQL 8.0.17 - New Features SummaryMySQL 8.0.17 - New Features Summary
MySQL 8.0.17 - New Features Summary
 
MySQL 8.0.16 New Features Summary
MySQL 8.0.16 New Features SummaryMySQL 8.0.16 New Features Summary
MySQL 8.0.16 New Features Summary
 
MySQL Day Paris 2018 - Introduction & The State of the Dolphin
MySQL Day Paris 2018 - Introduction & The State of the DolphinMySQL Day Paris 2018 - Introduction & The State of the Dolphin
MySQL Day Paris 2018 - Introduction & The State of the Dolphin
 
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirementsMySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
MySQL Day Paris 2018 - MySQL & GDPR; Privacy and Security requirements
 
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0
MySQL Day Paris 2018 - Upgrade from MySQL 5.7 to MySQL 8.0
 
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...
MySQL Day Paris 2018 - MySQL InnoDB Cluster; A complete High Availability sol...
 
MySQL Day Paris 2018 - MySQL JSON Document Store
MySQL Day Paris 2018 - MySQL JSON Document StoreMySQL Day Paris 2018 - MySQL JSON Document Store
MySQL Day Paris 2018 - MySQL JSON Document Store
 
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?
MySQL Day Paris 2018 - What’s New in MySQL 8.0 ?
 
MySQL 8.0, what's new ? - Forum PHP 2018
MySQL 8.0, what's new ? - Forum PHP 2018MySQL 8.0, what's new ? - Forum PHP 2018
MySQL 8.0, what's new ? - Forum PHP 2018
 
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...MySQL JSON Document Store - A Document Store with all the benefits of a Trans...
MySQL JSON Document Store - A Document Store with all the benefits of a Trans...
 
MySQL 8.0 - What's New ?
MySQL 8.0 - What's New ?MySQL 8.0 - What's New ?
MySQL 8.0 - What's New ?
 
MySQL Document Store - A Document Store with all the benefts of a Transactona...
MySQL Document Store - A Document Store with all the benefts of a Transactona...MySQL Document Store - A Document Store with all the benefts of a Transactona...
MySQL Document Store - A Document Store with all the benefts of a Transactona...
 
MySQL 5.7 InnoDB Cluster (Jan 2018)
MySQL 5.7 InnoDB Cluster (Jan 2018)MySQL 5.7 InnoDB Cluster (Jan 2018)
MySQL 5.7 InnoDB Cluster (Jan 2018)
 
MySQL InnoDB Cluster - A complete High Availability solution for MySQL
MySQL InnoDB Cluster - A complete High Availability solution for MySQLMySQL InnoDB Cluster - A complete High Availability solution for MySQL
MySQL InnoDB Cluster - A complete High Availability solution for MySQL
 
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP Paris
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP ParisMySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP Paris
MySQL InnoDB Cluster - Meetup Oracle MySQL / AFUP Paris
 
MySQL Day Paris 2016 - MySQL Enterprise Edition
MySQL Day Paris 2016 - MySQL Enterprise EditionMySQL Day Paris 2016 - MySQL Enterprise Edition
MySQL Day Paris 2016 - MySQL Enterprise Edition
 
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud Service
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud ServiceMySQL Day Paris 2016 - Introducing Oracle MySQL Cloud Service
MySQL Day Paris 2016 - Introducing Oracle MySQL Cloud Service
 

L'évolution de l'infrastructure BI Viadeo par François Le Lay

  • 1. The evolution of Business Intelligence at Viadeo Techdays 22/11/2012
  • 2. Agenda What is Business Intelligence? Key Roles Viadeo Data Technical Solutions : a short history
  • 3. What is Business Intelligence ? Application Stack Awareness Stack Insights Application Awareness Insights Actions Actions Act Marketing Actions, Business ion Strategies, Operations s Forecasting, Predicting, Statistics, Competitor Insights Information, Analysis Feedback Awareness Reports, Dashboards Meta Data, KPI’s, Visual Templates, Application Stack Security, Information Dissemination, Scheduling Plumbing of structured and Data Warehouse & ETL unstructured data, logic to persists data
  • 4. Key Roles : the Business Analyst Functional Simple Web BI Followup (Challenge (Metrics) BI Dashboards Informatio Product Dashboard Analysis PO) Specification n Access s Specificatio n (Scalars) Technical Complex Proactive (Enforce (Data viz) Direct data quality) (SQL, Datameer)
  • 5. Key Roles : the Big Data Engineer Implement Real Time REST/Scala/Java APIs Data Awarenes Visualization Data plumbing Expose to Apps s Enforce Batch JDBC/ODBC data quality
  • 6. Viadeo data : The Dynamics • 45 million members • Worldwide presence • China, India, Russia, Mexico,.. • Mobile App, Web, API • B2B / B2C Mining User Usage Engagement
  • 8. Technical solutions : The Beginnings Phase 1: 2006-2008 Phase 2 : 2008-2010 Mysql Mysql Server name : Peach Server name : Lakitu Internal tool to allow C-Level, Sales,… Access data
  • 9. Technical solutions : A better architecture Phase 3: 2010 - 2012 MySQL Server name : « Unfied ODS » Mysql Server name : Server name : ODS Live ODS Live Cluster 1 Cluster 2 Server name : Server name : ODS Live ODS Live Cluster 3 Cluster 5
  • 10. Technical solutions : 2 new internal products Scala-centric, Play! framework Cross-channel messaging system Email, Mobile, Social Flexible content management Flexible targeting of recipients Content testing strategies : A/B, multivariate Event-driven : web app events, mobile events, ad hoc events Automation, scheduling, frequency capping Analytics Data visualization : based on Javascript D3.js, processing.js etc. Tabular Reports, OLAP navigation Pluggable alerts : business activity monitoring A common requirement : scalability!!! Viadeo data is Big Processing performance is not an option, it is mandatory
  • 11. Technical solutions : a new architecture based on CQRS pattern
  • 12. Technical solutions : a new architecture • Master dataset : • Historical data stored in HBase • Provided as a service by architects team • Datamarts : • Built on HDFS using MapReduce jobs • MapReduce eased by use of Cascading library and Scala DSL (Scalding) • Pushed to in-memory distributed storage • Elastic Search, Riak
  • 13. Technical solutions : A better architecture P O O SQ MySQL
  • 14. Conclusion • Many scalable data storage solutions • Rapid application development frameworks and low-risk programming languages on the JVM • Custom analytics = what we implement is what we use • Analytical needs are very well identified • Blend data stream and batch processing to answer different needs • Pluggable Data mining R&D • Analytics for Viadeo members/recruiters/companies : Social Media Monitoring as a Complex Event Processing topic
  • 15. ?
  • 16. Thanks ! flelay@viadeoteam.com Tél : 01 75 70 12 93