Big Data Real Time Analytics - A Facebook Case Study

Real Time Analytics for Big Data Lessons from Facebook..

The Real Time Boom.. ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved Google Real Time Web Analytics Google Real Time Search Facebook Real Time Social Analytics Twitter paid tweet analytics SaaS Real Time User Tracking New Real Time Analytics Startups..

The data resolution & processing models

Traditional analytics applications ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

CEP – Complex Event Processing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

In Memory Data Grid ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

NoSQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

Goals ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

The solution.. PTail Scribe Puma Hbase HDFS Real Time Long Term Batch 1.5 Sec 10,000 write/sec per server FACEBOOK Log FACEBOOK Log FACEBOOK Log

Step 1: Use memory.. ,[object Object],[object Object],[object Object],[object Object],[object Object],® Copyright 2011 Gigaspaces Ltd. All Rights Reserved Events Memory Grid Data Grid Data Grid Data Grid FACEBOOK FACEBOOK FACEBOOK

Step 2 – Collocate ,[object Object],Events Processing Grid Data Grid Data Grid Data Grid FACEBOOK FACEBOOK FACEBOOK

Step 2 – Collocate ,[object Object],Events Processing Grid Data Grid Data Grid Data Grid FACEBOOK FACEBOOK FACEBOOK @EventDriven @Polling public class SimpleListener { @EventTemplate Data unprocessedData () { Data template = new Data (); template . setProcessed ( false ); return template ; } @SpaceDataEvent public Data eventListener ( Data event ) { //process Data here } }

Step 3 – Write behind to SQL/NoSQL Events Processing Grid Open Long Term persistency Write Behind FACEBOOK FACEBOOK FACEBOOK Data Grid Data Grid Data Grid

Putting it all together Analytic Application Event Sources Write behind ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Generate Patterns

Putting it all together Analytic Application Event Sources Write behind ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Generate Patterns Real Time Map/Reduce R Script script = new StaticScritpt( “groovy”,”println hi; return 0”) Query q = em.createNativeQuery( “execute ?”); q.setParamter(1, script); Integer result = query.getSingleResult();

5x better performance per server! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Event injector Up to 128 threads GigaSpaces/ (Other Msg Server) App Services Up to 128 threads Other Giga 50,000 write/sec per server

Live demo Inter Day Activity (Real Time) Monthly Trend Analysis

Summary Big Data Development Made Simple: Focus on your business logic, Use Big Data platform for dealing scalability, performance, continues availability ,.. Its Open: Use Any Stack : Avoid Lockin Any database (RDBMS or NoSQL); Any Cloud, Use common API’s & Frameworks . All While Minimizing Cost Use Memory & Disk for optimum cost/performance . Built-in Automation and management - Reduces operational costs Elasticity – reduce over provisioning cost

Thank YOU! @natishalom http://blog.gigaspaces.com

Big Data Real Time Analytics - A Facebook Case Study

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (6)

Ähnlich wie Big Data Real Time Analytics - A Facebook Case Study

Ähnlich wie Big Data Real Time Analytics - A Facebook Case Study (20)

Mehr von Nati Shalom

Mehr von Nati Shalom (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Big Data Real Time Analytics - A Facebook Case Study

Hinweis der Redaktion