SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Big Data in REAL TIME
Ron Zavner
We’re Living in a Real Time World…
        Social                           User Tracking &                 Homeland Security
                                          Engagement




      eCommerce                       Financial Services                 Real Time Search




2                 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
The Flavors of Big Data Analytics




       Counting                                Correlating               Research




3                 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



     It takes a week for users to
     send    1 billion tweets
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

4            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



                   On average,
           140 million
      tweets get sent every day
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

5            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



            The highest
        throughput to date is
6,939 tweets/sec.
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

6            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



         460,000 new
          accounts
            are created daily
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

7            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Challenge – Word Count
           Tweets




8
                                     ?
             ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
                                                                    Count
                                                                     Count
                                                                             Word:Count
Analyze the Problem
       Thousands of tweets per second to process
       Aggregate counters for each word
       Latency – less than a second
       System needs to linearly scale
       System needs to be fault tolerant
       Querying & Persisting Data
       Managing the system




9                ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Tier Based Architecture?




10        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Data Grid 




11        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Putting it all together




12         ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
The 3 Most Popular Words on Twitter?



                  1. Just
                  2. Found
                  3. Love
                                                                 - August 2012

13        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Q&A




       RonZ@gigaspaces.com

14      ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

Weitere ähnliche Inhalte

Ähnlich wie Big Data in Real Time

Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSematext Group, Inc.
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitterdfilppi
 
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo
 
Learn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesLearn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesTelligent
 
Alfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalAlfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalEmil Loreto
 
Social Radar 3.0 Deck
Social Radar 3.0 DeckSocial Radar 3.0 Deck
Social Radar 3.0 DeckJohn Mumford
 
Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Eli White
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stackFlytxt
 
How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1Trinity Web Works
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfrescorivetlogic
 
Transform your Classified business into Digital
Transform your Classified business into DigitalTransform your Classified business into Digital
Transform your Classified business into DigitalTANGERINE Digital
 
Sviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLSviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLAmazon Web Services
 
Big Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsBig Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsDataWorks Summit
 
Big Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoBig Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoDataWorks Summit
 
Aras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras
 
Social media it support.pptx
Social media  it support.pptxSocial media  it support.pptx
Social media it support.pptxPink Elephant
 

Ähnlich wie Big Data in Real Time (20)

Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL Backend
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitter
 
Search Analytics What? Why? How?
Search Analytics What? Why? How?Search Analytics What? Why? How?
Search Analytics What? Why? How?
 
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
 
Learn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesLearn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class Communities
 
Alfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalAlfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-final
 
Social Radar 3.0 Deck
Social Radar 3.0 DeckSocial Radar 3.0 Deck
Social Radar 3.0 Deck
 
How To Use It With Safe
How To Use It With SafeHow To Use It With Safe
How To Use It With Safe
 
Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stack
 
How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfresco
 
Transform your Classified business into Digital
Transform your Classified business into DigitalTransform your Classified business into Digital
Transform your Classified business into Digital
 
Sviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLSviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQL
 
Big Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsBig Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security Analytics
 
Big Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoBig Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - Tokyo
 
Aras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter Schroer
 
Social media it support.pptx
Social media  it support.pptxSocial media  it support.pptx
Social media it support.pptx
 
Big data by_mcal
Big data by_mcalBig data by_mcal
Big data by_mcal
 
Final_Bigdata_pret
Final_Bigdata_pretFinal_Bigdata_pret
Final_Bigdata_pret
 

Kürzlich hochgeladen

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Kürzlich hochgeladen (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Big Data in Real Time

  • 1. Big Data in REAL TIME Ron Zavner
  • 2. We’re Living in a Real Time World… Social User Tracking & Homeland Security Engagement eCommerce Financial Services Real Time Search 2 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 3. The Flavors of Big Data Analytics Counting Correlating Research 3 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 4. Twitter in Numbers (March 2011) It takes a week for users to send 1 billion tweets Source: http://blog.twitter.com/2011/03/numbers.html 4 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 5. Twitter in Numbers (March 2011) On average, 140 million tweets get sent every day Source: http://blog.twitter.com/2011/03/numbers.html 5 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 6. Twitter in Numbers (March 2011) The highest throughput to date is 6,939 tweets/sec. Source: http://blog.twitter.com/2011/03/numbers.html 6 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 7. Twitter in Numbers (March 2011) 460,000 new accounts are created daily Source: http://blog.twitter.com/2011/03/numbers.html 7 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 8. Challenge – Word Count Tweets 8 ? ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved Count Count Word:Count
  • 9. Analyze the Problem  Thousands of tweets per second to process  Aggregate counters for each word  Latency – less than a second  System needs to linearly scale  System needs to be fault tolerant  Querying & Persisting Data  Managing the system 9 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 10. Tier Based Architecture? 10 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 11. Data Grid  11 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 12. Putting it all together 12 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 13. The 3 Most Popular Words on Twitter? 1. Just 2. Found 3. Love - August 2012 13 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 14. Q&A RonZ@gigaspaces.com 14 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

Hinweis der Redaktion

  1. ActiveInsight