SlideShare ist ein Scribd-Unternehmen logo
1 von 12
BigData - ...what is it all
about?               Daniel Koller, @dakoller,
                       http://blog.dakoller.net
Data has always been there
          http://www.flickr.com/photos/charlestilford/2552654321/in/photostream/
A Boeing jet produces 20 TB
data in flight ...


    http://www.flickr.com/photos/idfonline/5707336691/in/photostream/
A Boeing jet produces 20 TB
data in flight ... PER HOUR!


    http://www.flickr.com/photos/idfonline/5707336691/in/photostream/
New dimensions of
          data
•                handles more than 1 million customer
    transactions every hour, > 2.5 petabytes of data -
    the equivalent of 167 times the information
    contained in all the books in the US Library of
    Congress.

•              handles 40 billion photos from its user
    base.

•   Decoding the          originally took 10 years to
    process; now it can be achieved in one week.
...a kind of summary:
„Tools and techniques to
manage different types of data,
in high volume, in high velocity
with varied requirements to
mine them„
Size
  Scale up and scale out: Terabyte,    Stream
     Petabyte …                          Torrent of real-time information

Structure                              Operation
  Structured                             Massively Parallel Processing (MPP)
  Unstructured : Audio, Video, Text,
    GeoSpatial
  Schema Less Structures
Which techniques can
you use to handle it?
•   Machine Learning    •   Crowd Sourcing

•   Natural Language    •   Regression Models
    Processing (NLP)    •   Sentiment Analysis

•   Cohort Analysis     •   Processing Signals

•   Network or Path     •   Spatial Analytics
    Analysis
                        •   Visualization
•   Predictive Models
                        •   Time-series Analysis
Techniques: Machine
          learning
•   „A computer program is said to learn from
    experience E with respect to some class of
    tasks T and performance measure P, if its
    performance at tasks in T, as measured by P,
    improves with experience E„ (E. Mitchell)
•   Supervised / unsupervised learning

•   Use cases: computer vision, recommender systems (Netflix prize), self-
    driving cars

•   http://en.wikipedia.org/wiki/Machine_learning
Techniques: Natural Language
     Processing (NLP)
•   Covers:
    •  stopword identification,
    •  entity recognition,
    •  machine translation,
    •  parsing & chunking of sentences
•   Useful everywhere, where user generated content comes up
•   Very good support in English, good support in european languages,
    limited support for other languages
Techniques: Spatial
          Analytics
•   Discover geographic
    contexts in an
    information source

•   Requires localizable data
    (e.g. location names,
    coordinates) of some
    quality

•   Examples: visualize social
    networks, black death in
    Europe (see on the
    right), Google Flu Trends
Which techniques can
you use to handle it?
•   Machine Learning    •   Crowd Sourcing

•   Natural Language    •   Regression Models
    Processing (NLP)    •   Sentiment Analysis

•   Cohort Analysis     •   Processing Signals

•   Network or Path     •   Spatial Analytics
    Analysis
                        •   Visualization
•   Predictive Models
                        •   Time-series Analysis
3 Items to take home
•   You can solve tasks now
    which previously were
    just not possible due to
    limited resources.

•   State your business
    problem before looking
    at the data.

•   Try a combination of
    different techniques to
    optimize result

Weitere ähnliche Inhalte

Ähnlich wie Big data 4 webmonday

NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognition
Mohammad Ilyas Malik
 

Ähnlich wie Big data 4 webmonday (20)

Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Artificial intelligence : what it is
Artificial intelligence : what it isArtificial intelligence : what it is
Artificial intelligence : what it is
 
Natural language Analysis
Natural language AnalysisNatural language Analysis
Natural language Analysis
 
NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognition
 
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabs
 
AI Technology Overview and Career Advice
AI Technology Overview and Career AdviceAI Technology Overview and Career Advice
AI Technology Overview and Career Advice
 
intro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabiintro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabi
 
ML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdf
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
 
n01.ppt
n01.pptn01.ppt
n01.ppt
 
OpenPOWER Webinar on Machine Learning for Academic Research
OpenPOWER Webinar on Machine Learning for Academic Research OpenPOWER Webinar on Machine Learning for Academic Research
OpenPOWER Webinar on Machine Learning for Academic Research
 
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
 
Exo cortex
Exo cortexExo cortex
Exo cortex
 
Artificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityArtificial Intelligence and The Complexity
Artificial Intelligence and The Complexity
 
Brave new search world
Brave new search worldBrave new search world
Brave new search world
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
 

Mehr von Daniel Koller

R Intro for Munich Google IO Extended
R Intro for Munich Google IO ExtendedR Intro for Munich Google IO Extended
R Intro for Munich Google IO Extended
Daniel Koller
 

Mehr von Daniel Koller (10)

LoRaWan 4 SITBern
LoRaWan 4 SITBernLoRaWan 4 SITBern
LoRaWan 4 SITBern
 
IoT Applications based on LoRaWan
IoT Applications based on LoRaWanIoT Applications based on LoRaWan
IoT Applications based on LoRaWan
 
TheThingsNetwork Rhein-Neckar/LoRaWan @OpenDataDay Heidelberg
TheThingsNetwork Rhein-Neckar/LoRaWan @OpenDataDay HeidelbergTheThingsNetwork Rhein-Neckar/LoRaWan @OpenDataDay Heidelberg
TheThingsNetwork Rhein-Neckar/LoRaWan @OpenDataDay Heidelberg
 
TheThingsNetwork Rhein-Neckar - Kurz-Pitch für den 22.2.
TheThingsNetwork Rhein-Neckar - Kurz-Pitch für den 22.2.TheThingsNetwork Rhein-Neckar - Kurz-Pitch für den 22.2.
TheThingsNetwork Rhein-Neckar - Kurz-Pitch für den 22.2.
 
Five pillars of Infrastructure Monitoring
Five pillars of Infrastructure MonitoringFive pillars of Infrastructure Monitoring
Five pillars of Infrastructure Monitoring
 
Intro to AWS IOT
Intro to AWS IOTIntro to AWS IOT
Intro to AWS IOT
 
A intro to (hosted) Shiny Apps
A intro to (hosted) Shiny AppsA intro to (hosted) Shiny Apps
A intro to (hosted) Shiny Apps
 
R Intro for Munich Google IO Extended
R Intro for Munich Google IO ExtendedR Intro for Munich Google IO Extended
R Intro for Munich Google IO Extended
 
OpenStreetMap: Nutzung & Integration
OpenStreetMap: Nutzung & IntegrationOpenStreetMap: Nutzung & Integration
OpenStreetMap: Nutzung & Integration
 
Linked Data in the Enterprise
Linked Data in the EnterpriseLinked Data in the Enterprise
Linked Data in the Enterprise
 

Kürzlich hochgeladen

Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Anamikakaur10
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
amitlee9823
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
dlhescort
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
dollysharma2066
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
lizamodels9
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
amitlee9823
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
lizamodels9
 

Kürzlich hochgeladen (20)

Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
 
Falcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to ProsperityFalcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to Prosperity
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptx
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
 
Falcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business GrowthFalcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business Growth
 
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort ServiceMalegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
 

Big data 4 webmonday

  • 1. BigData - ...what is it all about? Daniel Koller, @dakoller, http://blog.dakoller.net
  • 2. Data has always been there http://www.flickr.com/photos/charlestilford/2552654321/in/photostream/
  • 3. A Boeing jet produces 20 TB data in flight ... http://www.flickr.com/photos/idfonline/5707336691/in/photostream/
  • 4. A Boeing jet produces 20 TB data in flight ... PER HOUR! http://www.flickr.com/photos/idfonline/5707336691/in/photostream/
  • 5. New dimensions of data • handles more than 1 million customer transactions every hour, > 2.5 petabytes of data - the equivalent of 167 times the information contained in all the books in the US Library of Congress. • handles 40 billion photos from its user base. • Decoding the originally took 10 years to process; now it can be achieved in one week.
  • 6. ...a kind of summary: „Tools and techniques to manage different types of data, in high volume, in high velocity with varied requirements to mine them„ Size Scale up and scale out: Terabyte, Stream Petabyte … Torrent of real-time information Structure Operation Structured Massively Parallel Processing (MPP) Unstructured : Audio, Video, Text, GeoSpatial Schema Less Structures
  • 7. Which techniques can you use to handle it? • Machine Learning • Crowd Sourcing • Natural Language • Regression Models Processing (NLP) • Sentiment Analysis • Cohort Analysis • Processing Signals • Network or Path • Spatial Analytics Analysis • Visualization • Predictive Models • Time-series Analysis
  • 8. Techniques: Machine learning • „A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E„ (E. Mitchell) • Supervised / unsupervised learning • Use cases: computer vision, recommender systems (Netflix prize), self- driving cars • http://en.wikipedia.org/wiki/Machine_learning
  • 9. Techniques: Natural Language Processing (NLP) • Covers: • stopword identification, • entity recognition, • machine translation, • parsing & chunking of sentences • Useful everywhere, where user generated content comes up • Very good support in English, good support in european languages, limited support for other languages
  • 10. Techniques: Spatial Analytics • Discover geographic contexts in an information source • Requires localizable data (e.g. location names, coordinates) of some quality • Examples: visualize social networks, black death in Europe (see on the right), Google Flu Trends
  • 11. Which techniques can you use to handle it? • Machine Learning • Crowd Sourcing • Natural Language • Regression Models Processing (NLP) • Sentiment Analysis • Cohort Analysis • Processing Signals • Network or Path • Spatial Analytics Analysis • Visualization • Predictive Models • Time-series Analysis
  • 12. 3 Items to take home • You can solve tasks now which previously were just not possible due to limited resources. • State your business problem before looking at the data. • Try a combination of different techniques to optimize result

Hinweis der Redaktion

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n