×
  • Teilen
  • E-Mail
  • Einbetten
  • Gefällt mir
  • Speichern
  • Privater Inhalt
 
Mirko Kämpf FolgenWird bearbeitet ...

Meetup - Hadoop User Group - Munich : 2013-05-22

von Instructor and Data Analyst bei University of Halle-Wittenberg am 27. Mai 2013

  • 256 Views

Abstract: A Hadoop cluster is the tool of choice for many large scale analytics applications. A large variety of commercial tools is available for typical SQL like or data warehouse applications, but ...

Abstract: A Hadoop cluster is the tool of choice for many large scale analytics applications. A large variety of commercial tools is available for typical SQL like or data warehouse applications, but how to deal with networks and time series? 
How to collect and store data for social media analysis and what are good practices for working with libraries like Mahout and Giraph? 
The sample use case deals with a data set from Wikipedia to illustrate how to combine multiple public data sources with personal data collections, e.g. from Twitter or even personal mailboxes. We discuss efficient approaches for data organisation, data preprocessing and for time dependent graph analysis.

Statistiken

Views

Gesamtviews
256
Views auf SlideShare
256
Views einbetten
0

Actions

Gefällt mir
1
Downloads
2
Kommentare
0

0 Einbettungen 0

No embeds

Zugänglichkeit

Kategorien

Details hochladen

Hochgeladen über SlideShare als Adobe PDF

Benutzerrechte

© Alle Rechte vorbehalten

Report content

Als unangemessen gemeldet Als unangemessen melden
Als unangemessen melden

Select your reason for flagging this presentation as inappropriate.

Löschen
Kommentar posten
Kommentar bearbeiten

Meetup - Hadoop User Group - Munich : 2013-05-22 Meetup - Hadoop User Group - Munich : 2013-05-22 Presentation Transcript