Personal Information
Unternehmen/Arbeitsplatz
London, United Kingdom United Kingdom
Beruf
Data Science and Big Data
Branche
Technology / Software / Internet
Info
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Tags
newbie
pycon
python
programming
pycon2010
Mehr anzeigen
Präsentationen
(2)Dokumente
(1)Gefällt mir
(24)Netezza Architecture and Administration
Braja Krishna Das
•
Vor 7 Jahren
Netezza Deep Dives
Rush Shah
•
Vor 7 Jahren
Notes from Coursera Deep Learning courses by Andrew Ng
Tess Ferrandez
•
Vor 6 Jahren
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for large-scale data analytics
Databricks
•
Vor 8 Jahren
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
•
Vor 8 Jahren
Scala - The Simple Parts, SFScala presentation
Martin Odersky
•
Vor 9 Jahren
Pragmatic Real-World Scala (short version)
Jonas Bonér
•
Vor 15 Jahren
Scala Data Pipelines @ Spotify
Neville Li
•
Vor 8 Jahren
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
•
Vor 9 Jahren
Hive tuning
Michael Zhang
•
Vor 10 Jahren
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
•
Vor 8 Jahren
Spark Summit East 2015 Advanced Devops Student Slides
Databricks
•
Vor 9 Jahren
DTCC '14 Spark Runtime Internals
Cheng Lian
•
Vor 10 Jahren
Tuning and Debugging in Apache Spark
Databricks
•
Vor 9 Jahren
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Vor 9 Jahren
Why Scala Is Taking Over the Big Data World
Dean Wampler
•
Vor 9 Jahren
storm at twitter
Krishna Gade
•
Vor 10 Jahren
Collaborative Filtering with Spark
Chris Johnson
•
Vor 9 Jahren
DataFu @ ApacheCon 2014
William Vaughan
•
Vor 10 Jahren
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Paco Nathan
•
Vor 9 Jahren
Hadoop World 2011: Advanced HBase Schema Design - Lars George, Cloudera
Cloudera, Inc.
•
Vor 12 Jahren
HBase schema design Big Data TechCon Boston
amansk
•
Vor 11 Jahren
HBaseCon 2012 | HBase Schema Design - Ian Varley, Salesforce
Cloudera, Inc.
•
Vor 11 Jahren
The 21 Coolest Internet Of Things Gadgets
Bernard Marr
•
Vor 9 Jahren
Personal Information
Unternehmen/Arbeitsplatz
London, United Kingdom United Kingdom
Beruf
Data Science and Big Data
Branche
Technology / Software / Internet
Info
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Tags
newbie
pycon
python
programming
pycon2010
Mehr anzeigen