Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area, QC United States
Beruf
Data scientist at Stitch Fix
Branche
Retail
Info
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Tags
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Mehr anzeigen
Präsentationen
(4)Gefällt mir
(7)Kubernetes on AWS at Zalando: Failures & Learnings - DevOps NRW
Henning Jacobs
•
Vor 6 Jahren
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Cloudera, Inc.
•
Vor 8 Jahren
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
Amazon Web Services
•
Vor 8 Jahren
Spark shuffle introduction
colorant
•
Vor 9 Jahren
Streaming SQL
Julian Hyde
•
Vor 8 Jahren
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon 2015
StampedeCon
•
Vor 8 Jahren
Effective testing for spark programs Strata NY 2015
Holden Karau
•
Vor 8 Jahren
Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area, QC United States
Beruf
Data scientist at Stitch Fix
Branche
Retail
Info
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Tags
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Mehr anzeigen