Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area United States
Beruf
Engineering at LinkedIn
Webseite
www.linkedin.com
Info
Objective: Engineer systems & algorithms to help users get to the content they need.
Summary:
Hands-on experience with distributed systems for both online and offline data processing.
Designed and implemented low-latency high-throughput online retrieval systems from scratch, doing micro and millisecond latencies for few hundred QPS per node (without caching).
Designed and implemented simple & extensible data-infrastructure for offline data processing pipelines on hadoop. These range from simple search-index building pipelines, to non-trivial pipelines to do machine learning algorithms. Using tools like plain java map/reduce, pig, hive, spark, scalding and so forth (ordered by familiari...
Tags
concurrency
jvm
stm
multi-core
lock-free
transactional memory
overview
presto
mapreduce
hadoop
hdfs
spark
big data
Mehr anzeigen
Präsentationen
(2)Gefällt mir
(3)Invokedynamic in 45 Minutes
Charles Nutter
•
Vor 11 Jahren
Distributed Consensus A.K.A. "What do we eat for lunch?"
Konrad Malawski
•
Vor 9 Jahren
DocValues aka. Column Stride Fields in Lucene 4.0 - By Willnauer Simon
lucenerevolution
•
Vor 12 Jahren
Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area United States
Beruf
Engineering at LinkedIn
Webseite
www.linkedin.com
Info
Objective: Engineer systems & algorithms to help users get to the content they need.
Summary:
Hands-on experience with distributed systems for both online and offline data processing.
Designed and implemented low-latency high-throughput online retrieval systems from scratch, doing micro and millisecond latencies for few hundred QPS per node (without caching).
Designed and implemented simple & extensible data-infrastructure for offline data processing pipelines on hadoop. These range from simple search-index building pipelines, to non-trivial pipelines to do machine learning algorithms. Using tools like plain java map/reduce, pig, hive, spark, scalding and so forth (ordered by familiari...
Tags
concurrency
jvm
stm
multi-core
lock-free
transactional memory
overview
presto
mapreduce
hadoop
hdfs
spark
big data
Mehr anzeigen