Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area, CA United States
Beruf
Data Expert with System Architecture Insight
Branche
Technology / Software / Internet
Webseite
goldenorbit.wordpress.com
Info
With the thorough understandings of data, application & network architecture, Eric has developed & proven a set of approaches to improve the performance & ROI by 50%~200% based on the company's existing DW/BI infrastructure.
His 1st philosophy is to make the best use of the tools and to create better tools, as he has witnessed many poor project results simply because everyone expects the out-of-box features to satisfy all the requirements, yet few are willing to to deep dive into the tool and explore its full potential.
We often debates about which tool is the best, yet Eric believes that it is crucial to provide the valuable consulting and eduction to enable more team members and clien...
Tags
hadoop
incremental
upsert
time travel
data warehouse
hive
hudi
delta
iceberg
data lake
big data
json
etl
nosql
sql
elt
jdbc
fastload
mapreduce
tdch
teradata
Mehr anzeigen
Präsentationen
(4)Gefällt mir
(67)Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Tristan Baker
•
Vor 2 Jahren
Spark SQL Bucketing at Facebook
Databricks
•
Vor 4 Jahren
Modernizing Big Data Workload Using Amazon EMR & AWS Glue
Noritaka Sekiyama
•
Vor 4 Jahren
How to test infrastructure code: automated testing for Terraform, Kubernetes, Docker, Packer and more
Yevgeniy Brikman
•
Vor 4 Jahren
Presto Strata London 2019: Cost-Based Optimizer for interactive SQL on anything
Piotr Findeisen
•
Vor 4 Jahren
Trillion Dollar Coach Book (Bill Campbell)
Eric Schmidt
•
Vor 5 Jahren
"Smooth Operator" [Bay Area NewSQL meetup]
Kevin Xu
•
Vor 5 Jahren
Dynamic pricing of Lyft rides using streaming
Amar Pai
•
Vor 5 Jahren
YugaByte DB Internals - Storage Engine and Transactions
Yugabyte
•
Vor 5 Jahren
What’s new in Apache Spark 2.3
DataWorks Summit
•
Vor 5 Jahren
ORC improvement in Apache Spark 2.3
DataWorks Summit
•
Vor 6 Jahren
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
Vor 6 Jahren
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
Vor 6 Jahren
Apache Arrow: In Theory, In Practice
Dremio Corporation
•
Vor 6 Jahren
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginners | Edureka
Edureka!
•
Vor 6 Jahren
Top 5 Deep Learning and AI Stories - October 6, 2017
NVIDIA
•
Vor 6 Jahren
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Rosen, Databricks)
Spark Summit
•
Vor 8 Jahren
Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning
Spark Summit
•
Vor 7 Jahren
Scala Reflection & Runtime MetaProgramming
Meir Maor
•
Vor 7 Jahren
What to Expect for Big Data and Apache Spark in 2017
Databricks
•
Vor 7 Jahren
Hive: Loading Data
Benjamin Leonhardi
•
Vor 8 Jahren
Tuning Java for Big Data
Scott Seighman
•
Vor 9 Jahren
Deep Dive Into Catalyst: Apache Spark 2.0'S Optimizer
Spark Summit
•
Vor 7 Jahren
Introducing Neo4j 3.0
Neo4j
•
Vor 7 Jahren
File Format Benchmark - Avro, JSON, ORC & Parquet
DataWorks Summit/Hadoop Summit
•
Vor 7 Jahren
Dongwon Kim – A Comparative Performance Evaluation of Flink
Flink Forward
•
Vor 8 Jahren
Why apache Flink is the 4G of Big Data Analytics Frameworks
Slim Baltagi
•
Vor 8 Jahren
Apache Hive Hook
Minwoo Kim
•
Vor 10 Jahren
Spark etl
Imran Rashid
•
Vor 8 Jahren
Hive tuning
Michael Zhang
•
Vor 10 Jahren
Personal Information
Unternehmen/Arbeitsplatz
San Francisco Bay Area, CA United States
Beruf
Data Expert with System Architecture Insight
Branche
Technology / Software / Internet
Webseite
goldenorbit.wordpress.com
Info
With the thorough understandings of data, application & network architecture, Eric has developed & proven a set of approaches to improve the performance & ROI by 50%~200% based on the company's existing DW/BI infrastructure.
His 1st philosophy is to make the best use of the tools and to create better tools, as he has witnessed many poor project results simply because everyone expects the out-of-box features to satisfy all the requirements, yet few are willing to to deep dive into the tool and explore its full potential.
We often debates about which tool is the best, yet Eric believes that it is crucial to provide the valuable consulting and eduction to enable more team members and clien...
Tags
hadoop
incremental
upsert
time travel
data warehouse
hive
hudi
delta
iceberg
data lake
big data
json
etl
nosql
sql
elt
jdbc
fastload
mapreduce
tdch
teradata
Mehr anzeigen