Personal Information
Unternehmen/Arbeitsplatz
San Francisco, CA United States
Beruf
Senior Data Engineer at Workday
Branche
Technology / Software / Internet
Webseite
github.com/erenavsarogullari
Info
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Tags
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Mehr anzeigen
Präsentationen
(6)Personal Information
Unternehmen/Arbeitsplatz
San Francisco, CA United States
Beruf
Senior Data Engineer at Workday
Branche
Technology / Software / Internet
Webseite
github.com/erenavsarogullari
Info
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Tags
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Mehr anzeigen