apache spark batch processing spark on yarn springone spring spring integration multi tenancy spark sql metrics apache spark upgrade etl sql on hadoop distributed computing engine sql distributed sql engine gc policy storage level job scheduling data locality data skew serialization checkpointing event sourcing partitioning persistency data structures best practices apache pulsar stream processing streaming data processing patterns data pipelines rdd persistency catalyst optimizer tungsten spark job lifecycle spark ecosystem spark internals dataset dataframe rdd hazelcast
Mehr anzeigen