Teams, tools, and practices for scalable and resilient data value at Klarna Bank

•Als PPTX, PDF herunterladen•

0 gefällt mir•757 views

Manage your data infrastructure like this: (i) don’t drown the infra teams in domain data specifics (ii) build robust low-latency lookup facilities to feed online services (iii) always take stress out of the equation At Klarna bank we do online decisions on risk, fraud and ID. Over a hundred data sources are being processed by over a hundred analysts and over a hundred batch jobs. Three data infrastructure engineering teams are operating and developing this data lake: Core team, apps team, and performance team. The total head count is less than a dozen. To keep afloat, we’ve distilled the following practices: (i) The immutability and recomputation properties of the Lambda/Kappa architectures (ii) continuously delivered and automated infrastructure, (iii) tooling to empower producers and consumers of data to be accountable and self-sufficient, and (iv) proactively improve efficiency of data users. We’ll talk about some of these practices and tools we have built during several years of running banking applications on Hortonworks Hadoop. Ecosystem components we’ll touch include Kafka, Avro, Hive, Oozie, ELK, Ranger, and Ansible. Tools developed by us include HiveRunner, tooling for data import, along with continuous delivery of data pipelines. Speakers Erik Zeitler, Senior Data Engineer, PhD, Klarna Bank Per Ullberg, Lead Software Engineer, Klarna Bank

Technologie

Teams, tools, and practices for scalable and resilient data value at Klarna Bank

Empfohlen

CSC 405 - Iteration 1 PresentationFrances Coronel

[WEBINAR] How One NFL Team Turns Spectators into Super FansSocial Tables

Science content delivery networksRob Gardner

WordPress For Beginners Lesson 1 JALC Fall 2015Michele Butcher-Jones

User Interviews: A PrimerTalisa Chang

Use Cases with MURALMURAL

เครื่องชาร์จแบตเตอรี่นิแคด แบบเร็วnoosun

10 วัตต์ เครื่องขยายเอนกประสงค์noosun

Empfohlen

CSC 405 - Iteration 1 PresentationFrances Coronel

[WEBINAR] How One NFL Team Turns Spectators into Super FansSocial Tables

Science content delivery networksRob Gardner

WordPress For Beginners Lesson 1 JALC Fall 2015Michele Butcher-Jones

User Interviews: A PrimerTalisa Chang

Use Cases with MURALMURAL

เครื่องชาร์จแบตเตอรี่นิแคด แบบเร็วnoosun

10 วัตต์ เครื่องขยายเอนกประสงค์noosun

Data Science Crash CourseDataWorks Summit

Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit

Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit

HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit

Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit

Managing the Dewey Decimal SystemDataWorks Summit

Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit

HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit

Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit

Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit

Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit

Security Framework for Multitenant ArchitectureDataWorks Summit

Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit

Extending Twitter's Data Platform to Google CloudDataWorks Summit

Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit

Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit

Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit

Computer Vision: Coming to a Store Near YouDataWorks Summit

Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Weitere ähnliche Inhalte

Mehr von DataWorks Summit