Apache flink 1.0.0 overview

•Download as PPTX, PDF•

1 like•502 views

Apache Flink Overview presented by Jamie Grier in Flink Meetup at MaR Headquarters. https://community.mapr.com/docs/DOC-1503/edit?draftID=3848

Technology

What’s new in
Apache FlinkTM 1.0
Kostas Tzoumas
@kostas_tzoumas

Flink 1.0
• March 8, 2016
• First release in 1.x.y series
• Initiates backwards compatibility for selected APIs
• More than 64 contributors
• More than 450 JIRAs resolved
2

Flink 1.0: major features
• Out of core state
• Savepoints
• CEP library
• Improved monitoring & Kafka 0.9 support
3

Out of core state
• Alternative to in-memory state
• Powered by RocksDB instances in Flink TMs
• Enabled by using the RocksDBStateBackend
• State limited by disk space only
• State checkpoints save RocksDB databases in
reliable store
6

Production deployments
• Maintaining stateful applications in production
settings comes with its own challenges
• Failures, code upgrades, cluster maintenance, …
• Streaming jobs cannot be simply stopped and
restarted
8

Reminder: fault tolerance
• At least once, at most once, exactly once
• Flink guarantees exactly-once processing
• Flink guarantees end to end exactly-once with
selected sources and sinks
• e.g., Kafka —> Flink —> HDFS

How? Checkpoints
• Flink guarantees fault tolerance by regularly taking checkpoints
of the application state without ever stopping the execution
• At failure, input stream is rewinded to the logical time of the last
checkpoint
10

Introducing savepoints
• A savepoint is a Flink checkpoint that (1) is taken by
the user, (2) is accessible externally, and (3) never
expires
• Command line save & resume interface
• Save: flink savepoint <JobID>
• Resume: flink run -s
<path/to/savepoint> <jobJar>
11

Savepoints and versions
• A savepoint saves a version of a stateful application at a
well-defined time
• E.g.: take snapshots of one application at well-defined
times
12

“Like git for state”
• Branch off from savepoints creating a tree of
running application versions
13

Essential for production
deployments
• Application code upgrades
• Flink version upgrades
• Maintenance, migration, debugging
• What-if simulations
• A/B testing
• Time travel
14

FlinkCEP
• What is Complex Event Processing?
• A catch-all term
• In our context: easily detect patterns in streams
16

Other features in 1.0
• Support for Kafka 0.9 API (and hence MapR
Streams)
• Monitoring console: job submission, checkpoint
statistics, detecting bottlenecks
• See
http://flink.apache.org/news/2016/03/08/release-
1.0.0.html
21

Summary
• Flink 1.0: Initiating backwards compatibility and
pushing the envelope even further for production
streaming deployments
23

What’s next
• SQL
• Dynamic scaling (+ savepoints)
• Hybrid in-memory/out-of-core state backend
• Query-able state
• Support for Apache Mesos
• More connectors and sinks (Kinesis, Cassandra, …)
24

Join the community
• Follow: @ApacheFlink, @dataArtisans
• Read: flink.apache.org/blog, data-artisans.com/blog
• Subscribe: (news | dev | user)@flink.apache.org

What's hot

Flink Streaming @BudapestDataGyula Fóra

Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4Flink Forward

Stephan Ewen - Scaling to large StateFlink Forward

Stream Loops on Flink - Reinventing the wheel for the streaming eraParis Carbone

Continuous Processing with Apache Flink - Strata London 2016Stephan Ewen

Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015Till Rohrmann

What's new in 1.9.0 blink planner - Kurt Young, AlibabaFlink Forward

Tech Talk @ Google on Flink Fault Tolerance and HAParis Carbone

Data Stream Processing with Apache FlinkFabian Hueske

Flink Forward Berlin 2017: Maciek Próchniak - TouK Nussknacker - creating Fli...Flink Forward

Click-Through Example for Flink’s KafkaConsumer CheckpointingRobert Metzger

Marton Balassi – Stateful Stream ProcessingFlink Forward

Flink Forward Berlin 2017: Fabian Hueske - Using Stream and Batch Processing ...Flink Forward

Apache Flink@ Strata & Hadoop World LondonStephan Ewen

Tzu-Li (Gordon) Tai - Stateful Stream Processing with Apache FlinkVerverica

Stateful Distributed Stream ProcessingGyula Fóra

Streaming in the Wild with Apache FlinkKostas Tzoumas

Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...Flink Forward

Flink internals web Kostas Tzoumas

Flink Forward Berlin 2017: Boris Lublinsky, Stavros Kontopoulos - Introducing...Flink Forward

What's hot (20)

Flink Streaming @BudapestData

Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4

Stephan Ewen - Scaling to large State

Stream Loops on Flink - Reinventing the wheel for the streaming era

Continuous Processing with Apache Flink - Strata London 2016

Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015

What's new in 1.9.0 blink planner - Kurt Young, Alibaba

Tech Talk @ Google on Flink Fault Tolerance and HA

Data Stream Processing with Apache Flink

Flink Forward Berlin 2017: Maciek Próchniak - TouK Nussknacker - creating Fli...

Click-Through Example for Flink’s KafkaConsumer Checkpointing

Marton Balassi – Stateful Stream Processing

Flink Forward Berlin 2017: Fabian Hueske - Using Stream and Batch Processing ...

Apache Flink@ Strata & Hadoop World London

Tzu-Li (Gordon) Tai - Stateful Stream Processing with Apache Flink

Stateful Distributed Stream Processing

Streaming in the Wild with Apache Flink

Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...

Flink internals web

Flink Forward Berlin 2017: Boris Lublinsky, Stavros Kontopoulos - Introducing...

Viewers also liked

CMMi journey at small OrganizationsJyoti Chopra

Good practices: Livestock production with minimal use of antibioticsEkaterina Bessonova

1 conceptos básicos administraciónClaudia Demierre

Zaira Vicente - Full Life CoachingCarmen Amil Vena

Budapest - Deuterium content of water and glucose tolerance: potential role f...sorlov

The Editor as EAP InstructorLawrie Hunter

3. NECS 2016 _ Trade aspects and Northeast connectivity_ Dr.Deeparghya MukherjeeFICCINorthEast

24. NECS 2016 Connectivity through inland waterways_ Mr.Mahboob AhmedFICCINorthEast

Seismic data processing 13 stacking&migrationAmin khalil

Deep learning勉強会20121214ochiOhsawa Goodfellow

Rd preparacion de clases - judicialesELVIN VEGA ESPINOZA

Phpcon2015Hiroshi Tokumaru

WebRTC開発者向けプラットフォーム SkyWayの裏側Yusuke Naka

Accenture Technology Vision for Bankingaccenture

Texto Marcelo LagosLara Caravaca

Prof Ekram Hossain on DLT 2013 in IndonesiaArief Gunawan

Parent child relationshipsAdventures Soul

Investment Opportunities in FinTech - Techsauce Summit Bangkoktryb

Viewers also liked (18)

CMMi journey at small Organizations

Good practices: Livestock production with minimal use of antibiotics

1 conceptos básicos administración

Zaira Vicente - Full Life Coaching

Budapest - Deuterium content of water and glucose tolerance: potential role f...

The Editor as EAP Instructor

3. NECS 2016 _ Trade aspects and Northeast connectivity_ Dr.Deeparghya Mukherjee

24. NECS 2016 Connectivity through inland waterways_ Mr.Mahboob Ahmed

Seismic data processing 13 stacking&migration

Deep learning勉強会20121214ochi

Rd preparacion de clases - judiciales

Phpcon2015

WebRTC開発者向けプラットフォーム SkyWayの裏側

Accenture Technology Vision for Banking

Texto Marcelo Lagos

Prof Ekram Hossain on DLT 2013 in Indonesia

Parent child relationships

Investment Opportunities in FinTech - Techsauce Summit Bangkok

Similar to Apache flink 1.0.0 overview

Flink 1.0-slidesJamie Grier

Flink 0.10 - Upcoming FeaturesAljoscha Krettek

Consensus in Apache Kafka: From Theory to Production.pdfGuozhang Wang

Better Kafka Performance Without Changing Any Code | Simon Ritter, AzulHostedbyConfluent

Flink forward-2017-netflix keystones-paasMonal Daxini

Serverless design with Fn projectSiva Rama Krishna Chunduru

Apache flink 1.7 and BeyondTill Rohrmann

Flink at netflix paypal speaker seriesMonal Daxini

Realtime traffic analyserAlex Moskvin

IBM XL Compilers Performance Tuning 2016-11-18Yaoqing Gao

QCon London - Stream Processing with Apache FlinkRobert Metzger

Tips and Tricks for Operating Apache KafkaAll Things Open

Apache Kafkaemreakis

Streaming Processing with a Distributed Commit LogJoe Stein

The Twelve Factor App - Pivotal Trackerlauriepino

Ippevent : openshift Introductionkanedafromparis

Stream Processing @ LyftJamie Grier

GOTO Night Amsterdam - Stream processing with Apache FlinkRobert Metzger

Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)Apache Flink Taiwan User Group

Similar to Apache flink 1.0.0 overview (20)

Flink 1.0-slides

Flink 0.10 - Upcoming Features

Consensus in Apache Kafka: From Theory to Production.pdf

Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul

Flink forward-2017-netflix keystones-paas

Serverless design with Fn project

Apache flink 1.7 and Beyond

Flink at netflix paypal speaker series

Realtime traffic analyser

IBM XL Compilers Performance Tuning 2016-11-18

QCon London - Stream Processing with Apache Flink

Tips and Tricks for Operating Apache Kafka

Apache Kafka

Streaming Processing with a Distributed Commit Log

The Twelve Factor App - Pivotal Tracker

Ippevent : openshift Introduction

Stream Processing @ Lyft

GOTO Night Amsterdam - Stream processing with Apache Flink

Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)

Recently uploaded

Manual 508 Accessibility Compliance AuditSkynet Technologies

UiPath Community: Communication Mining from Zero to HeroUiPathCommunity

From Family Reminiscence to Scholarly Archive .Alan Dix

Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes

Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Data governance with Unity Catalog PresentationKnoldus Inc.

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

How to write a Business Continuity PlanDatabarracks

Rise of the Machines: Known As Drones...Rick Flair

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery

Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein

Scale your database traffic with Read & Write split using MySQL RouterMydbops

[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra

Recently uploaded (20)

Manual 508 Accessibility Compliance Audit

UiPath Community: Communication Mining from Zero to Hero

From Family Reminiscence to Scholarly Archive .

Assure Ecommerce and Retail Operations Uptime with ThousandEyes

Testing tools and AI - ideas what to try with some tool examples

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Data governance with Unity Catalog Presentation

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

What is DBT - The Ultimate Data Build Tool.pdf

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

How to write a Business Continuity Plan

Rise of the Machines: Known As Drones...

TeamStation AI System Report LATAM IT Salaries 2024

Emixa Mendix Meetup 11 April 2024 about Mendix Native development

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24

Scale your database traffic with Read & Write split using MySQL Router

[Webinar] SpiraTest - Setting New Standards in Quality Assurance

Apache flink 1.0.0 overview

1. What’s new in Apache FlinkTM 1.0 Kostas Tzoumas @kostas_tzoumas

2. Flink 1.0 • March 8, 2016 • First release in 1.x.y series • Initiates backwards compatibility for selected APIs • More than 64 contributors • More than 450 JIRAs resolved 2

3. Flink 1.0: major features • Out of core state • Savepoints • CEP library • Improved monitoring & Kafka 0.9 support 3

4. Interface stability 4

5. Out of core state 5

6. Out of core state • Alternative to in-memory state • Powered by RocksDB instances in Flink TMs • Enabled by using the RocksDBStateBackend • State limited by disk space only • State checkpoints save RocksDB databases in reliable store 6

7. Savepoints 7

8. Production deployments • Maintaining stateful applications in production settings comes with its own challenges • Failures, code upgrades, cluster maintenance, … • Streaming jobs cannot be simply stopped and restarted 8

9. Reminder: fault tolerance • At least once, at most once, exactly once • Flink guarantees exactly-once processing • Flink guarantees end to end exactly-once with selected sources and sinks • e.g., Kafka —> Flink —> HDFS

10. How? Checkpoints • Flink guarantees fault tolerance by regularly taking checkpoints of the application state without ever stopping the execution • At failure, input stream is rewinded to the logical time of the last checkpoint 10

11. Introducing savepoints • A savepoint is a Flink checkpoint that (1) is taken by the user, (2) is accessible externally, and (3) never expires • Command line save & resume interface • Save: flink savepoint <JobID> • Resume: flink run -s <path/to/savepoint> <jobJar> 11

12. Savepoints and versions • A savepoint saves a version of a stateful application at a well-defined time • E.g.: take snapshots of one application at well-defined times 12

13. “Like git for state” • Branch off from savepoints creating a tree of running application versions 13

14. Essential for production deployments • Application code upgrades • Flink version upgrades • Maintenance, migration, debugging • What-if simulations • A/B testing • Time travel 14

15. Complex Event Processing 15

16. FlinkCEP • What is Complex Event Processing? • A catch-all term • In our context: easily detect patterns in streams 16

17. 17

18. Pattern API 18

19. 19

20. 20

21. Other features in 1.0 • Support for Kafka 0.9 API (and hence MapR Streams) • Monitoring console: job submission, checkpoint statistics, detecting bottlenecks • See http://flink.apache.org/news/2016/03/08/release- 1.0.0.html 21

22. Closing 22

23. Summary • Flink 1.0: Initiating backwards compatibility and pushing the envelope even further for production streaming deployments 23

24. What’s next • SQL • Dynamic scaling (+ savepoints) • Hybrid in-memory/out-of-core state backend • Query-able state • Support for Apache Mesos • More connectors and sinks (Kinesis, Cassandra, …) 24

25. Join the community • Follow: @ApacheFlink, @dataArtisans • Read: flink.apache.org/blog, data-artisans.com/blog • Subscribe: (news | dev | user)@flink.apache.org

Apache flink 1.0.0 overview

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (18)

Similar to Apache flink 1.0.0 overview

Similar to Apache flink 1.0.0 overview (20)

More from MapR Technologies

More from MapR Technologies (20)

Recently uploaded

Recently uploaded (20)

Apache flink 1.0.0 overview