SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
Dr. Konstantin Boudnik
VP Open Source Development
WANdisco
Dr. Konstantin Boudnik
VP Open Source Development
WANdisco
Benefits of Coming Out of the Closet
Open-Source In-Memory PlatformsOpen-Source In-Memory Platforms
Coming out of the closet: benefits of FOSS
Short bio:Short bio:
Got addicted to Linux back in 1994
Member of Linux Foundation
Member of Apache Software Foundation
Apache Bigtop project founder;Apache Hadoop committer
Committer, PMC member, contributor to may ASF projects
Mentor of multiple Apache Incubator projects
Ignite, Geode, Groovy, Zeppelin
Background in compilers, JVM, distrubuted computing, system
integration & architecture
2015: got invited to IMCS to talk about FOSS
Coming out of the closet: benefits of FOSS
Open-source / Open communityOpen-source / Open community
• Open source is easy
• Jump to a “social development” site (Bitbucket,
Github)
• Pick up a license you like: (L)GPL, ASL, MIT, BSD
• while true; do <code>; done
• You might be lucky to recruite a lot of volunteers
• A selected group mostly owns the project road map
• Adoption might be an issue as the future is unknown
Coming out of the closet: benefits of FOSS
Open-source / Open communityOpen-source / Open community
• Open community is way harder
• Place to collaborate; meritocracy
• Consensus building / Conflicts resolution
• Continuity: avoiding 'hit by the bus' situations
• Protection of project brand (under some licenses)
• Legal shielding and takeover protection
• Infrastructure management
• Projects cross-polination
• “Community over code”
Coming out of the closet: benefits of FOSS
In-memory: what's FOSS'ing?In-memory: what's FOSS'ing?
• Was sorta quiet up to pretty much 2012
• Spark appeared & gained momentum quickly
• Great improvement on MapReduce
• Solved many shortcomings of MR
• Then nothing spectacular was happening until
• 2014: Apache Ignite (incubating) from GridGain
• 2015: Apache Geode (incubating) from Pivotal
Coming out of the closet: benefits of FOSS
open-source or open communityopen-source or open community
• FOSS foundations facilitate open communities
• Spark: from a relatively small GitHub project
to the most active Apache BigData project
in 2 years
• Apache Ignite: doubling committer base in 5
months; quadrupling the user base
• Apache Geode: check the talk @IMCS!
Coming out of the closet: benefits of FOSS
Apache Bigtop:Apache Bigtop:
From #BigData to #FastDataFrom #BigData to #FastData
Apache Bigtop:Apache Bigtop:
From #BigData to #FastDataFrom #BigData to #FastData
Apache Bigdata Stack.nextApache Bigdata Stack.nextApache Bigdata Stack.nextApache Bigdata Stack.next
Coming out of the closet: benefits of FOSS
#BigData#BigData#BigData#BigData
Solving the complexitySolving the complexitySolving the complexitySolving the complexity
Coming out of the closet: benefits of FOSS
Apache Bigtop primerApache Bigtop primer
• A project, environment, and a phylosophy to:
• Define and create software stacks (think Debian)
• Deploy and validate actual software in the real world
• Configuration management
• Guarantees of consistency and compatiblity
• Empirical vs Rational
• don't rely on someone's hearsay
• don't assume an environment: contol it
One stack to rule them all
Coming out of the closet: benefits of FOSS
Apache Bigdata stackApache Bigdata stack
• Bigtop is the cutting edge of Apache Bigdata stack
• Delivers:
• A ready data processing stack
• Dev. env. for anyone to create their own
• Framework for easy
integration/deployment/validation
• “It works on my laptop” isn't cool anymore
• 0.x release series was focused on Hadoop ecosystem
Coming out of the closet: benefits of FOSS
10K view of Bigdata10K view of Bigdata
• There's more than just Hadoop
• Hadoop is mere 5-10% of all Bigdata usecases
• Good for processing data in parallel
• Analytics and ML
• But it is NOT ideal...
• Suboptimal resource scheduling
• Batch oriented (mostly)
Coming out of the closet: benefits of FOSS
What's missingWhat's missing
• Hadoop is all about batch
• MR is slow and heavyly IO-bound
• 2nd
generation of tools might be a bit more interactive
• SQL is the most popular data access interface
• yet immature in Hadoop ecosystem
• Supporting transactions is very hard
• Almost everything is HDFS-bound
• Performance... performance... performance
• Scarce In-Memory Computing presence
Coming out of the closet: benefits of FOSS
IMC: what is that?IMC: what is that?
• technically, any computing gets done in memory,
but...
“IMC: middleware software that stores data in 
RAM, across a cluster of computers, and 
process it in parallel”
• Why In-Memory Computing?
• RAM is about 5,000 faster than HDD
• RAM is about 1,500-2,000 faster than SSD
Coming out of the closet: benefits of FOSS
#FastData#FastData
Apache In-Memory ComputingApache In-Memory ComputingApache In-Memory ComputingApache In-Memory Computing
Coming out of the closet: benefits of FOSS
Let's get serious about IMCLet's get serious about IMC
• Bigtop boards more & more IMC(-like) components
• Provides transitional tech for legacy MR-based users
HDFS acceleration
• MR acceleration
• Uses RAM as inter-component data media
• Crossing component boundaries w/o leaving RAM
• Advanced clustering and service models
Coming out of the closet: benefits of FOSS
Connecting the stackConnecting the stack
• Bigtop Data Fabric Core:
• Works with
HDFS/RDBMS/MR/Hive/Hbase/Spark/Storm/SQL
• Cluster memory is a natural media to exchange data
• A probable usecase:
• Kafka --> Data Fabric --> HBase --> Data Fabric -->
SQL querying --> Spark --> A service Singlethon
--> Data Fabric --> RDBMS or FS
Coming out of the closet: benefits of FOSS
Data Fabric: what is that?Data Fabric: what is that?
Coming out of the closet: benefits of FOSS
Data Fabric: customizeData Fabric: customize
Coming out of the closet: benefits of FOSS
Data Fabric: ... some moreData Fabric: ... some more
Coming out of the closet: benefits of FOSS
Transitory legacy supportTransitory legacy support
Coming out of the closet: benefits of FOSS
Direct StreamingDirect Streaming
Coming out of the closet: benefits of FOSS
ML and NoSQL on fabricML and NoSQL on fabric
Coming out of the closet: benefits of FOSS
Analysing w/ 3Analysing w/ 3rdrd
party toolsparty tools
Coming out of the closet: benefits of FOSS
Deploy nodes everywhereDeploy nodes everywhere
Coming out of the closet: benefits of FOSS
Connecting the ...Connecting the ...
Coming out of the closet: benefits of FOSS
Live DemoLive Demo
• Deploy Apache Ignite (incubating)
• Run MR Pi on YARN
• Run same MR Pi against Data Frabric:
• Only client config needs to be changed
• Gasp at the difference
Coming out of the closet: benefits of FOSS
Final recapFinal recap
• Build your project in the open
• Open community helps in many ways
• Find a good foundation to be your home
• Be inclusive and welcoming
• a developer from a competitor can be a
great contributor and a friend
• There's no “boss” in open source
• Keep coding: your code is your best resume!
Coming out of the closet: benefits of FOSS
Q & AQ & A
Coming out of the closet: benefits of FOSS
Dr. Konstantin Boudnik
@c0sin
cos@apache.org
Dr. Konstantin Boudnik
@c0sin
cos@apache.org
Open-Source In-Memory PlatformsOpen-Source In-Memory Platforms

Weitere ähnliche Inhalte

Was ist angesagt?

Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Spark Summit
 
Interactive Analytics using Apache Spark
Interactive Analytics using Apache SparkInteractive Analytics using Apache Spark
Interactive Analytics using Apache Spark
Sachin Aggarwal
 
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
DataWorks Summit
 
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
DataWorks Summit
 

Was ist angesagt? (20)

Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
 
Apache Spark: Lightning Fast Cluster Computing
Apache Spark: Lightning Fast Cluster ComputingApache Spark: Lightning Fast Cluster Computing
Apache Spark: Lightning Fast Cluster Computing
 
Keynote - Hosted PostgreSQL: An Objective Look
Keynote - Hosted PostgreSQL: An Objective LookKeynote - Hosted PostgreSQL: An Objective Look
Keynote - Hosted PostgreSQL: An Objective Look
 
Apache Ignite - Distributed Database Orchestration
Apache Ignite - Distributed Database OrchestrationApache Ignite - Distributed Database Orchestration
Apache Ignite - Distributed Database Orchestration
 
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data AnalyticsApache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
 
Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...
Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...
Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...
 
Camel Riders in the Cloud
Camel Riders in the CloudCamel Riders in the Cloud
Camel Riders in the Cloud
 
Managing PostgreSQL with Ansible
 Managing PostgreSQL with Ansible Managing PostgreSQL with Ansible
Managing PostgreSQL with Ansible
 
Interactive Analytics using Apache Spark
Interactive Analytics using Apache SparkInteractive Analytics using Apache Spark
Interactive Analytics using Apache Spark
 
Your Guide to Streaming - The Engineer's Perspective
Your Guide to Streaming - The Engineer's PerspectiveYour Guide to Streaming - The Engineer's Perspective
Your Guide to Streaming - The Engineer's Perspective
 
Spark introduction and architecture
Spark introduction and architectureSpark introduction and architecture
Spark introduction and architecture
 
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
How to Use Innovative Data Handling and Processing Techniques to Drive Alpha ...
 
Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
 Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
 
Accelerating Big Data Insights
Accelerating Big Data InsightsAccelerating Big Data Insights
Accelerating Big Data Insights
 
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
Secured (Kerberos-based) Spark Notebook for Data Science: Spark Summit East t...
 
Deep Dive - Usage of on premises data gateway for hybrid integration scenarios
Deep Dive - Usage of on premises data gateway for hybrid integration scenariosDeep Dive - Usage of on premises data gateway for hybrid integration scenarios
Deep Dive - Usage of on premises data gateway for hybrid integration scenarios
 
Apache deep learning 202 Washington DC - DWS 2019
Apache deep learning 202   Washington DC - DWS 2019Apache deep learning 202   Washington DC - DWS 2019
Apache deep learning 202 Washington DC - DWS 2019
 
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
 
Does Anyone Really Need RAC?
 Does Anyone Really Need RAC? Does Anyone Really Need RAC?
Does Anyone Really Need RAC?
 
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
 

Andere mochten auch

IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
In-Memory Computing Summit
 
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
In-Memory Computing Summit
 

Andere mochten auch (20)

IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
IMCSummit 2015 - Day 1 Developer Track - In-memory Computing for Iterative CP...
 
IMCSummit 2015 - Day 1 - General Session - From Big Data to Fast Data
IMCSummit 2015 - Day 1 - General Session - From Big Data to Fast DataIMCSummit 2015 - Day 1 - General Session - From Big Data to Fast Data
IMCSummit 2015 - Day 1 - General Session - From Big Data to Fast Data
 
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
 
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
 
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
 
IMCSummit 2015 - Day 1 IT Business Track - Designing a Big Data Analytics Pla...
IMCSummit 2015 - Day 1 IT Business Track - Designing a Big Data Analytics Pla...IMCSummit 2015 - Day 1 IT Business Track - Designing a Big Data Analytics Pla...
IMCSummit 2015 - Day 1 IT Business Track - Designing a Big Data Analytics Pla...
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
 
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage TierIMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
 
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGain
 

Ähnlich wie IMCSummit 2015 - Day 1 Developer Track - Open-Source In-Memory Platforms: Benefits of Coming Out of the Closet

Midwest php 2013 deploying php on paas- why & how
Midwest php 2013   deploying php on paas- why & howMidwest php 2013   deploying php on paas- why & how
Midwest php 2013 deploying php on paas- why & how
dotCloud
 
Deploying PHP on PaaS: Why and How?
Deploying PHP on PaaS: Why and How?Deploying PHP on PaaS: Why and How?
Deploying PHP on PaaS: Why and How?
Docker, Inc.
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
yhadoop
 
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
Data Con LA
 

Ähnlich wie IMCSummit 2015 - Day 1 Developer Track - Open-Source In-Memory Platforms: Benefits of Coming Out of the Closet (20)

Trend Micro Big Data Platform and Apache Bigtop
Trend Micro Big Data Platform and Apache BigtopTrend Micro Big Data Platform and Apache Bigtop
Trend Micro Big Data Platform and Apache Bigtop
 
Midwest php 2013 deploying php on paas- why & how
Midwest php 2013   deploying php on paas- why & howMidwest php 2013   deploying php on paas- why & how
Midwest php 2013 deploying php on paas- why & how
 
Unified Batch and Real-Time Stream Processing Using Apache Flink
Unified Batch and Real-Time Stream Processing Using Apache FlinkUnified Batch and Real-Time Stream Processing Using Apache Flink
Unified Batch and Real-Time Stream Processing Using Apache Flink
 
PyData Boston 2013
PyData Boston 2013PyData Boston 2013
PyData Boston 2013
 
Deploying PHP on PaaS: Why and How?
Deploying PHP on PaaS: Why and How?Deploying PHP on PaaS: Why and How?
Deploying PHP on PaaS: Why and How?
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
 
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache ApexMaking sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
 
Python the lingua franca of FEWS
Python the lingua franca of FEWSPython the lingua franca of FEWS
Python the lingua franca of FEWS
 
Stackato v5
Stackato v5Stackato v5
Stackato v5
 
Building cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and DockerBuilding cloud-enabled genomics workflows with Luigi and Docker
Building cloud-enabled genomics workflows with Luigi and Docker
 
Stackato
StackatoStackato
Stackato
 
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and  one click hadoop provis...How bigtop leveraged docker for build automation and  one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
 
A Tour of Open Source on the Mainframe
A Tour of Open Source on the MainframeA Tour of Open Source on the Mainframe
A Tour of Open Source on the Mainframe
 
Lightning Fast Dataframes with Polars
Lightning Fast Dataframes with PolarsLightning Fast Dataframes with Polars
Lightning Fast Dataframes with Polars
 
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
Data Con LA 2022-Open Source or Open Core in Your Data Layer? What Needs to B...
 
Ceph & OpenStack - Boston Meetup
Ceph & OpenStack - Boston MeetupCeph & OpenStack - Boston Meetup
Ceph & OpenStack - Boston Meetup
 
Ceph and openstack at the boston meetup
Ceph and openstack at the boston meetupCeph and openstack at the boston meetup
Ceph and openstack at the boston meetup
 
Not Just Another Overview of Apache Hadoop
Not Just Another Overview of Apache HadoopNot Just Another Overview of Apache Hadoop
Not Just Another Overview of Apache Hadoop
 
Pharo Consortium: A roadmap to solid evolution
Pharo Consortium: A roadmap to solid evolutionPharo Consortium: A roadmap to solid evolution
Pharo Consortium: A roadmap to solid evolution
 

Mehr von In-Memory Computing Summit

IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
In-Memory Computing Summit
 
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
In-Memory Computing Summit
 

Mehr von In-Memory Computing Summit (12)

IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent MemoryIMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
 
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
 
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
 
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
 
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
 
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
 

KĂźrzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

KĂźrzlich hochgeladen (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

IMCSummit 2015 - Day 1 Developer Track - Open-Source In-Memory Platforms: Benefits of Coming Out of the Closet

  • 1. Dr. Konstantin Boudnik VP Open Source Development WANdisco Dr. Konstantin Boudnik VP Open Source Development WANdisco Benefits of Coming Out of the Closet Open-Source In-Memory PlatformsOpen-Source In-Memory Platforms
  • 2. Coming out of the closet: benefits of FOSS Short bio:Short bio: Got addicted to Linux back in 1994 Member of Linux Foundation Member of Apache Software Foundation Apache Bigtop project founder;Apache Hadoop committer Committer, PMC member, contributor to may ASF projects Mentor of multiple Apache Incubator projects Ignite, Geode, Groovy, Zeppelin Background in compilers, JVM, distrubuted computing, system integration & architecture 2015: got invited to IMCS to talk about FOSS
  • 3. Coming out of the closet: benefits of FOSS Open-source / Open communityOpen-source / Open community • Open source is easy • Jump to a “social development” site (Bitbucket, Github) • Pick up a license you like: (L)GPL, ASL, MIT, BSD • while true; do <code>; done • You might be lucky to recruite a lot of volunteers • A selected group mostly owns the project road map • Adoption might be an issue as the future is unknown
  • 4. Coming out of the closet: benefits of FOSS Open-source / Open communityOpen-source / Open community • Open community is way harder • Place to collaborate; meritocracy • Consensus building / Conflicts resolution • Continuity: avoiding 'hit by the bus' situations • Protection of project brand (under some licenses) • Legal shielding and takeover protection • Infrastructure management • Projects cross-polination • “Community over code”
  • 5. Coming out of the closet: benefits of FOSS In-memory: what's FOSS'ing?In-memory: what's FOSS'ing? • Was sorta quiet up to pretty much 2012 • Spark appeared & gained momentum quickly • Great improvement on MapReduce • Solved many shortcomings of MR • Then nothing spectacular was happening until • 2014: Apache Ignite (incubating) from GridGain • 2015: Apache Geode (incubating) from Pivotal
  • 6. Coming out of the closet: benefits of FOSS open-source or open communityopen-source or open community • FOSS foundations facilitate open communities • Spark: from a relatively small GitHub project to the most active Apache BigData project in 2 years • Apache Ignite: doubling committer base in 5 months; quadrupling the user base • Apache Geode: check the talk @IMCS!
  • 7. Coming out of the closet: benefits of FOSS Apache Bigtop:Apache Bigtop: From #BigData to #FastDataFrom #BigData to #FastData Apache Bigtop:Apache Bigtop: From #BigData to #FastDataFrom #BigData to #FastData Apache Bigdata Stack.nextApache Bigdata Stack.nextApache Bigdata Stack.nextApache Bigdata Stack.next
  • 8. Coming out of the closet: benefits of FOSS #BigData#BigData#BigData#BigData Solving the complexitySolving the complexitySolving the complexitySolving the complexity
  • 9. Coming out of the closet: benefits of FOSS Apache Bigtop primerApache Bigtop primer • A project, environment, and a phylosophy to: • Define and create software stacks (think Debian) • Deploy and validate actual software in the real world • Configuration management • Guarantees of consistency and compatiblity • Empirical vs Rational • don't rely on someone's hearsay • don't assume an environment: contol it One stack to rule them all
  • 10. Coming out of the closet: benefits of FOSS Apache Bigdata stackApache Bigdata stack • Bigtop is the cutting edge of Apache Bigdata stack • Delivers: • A ready data processing stack • Dev. env. for anyone to create their own • Framework for easy integration/deployment/validation • “It works on my laptop” isn't cool anymore • 0.x release series was focused on Hadoop ecosystem
  • 11. Coming out of the closet: benefits of FOSS 10K view of Bigdata10K view of Bigdata • There's more than just Hadoop • Hadoop is mere 5-10% of all Bigdata usecases • Good for processing data in parallel • Analytics and ML • But it is NOT ideal... • Suboptimal resource scheduling • Batch oriented (mostly)
  • 12. Coming out of the closet: benefits of FOSS What's missingWhat's missing • Hadoop is all about batch • MR is slow and heavyly IO-bound • 2nd generation of tools might be a bit more interactive • SQL is the most popular data access interface • yet immature in Hadoop ecosystem • Supporting transactions is very hard • Almost everything is HDFS-bound • Performance... performance... performance • Scarce In-Memory Computing presence
  • 13. Coming out of the closet: benefits of FOSS IMC: what is that?IMC: what is that? • technically, any computing gets done in memory, but... “IMC: middleware software that stores data in  RAM, across a cluster of computers, and  process it in parallel” • Why In-Memory Computing? • RAM is about 5,000 faster than HDD • RAM is about 1,500-2,000 faster than SSD
  • 14. Coming out of the closet: benefits of FOSS #FastData#FastData Apache In-Memory ComputingApache In-Memory ComputingApache In-Memory ComputingApache In-Memory Computing
  • 15. Coming out of the closet: benefits of FOSS Let's get serious about IMCLet's get serious about IMC • Bigtop boards more & more IMC(-like) components • Provides transitional tech for legacy MR-based users HDFS acceleration • MR acceleration • Uses RAM as inter-component data media • Crossing component boundaries w/o leaving RAM • Advanced clustering and service models
  • 16. Coming out of the closet: benefits of FOSS Connecting the stackConnecting the stack • Bigtop Data Fabric Core: • Works with HDFS/RDBMS/MR/Hive/Hbase/Spark/Storm/SQL • Cluster memory is a natural media to exchange data • A probable usecase: • Kafka --> Data Fabric --> HBase --> Data Fabric --> SQL querying --> Spark --> A service Singlethon --> Data Fabric --> RDBMS or FS
  • 17. Coming out of the closet: benefits of FOSS Data Fabric: what is that?Data Fabric: what is that?
  • 18. Coming out of the closet: benefits of FOSS Data Fabric: customizeData Fabric: customize
  • 19. Coming out of the closet: benefits of FOSS Data Fabric: ... some moreData Fabric: ... some more
  • 20. Coming out of the closet: benefits of FOSS Transitory legacy supportTransitory legacy support
  • 21. Coming out of the closet: benefits of FOSS Direct StreamingDirect Streaming
  • 22. Coming out of the closet: benefits of FOSS ML and NoSQL on fabricML and NoSQL on fabric
  • 23. Coming out of the closet: benefits of FOSS Analysing w/ 3Analysing w/ 3rdrd party toolsparty tools
  • 24. Coming out of the closet: benefits of FOSS Deploy nodes everywhereDeploy nodes everywhere
  • 25. Coming out of the closet: benefits of FOSS Connecting the ...Connecting the ...
  • 26. Coming out of the closet: benefits of FOSS Live DemoLive Demo • Deploy Apache Ignite (incubating) • Run MR Pi on YARN • Run same MR Pi against Data Frabric: • Only client config needs to be changed • Gasp at the difference
  • 27. Coming out of the closet: benefits of FOSS Final recapFinal recap • Build your project in the open • Open community helps in many ways • Find a good foundation to be your home • Be inclusive and welcoming • a developer from a competitor can be a great contributor and a friend • There's no “boss” in open source • Keep coding: your code is your best resume!
  • 28. Coming out of the closet: benefits of FOSS Q & AQ & A
  • 29. Coming out of the closet: benefits of FOSS Dr. Konstantin Boudnik @c0sin cos@apache.org Dr. Konstantin Boudnik @c0sin cos@apache.org Open-Source In-Memory PlatformsOpen-Source In-Memory Platforms