SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
A Microservice Architecture
for Big Data Pipelines
BigData.be Meetup June 2016
Let’s face it: Big Data is no longer a Big Deal
2
Image © User:Kleiner / Wikimedia Commons / CC BY-SA 3.0
www.realimpactanalytics.com
Yardsticks of Software Development:
1. Create Modularity
2. Ensure Quality
3. Scale Development
4. Painless Deployment
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
Image © User:Guma89 / Wikimedia Commons / CC BY-SA 3.0
www.realimpactanalytics.com
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
Modularity is Imperative for On Premise Deployments:
RealImpact
Product Product Product
Client Client Client
The Promised Land
5
Image http://hanciong.deviantart.com/art/old-world-map-253195357
www.realimpactanalytics.com
Micro Services: Maximal Modularity
1. No shared state
2. Minimal coupling
3. Separation of concerns
4. Mix & match
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
www.realimpactanalytics.com
Micro Services: Scalable Development
1. Team responsibility
2. Less code = faster ramp up
3. Technology independence
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
www.realimpactanalytics.com
Micro Services: Painless Deployment
1. Reproducible environments
2. Versioned APIs
3. Installation = docker-compose up
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
Prod
Dev
www.realimpactanalytics.com
Micro Services: QA Friendly
1. Three levels of testing
• Class / function level
• Service level
• Integration level
2. Staging is no big deal
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
www.realimpactanalytics.com
Translation to Big Data Pipelines…
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
TrendingAnalysis
Twitter Data
TopTweeters
Recommend
www.realimpactanalytics.com
container
Translation to Big Data Pipelines…
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
TrendingAnalysis
manifest.yaml
run.sh
jar
runtime
www.realimpactanalytics.com
container
Translation to Big Data Pipelines…
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
TrendingAnalysis
datasources:
- twitter
outputs:
-
id: daily-trends
fields:
-
name: keyword
type: string
-
name: relevance
type: integer
parameters:
…
manifest.yaml
run.sh
jar
runtime
www.realimpactanalytics.com
Translation to Big Data Pipelines…
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
TrendingAnalysis
HDFS
Input Data
Result
Parameters
Demo
14
www.realimpactanalytics.com
Data Modules: QA Friendly?
1. Three levels of testing ✔
• Class / function level
• Module level
• Integration level
2. Staging is no big deal ✔
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
www.realimpactanalytics.com
Data Modules: Painless Deployment?
1. Reproducible environments (✔)
2. Versioned APIs ✔
3. Installation = docker-compose up (✔)
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
Prod
Dev
www.realimpactanalytics.com
Data Modules: Scalable Development?
1. Team responsibility ✔
2. Less code = faster ramp up ✔
3. Technology independence ✔
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
www.realimpactanalytics.com
Data Modules: Modularity?
1. No shared state (well…)
2. Minimal coupling ✔
3. Separation of concerns ✔
4. Mix & match ✔
1. Challenge:

Big Data in Production
2. Zen of Micro Services
3. Data Modules
4. Conclusion
Conclusion
19
Brussels Office
5, Place du Champ de Mars
1050 Brussels
Belgium
Cape Town Office
Sovereign Quay, 34 Somerset Road

8005, Green Point, Cape Town 

South Africa
São Paulo Office
93, Rua Doutor Andrade Pertence
Vila Olímpia, São Paulo
Brazil
Luxembourg Office
691, rue de Neudorf
2220 Luxembourg
Grand Duché du Luxembourg
www.realimpactanalytics.com
Kuala Lumpur Office
28-01, Integra Tower 348 Jalan

Tun Razak, 50400 Kuala Lumpur
Malaysia

Weitere ähnliche Inhalte

Was ist angesagt?

NoSQL for the SQL Server Pro
NoSQL for the SQL Server ProNoSQL for the SQL Server Pro
NoSQL for the SQL Server Pro
Lynn Langit
 
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Altan Khendup
 

Was ist angesagt? (20)

How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...
How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...
How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...
 
Sidecars and a Microservices Mesh
Sidecars and a Microservices MeshSidecars and a Microservices Mesh
Sidecars and a Microservices Mesh
 
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
 
NoSQL Database- cassandra column Base DB
NoSQL Database- cassandra column Base DBNoSQL Database- cassandra column Base DB
NoSQL Database- cassandra column Base DB
 
Real Time Analytics with Apache Cassandra - Cassandra Day Munich
Real Time Analytics with Apache Cassandra - Cassandra Day MunichReal Time Analytics with Apache Cassandra - Cassandra Day Munich
Real Time Analytics with Apache Cassandra - Cassandra Day Munich
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
NoSQL for the SQL Server Pro
NoSQL for the SQL Server ProNoSQL for the SQL Server Pro
NoSQL for the SQL Server Pro
 
What every software engineer should know about streams and tables in kafka ...
What every software engineer should know about streams and tables in kafka   ...What every software engineer should know about streams and tables in kafka   ...
What every software engineer should know about streams and tables in kafka ...
 
Cloud Big Data Architectures
Cloud Big Data ArchitecturesCloud Big Data Architectures
Cloud Big Data Architectures
 
Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
 
Fast Data: A Customer’s Journey to Delivering a Compelling Real-Time Solution
Fast Data: A Customer’s Journey to Delivering a Compelling Real-Time SolutionFast Data: A Customer’s Journey to Delivering a Compelling Real-Time Solution
Fast Data: A Customer’s Journey to Delivering a Compelling Real-Time Solution
 
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformNatalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
 
The Future of ETL - Strata Data New York 2018
The Future of ETL - Strata Data New York 2018The Future of ETL - Strata Data New York 2018
The Future of ETL - Strata Data New York 2018
 
Single View of Well, Production and Assets
Single View of Well, Production and AssetsSingle View of Well, Production and Assets
Single View of Well, Production and Assets
 
Oracle Panel: Expert Insights into Faster Oracle SOA Suite Project Delivery
Oracle Panel: Expert Insights into Faster Oracle SOA Suite Project DeliveryOracle Panel: Expert Insights into Faster Oracle SOA Suite Project Delivery
Oracle Panel: Expert Insights into Faster Oracle SOA Suite Project Delivery
 
Building Event-Driven Microservices using Kafka Streams (Stathis Souris, Thou...
Building Event-Driven Microservices using Kafka Streams (Stathis Souris, Thou...Building Event-Driven Microservices using Kafka Streams (Stathis Souris, Thou...
Building Event-Driven Microservices using Kafka Streams (Stathis Souris, Thou...
 
Reference architecture for Internet of Things
Reference architecture for Internet of ThingsReference architecture for Internet of Things
Reference architecture for Internet of Things
 
Monitoring at scale - Sensu Kafka Kafka-connect Cassandra PrestoDB
Monitoring at scale - Sensu Kafka Kafka-connect Cassandra PrestoDBMonitoring at scale - Sensu Kafka Kafka-connect Cassandra PrestoDB
Monitoring at scale - Sensu Kafka Kafka-connect Cassandra PrestoDB
 
Getting Started with Real-time Analytics
Getting Started with Real-time AnalyticsGetting Started with Real-time Analytics
Getting Started with Real-time Analytics
 
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
 

Andere mochten auch

Andere mochten auch (20)

Apache Olingo - from Incubation to a real Olingo (Apache TLP)
Apache Olingo - from Incubation to a real Olingo (Apache TLP)Apache Olingo - from Incubation to a real Olingo (Apache TLP)
Apache Olingo - from Incubation to a real Olingo (Apache TLP)
 
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
 
AWSome Day - Milan, July 24th 2014
AWSome Day - Milan, July 24th 2014AWSome Day - Milan, July 24th 2014
AWSome Day - Milan, July 24th 2014
 
2011_Herbstcampus_Rapid_Cloud_Development_with_Spring_Roo
2011_Herbstcampus_Rapid_Cloud_Development_with_Spring_Roo2011_Herbstcampus_Rapid_Cloud_Development_with_Spring_Roo
2011_Herbstcampus_Rapid_Cloud_Development_with_Spring_Roo
 
Resume Building for Teens
Resume Building for TeensResume Building for Teens
Resume Building for Teens
 
okspring3x
okspring3xokspring3x
okspring3x
 
Oracle Cloud Café IOT 12 avril 2016
Oracle Cloud Café IOT 12 avril 2016Oracle Cloud Café IOT 12 avril 2016
Oracle Cloud Café IOT 12 avril 2016
 
E learning: kansen en risico's
E learning: kansen en risico'sE learning: kansen en risico's
E learning: kansen en risico's
 
GoAzure 2015 Azure AD for Developers
GoAzure 2015 Azure AD for DevelopersGoAzure 2015 Azure AD for Developers
GoAzure 2015 Azure AD for Developers
 
Pre-Con Ed: Discover the New CA App Experience Analytics 16.3 - The Omnichann...
Pre-Con Ed: Discover the New CA App Experience Analytics 16.3 - The Omnichann...Pre-Con Ed: Discover the New CA App Experience Analytics 16.3 - The Omnichann...
Pre-Con Ed: Discover the New CA App Experience Analytics 16.3 - The Omnichann...
 
Dino Product Overview
Dino Product OverviewDino Product Overview
Dino Product Overview
 
Vasilis Bankov & Calin Iliescu AEGON
Vasilis Bankov & Calin Iliescu AEGONVasilis Bankov & Calin Iliescu AEGON
Vasilis Bankov & Calin Iliescu AEGON
 
Running Business Critical Workloads on AWS
Running Business Critical Workloads on AWS Running Business Critical Workloads on AWS
Running Business Critical Workloads on AWS
 
Node.JS error handling best practices
Node.JS error handling best practicesNode.JS error handling best practices
Node.JS error handling best practices
 
Silicon Valley Grade IT and Cloud Maturity Assessment for Startup Ecosystem i...
Silicon Valley Grade IT and Cloud Maturity Assessment for Startup Ecosystem i...Silicon Valley Grade IT and Cloud Maturity Assessment for Startup Ecosystem i...
Silicon Valley Grade IT and Cloud Maturity Assessment for Startup Ecosystem i...
 
Fun git hub
Fun git hubFun git hub
Fun git hub
 
Gaining visibility into your Openshift application container platform with Dy...
Gaining visibility into your Openshift application container platform with Dy...Gaining visibility into your Openshift application container platform with Dy...
Gaining visibility into your Openshift application container platform with Dy...
 
Cloud Camp Azure概要
Cloud Camp Azure概要Cloud Camp Azure概要
Cloud Camp Azure概要
 
GDPR. Et alors?
GDPR. Et alors?GDPR. Et alors?
GDPR. Et alors?
 
Native XML processing in C++ (BoostCon'11)
Native XML processing in C++ (BoostCon'11)Native XML processing in C++ (BoostCon'11)
Native XML processing in C++ (BoostCon'11)
 

Ähnlich wie A Microservice Architecture for Big Data Pipelines

CAMP IT Slides - Skytap - Brian White
CAMP IT Slides - Skytap - Brian White CAMP IT Slides - Skytap - Brian White
CAMP IT Slides - Skytap - Brian White
Skytap Cloud
 
The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data Centers
Gina Buck
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
Robert Gleave
 
Productionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best PracticesProductionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best Practices
MapR Technologies
 

Ähnlich wie A Microservice Architecture for Big Data Pipelines (20)

DevOps: Find Solutions, Not More Defects
DevOps: Find Solutions, Not More DefectsDevOps: Find Solutions, Not More Defects
DevOps: Find Solutions, Not More Defects
 
Black_Friday_Sales_Trushita
Black_Friday_Sales_TrushitaBlack_Friday_Sales_Trushita
Black_Friday_Sales_Trushita
 
Pragmatic approach to Microservice Architecture: Role of Middleware
Pragmatic approach to Microservice Architecture: Role of MiddlewarePragmatic approach to Microservice Architecture: Role of Middleware
Pragmatic approach to Microservice Architecture: Role of Middleware
 
Bigdata-Intro.pptx
Bigdata-Intro.pptxBigdata-Intro.pptx
Bigdata-Intro.pptx
 
CAMP IT Slides - Skytap - Brian White
CAMP IT Slides - Skytap - Brian White CAMP IT Slides - Skytap - Brian White
CAMP IT Slides - Skytap - Brian White
 
MA Microservices Meetup: Move fast and make things
MA Microservices Meetup: Move fast and make thingsMA Microservices Meetup: Move fast and make things
MA Microservices Meetup: Move fast and make things
 
apidays LIVE Hong Kong 2021 - Modernizing Monolith Applications with API Arch...
apidays LIVE Hong Kong 2021 - Modernizing Monolith Applications with API Arch...apidays LIVE Hong Kong 2021 - Modernizing Monolith Applications with API Arch...
apidays LIVE Hong Kong 2021 - Modernizing Monolith Applications with API Arch...
 
Ship code like a keptn
Ship code like a keptnShip code like a keptn
Ship code like a keptn
 
The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data Centers
 
Microservices and the Modern IT Stack: Trends of Tomorrow - AppSphere16
Microservices and the Modern IT Stack: Trends of Tomorrow - AppSphere16Microservices and the Modern IT Stack: Trends of Tomorrow - AppSphere16
Microservices and the Modern IT Stack: Trends of Tomorrow - AppSphere16
 
Accelerate User Driven Innovation [Webinar]
Accelerate User Driven Innovation [Webinar]Accelerate User Driven Innovation [Webinar]
Accelerate User Driven Innovation [Webinar]
 
DataOps - Production ML
DataOps - Production MLDataOps - Production ML
DataOps - Production ML
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
 
Faster, Simpler, Better - MongoDB to the rescue
Faster, Simpler, Better - MongoDB to the rescue Faster, Simpler, Better - MongoDB to the rescue
Faster, Simpler, Better - MongoDB to the rescue
 
Productionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best PracticesProductionizing Hadoop: 7 Architectural Best Practices
Productionizing Hadoop: 7 Architectural Best Practices
 
The Enterprise Guide to Building a Data Mesh - Introducing SpecMesh
The Enterprise Guide to Building a Data Mesh - Introducing SpecMeshThe Enterprise Guide to Building a Data Mesh - Introducing SpecMesh
The Enterprise Guide to Building a Data Mesh - Introducing SpecMesh
 
Migrate to microservices
Migrate to microservicesMigrate to microservices
Migrate to microservices
 
[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
 
JUNIPER: Towards Modeling Approach Enabling Efficient Platform for Heterogene...
JUNIPER: Towards Modeling Approach Enabling Efficient Platform for Heterogene...JUNIPER: Towards Modeling Approach Enabling Efficient Platform for Heterogene...
JUNIPER: Towards Modeling Approach Enabling Efficient Platform for Heterogene...
 
Presentaion final
Presentaion finalPresentaion final
Presentaion final
 

Kürzlich hochgeladen

一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 

Kürzlich hochgeladen (20)

Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 

A Microservice Architecture for Big Data Pipelines

  • 1. A Microservice Architecture for Big Data Pipelines BigData.be Meetup June 2016
  • 2. Let’s face it: Big Data is no longer a Big Deal 2 Image © User:Kleiner / Wikimedia Commons / CC BY-SA 3.0
  • 3. www.realimpactanalytics.com Yardsticks of Software Development: 1. Create Modularity 2. Ensure Quality 3. Scale Development 4. Painless Deployment 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion Image © User:Guma89 / Wikimedia Commons / CC BY-SA 3.0
  • 4. www.realimpactanalytics.com 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion Modularity is Imperative for On Premise Deployments: RealImpact Product Product Product Client Client Client
  • 5. The Promised Land 5 Image http://hanciong.deviantart.com/art/old-world-map-253195357
  • 6. www.realimpactanalytics.com Micro Services: Maximal Modularity 1. No shared state 2. Minimal coupling 3. Separation of concerns 4. Mix & match 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 7. www.realimpactanalytics.com Micro Services: Scalable Development 1. Team responsibility 2. Less code = faster ramp up 3. Technology independence 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 8. www.realimpactanalytics.com Micro Services: Painless Deployment 1. Reproducible environments 2. Versioned APIs 3. Installation = docker-compose up 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion Prod Dev
  • 9. www.realimpactanalytics.com Micro Services: QA Friendly 1. Three levels of testing • Class / function level • Service level • Integration level 2. Staging is no big deal 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 10. www.realimpactanalytics.com Translation to Big Data Pipelines… 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion TrendingAnalysis Twitter Data TopTweeters Recommend
  • 11. www.realimpactanalytics.com container Translation to Big Data Pipelines… 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion TrendingAnalysis manifest.yaml run.sh jar runtime
  • 12. www.realimpactanalytics.com container Translation to Big Data Pipelines… 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion TrendingAnalysis datasources: - twitter outputs: - id: daily-trends fields: - name: keyword type: string - name: relevance type: integer parameters: … manifest.yaml run.sh jar runtime
  • 13. www.realimpactanalytics.com Translation to Big Data Pipelines… 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion TrendingAnalysis HDFS Input Data Result Parameters
  • 15. www.realimpactanalytics.com Data Modules: QA Friendly? 1. Three levels of testing ✔ • Class / function level • Module level • Integration level 2. Staging is no big deal ✔ 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 16. www.realimpactanalytics.com Data Modules: Painless Deployment? 1. Reproducible environments (✔) 2. Versioned APIs ✔ 3. Installation = docker-compose up (✔) 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion Prod Dev
  • 17. www.realimpactanalytics.com Data Modules: Scalable Development? 1. Team responsibility ✔ 2. Less code = faster ramp up ✔ 3. Technology independence ✔ 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 18. www.realimpactanalytics.com Data Modules: Modularity? 1. No shared state (well…) 2. Minimal coupling ✔ 3. Separation of concerns ✔ 4. Mix & match ✔ 1. Challenge:
 Big Data in Production 2. Zen of Micro Services 3. Data Modules 4. Conclusion
  • 20. Brussels Office 5, Place du Champ de Mars 1050 Brussels Belgium Cape Town Office Sovereign Quay, 34 Somerset Road
 8005, Green Point, Cape Town 
 South Africa São Paulo Office 93, Rua Doutor Andrade Pertence Vila Olímpia, São Paulo Brazil Luxembourg Office 691, rue de Neudorf 2220 Luxembourg Grand Duché du Luxembourg www.realimpactanalytics.com Kuala Lumpur Office 28-01, Integra Tower 348 Jalan
 Tun Razak, 50400 Kuala Lumpur Malaysia