SlideShare ist ein Scribd-Unternehmen logo
1 von 4
Downloaden Sie, um offline zu lesen
A couple of days ago I came across the article "Mapping AWS, Google Cloud, Azure Services to Big
Data Warehouse Architecture" here. I do know a bit about data warehousing, and even big data
warehouse architecture. However, what interests me is actually a "map of various cloud services
against the big data warehouse architecture". More precisely, cloud services from "the three most
popular cloud platforms: Microsoft Azure, Google Cloud Platform, and Amazon AWS" are mapped to
their open source origination and/or counterparts. As a technical IBMer, my primary area is Big Data &
Advanced Analytics, but I happen to know a little about the IBM Bluemix platform. So for (more)
completeness, here it comes Bluemix! - Note though, here only Bluemix services involved in big data
warehouse architecture are listed. To explore more, see Bluemix website.
Disclaimer
1.While I'm employed by IBM this article represents completely my personal viewpoints.
Furthermore, I've tried my best but still I can't guarantee the 100% completeness, accuracy,
and/or potential services changes.
2.The original author of the article aforementioned own(s) the copyright and by no means I'm
modifying the content. Neither do I agree nor disagree with the author on the content. However,
for convenience, I'm putting the original table (or map) along with their IBM Bluemix
counterparts side by side.
PS, Due to space limitation, all the open source stuff in the Bluemix column refers to the cloud service
provisioned by IBM Bluemix rather than the original open source software, e.g., HDFS/Hadoop/Hive,
etc. means the individual component within BigInsights for Apache Hadoop or BigInsights for Apache
Hadoop (Subscription) service and PostgreSQL refers to ElephantSQL and/or Compose for
PostgreSQL service.
Open Source Amazon AWS Microsoft Azure Google Cloud IBM Bluemix
Batch Ingest
Sqoop
File Transfer
Flume
StreamSets
AWS Data Transfer
Services (various
options)
Import/Export
Service
Data Factory
Cloud DataFlow
Sqoop
File Transfer
Lift (Aspera)
Flume
Various services
Streaming Ingest
Flume
StreamSets
Amazon Kinesis
Firehose
Event Hubs
IOT Hub
Cloud DataFlow
Flume, Spark
Streaming Analytics
Persistent
Storage
HDFS
RDBMS
S3, Glacier
RDS
Storage Blob
HDFS
SQL Database
Persistent Disk
Google Cloud
Storage
Cloud SQL
HDFS
RDBMS (IBM
Proprietary: Db2,
dashDB, Informix ...
open source: MySQL,
PostgreSQL ...
NoSQL: MongoDB,
Redis, Cloudant ...
Block Storage, Cloud
Object Storage, File
Storage, CDN, etc.
Transient Storage Kafka Kinesis
Event Hubs
IOT Hub
HDInsight (Kafka)
Cloud Pub/Sub
Cloud IoT Core
Kafka, Message Hub
Batch Processing
Hive
Flink, Spark
MapReduce
PostgreSQL
EMR Spark
EMR Hadoop
EMR Presto
AWS Batch
Redshift
Azure Batch
HDInisght
(Spark/Map Reduce)
SQL Data
Warehouse
Data Lake Analytics
Cloud Dataflow
(open source
Apache Beam)
Cloud DataProc
(Spark, Hadoop)
Hive, Spark,
MapReduce, MySQL,
PostgreSQL
Db2, Information Server
on Cloud, etc.
Stream
Processing
Flink
Spark
Beam
Amazon Kinesis
Streams
Amazon Kinesis
Analytics
EMR Spark
Stream Analytics
HDInsight (Storm,
Spark)
Cloud Dataflow
(open source
Apache Beam)
DataProc (Spark,
Hadoop)
Spark
Streaming Analytics
Machine
Learning
Scikit
Tensorflow
Spark MLLib
Lex
Polly
Recognition
Azure ML
Cognitive Services
Natural
Language
SpeechTranslati
Data Science
Experience (includes
TensorFlow
etc.
Huge number
of libraries
Amazon Machine
Learning
on
Vision
Video
ML Engine
support for R, Python
with scikit, TensorFlow,
Spark with MLLib, etc.)
Watson Machine
Learning
Serving Storage
Graph
JanusGraph
N/A Marketplace
Only, e.g. OrientDB
N/A Marketplace
only, e.g OrientDB
N/A IBM Graph
Serving Storage
BI/EDW
Impala +
Kudu
Redshift
Athena
SQL Data
Warehouse
BigQuery
Db2 for Warehouse
BigSQL
Serving Storage
Search (keywords
+ facets)
Solr
Amazon
CloudSearch
Amazon
Elasticsearch
Azure Search
N/A
Marketplace,
e.g. Solr
Solr, Compose for
ElasticSearch
Serving Storage
RDBMS
PostgreSQL RDS SQL DB Cloud SQL
IBM Proprietary: Db2,
dashDB, Informix ...
and open source:
MySQL, PostgreSQL ...
Serving Storage
NoSQL
HBase DynamoDB
HDInsight (HBase)
CosmosDB
BigTable
Spanner
DataStore
NoSQL: HBase,
MongoDB, Redis,
Cloudant, Redis ...
Sandboxes
Notebook
Zeppelin EMR Zeppelin Azure Notebooks Cloud Datalab
Data Science
Experience (Juypter)
Spark
Sandboxes Data
Science or
Preparation
Platform
Dataiku DSS
Community
Edition (not
open source)
N/A Marketplace
only, e.g. Dataiku
DSS
N/A Marketplace
only, e.g. Dataiku
DSS
Cloud DataPrep
(beta). Under the
hood this is
Trifacta.
Data Science
Experience
Clients/Data
Apps
Superset (BI) Quicksight PowerBI
Google Data
Studio
Data Science
Experience
Watson Machine
Learning
Decision Optimization
Orchestration Airflow AWS Data Pipeline Data Factory
N/A
Marketplace
Workload Scheduler (?)
ETL Tool N/A AWS Glue (beta) Data Factory N/A
Marketplace
Data Connect
Information Server on
Cloud
MDM Hub N/A N/A Marketplace N/A Marketplace
N/A
Marketplace
MDM on Cloud
Lineage N/A AWS Glue (beta) N/A N/A
Information Server on
Cloud
Catalog N/A AWS Glue (beta) Data Catalog
N/A
Marketplace
Information Server on
Cloud

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 

Kürzlich hochgeladen (20)

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

Empfohlen

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Empfohlen (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Cloud platform aws-gcp-azure-bluemix

  • 1. A couple of days ago I came across the article "Mapping AWS, Google Cloud, Azure Services to Big Data Warehouse Architecture" here. I do know a bit about data warehousing, and even big data warehouse architecture. However, what interests me is actually a "map of various cloud services against the big data warehouse architecture". More precisely, cloud services from "the three most popular cloud platforms: Microsoft Azure, Google Cloud Platform, and Amazon AWS" are mapped to their open source origination and/or counterparts. As a technical IBMer, my primary area is Big Data & Advanced Analytics, but I happen to know a little about the IBM Bluemix platform. So for (more) completeness, here it comes Bluemix! - Note though, here only Bluemix services involved in big data warehouse architecture are listed. To explore more, see Bluemix website. Disclaimer 1.While I'm employed by IBM this article represents completely my personal viewpoints. Furthermore, I've tried my best but still I can't guarantee the 100% completeness, accuracy, and/or potential services changes. 2.The original author of the article aforementioned own(s) the copyright and by no means I'm modifying the content. Neither do I agree nor disagree with the author on the content. However, for convenience, I'm putting the original table (or map) along with their IBM Bluemix counterparts side by side. PS, Due to space limitation, all the open source stuff in the Bluemix column refers to the cloud service provisioned by IBM Bluemix rather than the original open source software, e.g., HDFS/Hadoop/Hive, etc. means the individual component within BigInsights for Apache Hadoop or BigInsights for Apache Hadoop (Subscription) service and PostgreSQL refers to ElephantSQL and/or Compose for PostgreSQL service.
  • 2. Open Source Amazon AWS Microsoft Azure Google Cloud IBM Bluemix Batch Ingest Sqoop File Transfer Flume StreamSets AWS Data Transfer Services (various options) Import/Export Service Data Factory Cloud DataFlow Sqoop File Transfer Lift (Aspera) Flume Various services Streaming Ingest Flume StreamSets Amazon Kinesis Firehose Event Hubs IOT Hub Cloud DataFlow Flume, Spark Streaming Analytics Persistent Storage HDFS RDBMS S3, Glacier RDS Storage Blob HDFS SQL Database Persistent Disk Google Cloud Storage Cloud SQL HDFS RDBMS (IBM Proprietary: Db2, dashDB, Informix ... open source: MySQL, PostgreSQL ... NoSQL: MongoDB, Redis, Cloudant ... Block Storage, Cloud Object Storage, File Storage, CDN, etc. Transient Storage Kafka Kinesis Event Hubs IOT Hub HDInsight (Kafka) Cloud Pub/Sub Cloud IoT Core Kafka, Message Hub Batch Processing Hive Flink, Spark MapReduce PostgreSQL EMR Spark EMR Hadoop EMR Presto AWS Batch Redshift Azure Batch HDInisght (Spark/Map Reduce) SQL Data Warehouse Data Lake Analytics Cloud Dataflow (open source Apache Beam) Cloud DataProc (Spark, Hadoop) Hive, Spark, MapReduce, MySQL, PostgreSQL Db2, Information Server on Cloud, etc. Stream Processing Flink Spark Beam Amazon Kinesis Streams Amazon Kinesis Analytics EMR Spark Stream Analytics HDInsight (Storm, Spark) Cloud Dataflow (open source Apache Beam) DataProc (Spark, Hadoop) Spark Streaming Analytics Machine Learning Scikit Tensorflow Spark MLLib Lex Polly Recognition Azure ML Cognitive Services Natural Language SpeechTranslati Data Science Experience (includes
  • 3. TensorFlow etc. Huge number of libraries Amazon Machine Learning on Vision Video ML Engine support for R, Python with scikit, TensorFlow, Spark with MLLib, etc.) Watson Machine Learning Serving Storage Graph JanusGraph N/A Marketplace Only, e.g. OrientDB N/A Marketplace only, e.g OrientDB N/A IBM Graph Serving Storage BI/EDW Impala + Kudu Redshift Athena SQL Data Warehouse BigQuery Db2 for Warehouse BigSQL Serving Storage Search (keywords + facets) Solr Amazon CloudSearch Amazon Elasticsearch Azure Search N/A Marketplace, e.g. Solr Solr, Compose for ElasticSearch Serving Storage RDBMS PostgreSQL RDS SQL DB Cloud SQL IBM Proprietary: Db2, dashDB, Informix ... and open source: MySQL, PostgreSQL ... Serving Storage NoSQL HBase DynamoDB HDInsight (HBase) CosmosDB BigTable Spanner DataStore NoSQL: HBase, MongoDB, Redis, Cloudant, Redis ... Sandboxes Notebook Zeppelin EMR Zeppelin Azure Notebooks Cloud Datalab Data Science Experience (Juypter) Spark Sandboxes Data Science or Preparation Platform Dataiku DSS Community Edition (not open source) N/A Marketplace only, e.g. Dataiku DSS N/A Marketplace only, e.g. Dataiku DSS Cloud DataPrep (beta). Under the hood this is Trifacta. Data Science Experience Clients/Data Apps Superset (BI) Quicksight PowerBI Google Data Studio Data Science Experience Watson Machine Learning Decision Optimization Orchestration Airflow AWS Data Pipeline Data Factory N/A Marketplace Workload Scheduler (?) ETL Tool N/A AWS Glue (beta) Data Factory N/A Marketplace Data Connect Information Server on
  • 4. Cloud MDM Hub N/A N/A Marketplace N/A Marketplace N/A Marketplace MDM on Cloud Lineage N/A AWS Glue (beta) N/A N/A Information Server on Cloud Catalog N/A AWS Glue (beta) Data Catalog N/A Marketplace Information Server on Cloud