SlideShare a Scribd company logo
1 of 20
Download to read offline
THE ROLE OF AWS IN THE DATALANDSCAPE
OF A FAST GROWING STARTUP
September 2020
2
Cluno is “your car“
without the hassle of
really owning it
CLUNO IS YOUR ONE-STOP
SHOP.
* The following services are included in the monthly package price: car registration, liability insurance, partial and fully comprehensive insurance with deductible, car tax, GEZ fees, maintenance,
winterfit tires, inspections and general inspections. The monthly package price does not include: fuel, electricity, AdBlue, windscreen washer fluid, motor oil.
Your monthly subscription fee includes
everything.*
Just drive – Cluno takes care of
everything else.
3
Cluno gives you the
freedom to walk away
anytime
ALWAYS DRIVE THE CAR THAT
FITS YOUR DAY-TO-DAY LIFE.
Cluno is highly flexible:
Drive as long as you want with a 6-
month minimum term per car.
4
Cluno turns car ownership
into a superior, digital
user experience
CLUNO IS YOUR LIFETIME
MOBILITY COMPANION.
Subscribe in 3 minutes:
Get approved once (ID and solvency
check) and sign via the app or web.
5
Importance of a data platform already in early stage of company
High complexity through
- Many services
- Flexibility to customer
- Digital experience
Requirements for growing fast and
successful
- Taking the right decisions
- Act/react fast
- Learn fast
Data culture
- Data driven decisions
- Data platform which gives insights, helps making data
driven decisions and automates complex processes by
data products
Requirements to data platform
- Scalable – more data points/data sources/consumers
- Flexible - additional use cases/requirements
- Low initial costs in early startup phase
- Quick value
Challenge Solution
6
Initial thoughts on Architecture
Store
Data Platform
Ingest ServeData Producers
Self Service
Technical Consumers
Business
Consumers
Data lake
Topic Queue
BI Service
Endpoint
AI
7
Step 0: Most pragmatic spike
Store
Data Platform
Ingest ServeData Producers
Observing
Technical Consumers
Business
Consumers
Excel tables
Manual download
PowerPoint slides
Manual import
CSV files
SQL DB
8
Step 1: Automate ingest and storage of data
Store
Data Platform
Ingest ServeData Producers
Observing
Technical Consumers
Business
Consumers
PowerPoint slides
Manual import
CSV files
SQL DB
AWS Lambda
Amazon
S3
9
Step 1: Automate ingest and storage of data
StoreIngest Serve
WHAT
- Lambdas import full snapshot of datasource 1x per day
- S3 as storage for raw CSV files and source for futher
analysis
WHY
- Lambda
- managed service (fast start)
- pay as you use (free tier)
- S3
- cheap in costs
- batch storage
- filebased external tables (enabled by glue
metastore)
- CSV
- lowest development time
RESULT
- Automated import
- Single datasource
- Faster Analysis
- More time for Analyst which can be used for building
reports
Newly added services
Data Platform
10
Amazon Quicksight
Step 2: Accessibility for other users
Store
Data Platform
AWS Lambda
Amazon
S3
AWS
Glue
Amazon
Athena
Ingest ServeData Producers
Technical Consumers
Business
Consumers
Observing
Manual import
CSV files
SQL DB
11
Step 2: Accessibility for other users
StoreIngest Serve
WHAT
- Parquet files saved in S3 instead of raw CSV files
- Glue used as metadata storage
- Athena as SQL interface
- Quicksight as dashboarding tool
WHY
- Parquet
- Structured format, on the fly to develop
- Memory efficient (columnar storage)
- Glue + Athena
- Automatic schema definition
- Fast setup (serverless, SQL connector)
- Quicksight
- Pay as you use
- Fast setup (connector to athena)
- In-memory optimized calculation engine (SPICE)
RESULT
- Further automation (schema, dashboards)
- Faster results (dashboards update every morning)
- More time for Analyst which can be used for deeper
analysis
Newly added services
Data Platform
12
Automated Request
Amazon API Gateway
Tableau
Main Dashboards
Step 3: Improve Accessibility for other users
Store
Data Platform
AWS Lambda
Amazon
S3
AWS
Glue
Amazon
Athena
Ingest ServeData Producers
Technical Consumers
Business
Consumers
ObservingSQL DB
Amazon Athena View
13
Step 3: Improve Accessibility for other users
StoreIngest Serve
WHAT
- Views created on top of athena tables
- Tableau instead of Quicksight
- API Gateway as interface for data consumers
WHY
- Views
- Analysts can own views
- Analysts are better aware of needed business
logic
- Tableau
- Split of metadata and data (replacing datasets,
sharing calculated fields)
- Vizualization possibilities (personalization on CI,
wider range of diagrams)
- Bigger community
- API Gateway
- Machine readable insights from external network
to network of datasources
RESULT
- Datasets and Insights generated closer with teams,
where business knowledge is
- Scalable architecture for creating dashboards
- Automated processes (for website or internal tools)
Newly added services
Data Platform
14
Step 4: Getting realtime input
Tableau
Main Dashboards
Amazon DynamoDB
Amazon Kinesis
Data Firehose
Amazon SNS
Store
Data Platform
AWS Lambda
SQL DB
Amazon SQS Amazon
S3
AWS
Glue
Amazon
Athena
Amazon Athena View
Ingest ServeData Producers
Technical Consumers
Business
Consumers
Amazon API Gateway
Automated Request
Observing
15
Step 4: Getting realtime input
StoreIngest Serve
WHAT
- SNS/SQS as event hub
- Lambda as node and for deduplication
- DynamoDB with most recent records
- Kinesis as batch stream to save change events in S3
WHY
- SNS/SQS
- One producer to many consumers
- Serverless
- Scalable
- Lambda
- One node instead of multiple queues
- Logic can be included (deduplication)
- Kinesis
- Collect data until processing to S3 file
- DynamoDB
- No bigger adjustments in data structure needed
(similar to Athena)
RESULT
- Realtime DB can be used from existing APIs for realtime
data products
- Base for more granular data points (change of
dimensions)
Newly added services
Data Platform
16
Amazon API Gateway
Packaged Code
Step 5: Redefine storing and serving layer
Tableau
Main Dashboards
Amazon DynamoDB
Amazon Kinesis
Data Firehose
Amazon SNS
Store
Data Platform
AWS Lambda
SQL DB
Amazon SQS Amazon
S3
AWS
Glue
Amazon
Athena
Amazon Athena View
Ingest ServeData Producers
Tableau
Self Service
Technical Consumers
Business
Consumers
17
Step 5: Redefine storing and serving layer
StoreIngest Serve
WHAT
- Metrics calculated iteratively and dimensions get
historized with realtime events
- Enablement in other teams to create own Tableau
dashboards
- Packaged code provided instead of API with logic
WHY
- Events as source for metrics
- Scalable
- Realtime history
- More granular information about changes
- Tableau enablement
- Data ressources no bottleneck
- Packaged code
- Ownership on expert domain only
RESULT
- More granular insights
- Faster decisions (realtime and self service)
- More stable data product environment
Newly added services
Data Platform
18
Next step: Data platform as service platform
Tableau
Main Dashboards
Amazon DynamoDB
Amazon Kinesis
Data Firehose
Amazon SNS
Amazon API Gateway
Store
Data Platform
AWS Lambda
SQL DB
Amazon SQS Amazon
S3
AWS
Glue
Amazon
Athena
Amazon Athena View
Packaged Code
Ingest ServeData Producers
Tableau
Self Service
Technical Consumers
Business
Consumers
Owner
Business logic
Analysts
Independency
Data Engineers
19
Lessons learned
Priority
- the earlier the stage in an area, the more important is quick value
- the later the stage, the more important is clean architecture to stay flexible and scalable
Ø constantly switch from generating new features to structuring the architecture around
Toolset
- you constantly need to reevaluate you choosen toolset
- it can make sense to implement a tool, knowing that you will decommission it later
- keep in mind that tools will change
Ø don‘t commit too much to one tool
Team setup
- move from centralised team with experts to crossfuntional teams as soon as teamsize and maturity big enough
- move ownership to business teams as soon as possible
Ø ownership needs to be where the business knowledge is
JOIN US FOR THE RIDE
Max Ehrlich
Head of Data
max.ehrlich@cluno.com Cluno GmbH
www.cluno.com

More Related Content

What's hot

Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Kai Wähner
 
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
Kai Wähner
 
JAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop IntegrationJAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop Integration
jazoon13
 
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
Geodata AS
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
Lucas Jellema
 

What's hot (20)

Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
 
Utilizing Esri Out of the Box Tools for Field Data Verification
Utilizing Esri Out of the Box Tools for Field Data VerificationUtilizing Esri Out of the Box Tools for Field Data Verification
Utilizing Esri Out of the Box Tools for Field Data Verification
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart City
 
Cloud Management Platform - Managing End to End Cloud Delivery, Billing and M...
Cloud Management Platform - Managing End to End Cloud Delivery, Billing and M...Cloud Management Platform - Managing End to End Cloud Delivery, Billing and M...
Cloud Management Platform - Managing End to End Cloud Delivery, Billing and M...
 
Mesh-ing around with Streams across the Enterprise | Phil Scanlon, Solace
Mesh-ing around with Streams across the Enterprise | Phil Scanlon, SolaceMesh-ing around with Streams across the Enterprise | Phil Scanlon, Solace
Mesh-ing around with Streams across the Enterprise | Phil Scanlon, Solace
 
Hybrid & Global Kafka Architecture
Hybrid & Global Kafka ArchitectureHybrid & Global Kafka Architecture
Hybrid & Global Kafka Architecture
 
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
Simplified Machine Learning Architecture with an Event Streaming Platform (Ap...
 
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
WSO2Con USA 2017: Driving Insights for Your Digital Business With AnalyticsWSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
 
Introduction to AWS Glue
Introduction to AWS Glue Introduction to AWS Glue
Introduction to AWS Glue
 
20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud Overview20181212 AWS NL - Informatica Cloud Overview
20181212 AWS NL - Informatica Cloud Overview
 
Introduction to Amazon Athena
Introduction to Amazon AthenaIntroduction to Amazon Athena
Introduction to Amazon Athena
 
Hybrid IT: Legg Mason
Hybrid IT: Legg MasonHybrid IT: Legg Mason
Hybrid IT: Legg Mason
 
JAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop IntegrationJAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop Integration
 
DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...
DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...
DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...
 
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
ArcGIS for Server, Portal for ArcGIS and the Road Ahead - Esri norsk BK 2014
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
 
Serverless Streaming Data Processing using Amazon Kinesis Analytics
Serverless Streaming Data Processing using Amazon Kinesis AnalyticsServerless Streaming Data Processing using Amazon Kinesis Analytics
Serverless Streaming Data Processing using Amazon Kinesis Analytics
 
Aws centralized logs
Aws centralized logsAws centralized logs
Aws centralized logs
 

Similar to The role of AWS in the Datalandscape of a fast growing Startup

Azure Overview Csco
Azure Overview CscoAzure Overview Csco
Azure Overview Csco
rajramab
 

Similar to The role of AWS in the Datalandscape of a fast growing Startup (20)

Analytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWSAnalytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWS
 
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
 
Building your Datalake on AWS
Building your Datalake on AWSBuilding your Datalake on AWS
Building your Datalake on AWS
 
LeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesLeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration Services
 
Intro Presentation at AWS AWSome Day London September 2015
Intro Presentation at AWS AWSome Day London September 2015Intro Presentation at AWS AWSome Day London September 2015
Intro Presentation at AWS AWSome Day London September 2015
 
UiPath 23.4 Product Release Updates
UiPath 23.4 Product Release UpdatesUiPath 23.4 Product Release Updates
UiPath 23.4 Product Release Updates
 
Intro Presentation at AWS AWSome Day Glasgow September 2015
Intro Presentation at AWS AWSome Day Glasgow September 2015Intro Presentation at AWS AWSome Day Glasgow September 2015
Intro Presentation at AWS AWSome Day Glasgow September 2015
 
Analysing Data in Real-time
Analysing Data in Real-timeAnalysing Data in Real-time
Analysing Data in Real-time
 
NEW LAUNCH! Introducing AWS IoT Analytics - IOT214 - re:Invent 2017
NEW LAUNCH! Introducing AWS IoT Analytics - IOT214 - re:Invent 2017NEW LAUNCH! Introducing AWS IoT Analytics - IOT214 - re:Invent 2017
NEW LAUNCH! Introducing AWS IoT Analytics - IOT214 - re:Invent 2017
 
AWSome Day Manchester 2105 - Intro/Close
AWSome Day Manchester 2105 - Intro/CloseAWSome Day Manchester 2105 - Intro/Close
AWSome Day Manchester 2105 - Intro/Close
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
 
AWS AWSome Day London October 2015
AWS AWSome Day London October 2015 AWS AWSome Day London October 2015
AWS AWSome Day London October 2015
 
Importance of ‘Centralized Event collection’ and BigData platform for Analysis !
Importance of ‘Centralized Event collection’ and BigData platform for Analysis !Importance of ‘Centralized Event collection’ and BigData platform for Analysis !
Importance of ‘Centralized Event collection’ and BigData platform for Analysis !
 
AWS Summit Stockholm 2014 – B4 – Business intelligence on AWS
AWS Summit Stockholm 2014 – B4 – Business intelligence on AWSAWS Summit Stockholm 2014 – B4 – Business intelligence on AWS
AWS Summit Stockholm 2014 – B4 – Business intelligence on AWS
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
 
Serverless Design Patterns for Rethinking Traditional Enterprise Application ...
Serverless Design Patterns for Rethinking Traditional Enterprise Application ...Serverless Design Patterns for Rethinking Traditional Enterprise Application ...
Serverless Design Patterns for Rethinking Traditional Enterprise Application ...
 
Azure Overview Csco
Azure Overview CscoAzure Overview Csco
Azure Overview Csco
 
Comparison of Cloud Computing Services | Torry Harris Whitepaper
Comparison of Cloud Computing Services | Torry Harris WhitepaperComparison of Cloud Computing Services | Torry Harris Whitepaper
Comparison of Cloud Computing Services | Torry Harris Whitepaper
 
Cap intro oct2014 pdf
Cap intro oct2014 pdfCap intro oct2014 pdf
Cap intro oct2014 pdf
 

Recently uploaded

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 

Recently uploaded (20)

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

The role of AWS in the Datalandscape of a fast growing Startup

  • 1. THE ROLE OF AWS IN THE DATALANDSCAPE OF A FAST GROWING STARTUP September 2020
  • 2. 2 Cluno is “your car“ without the hassle of really owning it CLUNO IS YOUR ONE-STOP SHOP. * The following services are included in the monthly package price: car registration, liability insurance, partial and fully comprehensive insurance with deductible, car tax, GEZ fees, maintenance, winterfit tires, inspections and general inspections. The monthly package price does not include: fuel, electricity, AdBlue, windscreen washer fluid, motor oil. Your monthly subscription fee includes everything.* Just drive – Cluno takes care of everything else.
  • 3. 3 Cluno gives you the freedom to walk away anytime ALWAYS DRIVE THE CAR THAT FITS YOUR DAY-TO-DAY LIFE. Cluno is highly flexible: Drive as long as you want with a 6- month minimum term per car.
  • 4. 4 Cluno turns car ownership into a superior, digital user experience CLUNO IS YOUR LIFETIME MOBILITY COMPANION. Subscribe in 3 minutes: Get approved once (ID and solvency check) and sign via the app or web.
  • 5. 5 Importance of a data platform already in early stage of company High complexity through - Many services - Flexibility to customer - Digital experience Requirements for growing fast and successful - Taking the right decisions - Act/react fast - Learn fast Data culture - Data driven decisions - Data platform which gives insights, helps making data driven decisions and automates complex processes by data products Requirements to data platform - Scalable – more data points/data sources/consumers - Flexible - additional use cases/requirements - Low initial costs in early startup phase - Quick value Challenge Solution
  • 6. 6 Initial thoughts on Architecture Store Data Platform Ingest ServeData Producers Self Service Technical Consumers Business Consumers Data lake Topic Queue BI Service Endpoint AI
  • 7. 7 Step 0: Most pragmatic spike Store Data Platform Ingest ServeData Producers Observing Technical Consumers Business Consumers Excel tables Manual download PowerPoint slides Manual import CSV files SQL DB
  • 8. 8 Step 1: Automate ingest and storage of data Store Data Platform Ingest ServeData Producers Observing Technical Consumers Business Consumers PowerPoint slides Manual import CSV files SQL DB AWS Lambda Amazon S3
  • 9. 9 Step 1: Automate ingest and storage of data StoreIngest Serve WHAT - Lambdas import full snapshot of datasource 1x per day - S3 as storage for raw CSV files and source for futher analysis WHY - Lambda - managed service (fast start) - pay as you use (free tier) - S3 - cheap in costs - batch storage - filebased external tables (enabled by glue metastore) - CSV - lowest development time RESULT - Automated import - Single datasource - Faster Analysis - More time for Analyst which can be used for building reports Newly added services Data Platform
  • 10. 10 Amazon Quicksight Step 2: Accessibility for other users Store Data Platform AWS Lambda Amazon S3 AWS Glue Amazon Athena Ingest ServeData Producers Technical Consumers Business Consumers Observing Manual import CSV files SQL DB
  • 11. 11 Step 2: Accessibility for other users StoreIngest Serve WHAT - Parquet files saved in S3 instead of raw CSV files - Glue used as metadata storage - Athena as SQL interface - Quicksight as dashboarding tool WHY - Parquet - Structured format, on the fly to develop - Memory efficient (columnar storage) - Glue + Athena - Automatic schema definition - Fast setup (serverless, SQL connector) - Quicksight - Pay as you use - Fast setup (connector to athena) - In-memory optimized calculation engine (SPICE) RESULT - Further automation (schema, dashboards) - Faster results (dashboards update every morning) - More time for Analyst which can be used for deeper analysis Newly added services Data Platform
  • 12. 12 Automated Request Amazon API Gateway Tableau Main Dashboards Step 3: Improve Accessibility for other users Store Data Platform AWS Lambda Amazon S3 AWS Glue Amazon Athena Ingest ServeData Producers Technical Consumers Business Consumers ObservingSQL DB Amazon Athena View
  • 13. 13 Step 3: Improve Accessibility for other users StoreIngest Serve WHAT - Views created on top of athena tables - Tableau instead of Quicksight - API Gateway as interface for data consumers WHY - Views - Analysts can own views - Analysts are better aware of needed business logic - Tableau - Split of metadata and data (replacing datasets, sharing calculated fields) - Vizualization possibilities (personalization on CI, wider range of diagrams) - Bigger community - API Gateway - Machine readable insights from external network to network of datasources RESULT - Datasets and Insights generated closer with teams, where business knowledge is - Scalable architecture for creating dashboards - Automated processes (for website or internal tools) Newly added services Data Platform
  • 14. 14 Step 4: Getting realtime input Tableau Main Dashboards Amazon DynamoDB Amazon Kinesis Data Firehose Amazon SNS Store Data Platform AWS Lambda SQL DB Amazon SQS Amazon S3 AWS Glue Amazon Athena Amazon Athena View Ingest ServeData Producers Technical Consumers Business Consumers Amazon API Gateway Automated Request Observing
  • 15. 15 Step 4: Getting realtime input StoreIngest Serve WHAT - SNS/SQS as event hub - Lambda as node and for deduplication - DynamoDB with most recent records - Kinesis as batch stream to save change events in S3 WHY - SNS/SQS - One producer to many consumers - Serverless - Scalable - Lambda - One node instead of multiple queues - Logic can be included (deduplication) - Kinesis - Collect data until processing to S3 file - DynamoDB - No bigger adjustments in data structure needed (similar to Athena) RESULT - Realtime DB can be used from existing APIs for realtime data products - Base for more granular data points (change of dimensions) Newly added services Data Platform
  • 16. 16 Amazon API Gateway Packaged Code Step 5: Redefine storing and serving layer Tableau Main Dashboards Amazon DynamoDB Amazon Kinesis Data Firehose Amazon SNS Store Data Platform AWS Lambda SQL DB Amazon SQS Amazon S3 AWS Glue Amazon Athena Amazon Athena View Ingest ServeData Producers Tableau Self Service Technical Consumers Business Consumers
  • 17. 17 Step 5: Redefine storing and serving layer StoreIngest Serve WHAT - Metrics calculated iteratively and dimensions get historized with realtime events - Enablement in other teams to create own Tableau dashboards - Packaged code provided instead of API with logic WHY - Events as source for metrics - Scalable - Realtime history - More granular information about changes - Tableau enablement - Data ressources no bottleneck - Packaged code - Ownership on expert domain only RESULT - More granular insights - Faster decisions (realtime and self service) - More stable data product environment Newly added services Data Platform
  • 18. 18 Next step: Data platform as service platform Tableau Main Dashboards Amazon DynamoDB Amazon Kinesis Data Firehose Amazon SNS Amazon API Gateway Store Data Platform AWS Lambda SQL DB Amazon SQS Amazon S3 AWS Glue Amazon Athena Amazon Athena View Packaged Code Ingest ServeData Producers Tableau Self Service Technical Consumers Business Consumers Owner Business logic Analysts Independency Data Engineers
  • 19. 19 Lessons learned Priority - the earlier the stage in an area, the more important is quick value - the later the stage, the more important is clean architecture to stay flexible and scalable Ø constantly switch from generating new features to structuring the architecture around Toolset - you constantly need to reevaluate you choosen toolset - it can make sense to implement a tool, knowing that you will decommission it later - keep in mind that tools will change Ø don‘t commit too much to one tool Team setup - move from centralised team with experts to crossfuntional teams as soon as teamsize and maturity big enough - move ownership to business teams as soon as possible Ø ownership needs to be where the business knowledge is
  • 20. JOIN US FOR THE RIDE Max Ehrlich Head of Data max.ehrlich@cluno.com Cluno GmbH www.cluno.com