SlideShare a Scribd company logo
1 of 25
Download to read offline
Creating a modern data architecture
September 28, 2016
Ben Sharma | CEO
ben@zaloni.com
•  Award-winning provider of enterprise data lake
management solutions:
Integrated data lake management platform
Self-service data preparation
•  Data Lake Design and Implementation Services:
POC, Pilot, Production, Operations, Training
•  Data Science Professional Services
3 Zaloni Proprietary
Increased
Agility
New
Insights
Improved
Scalability
Data lakes are central to the modern data architecture
4 Zaloni Proprietary
•  Store all types of data in its raw format
•  Create Refined, Standardized, Trusted datasets for various use
cases
•  Store data for longer periods of time to enable historical analysis
•  Query and Access the data using a variety of methods
•  Manage streaming and batch data in a converged platform
•  Provide shorter time-to-insight with proper data management and
governance
The data lake promise
5 Zaloni Proprietary
Data architecture modernizationTraditionalModern
Data Lake
Sources ETL EDW
Derived
(Transformed)
Discovery Sandbox
EDW
Streaming
Unstructured Data
Various Sources
Data Discovery
Analytics BI
Data Science
Data Discovery
Analytics BI
6 Zaloni Proprietary
Data lake – the challenges and the solution
•  Ingestion
•  Lack of Visibility
•  Privacy and Compliance
•  Quality Issues
•  Reliance on IT
•  Reusability
•  Rate of Change
•  Skills Gap
•  Complexity
Managing: Delivering:Building:
7 Zaloni Proprietary
Data Lake Reference Architecture
•  Data required for LOB specific views - transformed
from existing certified data
•  Consumers are anyone with appropriate role-based access
•  Standardized on corporate governance/ quality policies
•  Consumers are anyone with appropriate role-based access
•  Single version of truth
Transient
Landing Zone Raw Zone
Refined Zone
Trusted Zone
Sandbox
Data Lake
•  Temporary store of
source data
•  Consumers are IT,
Data Stewards
•  Implemented in highly
regulation industries
•  Original source data
ready for consumption
•  Consumers are ETL
developers, data
stewards, some data
scientists
•  Single source of truth
with history
•  Data required for LOB specific views - transformed
from existing certified data
•  Consumers are anyone with appropriate role-based access
Sensors
(or other time series data)
Relational Data
Stores (OLTP/ODS/
DW)
Logs
(or other unstructured
data)
Social and
shared data
8 Zaloni Proprietary
Inputs:
•  Sources: RDBMS, File, Streaming, Structure/
Unstructured, External Data
Processes:
•  Data transfer and intake: Managed and scheduled
•  Discover metadata
•  Register in the catalog
•  Apply Zone specific policies
•  Capture operational metrics and monitoring
•  Post-ingestion validations and clean up
Outputs:
•  Data transfer to Raw Zone
Policies:
•  Data privacy – tokenization, masking
•  Data security – user access
•  Data quality – profiling, entity level checks
•  Data lifecycle management – short lived, temporary
Transient Landing Zone
Transient
Landing Zone
•  Temporary stores
source data
•  Limited access
•  Consumers are IT,
Data Stewards
•  Implemented in
highly regulation
industries
•  This zone collapses
with Raw Zone if
security is not needed
Transient
Landing Zone
Raw Zone Refined Zone
Trusted Zone
Sandbox
Data Lake
9 Zaloni Proprietary
Inputs:
•  Output from TLZ (Hcatalog entries)
•  Source inputs if TLZ is skipped
Policies:
•  Data privacy – tokenization, masking
•  Data security – user access
•  Data Quality – profiling, entity level, field level
•  Data transformations required for Single View of Truth
•  Combine attributes to one entity
•  Change data formats (e.g. VSAM binary of JSON)
•  Derived columns
•  Field mappings
•  Drop columns
•  Data lifecycle management
•  Could be a candidate for S3 or Object store
Outputs:
•  Data transfer to Trusted Zone
•  Data transfer to Sandbox
Processes:
•  Register and update catalog
•  Apply zone specific policies
•  Operational metrics and monitoring
Raw Zone
Transient
Landing Zone
Raw Zone Refined Zone
Trusted Zone
Sandbox
Data Lake
Raw Zone
•  Original source data
•  Ready for consumption
•  Treated for basic
validation and privacy
•  Metadata available to
everyone but data access
limited based on role
•  Consumers are ETL
developers, data
stewards, some data
scientists
•  Single source of truth
with history
10 Zaloni Proprietary
Inputs:
•  Output from Raw Zone
Processes:
•  Register and update catalog
•  Apply zone specific policies
•  Data transformation required for refined use cases
for LOB such as
•  Customer360 view
•  Periodic snapshots of revenue
Outputs:
•  Data transfer to Refined
Policies:
•  Data security – user access
•  Data lifecycle management
•  Lifetime of use case
•  Use case specific
Trusted Zone
Raw Zone Refined Zone
Trusted Zone
Sandbox
Data Lake
•  Standardized on
corporate
governance/ quality
policies
•  Consumers are
anyone with
appropriate
role-based access
•  Metadata catalog
available to all
•  Single version
of truth
Trusted Zone
Transient
Landing Zone
11 Zaloni Proprietary
Inputs:
•  Output from Trusted Zone
•  Output from Raw Zone for LOB-specific use cases
Processes:
•  LOB specific transformations
•  Aggregates
•  De-normalized
•  Apply zone specific policies
•  Model building for reports
•  Optionally a cube generation
Outputs:
•  Transformed data can be saved back to
Refined Zone
•  Applications such as BI tools
•  Transferred to sandbox if required
Policies:
•  Data security – user access
•  Data lifecycle management
•  Lifetime of use case
•  Use case specific
•  De tokenization if required (based on access)
Refined Zone
Raw Zone Refined Zone
Trusted Zone
Sandbox
Data Lake
Transient
Landing Zone
Refined Zone
•  Data required for
LOB specific views -
transformed from
existing certified
data
•  Consumers are
anyone with
appropriate role-
based access
•  Metadata catalog
available to all
12 Zaloni Proprietary
Inputs:
•  Output from Raw, Trusted and Refined Zones
•  Self-service ingestion
Sandbox
Raw Zone Refined Zone
Trusted Zone
Sandbox
Data Lake
Transient
Landing Zone
Processes:
•  Data scientists drive analysis
•  Self-service for ad-hoc
Outputs:
•  Models that can later be operationalized
•  Optionally, results/data can be sent back to
the Raw Zone
Policies:
•  Data security – user access
•  Data lifecycle management
•  Lifetime of use case
•  Use case specific
•  Data required for
LOB specific views -
transformed from
existing certified
data
•  Consumers are
anyone with
appropriate role-
based access
•  Metadata catalog
available to all
Sandbox
13 Zaloni Proprietary
Data lake Reference Architecture with Zaloni
Consumption ZoneSource
System
File Data
DB Data
ETL Extracts
Streaming
Transient
Landing Zone Raw Zone
Refined
Zone
Trusted
Zone
Sandbox
APIs
Metadata
Management
Data Quality Data Catalog Security
Data Lake
Business Analysts
Researchers
Data Scientists
DATA LAKE MANAGEMENT
& GOVERNANCE PLATFORM
Sensors
(or other time series data)
Relational Data
Stores (OLTP/ODS/
DW)
Logs
(or other unstructured
data)
Social and
shared data
14 Zaloni Proprietary
Data Lake 360°: A holistic approach to actionable big data
1. Enable the lake
2. Govern the data
3. Engage the business
•  Foster a data-driven business
through self-service data
discovery and preparation
•  Safeguard sensitive data and
enable regulatory compliance
•  Improve data visibility, reliability
and quality to reduce time-to-
insight
•  Leverage the full power of a scale-out
architecture with an actionable,
scalable data lake
15 Zaloni Proprietary
•  Managed Ingestion
§  Ability to ingest vast amounts of data
§  Ability to handle a wide variety of formats
(streaming, files, custom) and sources
§  Build in repeatability through automation to pick up incoming data
and apply pre-defined processing
•  Metadata Management
§  Capture and manage operational, technical and business metadata
§  Provides visibility and reliability – key to finding data in the lake
§  Reduced time to insight for analytics
§  File and record level watermarking provides data lineage, enables
audit and traceability
Enable the lake
16 Zaloni Proprietary
•  Data Lineage
§  See how data moves and how it is consumed in the data lake.
§  Safeguard data and reduce risk, always knowing where data
has come from, where it is, and how it is being used.
•  Data Quality
§  Rules based Data validation
§  Integration with the Managed Data Pipeline
§  Stats and metrics for reporting and actions
Govern the data
17 Zaloni Proprietary
•  Data Security and Privacy
§  Differing permissions require enhanced data security
§  Mask or tokenize data before published in the lake for consumption
§  Policy-based security
•  Data lifecycle management across tiered storage environments
§  Hot -> Warm -> Cold on an entity level based on policies/SLAs
§  Across on-premise and cloud environments
§  Provide data management features to automate scheduling and
orchestration of data movement between heterogeneous storage
environments
Govern the data
18 Zaloni Proprietary
Engage the business
•  Data Catalog
§  See what data is available across your enterprise
§  Contribute valuable business information to improve
search and usage
§  Use a shopping cart experience to create sandbox for ad-
hoc and exploratory analytics
•  Self-service Data Preparation
§  Blend data in the lake without a costly IT project
§  Perform interactive data-driven transformations
§  Collaborate and share data assets and transformations
with peers
19 Zaloni Proprietary
•  Rapid increase of Data Lake platforms in the Cloud
•  Hybrid cloud and multi-cloud considerations
•  Support sensitive data on premise and external data
in the cloud
(e.g. client data, machine-generated)
•  Key challenges:
§  Leverage Cloud native features
§  Consistence Data Management and Governance
Emergence of cloud-based and hybrid data lakes
GOVERNANCE
VISIBILITY
20 Zaloni Proprietary
•  How do you create a cloud agnostic data
lake platform?
•  How deploy a cost-effective compute layer?
§  Elastic compute layer
§  Batch and near real-time
•  How do you optimize storage?
§  Support polyglot persistence
§  DLM
•  How do you optimize network connectivity
between Ground to Cloud?
•  How do you meet enterprise security
requirements?
Considerations for data lake in the cloud
CLOUD and HYBRID
ENVIRONMENTS
21 Zaloni Proprietary
Cloud Data Lake Maturity model
Lift and Shift
Cloud Native
features
Multi and
Hybrid Cloud
Replicate on-
premise Data Lake
in the cloud
Leverage Object
stores, Transient
compute platforms,
Messaging systems
Abstraction over
multiple clouds,
consistent Data
Management and
Governance
22 Zaloni Proprietary
Building your blueprint
1. Questions 2. Inputs 3. Outcomes
Business Drivers
AND Business
Questions:
e.g. Where is fraud
occurring? How do I
optimize inventory?
Data Use Cases Platform
Subject Areas
Source System
Capabilities,
Process
Ingest, Organize,
Enrich, Explore
Roadmap
Managed
Data Lake
Analytics
Strategy
=
++
23 Zaloni Proprietary
Typical data lake implementation timeline
POC
Weeks Weeks
Production Data Lake Platform
Proof of Concept:
ü  Demonstrate technical
capabilities of the
platform in the context
of selected use cases
Data Lake Implementation:
ü  Planning, Installation, Training
ü  Sample data sets ingested
ü  Pilot uses cases created
Business Use Case Delivered:
ü  Engage business
stakeholders to identify
production use cases at scale
ü  Review learnings and
optimize the data lake
Data Lake Use Case
Implement Business Use Case
Varies by
Use Case
DATA LAKE MANAGEMENT
AND GOVERNANCE PLATFORM
SELF-SERVICE DATA PREPARATION
FREE T-SHIRT!
Building a Modern Data Architecture
Ben Sharma, CEO and Founder, Zaloni
Wednesday, 2:05 p.m. – 1 E 09
Demo and FREE
copy of book
“Architecting Data Lakes”
Speaking Sessions:
Cloud Computing and Big Data
Ben Sharma, CEO and Founder, Zaloni
Tuesday, 9:30 a.m. – 1B 01/02
Visit Booth #644 for these giveaways!

More Related Content

What's hot

Data quality architecture
Data quality architectureData quality architecture
Data quality architectureanicewick
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaScyllaDB
 
MDM Strategy & Roadmap
MDM Strategy & RoadmapMDM Strategy & Roadmap
MDM Strategy & Roadmapvictorlbrown
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureDATAVERSITY
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Burak S. Arikan
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Customer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesCustomer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesInformatica
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...DATAVERSITY
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseJames Serra
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingDatabricks
 
Large Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured StreamingDatabricks
 

What's hot (20)

Data quality architecture
Data quality architectureData quality architecture
Data quality architecture
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
 
MDM Strategy & Roadmap
MDM Strategy & RoadmapMDM Strategy & Roadmap
MDM Strategy & Roadmap
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Customer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesCustomer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer Experiences
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data Warehouse
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks Streaming
 
Large Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured Streaming
 

Similar to Creating a Modern Data Architecture

Strata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma PresentationStrata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma PresentationZaloni
 
Webinar -Data Warehouse Augmentation: Cut Costs, Increase Power
Webinar -Data Warehouse Augmentation: Cut Costs, Increase PowerWebinar -Data Warehouse Augmentation: Cut Costs, Increase Power
Webinar -Data Warehouse Augmentation: Cut Costs, Increase PowerZaloni
 
Harness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data LakeHarness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data LakeSaurabh K. Gupta
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big DataZaloni
 
Jumbune data analyzer
Jumbune data analyzerJumbune data analyzer
Jumbune data analyzerPrachi Gupta
 
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Denodo
 
Houd controle over uw data
Houd controle over uw dataHoud controle over uw data
Houd controle over uw dataICT-Partners
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesDATAVERSITY
 
Various Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptVarious Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptRafiulHasan19
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothAdaryl "Bob" Wakefield, MBA
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameCloudera, Inc.
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amirydatastack
 

Similar to Creating a Modern Data Architecture (20)

Strata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma PresentationStrata San Jose 2017 - Ben Sharma Presentation
Strata San Jose 2017 - Ben Sharma Presentation
 
Webinar -Data Warehouse Augmentation: Cut Costs, Increase Power
Webinar -Data Warehouse Augmentation: Cut Costs, Increase PowerWebinar -Data Warehouse Augmentation: Cut Costs, Increase Power
Webinar -Data Warehouse Augmentation: Cut Costs, Increase Power
 
Operationalizing your Data Lake: Get Ready for Advanced Analytics
Operationalizing your Data Lake: Get Ready for Advanced AnalyticsOperationalizing your Data Lake: Get Ready for Advanced Analytics
Operationalizing your Data Lake: Get Ready for Advanced Analytics
 
Harness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data LakeHarness the power of Data in a Big Data Lake
Harness the power of Data in a Big Data Lake
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Jumbune data analyzer
Jumbune data analyzerJumbune data analyzer
Jumbune data analyzer
 
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
 
Houd controle over uw data
Houd controle over uw dataHoud controle over uw data
Houd controle over uw data
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use Cases
 
Various Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptVarious Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.ppt
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
Datawarehouse org
Datawarehouse orgDatawarehouse org
Datawarehouse org
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the Same
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
 
DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
 

Recently uploaded

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Creating a Modern Data Architecture

  • 1. Creating a modern data architecture September 28, 2016 Ben Sharma | CEO ben@zaloni.com
  • 2. •  Award-winning provider of enterprise data lake management solutions: Integrated data lake management platform Self-service data preparation •  Data Lake Design and Implementation Services: POC, Pilot, Production, Operations, Training •  Data Science Professional Services
  • 3. 3 Zaloni Proprietary Increased Agility New Insights Improved Scalability Data lakes are central to the modern data architecture
  • 4. 4 Zaloni Proprietary •  Store all types of data in its raw format •  Create Refined, Standardized, Trusted datasets for various use cases •  Store data for longer periods of time to enable historical analysis •  Query and Access the data using a variety of methods •  Manage streaming and batch data in a converged platform •  Provide shorter time-to-insight with proper data management and governance The data lake promise
  • 5. 5 Zaloni Proprietary Data architecture modernizationTraditionalModern Data Lake Sources ETL EDW Derived (Transformed) Discovery Sandbox EDW Streaming Unstructured Data Various Sources Data Discovery Analytics BI Data Science Data Discovery Analytics BI
  • 6. 6 Zaloni Proprietary Data lake – the challenges and the solution •  Ingestion •  Lack of Visibility •  Privacy and Compliance •  Quality Issues •  Reliance on IT •  Reusability •  Rate of Change •  Skills Gap •  Complexity Managing: Delivering:Building:
  • 7. 7 Zaloni Proprietary Data Lake Reference Architecture •  Data required for LOB specific views - transformed from existing certified data •  Consumers are anyone with appropriate role-based access •  Standardized on corporate governance/ quality policies •  Consumers are anyone with appropriate role-based access •  Single version of truth Transient Landing Zone Raw Zone Refined Zone Trusted Zone Sandbox Data Lake •  Temporary store of source data •  Consumers are IT, Data Stewards •  Implemented in highly regulation industries •  Original source data ready for consumption •  Consumers are ETL developers, data stewards, some data scientists •  Single source of truth with history •  Data required for LOB specific views - transformed from existing certified data •  Consumers are anyone with appropriate role-based access Sensors (or other time series data) Relational Data Stores (OLTP/ODS/ DW) Logs (or other unstructured data) Social and shared data
  • 8. 8 Zaloni Proprietary Inputs: •  Sources: RDBMS, File, Streaming, Structure/ Unstructured, External Data Processes: •  Data transfer and intake: Managed and scheduled •  Discover metadata •  Register in the catalog •  Apply Zone specific policies •  Capture operational metrics and monitoring •  Post-ingestion validations and clean up Outputs: •  Data transfer to Raw Zone Policies: •  Data privacy – tokenization, masking •  Data security – user access •  Data quality – profiling, entity level checks •  Data lifecycle management – short lived, temporary Transient Landing Zone Transient Landing Zone •  Temporary stores source data •  Limited access •  Consumers are IT, Data Stewards •  Implemented in highly regulation industries •  This zone collapses with Raw Zone if security is not needed Transient Landing Zone Raw Zone Refined Zone Trusted Zone Sandbox Data Lake
  • 9. 9 Zaloni Proprietary Inputs: •  Output from TLZ (Hcatalog entries) •  Source inputs if TLZ is skipped Policies: •  Data privacy – tokenization, masking •  Data security – user access •  Data Quality – profiling, entity level, field level •  Data transformations required for Single View of Truth •  Combine attributes to one entity •  Change data formats (e.g. VSAM binary of JSON) •  Derived columns •  Field mappings •  Drop columns •  Data lifecycle management •  Could be a candidate for S3 or Object store Outputs: •  Data transfer to Trusted Zone •  Data transfer to Sandbox Processes: •  Register and update catalog •  Apply zone specific policies •  Operational metrics and monitoring Raw Zone Transient Landing Zone Raw Zone Refined Zone Trusted Zone Sandbox Data Lake Raw Zone •  Original source data •  Ready for consumption •  Treated for basic validation and privacy •  Metadata available to everyone but data access limited based on role •  Consumers are ETL developers, data stewards, some data scientists •  Single source of truth with history
  • 10. 10 Zaloni Proprietary Inputs: •  Output from Raw Zone Processes: •  Register and update catalog •  Apply zone specific policies •  Data transformation required for refined use cases for LOB such as •  Customer360 view •  Periodic snapshots of revenue Outputs: •  Data transfer to Refined Policies: •  Data security – user access •  Data lifecycle management •  Lifetime of use case •  Use case specific Trusted Zone Raw Zone Refined Zone Trusted Zone Sandbox Data Lake •  Standardized on corporate governance/ quality policies •  Consumers are anyone with appropriate role-based access •  Metadata catalog available to all •  Single version of truth Trusted Zone Transient Landing Zone
  • 11. 11 Zaloni Proprietary Inputs: •  Output from Trusted Zone •  Output from Raw Zone for LOB-specific use cases Processes: •  LOB specific transformations •  Aggregates •  De-normalized •  Apply zone specific policies •  Model building for reports •  Optionally a cube generation Outputs: •  Transformed data can be saved back to Refined Zone •  Applications such as BI tools •  Transferred to sandbox if required Policies: •  Data security – user access •  Data lifecycle management •  Lifetime of use case •  Use case specific •  De tokenization if required (based on access) Refined Zone Raw Zone Refined Zone Trusted Zone Sandbox Data Lake Transient Landing Zone Refined Zone •  Data required for LOB specific views - transformed from existing certified data •  Consumers are anyone with appropriate role- based access •  Metadata catalog available to all
  • 12. 12 Zaloni Proprietary Inputs: •  Output from Raw, Trusted and Refined Zones •  Self-service ingestion Sandbox Raw Zone Refined Zone Trusted Zone Sandbox Data Lake Transient Landing Zone Processes: •  Data scientists drive analysis •  Self-service for ad-hoc Outputs: •  Models that can later be operationalized •  Optionally, results/data can be sent back to the Raw Zone Policies: •  Data security – user access •  Data lifecycle management •  Lifetime of use case •  Use case specific •  Data required for LOB specific views - transformed from existing certified data •  Consumers are anyone with appropriate role- based access •  Metadata catalog available to all Sandbox
  • 13. 13 Zaloni Proprietary Data lake Reference Architecture with Zaloni Consumption ZoneSource System File Data DB Data ETL Extracts Streaming Transient Landing Zone Raw Zone Refined Zone Trusted Zone Sandbox APIs Metadata Management Data Quality Data Catalog Security Data Lake Business Analysts Researchers Data Scientists DATA LAKE MANAGEMENT & GOVERNANCE PLATFORM Sensors (or other time series data) Relational Data Stores (OLTP/ODS/ DW) Logs (or other unstructured data) Social and shared data
  • 14. 14 Zaloni Proprietary Data Lake 360°: A holistic approach to actionable big data 1. Enable the lake 2. Govern the data 3. Engage the business •  Foster a data-driven business through self-service data discovery and preparation •  Safeguard sensitive data and enable regulatory compliance •  Improve data visibility, reliability and quality to reduce time-to- insight •  Leverage the full power of a scale-out architecture with an actionable, scalable data lake
  • 15. 15 Zaloni Proprietary •  Managed Ingestion §  Ability to ingest vast amounts of data §  Ability to handle a wide variety of formats (streaming, files, custom) and sources §  Build in repeatability through automation to pick up incoming data and apply pre-defined processing •  Metadata Management §  Capture and manage operational, technical and business metadata §  Provides visibility and reliability – key to finding data in the lake §  Reduced time to insight for analytics §  File and record level watermarking provides data lineage, enables audit and traceability Enable the lake
  • 16. 16 Zaloni Proprietary •  Data Lineage §  See how data moves and how it is consumed in the data lake. §  Safeguard data and reduce risk, always knowing where data has come from, where it is, and how it is being used. •  Data Quality §  Rules based Data validation §  Integration with the Managed Data Pipeline §  Stats and metrics for reporting and actions Govern the data
  • 17. 17 Zaloni Proprietary •  Data Security and Privacy §  Differing permissions require enhanced data security §  Mask or tokenize data before published in the lake for consumption §  Policy-based security •  Data lifecycle management across tiered storage environments §  Hot -> Warm -> Cold on an entity level based on policies/SLAs §  Across on-premise and cloud environments §  Provide data management features to automate scheduling and orchestration of data movement between heterogeneous storage environments Govern the data
  • 18. 18 Zaloni Proprietary Engage the business •  Data Catalog §  See what data is available across your enterprise §  Contribute valuable business information to improve search and usage §  Use a shopping cart experience to create sandbox for ad- hoc and exploratory analytics •  Self-service Data Preparation §  Blend data in the lake without a costly IT project §  Perform interactive data-driven transformations §  Collaborate and share data assets and transformations with peers
  • 19. 19 Zaloni Proprietary •  Rapid increase of Data Lake platforms in the Cloud •  Hybrid cloud and multi-cloud considerations •  Support sensitive data on premise and external data in the cloud (e.g. client data, machine-generated) •  Key challenges: §  Leverage Cloud native features §  Consistence Data Management and Governance Emergence of cloud-based and hybrid data lakes GOVERNANCE VISIBILITY
  • 20. 20 Zaloni Proprietary •  How do you create a cloud agnostic data lake platform? •  How deploy a cost-effective compute layer? §  Elastic compute layer §  Batch and near real-time •  How do you optimize storage? §  Support polyglot persistence §  DLM •  How do you optimize network connectivity between Ground to Cloud? •  How do you meet enterprise security requirements? Considerations for data lake in the cloud CLOUD and HYBRID ENVIRONMENTS
  • 21. 21 Zaloni Proprietary Cloud Data Lake Maturity model Lift and Shift Cloud Native features Multi and Hybrid Cloud Replicate on- premise Data Lake in the cloud Leverage Object stores, Transient compute platforms, Messaging systems Abstraction over multiple clouds, consistent Data Management and Governance
  • 22. 22 Zaloni Proprietary Building your blueprint 1. Questions 2. Inputs 3. Outcomes Business Drivers AND Business Questions: e.g. Where is fraud occurring? How do I optimize inventory? Data Use Cases Platform Subject Areas Source System Capabilities, Process Ingest, Organize, Enrich, Explore Roadmap Managed Data Lake Analytics Strategy = ++
  • 23. 23 Zaloni Proprietary Typical data lake implementation timeline POC Weeks Weeks Production Data Lake Platform Proof of Concept: ü  Demonstrate technical capabilities of the platform in the context of selected use cases Data Lake Implementation: ü  Planning, Installation, Training ü  Sample data sets ingested ü  Pilot uses cases created Business Use Case Delivered: ü  Engage business stakeholders to identify production use cases at scale ü  Review learnings and optimize the data lake Data Lake Use Case Implement Business Use Case Varies by Use Case
  • 24. DATA LAKE MANAGEMENT AND GOVERNANCE PLATFORM SELF-SERVICE DATA PREPARATION
  • 25. FREE T-SHIRT! Building a Modern Data Architecture Ben Sharma, CEO and Founder, Zaloni Wednesday, 2:05 p.m. – 1 E 09 Demo and FREE copy of book “Architecting Data Lakes” Speaking Sessions: Cloud Computing and Big Data Ben Sharma, CEO and Founder, Zaloni Tuesday, 9:30 a.m. – 1B 01/02 Visit Booth #644 for these giveaways!