SlideShare ist ein Scribd-Unternehmen logo
1 von 41
Downloaden Sie, um offline zu lesen
Data Architecture
Best Practices for
Advanced Analytics
Presented by: William McKnight
“#1 Global Influencer in Big Data” Thinkers360
President, McKnight Consulting Group
A 2 time Inc. 5000 Company
@williammcknight
www.mcknightcg.com
(214) 514-1444
Second Thursday of Every Month, at 2:00 ET
With William McKnight
Image Goes
Here
Know Better®
May 12, 2022
© 2022 ChaosSearch, Inc.
ChaosSearch Activates the Data Lake
for Analytics at Scale
The ChaosSearch Data Lake Platform
eliminates complexity and helps organizations
overcome the challenges posed by costly,
siloed analytics solutions.
Users perform both search and SQL queries
concurrently and in situ from their cloud
object storage, without data pipelining,
transformation, or movement.
The end result: Dramatic reductions in time,
cost and complexity.
FEATURED CUSTOMERS
© 2022 ChaosSearch, Inc.
The ChaosSearch Approach: Connect Seamlessly, Eliminate Complexity
Constantly Scale and Reduce Cost
3
* This is a roadmap item and subject to change.
BENEFICIAL OUTCOMES
✓ One unified data lake for analytics at scale
✓ Combined search, SQL insights (PLG, security, etc.)
✓ No data pipelines or data movement
✓ No schema management, sharding or managing
server clusters
✓ Simple, open API access to analytics tools of choice
✓ Scale and performance for analytic workloads with up
to 80% cost savings
✓ Unprecedented simplicity
© 2022 ChaosSearch, Inc.
Optimize Cloud Apps, Infrastructure and Security
Optimize CloudOps/DevOps
• Unlimited retention to optimize troubleshooting and
performance of increasingly complex cloud architectures
• Better log coverage to shorten time to resolution
• Eliminate administrative toil, reduce operational costs
4
Optimize SecOps
• Affordable long-term retention for in-depth forensics
• Centralize logs in a security data lake for end-to-end
visibility and monitoring
• Simpler, more cost-effective compliance
“With the move to ChaosSearch, 98% of all
operational burdens have been lifted from us,
allowing us to focus on Blackboard-specific
tasks.”
Joel Snook, Director, DevOps Engineering
© 2022 ChaosSearch, Inc. 6
Single Access Pane on Global Deployment
Case Study
Challenge
Need to reduce Elasticsearch costs & improve
stability
• Architectural challenges at scale caused system
downtime requiring support
• Challenges with infrastructure planning due to
unpredictable log spikes -- especially during
COVID-19 when demand grew 3,000%
• SREs spent 10-15+ hours per week maintaining
environment
Solution
• GDPR-compliant ChaosSearch log analytics solution provides
single access pane to storage across multiple environments
• True managed service with 99.999% availability
• Use cases: visibility of cloud environments at scale, long-term
app troubleshooting & alerting, root cause analysis
Impact
• Increased uptime with decreased management overhead
• Retention more than doubled at 50% cost
• No more data duplication
• Querying activities scale on demand without manual
reconfigurations or cost increases
• SREs spend time on value-add work vs. ELK stack
maintenance
Need access to long-term data for resolving
academic disputes, compliance purposes, etc.
© 2022 ChaosSearch, Inc.
Before: Amazon ES
Single Access Pane on Global Deployment
© 2021 ChaosSearch, Inc. 7
Case Study
• Multiple environments per region per team (3 teams)
• Unstable environment with massive admin pain & multiple silos with failure points
(master nodes)
• $2M per year
• 1-3 days retention
AZ2
AZ1
…
…
ca-central-1
AZ2
AZ1
…
…
us-east-2
AZ2
AZ1
…
…
ap-northeast-2
AZ2
AZ1
…
…
eu-central-1
AZ2
AZ1
…
…
AZ2
AZ1
…
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node
Master
Master
Master
Data
Node
Data
Node
Data
Node
Data
Node …
ap-southeast-2
ap-southeast-1
With ChaosSearch
Single access pane,
multiple environments for
storage & access
isolation (e.g., GDPR)
$950k per year
(50%+ discount)
True managed service
with 5 9s availability
7 days retention
(could be unlimited)
© 2022 ChaosSearch, Inc.
A Question for Our Audience Today…
8
What are your greatest challenges when it comes to deriving insight from all your
data?
• Expertise
• Resources
• Technology
• Time
• Cost
• Other
Thank you!
Learn more
● ChaosSearch.io
● Free Trial - chaossearch.io/trial
● Demo Webinar - chaossearch.io/demo-webinar
● 2022 Cloud Data & Analytics Survey Report
© 2022 ChaosSearch, Inc. 10
Connect to any
and all data in
your cloud object
storage
Index into highly
compressed,
unified
representation of
data that never
leaves your
storage
Prepare your
data views for
discovery and
analytics with no
data movement
Use your existing
tools for
• Log analytics
• Operational BI
queries
• Dashboards
• Monitoring and
alerting
ChaosSearch is Built for Operational Analytics at Scale
Index
2 Refine
3 Analyze
4
Store
1
© 2022 ChaosSearch, Inc.
Log Analytics Transformed
Before: Elasticsearch (ELK stack)
DevO
ps
SecO
ps
LOB
???
• Limited retention
• Expensive to scale
• Management and
configuration
challenges
• Downtime created by
instability at scale
• Multiple data silos
created due to the
limits above
Cloud Object Storage
i.e., Google GCS, AWS S3
Dev
Ops
Sec
Ops
LOB ???
PUBLISHED
ELASTIC API
One unified data lake
Unlimited scale and retention.
Save up to 80% on Managed Service with 99.99% uptime.
With ChaosSearch
11
Appendix
© 2022 ChaosSearch, Inc. 13
Partially In-Situ architectures do
work and add incremental value.
But fall short of a true
breakthrough.
In-Situ
INFORMATION
In-Situ
PROCESSING
• Underlying representation
of data is native to use
case
• Multi-model access
• Distributed
processing
• Leveraging the
power of cloud
computing
elasticity
In-Situ
ALGORITHM
• Low memory
execution
• Stream based
operations
A fully In-Situ Architecture is needed to
unlock the full spectrum of In-Situ advantages
A Unified Data Lake Architecture for Log and SQL Analytics
14
ChaosSearch uniquely solves for known and unknown data and queries
* Source: Gartner. DW = data warehouse
The Zone of Confusion Within the Data
and Analytics Infrastructure Model
Expanding
Understanding and
Investigating
Founational
Core
Innovation
and Exploration
Establish
Value
Traditional DW
Data Lake
Zone of Confusion
Questions
Known Unknown
Data
Unknown
Known
Bring together search and relational analytics
• Eliminate pipelines, ETL, data movements
• Faster insights
A unified data lake architecture that supports:
• Innovation & exploration
• Investigative queries
• Operational analytics
Disrupts the economics of Big Data
A Unified Data Lake Architecture for Log and SQL Analytics
15
ChaosSearch uniquely solves for broadest scope of analytics needs
Bring together search and relational analytics
• Eliminate pipelines, ETL, data movements
• Faster insights
A unified data lake architecture that supports:
• Innovation & exploration
• Investigative queries
• Operational analytics
Disrupts the economics of Big Data
* Source: Gartner. DW = data warehouse
The Zone of Confusion Within the Data
and Analytics Infrastructure Model
Expanding
Understanding and
Investigating
Founational
Core
Innovation
and Exploration
Establish
Value
Traditional DW
Data Lake
Questions
Known Unknown
Data
Unknown
Known
William McKnight
President, McKnight Consulting Group
• Consulted to Pfizer, Scotiabank, Fidelity, TD
Ameritrade, Teva Pharmaceuticals, Verizon, and many
other Global 1000 companies
• Frequent keynote speaker and trainer internationally
• Hundreds of articles, blogs and white papers in
publication
• Focused on delivering business value and solving
business problems utilizing proven, streamlined
approaches to information management
• Former Database Engineer, Fortune 50 Information
Technology executive and Ernst&Young Entrepreneur
of Year Finalist
• Owner/consultant: Data strategy and implementation
consulting firm
William McKnight
The Savvy Manager’s Guide
The
Savvy
Manager’s
Guide
Information
Management
Information Management
Strategies for Gaining a
Competitive Advantage with Data
2
3
All Data Under
Management
Best Practice:
Get all data
under
management
Data is Under Management when it is…
• In a leveragable platform
• In an appropriate platform for its profile and
usage
• With high non-functionals (Availability,
performance, scalability, stability, durability,
secure)
• Data is captured at the most granular level
• Data is at a data quality standard (as
defined by Data Governance)
• Enables self-service
4
Best Practice:
Enpower
everyone with
true self-service
“80% of analysts’ time
is spent simply discovering
and preparing data.”
What’s Your Data Strategy, Thomas Davenport, HBR 2017
Best Practice:
Start getting
concerned with
the tools and
processes of the
analyst
The Relational Database Data Page
© McKnight Consulting Group, 2010
Page Header
Page
Footer
Row IDs
Records
1120MCG William McKnight President
214-514-1444 wmcknight@mcknightcg.com
1121Stolt Offshore MS Ltd Joe Tyron Director 226-5555-
1269 jtyron@stoltoffshore.com
1122Medtronic, Inc. Mark Smith Principle Database Administrator
763-555-2557
mark.smith@medtronic.com
Columnar Orientation
7
Best Practice:
Make all
analytic
structure
columnar
Data Lakes
• P
8
Parquet format
Best Practice:
Put big data in
data lakes
Best Practice:
Index the data
lake
Data Lakes
• Common & centralized storage for the enterprise
• No defined data model into which the data is
formed
• No relationships between the datasets
• Historical data retention
• All data formats
• For big data
• Analytical processing
• Data scientists and analysts
• Less governance/quality than data warehouse
– Focus: Ingestion
9
Graph Databases
Bridge
vertex
Bridge
vertex
10
• Subject: John R Peterson Predicate: Knows Object: Frank T
Smith
• Subject: Triple #1 Predicate: Confidence Percent Object: 70
• Subject: Triple #1 Predicate: Provenance Object: Mary L Jones
Best Practice:
Use graph
databases for
sizable
connected data
Data Virtualization
“The right answer is not
always to centralize the
data. Data Virtualization
will be of utmost
importance as the
‘perpetual short-term’
solution to the need.”
11
Data Warehouses
Marts & Cubes Operational
Data Stores Transactional
Sources
File Systems
Big Data
Enterprise Data
Virtualization
Best Practice:
Enable data
virtualization for
edge and
temporary
needs
Enterprise Analytic Stack
• Dedicated Compute
• Storage
• Data Integration
• Streaming
• Analytics
• Data Exploration
• Data Lake
• Business Intelligence
• Machine Learning
• Identity Management
• Data Catalog
• Data Virtualization
Best Practice:
Leverage best of
breed for your
analytics stack
• Autonomous Administration
• Lack of Platform Features Leads to Increased
Configuration and Management
– stored procedures, referential integrity and uniqueness
capabilities
– mission critical options for backup and disaster recovery, which
typically includes a standby database
– full ANSI-SQL compliance
• Performance
Total Cost of Ownership is More Than Just
Cloud Costs
Best Practice:
Get a strong
handle on your
cloud costs
Capabilities for Data Integration for
Enterprise Data
• Comprehensive Native Connectivity
• Multi-Latency Data Ingestion
• Data Integration (in ETL, ELT, Streaming)
• Data Quality and Data Governance
• Data Cataloging and Metadata
Management
• Enterprise Trust, Enterprise Scale (or Class)
• AI Intelligence and Automation
• Ecosystem and Multi-cloud
Data Integration Options
Project Technical Environments Recommended For
Consideration Project Scope
Heterogenous:
Cloudera Any Any
IBM Any Any
Informatica Any Any
Talend Any Any
Specialist:
AWS (Glue) Environments on AWS with core of Redshift, EMR Any
Azure (Azure Data Factory) Environments on Azure with core of Synapse, HDInsight Any
FiveTran Any Contained scope
Google Environments on GCP with core of BQ, DataProc Any
Matillion Any Contained scope
Oracle Environments with Oracle database Any
SAP SAP-only environments SAP projects
Best Practice:
Use fit-for-
purpose data
integration
Competitive Analytic
Architectures
Architecture Component Needs
• Security and Privacy
• Governance and Compliance
• Availability
• Backup and Recovery
• Performance
• Scalability
• Licensing
17
Analytics Reference Architecture
Logs
(Apps, Web,
Devices)
User tracking
Operational
Metrics
Offload
data
Raw Data Topics
JSON, AVRO
Processed
Data Topics
Sensors
and
/ or
Transactiona
l/ Context
Data
OLTP/ODS
ETL
Or
EL with
T in Spark
Batch
Low
Latency
Applications
Files
In-
database
analytics
Reach
through
or ETL/ELT
or
Stream
Processing
or
Stream
Processing
Q
Q
Data
Warehouse
Data Lake
Data Lakehouse
19
and
/ or
In-
database
analytics
Reach
through
Q
Q
Data
Warehouse
Data Lake
Data Mesh
Logs
(Apps, Web,
Devices)
User tracking
Operational
Metrics
Offload
data
Sensors
and
/ or
Transactiona
l/ Context
Data
OLTP/ODS
Batch
Low
Latency
Applications
Files
In-
database
analytics
Reach
through
or ETL/ELT
or
Stream
Processing
or
Stream
Processing
Q
Q
Data
Warehouse
Data Lake
ETL
Or
EL with
T in Spark
Raw Data Topics
JSON, AVRO
Processed
Data Topics
Data Fabric
Logs
(Apps, Web,
Devices)
User tracking
Operational
Metrics
Raw Data Topics
JSON, AVRO
Processed
Data Topics
Sensors
and
/ or
Transactiona
l/ Context
Data
OLTP/ODS
ETL
Or
EL with
T in Spark
Batch
Low
Latency
Applications
Files
In-
database
analytics
or
Stream
Processing
Data
Warehouse
Data Lake
Best Practice: Pursue
mesh and fabric
architectures to the
degree possible
Data Cloud (Snowflake)
Logs
(Apps, Web,
Devices)
User tracking
Operational
Metrics
Raw Data Topics
JSON, AVRO
Processed
Data Topics
Sensors
and
/ or
Transactiona
l/ Context
Data
OLTP/ODS
ETL
Or
EL with
T in Spark
Batch
Low
Latency
Applications
Files
In-
database
analytics
or
Stream
Processing
Data
Warehouse
Data Lake
Summary
• Get all enterprise data under management
• RDBMS (/columnar), Cloud Storage/Parquet, Graph cover most analytic
platform needs
• Cost of ownership is more than the cloud costs
• Data Integration is vital to Data Architecture for Modern Analytics
• The Data Mesh and Data Fabric are decentralizing the architecture
24
Upcoming Topics
• Is Our Information Management Mature?
• The Future based on AI & Analytics
• Organizational Change Management for Data & Analytics
Driven Projects
• Graph Database Use Cases
• Assessing New Databases: Translytical Use Cases
25
Second Thursday of Every Month, at 2:00 ET
Data Architecture
Best Practices for
Advanced Analytics
Presented by: William McKnight
“#1 Global Influencer in Big Data” Thinkers360
President, McKnight Consulting Group
A 2 time Inc. 5000 Company
@williammcknight
www.mcknightcg.com
(214) 514-1444
Second Thursday of Every Month, at 2:00 ET
#AdvAnalytics

Weitere ähnliche Inhalte

Was ist angesagt?

Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Microsoft Data Platform - What's included
Microsoft Data Platform - What's includedMicrosoft Data Platform - What's included
Microsoft Data Platform - What's includedJames Serra
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture DesignKujambu Murugesan
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)DATAVERSITY
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookJames Serra
 
The Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceThe Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceRoland Bullivant
 
Data Governance
Data GovernanceData Governance
Data GovernanceRob Lux
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesDATAVERSITY
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
 

Was ist angesagt? (20)

Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Microsoft Data Platform - What's included
Microsoft Data Platform - What's includedMicrosoft Data Platform - What's included
Microsoft Data Platform - What's included
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
The Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceThe Business Value of Metadata for Data Governance
The Business Value of Metadata for Data Governance
 
Data Governance
Data GovernanceData Governance
Data Governance
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 

Ähnlich wie Data Architecture Best Practices for Advanced Analytics

2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Houd controle over uw data
Houd controle over uw dataHoud controle over uw data
Houd controle over uw dataICT-Partners
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeDenodo
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesDATAVERSITY
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Kent Graziano
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
ADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsDATAVERSITY
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014Amazon Web Services
 
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)Denodo
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Denodo
 
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the CloudHow Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the CloudTyler Wishnoff
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse OptimizationCloudera, Inc.
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 

Ähnlich wie Data Architecture Best Practices for Advanced Analytics (20)

2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Houd controle over uw data
Houd controle over uw dataHoud controle over uw data
Houd controle over uw data
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
ADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic Solutions
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
 
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
 
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the CloudHow Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 

Mehr von DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceDATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 

Kürzlich hochgeladen

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 

Kürzlich hochgeladen (20)

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

Data Architecture Best Practices for Advanced Analytics

  • 1. Data Architecture Best Practices for Advanced Analytics Presented by: William McKnight “#1 Global Influencer in Big Data” Thinkers360 President, McKnight Consulting Group A 2 time Inc. 5000 Company @williammcknight www.mcknightcg.com (214) 514-1444 Second Thursday of Every Month, at 2:00 ET With William McKnight
  • 3. © 2022 ChaosSearch, Inc. ChaosSearch Activates the Data Lake for Analytics at Scale The ChaosSearch Data Lake Platform eliminates complexity and helps organizations overcome the challenges posed by costly, siloed analytics solutions. Users perform both search and SQL queries concurrently and in situ from their cloud object storage, without data pipelining, transformation, or movement. The end result: Dramatic reductions in time, cost and complexity. FEATURED CUSTOMERS
  • 4. © 2022 ChaosSearch, Inc. The ChaosSearch Approach: Connect Seamlessly, Eliminate Complexity Constantly Scale and Reduce Cost 3 * This is a roadmap item and subject to change. BENEFICIAL OUTCOMES ✓ One unified data lake for analytics at scale ✓ Combined search, SQL insights (PLG, security, etc.) ✓ No data pipelines or data movement ✓ No schema management, sharding or managing server clusters ✓ Simple, open API access to analytics tools of choice ✓ Scale and performance for analytic workloads with up to 80% cost savings ✓ Unprecedented simplicity
  • 5. © 2022 ChaosSearch, Inc. Optimize Cloud Apps, Infrastructure and Security Optimize CloudOps/DevOps • Unlimited retention to optimize troubleshooting and performance of increasingly complex cloud architectures • Better log coverage to shorten time to resolution • Eliminate administrative toil, reduce operational costs 4 Optimize SecOps • Affordable long-term retention for in-depth forensics • Centralize logs in a security data lake for end-to-end visibility and monitoring • Simpler, more cost-effective compliance
  • 6. “With the move to ChaosSearch, 98% of all operational burdens have been lifted from us, allowing us to focus on Blackboard-specific tasks.” Joel Snook, Director, DevOps Engineering
  • 7. © 2022 ChaosSearch, Inc. 6 Single Access Pane on Global Deployment Case Study Challenge Need to reduce Elasticsearch costs & improve stability • Architectural challenges at scale caused system downtime requiring support • Challenges with infrastructure planning due to unpredictable log spikes -- especially during COVID-19 when demand grew 3,000% • SREs spent 10-15+ hours per week maintaining environment Solution • GDPR-compliant ChaosSearch log analytics solution provides single access pane to storage across multiple environments • True managed service with 99.999% availability • Use cases: visibility of cloud environments at scale, long-term app troubleshooting & alerting, root cause analysis Impact • Increased uptime with decreased management overhead • Retention more than doubled at 50% cost • No more data duplication • Querying activities scale on demand without manual reconfigurations or cost increases • SREs spend time on value-add work vs. ELK stack maintenance Need access to long-term data for resolving academic disputes, compliance purposes, etc.
  • 8. © 2022 ChaosSearch, Inc. Before: Amazon ES Single Access Pane on Global Deployment © 2021 ChaosSearch, Inc. 7 Case Study • Multiple environments per region per team (3 teams) • Unstable environment with massive admin pain & multiple silos with failure points (master nodes) • $2M per year • 1-3 days retention AZ2 AZ1 … … ca-central-1 AZ2 AZ1 … … us-east-2 AZ2 AZ1 … … ap-northeast-2 AZ2 AZ1 … … eu-central-1 AZ2 AZ1 … … AZ2 AZ1 … Master Master Master Data Node Data Node Data Node Data Node Master Master Master Data Node Data Node Data Node Data Node Master Master Master Data Node Data Node Data Node Data Node Master Master Master Data Node Data Node Data Node Data Node Master Master Master Data Node Data Node Data Node Data Node Master Master Master Data Node Data Node Data Node Data Node … ap-southeast-2 ap-southeast-1 With ChaosSearch Single access pane, multiple environments for storage & access isolation (e.g., GDPR) $950k per year (50%+ discount) True managed service with 5 9s availability 7 days retention (could be unlimited)
  • 9. © 2022 ChaosSearch, Inc. A Question for Our Audience Today… 8 What are your greatest challenges when it comes to deriving insight from all your data? • Expertise • Resources • Technology • Time • Cost • Other
  • 10. Thank you! Learn more ● ChaosSearch.io ● Free Trial - chaossearch.io/trial ● Demo Webinar - chaossearch.io/demo-webinar ● 2022 Cloud Data & Analytics Survey Report
  • 11. © 2022 ChaosSearch, Inc. 10 Connect to any and all data in your cloud object storage Index into highly compressed, unified representation of data that never leaves your storage Prepare your data views for discovery and analytics with no data movement Use your existing tools for • Log analytics • Operational BI queries • Dashboards • Monitoring and alerting ChaosSearch is Built for Operational Analytics at Scale Index 2 Refine 3 Analyze 4 Store 1
  • 12. © 2022 ChaosSearch, Inc. Log Analytics Transformed Before: Elasticsearch (ELK stack) DevO ps SecO ps LOB ??? • Limited retention • Expensive to scale • Management and configuration challenges • Downtime created by instability at scale • Multiple data silos created due to the limits above Cloud Object Storage i.e., Google GCS, AWS S3 Dev Ops Sec Ops LOB ??? PUBLISHED ELASTIC API One unified data lake Unlimited scale and retention. Save up to 80% on Managed Service with 99.99% uptime. With ChaosSearch 11
  • 14. © 2022 ChaosSearch, Inc. 13 Partially In-Situ architectures do work and add incremental value. But fall short of a true breakthrough. In-Situ INFORMATION In-Situ PROCESSING • Underlying representation of data is native to use case • Multi-model access • Distributed processing • Leveraging the power of cloud computing elasticity In-Situ ALGORITHM • Low memory execution • Stream based operations A fully In-Situ Architecture is needed to unlock the full spectrum of In-Situ advantages
  • 15. A Unified Data Lake Architecture for Log and SQL Analytics 14 ChaosSearch uniquely solves for known and unknown data and queries * Source: Gartner. DW = data warehouse The Zone of Confusion Within the Data and Analytics Infrastructure Model Expanding Understanding and Investigating Founational Core Innovation and Exploration Establish Value Traditional DW Data Lake Zone of Confusion Questions Known Unknown Data Unknown Known Bring together search and relational analytics • Eliminate pipelines, ETL, data movements • Faster insights A unified data lake architecture that supports: • Innovation & exploration • Investigative queries • Operational analytics Disrupts the economics of Big Data
  • 16. A Unified Data Lake Architecture for Log and SQL Analytics 15 ChaosSearch uniquely solves for broadest scope of analytics needs Bring together search and relational analytics • Eliminate pipelines, ETL, data movements • Faster insights A unified data lake architecture that supports: • Innovation & exploration • Investigative queries • Operational analytics Disrupts the economics of Big Data * Source: Gartner. DW = data warehouse The Zone of Confusion Within the Data and Analytics Infrastructure Model Expanding Understanding and Investigating Founational Core Innovation and Exploration Establish Value Traditional DW Data Lake Questions Known Unknown Data Unknown Known
  • 17. William McKnight President, McKnight Consulting Group • Consulted to Pfizer, Scotiabank, Fidelity, TD Ameritrade, Teva Pharmaceuticals, Verizon, and many other Global 1000 companies • Frequent keynote speaker and trainer internationally • Hundreds of articles, blogs and white papers in publication • Focused on delivering business value and solving business problems utilizing proven, streamlined approaches to information management • Former Database Engineer, Fortune 50 Information Technology executive and Ernst&Young Entrepreneur of Year Finalist • Owner/consultant: Data strategy and implementation consulting firm William McKnight The Savvy Manager’s Guide The Savvy Manager’s Guide Information Management Information Management Strategies for Gaining a Competitive Advantage with Data 2
  • 18. 3 All Data Under Management Best Practice: Get all data under management
  • 19. Data is Under Management when it is… • In a leveragable platform • In an appropriate platform for its profile and usage • With high non-functionals (Availability, performance, scalability, stability, durability, secure) • Data is captured at the most granular level • Data is at a data quality standard (as defined by Data Governance) • Enables self-service 4 Best Practice: Enpower everyone with true self-service
  • 20. “80% of analysts’ time is spent simply discovering and preparing data.” What’s Your Data Strategy, Thomas Davenport, HBR 2017 Best Practice: Start getting concerned with the tools and processes of the analyst
  • 21. The Relational Database Data Page © McKnight Consulting Group, 2010 Page Header Page Footer Row IDs Records 1120MCG William McKnight President 214-514-1444 wmcknight@mcknightcg.com 1121Stolt Offshore MS Ltd Joe Tyron Director 226-5555- 1269 jtyron@stoltoffshore.com 1122Medtronic, Inc. Mark Smith Principle Database Administrator 763-555-2557 mark.smith@medtronic.com
  • 22. Columnar Orientation 7 Best Practice: Make all analytic structure columnar
  • 23. Data Lakes • P 8 Parquet format Best Practice: Put big data in data lakes Best Practice: Index the data lake
  • 24. Data Lakes • Common & centralized storage for the enterprise • No defined data model into which the data is formed • No relationships between the datasets • Historical data retention • All data formats • For big data • Analytical processing • Data scientists and analysts • Less governance/quality than data warehouse – Focus: Ingestion 9
  • 25. Graph Databases Bridge vertex Bridge vertex 10 • Subject: John R Peterson Predicate: Knows Object: Frank T Smith • Subject: Triple #1 Predicate: Confidence Percent Object: 70 • Subject: Triple #1 Predicate: Provenance Object: Mary L Jones Best Practice: Use graph databases for sizable connected data
  • 26. Data Virtualization “The right answer is not always to centralize the data. Data Virtualization will be of utmost importance as the ‘perpetual short-term’ solution to the need.” 11 Data Warehouses Marts & Cubes Operational Data Stores Transactional Sources File Systems Big Data Enterprise Data Virtualization Best Practice: Enable data virtualization for edge and temporary needs
  • 27. Enterprise Analytic Stack • Dedicated Compute • Storage • Data Integration • Streaming • Analytics • Data Exploration • Data Lake • Business Intelligence • Machine Learning • Identity Management • Data Catalog • Data Virtualization Best Practice: Leverage best of breed for your analytics stack
  • 28. • Autonomous Administration • Lack of Platform Features Leads to Increased Configuration and Management – stored procedures, referential integrity and uniqueness capabilities – mission critical options for backup and disaster recovery, which typically includes a standby database – full ANSI-SQL compliance • Performance Total Cost of Ownership is More Than Just Cloud Costs Best Practice: Get a strong handle on your cloud costs
  • 29. Capabilities for Data Integration for Enterprise Data • Comprehensive Native Connectivity • Multi-Latency Data Ingestion • Data Integration (in ETL, ELT, Streaming) • Data Quality and Data Governance • Data Cataloging and Metadata Management • Enterprise Trust, Enterprise Scale (or Class) • AI Intelligence and Automation • Ecosystem and Multi-cloud
  • 30. Data Integration Options Project Technical Environments Recommended For Consideration Project Scope Heterogenous: Cloudera Any Any IBM Any Any Informatica Any Any Talend Any Any Specialist: AWS (Glue) Environments on AWS with core of Redshift, EMR Any Azure (Azure Data Factory) Environments on Azure with core of Synapse, HDInsight Any FiveTran Any Contained scope Google Environments on GCP with core of BQ, DataProc Any Matillion Any Contained scope Oracle Environments with Oracle database Any SAP SAP-only environments SAP projects Best Practice: Use fit-for- purpose data integration
  • 32. Architecture Component Needs • Security and Privacy • Governance and Compliance • Availability • Backup and Recovery • Performance • Scalability • Licensing 17
  • 33. Analytics Reference Architecture Logs (Apps, Web, Devices) User tracking Operational Metrics Offload data Raw Data Topics JSON, AVRO Processed Data Topics Sensors and / or Transactiona l/ Context Data OLTP/ODS ETL Or EL with T in Spark Batch Low Latency Applications Files In- database analytics Reach through or ETL/ELT or Stream Processing or Stream Processing Q Q Data Warehouse Data Lake
  • 35. Data Mesh Logs (Apps, Web, Devices) User tracking Operational Metrics Offload data Sensors and / or Transactiona l/ Context Data OLTP/ODS Batch Low Latency Applications Files In- database analytics Reach through or ETL/ELT or Stream Processing or Stream Processing Q Q Data Warehouse Data Lake ETL Or EL with T in Spark Raw Data Topics JSON, AVRO Processed Data Topics
  • 36. Data Fabric Logs (Apps, Web, Devices) User tracking Operational Metrics Raw Data Topics JSON, AVRO Processed Data Topics Sensors and / or Transactiona l/ Context Data OLTP/ODS ETL Or EL with T in Spark Batch Low Latency Applications Files In- database analytics or Stream Processing Data Warehouse Data Lake
  • 37. Best Practice: Pursue mesh and fabric architectures to the degree possible
  • 38. Data Cloud (Snowflake) Logs (Apps, Web, Devices) User tracking Operational Metrics Raw Data Topics JSON, AVRO Processed Data Topics Sensors and / or Transactiona l/ Context Data OLTP/ODS ETL Or EL with T in Spark Batch Low Latency Applications Files In- database analytics or Stream Processing Data Warehouse Data Lake
  • 39. Summary • Get all enterprise data under management • RDBMS (/columnar), Cloud Storage/Parquet, Graph cover most analytic platform needs • Cost of ownership is more than the cloud costs • Data Integration is vital to Data Architecture for Modern Analytics • The Data Mesh and Data Fabric are decentralizing the architecture 24
  • 40. Upcoming Topics • Is Our Information Management Mature? • The Future based on AI & Analytics • Organizational Change Management for Data & Analytics Driven Projects • Graph Database Use Cases • Assessing New Databases: Translytical Use Cases 25 Second Thursday of Every Month, at 2:00 ET
  • 41. Data Architecture Best Practices for Advanced Analytics Presented by: William McKnight “#1 Global Influencer in Big Data” Thinkers360 President, McKnight Consulting Group A 2 time Inc. 5000 Company @williammcknight www.mcknightcg.com (214) 514-1444 Second Thursday of Every Month, at 2:00 ET #AdvAnalytics