SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
DEEP DIVE
Building a Streaming-Enabled Architecture
www.dmradio.biz
Featured Speakers
 Hardware (network, storage, servers)
 Data Sources
 Data Staging
 Data Volumes
 Data Flow
 Data Governance
 Data Usage
 Data Structures
 Schema Definition
 Ingest Speeds
 Data Workloads
Everything Is In Flux
The Impact of Parallelism
We used to see 10x performance
improvement every 6 years, now we
see 1000x (and that’s just an
approximation) regularly
You Can’t Build Skyscrapers with Bricks & Mortar
A Renaissance in Data Engineering Is Underway
- Web giants innovated to solve their own challenges
- Facebook, Google, LinkedIn, Yahoo! and others…
- By open-sourcing software, these companies
changed how the industry operates, how tech is built
- The result is a new world of scale-out software
- Innovations span the spectrum of functionality:
database, analytics, networking, data flow, security
- Paramount among these in terms of significance is
the world of streaming data and supporting tools
Competition Demands Digital Transformation
Adopted by the EU, but affects the USA
Behemoths are straws in the wind
- They identified huge opportunities
- Upended entire industries
- Built bulletproof infrastructure
- Deconstructed business processes
- Re-architected processes at scale
- Instilled a data-driven approach
The Great Democratizer
Stream First Architecture
Technical Concepts
Streaming as Primary Method
• Must do it: at or near real-time, partial updates with low cycle time
• Events: financial fraud, stock trading, high frequency sensor to controller
(autonomous vehicles)
• Many different items co-mingled (Internet of Things)
• Older examples: Internet packets; digital sensors; computer high density disks
• Boosts business: faster awareness leads to faster response leads to
improved business (consumer activity monitoring)
• Add flexibility and lower Total Cost Ownership (if x and if y and if z)
• Avoid committee-itis
• Faster process-analyze-change cycles
• Allow personnel to address more topics in same time period
Processing
Logic
Processing
Logic
Processing
Logic
Processing
Logic
Processing
Logic
Processing
Logic
End User
End User
End UserAccumulator
Streams parallel processing multiple inputs. Streaming means processing occurs as data flows through.
Straight through or accumulate
• Timing
• Iterative
• Correlation
• Referencing
• Coding
• Real-time
• Near real-
time
• Non-time
Technical Issues: High Level
• Latency: built in delay time caused by many factors. Are you able and
willing to invest in finding and removing these items? Can be
significant cost to do so. Inherent latency in your use case.
• Order misalignment: data arrives or produced out of order
• Errors: detect and correct (don’t underestimate this). Embedded data
quality problems. Processing logic flaws. How to surgically update.
• Power usage: Higher power use per compute period (mobile)
• Storage space: overhead for parallel and streaming look ahead, look
back requires multiple copies. Content mgmt. procedures become
very important.
Technical Issues: Low Level
• Requires deep expertise (hard to find usually)
• Memory: must contain all data needed per calculation including
lookup codes, cross-references, accumulation arrays
• Memory mgmt. is usually weak in high level languages like Java, C#
• Cross-stream data exchange: how to truly separate data for full
independence (age old parallel computing challenge) or how to store
and forward across streams (what is needed, how much, how long)
• Dependencies: state mgmt., server health
• Latency: network slow down, security hand shakes (e.g. TLS),
database freezes, file contention, cluster IO
Buyer Advice: Look for the Special Things
• In memory: of course but especially for engineering of known state-
of-art problems like heap space, garbage collection, swapping
• Integration: both done for you (framework) and well documented
interface APIs using standard computer languages
• Error reporting: this is critical. Your personnel cost will go up
significantly without this because you will (really!!) experience many
crashes. Need meaningful error messages pointing to actual problem.
• Demonstrates knowledge and work in next generation: expanded
memory spaces; integrated memory-compute.
• Shows end-to-end more complicated use cases than yours. (Since
yours will become more complicated than you know quickly).
September 2018
Kevin Petrie
Sr Director, Product Marketing
ADOPTING A STREAMING-
ENABLED ARCHITECTURE
ATTUNITY
2© 2018 Attunity 2© 2017 Attunity
CLOUDSTREAMINGDATA LAKES
ON PREMDATA WAREHOUSE BATCH
MODERN DATA ARCHITECTURES
3© 2018 Attunity 3© 2017 Attunity
#1 provider of change
data capture (CDC)
Support most sources with
best performance and
least impact
#1 cloud database
migration technology
Already moved over
80,000 databases
to Cloud platforms
like AWS
#1 in ease of use
Designed to accelerate
deployments by data
architects and
DBAs instead of
developers
The leading platform for delivering data efficiently and in
real-time to data lake, streaming and cloud architectures
ATTUNITY
4© 2018 Attunity 4© 2017 Attunity
Automated, universal and
real-time data delivery
Accelerate creation of analytics-
ready data structures
ATTUNITY REPLICATE ATTUNITY COMPOSE
ATTUNITY ENTERPRISE MANAGER
Intelligent management, metadata & control
THE ATTUNITY PLATFORM
5© 2018 Attunity 5© 2017 Attunity
Pre-packaged
automation of complex
tasks
Modern user experience
Zero source footprint
Change data
capture (CDC)
Stream live updates
Optimized for high-
performance movement
All major platforms
DB | DW | Hadoop |
Legacy
On Premises | Cloud
SAP | Mainframe
Simplified Real-Time Universal
DATA DELIVERY WITH ATTUNITY REPLICATE
6© 2018 Attunity
Configure, execute and
monitor data flows
Multiple data centers
On premises and cloud
Scale to 1,000s of tasks
OPERATIONS ANALYTICS CONTROL
Historical and real-time
reporting
Capacity planning
Performance monitoring
Visually optimize operations
Create and discover
operational metadata
Microservices integration
via .NET and REST APIs
ATTUNITY ENTERPRISE MANAGER
7© 2018 Attunity 7© 2017 Attunity
WHAT IT IS
ESSENTIAL CHARACTERISTICS OF MODERN DATA STREAMING
Producers and consumers are independent
Communication is asynchronous
Records are persisted for future use
High throughput – e.g., records/second
Records are sent in re-playable, ordered sequence
Geo-distributed replication
Fault tolerance
8© 2018 Attunity 8© 2017 Attunity
WHAT IT IS
APACHE KAFKA
Open source distributed streaming platform for moving, storing and processing
high volumes of data in real time
Developed by Jay Kreps and colleagues to process many continuous data flows at
LinkedIn
More scalable, more fault-tolerant and higher-performance than traditional
message-oriented middleware
Used for building real-time streaming data pipelines and streaming applications
9© 2018 Attunity 9© 2017 Attunity
WHAT IT IS
KAFKA VS. ENTERPRISE SERVICE BUS PREDECESSORS
ActiveMQ, RabbitMQ and IBM MQSeries
Centralized, highly scalable cluster to serve all
applications across large enterprise environments
Persistent storage system for configurable time periods,
including forever
Abstracted stream processing to easily create derived
streams and datasets with minimal coding
10© 2018 Attunity 10© 2017 Attunity
WHY DATA STREAMING MATTERS
“LIFE DOESN’T HAPPEN IN BATCHES”
Data streams enable businesses to react to
events as they happen
Streaming data improves efficiency and
scalability
Can be used for multiple purposes, by
multiple users
11© 2018 Attunity 11© 2017 Attunity
HOW IT WORKS
HIGH LEVEL ARCHITECTURE
Producers send records to brokers to be
read by consumers
Broker persists records to file system on
disk for subsequent usage
Records are grouped into topics for
selected consumer use
Topics can be partitioned to improve
throughput and redundancy
CONSUMER
KAFKA
BROKER
CONSUMER
CONSUMER
PRODUCER
PRODUCER
PRODUCER
DISK
12© 2018 Attunity 12© 2017 Attunity
HOW IT WORKS
KEY COMPONENTS
(a.k.a. message): unit of data, similar to DB row or record
(a.k.a. publisher/writer): process that creates and publishes records
(a.k.a. subscriber/reader): process that reads records
Kafka instance that receives records from producers, persists, provides them to consumers
Group of two or more brokers that provide redundancy and scalability
(a.k.a. stream): category of record to which a consumer subscribes
Subset of topic created to enable redundancy, and parallel reading/writing for higher
performance
RECORD
PRODUCER
CONSUMER
BROKER
CLUSTER
TOPIC
PARTITION
13© 2018 Attunity 13© 2017 Attunity
HOW IT WORKS
DETAILED ARCHITECTURE
Key
TOPIC(S)
Data
0111
1110
PRODUCER(S)
Creates
records with
serialized key
and value
Identifies
location of
write partition
for topic
RECORD
BROKER
0111
1110 0111
1110
Partition
Leader
RECORD
0111
1110
CONSUMER(S)
Deserializes
record to
create
original key
and value
Key
Data
0111
1110
BROKER
0111
1110
Partition
Replica
TOPIC(S)
*Adapted from Enabling Streaming Architectures for Continuous Data and Events with Kafka; Gartner; TJ Craig, Gary Oliffe, Soyeb Barot; 23 May 2018
14© 2018 Attunity 14© 2017 Attunity
HOW IT WORKS
DETAILED ARCHITECTURE
Key
TOPIC(S)
Data
0111
1110
PRODUCER(S)
Creates
records with
serialized key
and value
Identifies
location of
write partition
for topic
RECORD
BROKER
0111
1110 0111
1110
Partition
Leader
RECORD
0111
1110
CONSUMER(S)
Deserializes
record to
create
original key
and value
Key
Data
0111
1110
TOPIC(S)
CHANGE DATA CAPTURE
Eliminates manual scripting to
configure record creation from
source database transactions
*Adapted from Enabling Streaming Architectures for Continuous Data and Events with Kafka; Gartner; TJ Craig, Gary Oliffe, Soyeb Barot; 23 May 2018
BROKER
0111
1110
Partition
Replica
15© 2018 Attunity 15© 2017 Attunity
Generate real-time events
Multi topic, multi partition
One-to-many event
publication
Schema evolution; easy
schema registry integration
DATABASE AS A STREAM UNIVERSAL STREAMING METADATA
Amazon KinesisAzure Event Hub
ATTUNITY AND DATA STREAMING
16© 2018 Attunity 16© 2017 Attunity
USE CASES
STREAMING
INGESTION
MESSAGE/EVENT
BROKER
PREPROCESSING
FOR MACHINE
LEARNING
EVENT STREAM
PROCESSING
DATA PERSISTENCE
REAL-TIME
ANALYTIC
PROCESSING
MICROSERVICES
17© 2018 Attunity 17© 2017 Attunity
KAFKA
STREAMING INGESTION AND MESSAGE BROKER
FORTUNE 100 FOOD PROCESSOR
CDC to HDP data lake
Attunity Replicate feeds HDFS, HBase for
timely reporting and product delivery
Needed real-time view of production
capacity and customer orders
Nightly batch loads couldn’t keep up
=> Fulfilment delays, inaccurate reports
PROBLEM SOLUTION
ATTUNITY
REPLICATE
Log based
CDC
SAP ECC
10 tables
(purchase orders,
production plans)
HDP DATA LAKE
18© 2018 Attunity 18© 2017 Attunity
Copies live transactions without
touching production
Securely transfers them for
client usage on global AWS
microservices platform
Need to efficiently roll out extensive
cloud-based microservices platform
Must minimize latency and security risk
while synchronizing massive
transactional updates globally
PROBLEM SOLUTION
$1
Trillion
ATTUNITY
REPLICATEDB2 z/OS
(on prem)
KINESIS DYNAMO DB
MICROSERVICE
HUB ON RDS
DYNAMO DB
STREAMS
RDS
EMEA
CUSTOMER
APJ
CUSTOMER
AWS CLOUD
CDC
MICROSERVICES
LEADING ASSET MANAGEMENT FIRM
Assetsunder
Management
19© 2018 Attunity 19© 2017 Attunity
EVENT STREAM PROCESSING
FORTUNE 100 PHARMACEUTICAL FIRM
CDC to Kafka to Lambda Architecture
Multi-pronged analysis of clinical data at
scale
Minimal administrative burden; no PROD
impact
Needed efficient, scalable delivery of
clinical data for analytics
Lacked tools for low-impact data capture
PROBLEM SOLUTION
ATTUNITY
REPLICATE
Log based
CDC
Lambda
Architecture
Clinical
Systems
Structured Analysis
Clinical
Systems
Batch Historical
Data
Real Time
Updates
Stream
Processing
Graph Analysis
Natural Language
Processing
Machine Learning
KAFKA
20© 2018 Attunity 20© 2017 Attunity
Improved ease of use
Consistent, 100% automated processes
across end points
Reduced impact on MF production
Improved performance to Kafka
Better TCO
Needed 360 degree customer view for
CSAT initiatives
Inefficient OGG solution
Lack of data consistency and
standardization
PROBLEM ATTUNITY SOLUTION
Fortune 100
company
$100
BILLION
STREAMING INGESTION
FORTUNE 100 HEALTH BENEFITS FIRM
DB2 Z/OS
SQL SERVER
ORACLE
KAFKA
ATTUNITY
REPLICATE
Log based
CDC
HDP DATA LAKE
21© 2018 Attunity
MACHINE LEARNING PRE-PROCESSING
LEADING PAYMENT PROCESSOR
BANK/
MERCHANT
GATEWAY
APPLICATION AND
DATABASE
OPENSCORING.IO
DECISION
SERVICE ENGINE
Credit Check,
Authentication
Decision < 100 MS
CHANGE DATA
CAPTURE
A B
Transactions
Decisions
logged real
time
MACHINE
LEARNING
Data analyzed
over days
Decision
models
published
with new
insights
Data
delivered
in seconds
Data delivered
in minutes
Decision
performance
monitored
Real time decisions
Decision measurement
and tuning
C
D
E
F
A
22© 2018 Attunity
Overall
Rating
Product
Capabilities
Ease of
Deployment
4.5 out of 5 4.4 4.2
4.1 out of 5 4.2 3.7
4.2 out of 5 4.3 4.0
4.1 out of 5 4.3 4.0
VENDOR OF CHOICE
Replicate has been working great for
several years, implementation was a
breeze
- DBA, Retail Industry
Great vendor to work with and an
incredibly easy tool to use
- Senior Member of Technical Staff,
Communications Industry
23© 2018 Attunity 23© 2017 Attunity
Trusted by Microsoft
with 3 OEMs,
bundled inside
SQL Server
Trusted by Amazon
(AWS) with strategic
partnership for cloud
database migration
Trusted by IBM and
Oracle with respective
OEMs of Attunity
technology
Trusted by Teradata
and HP as resellers for
data warehouse and
analytics
Trusted by
global system
integrators
Trusted by over
2000 customers for
commitment, flexibility
and speed
2000+
Trusted by SAP as
certified solution in use
with over 200 SAP
customers
Trusted by big data
leaders for data lake
solutions
Trusted by IBM and
Oracle with respective
OEMs of Attunity
technology
Trusted by Teradata
and HP as resellers for
data warehouse and
analytics
PARTNER OF CHOICE
24© 2018 Attunity
FINANCIAL
SERVICES
MANUFACTURING/
INDUSTRIAL
HEALTH
CARE
GOVERNMENT TECHNOLOGY /
TELECOM
RETAIL OTHER
INDUSTRIES
2000 CUSTOMERS AND HALF THE FORTUNE 100
Thank you
attunity.com
Kevin Petrie
Kevin.Petrie@Attunity.com

Weitere ähnliche Inhalte

Was ist angesagt?

The Need to Know for Information Architects: Big Data to Big Information
The Need to Know for Information Architects: Big Data to Big InformationThe Need to Know for Information Architects: Big Data to Big Information
The Need to Know for Information Architects: Big Data to Big Information
DATAVERSITY
 

Was ist angesagt? (20)

RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceRWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
The Need to Know for Information Architects: Big Data to Big Information
The Need to Know for Information Architects: Big Data to Big InformationThe Need to Know for Information Architects: Big Data to Big Information
The Need to Know for Information Architects: Big Data to Big Information
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
 
IDERA Slides: Managing Complex Data Environments
IDERA Slides: Managing Complex Data EnvironmentsIDERA Slides: Managing Complex Data Environments
IDERA Slides: Managing Complex Data Environments
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Data Prep - A Key Ingredient for Cloud-based Analytics
Data Prep - A Key Ingredient for Cloud-based AnalyticsData Prep - A Key Ingredient for Cloud-based Analytics
Data Prep - A Key Ingredient for Cloud-based Analytics
 
Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!
 
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL Code
 
RWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata ManagementRWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata Management
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Big Challenges in Data Modeling: Modeling Metadata
Big Challenges in Data Modeling: Modeling MetadataBig Challenges in Data Modeling: Modeling Metadata
Big Challenges in Data Modeling: Modeling Metadata
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
 
Metadata Governance for Vocabularies, Dictionaries, and Data
Metadata Governance for Vocabularies, Dictionaries, and DataMetadata Governance for Vocabularies, Dictionaries, and Data
Metadata Governance for Vocabularies, Dictionaries, and Data
 
Data-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata StrategiesData-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata Strategies
 
Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data Quality
 

Ähnlich wie DM Radio Webinar: Adopting a Streaming-Enabled Architecture

TidalScale Overview
TidalScale OverviewTidalScale Overview
TidalScale Overview
Pete Jarvis
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
solarisyourep
 
Architecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentationArchitecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentation
Vlad Ponomarev
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 

Ähnlich wie DM Radio Webinar: Adopting a Streaming-Enabled Architecture (20)

How to Radically Simplify Your Business Data Management
How to Radically Simplify Your Business Data ManagementHow to Radically Simplify Your Business Data Management
How to Radically Simplify Your Business Data Management
 
Solving enterprise challenges through scale out storage &amp; big compute final
Solving enterprise challenges through scale out storage &amp; big compute finalSolving enterprise challenges through scale out storage &amp; big compute final
Solving enterprise challenges through scale out storage &amp; big compute final
 
(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...
(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...
(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...
 
TidalScale Overview
TidalScale OverviewTidalScale Overview
TidalScale Overview
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
 
SaaS Enablement of your existing application (Cloud Slam 2010)
SaaS Enablement of your existing application (Cloud Slam 2010)SaaS Enablement of your existing application (Cloud Slam 2010)
SaaS Enablement of your existing application (Cloud Slam 2010)
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Evolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in MotionEvolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in Motion
 
Cloud Computing for Small & Medium Businesses
Cloud Computing for Small & Medium BusinessesCloud Computing for Small & Medium Businesses
Cloud Computing for Small & Medium Businesses
 
Architecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentationArchitecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentation
 
Infrastructure Strategies 2007
Infrastructure Strategies 2007Infrastructure Strategies 2007
Infrastructure Strategies 2007
 
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
 
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 

Mehr von DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

Mehr von DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Kürzlich hochgeladen

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 

Kürzlich hochgeladen (20)

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 

DM Radio Webinar: Adopting a Streaming-Enabled Architecture

  • 1. DEEP DIVE Building a Streaming-Enabled Architecture www.dmradio.biz
  • 3.  Hardware (network, storage, servers)  Data Sources  Data Staging  Data Volumes  Data Flow  Data Governance  Data Usage  Data Structures  Schema Definition  Ingest Speeds  Data Workloads Everything Is In Flux
  • 4. The Impact of Parallelism We used to see 10x performance improvement every 6 years, now we see 1000x (and that’s just an approximation) regularly
  • 5. You Can’t Build Skyscrapers with Bricks & Mortar
  • 6. A Renaissance in Data Engineering Is Underway - Web giants innovated to solve their own challenges - Facebook, Google, LinkedIn, Yahoo! and others… - By open-sourcing software, these companies changed how the industry operates, how tech is built - The result is a new world of scale-out software - Innovations span the spectrum of functionality: database, analytics, networking, data flow, security - Paramount among these in terms of significance is the world of streaming data and supporting tools
  • 7.
  • 8. Competition Demands Digital Transformation Adopted by the EU, but affects the USA Behemoths are straws in the wind - They identified huge opportunities - Upended entire industries - Built bulletproof infrastructure - Deconstructed business processes - Re-architected processes at scale - Instilled a data-driven approach
  • 11. Streaming as Primary Method • Must do it: at or near real-time, partial updates with low cycle time • Events: financial fraud, stock trading, high frequency sensor to controller (autonomous vehicles) • Many different items co-mingled (Internet of Things) • Older examples: Internet packets; digital sensors; computer high density disks • Boosts business: faster awareness leads to faster response leads to improved business (consumer activity monitoring) • Add flexibility and lower Total Cost Ownership (if x and if y and if z) • Avoid committee-itis • Faster process-analyze-change cycles • Allow personnel to address more topics in same time period
  • 12. Processing Logic Processing Logic Processing Logic Processing Logic Processing Logic Processing Logic End User End User End UserAccumulator Streams parallel processing multiple inputs. Streaming means processing occurs as data flows through. Straight through or accumulate • Timing • Iterative • Correlation • Referencing • Coding • Real-time • Near real- time • Non-time
  • 13. Technical Issues: High Level • Latency: built in delay time caused by many factors. Are you able and willing to invest in finding and removing these items? Can be significant cost to do so. Inherent latency in your use case. • Order misalignment: data arrives or produced out of order • Errors: detect and correct (don’t underestimate this). Embedded data quality problems. Processing logic flaws. How to surgically update. • Power usage: Higher power use per compute period (mobile) • Storage space: overhead for parallel and streaming look ahead, look back requires multiple copies. Content mgmt. procedures become very important.
  • 14.
  • 15. Technical Issues: Low Level • Requires deep expertise (hard to find usually) • Memory: must contain all data needed per calculation including lookup codes, cross-references, accumulation arrays • Memory mgmt. is usually weak in high level languages like Java, C# • Cross-stream data exchange: how to truly separate data for full independence (age old parallel computing challenge) or how to store and forward across streams (what is needed, how much, how long) • Dependencies: state mgmt., server health • Latency: network slow down, security hand shakes (e.g. TLS), database freezes, file contention, cluster IO
  • 16. Buyer Advice: Look for the Special Things • In memory: of course but especially for engineering of known state- of-art problems like heap space, garbage collection, swapping • Integration: both done for you (framework) and well documented interface APIs using standard computer languages • Error reporting: this is critical. Your personnel cost will go up significantly without this because you will (really!!) experience many crashes. Need meaningful error messages pointing to actual problem. • Demonstrates knowledge and work in next generation: expanded memory spaces; integrated memory-compute. • Shows end-to-end more complicated use cases than yours. (Since yours will become more complicated than you know quickly).
  • 17.
  • 18. September 2018 Kevin Petrie Sr Director, Product Marketing ADOPTING A STREAMING- ENABLED ARCHITECTURE ATTUNITY
  • 19. 2© 2018 Attunity 2© 2017 Attunity CLOUDSTREAMINGDATA LAKES ON PREMDATA WAREHOUSE BATCH MODERN DATA ARCHITECTURES
  • 20. 3© 2018 Attunity 3© 2017 Attunity #1 provider of change data capture (CDC) Support most sources with best performance and least impact #1 cloud database migration technology Already moved over 80,000 databases to Cloud platforms like AWS #1 in ease of use Designed to accelerate deployments by data architects and DBAs instead of developers The leading platform for delivering data efficiently and in real-time to data lake, streaming and cloud architectures ATTUNITY
  • 21. 4© 2018 Attunity 4© 2017 Attunity Automated, universal and real-time data delivery Accelerate creation of analytics- ready data structures ATTUNITY REPLICATE ATTUNITY COMPOSE ATTUNITY ENTERPRISE MANAGER Intelligent management, metadata & control THE ATTUNITY PLATFORM
  • 22. 5© 2018 Attunity 5© 2017 Attunity Pre-packaged automation of complex tasks Modern user experience Zero source footprint Change data capture (CDC) Stream live updates Optimized for high- performance movement All major platforms DB | DW | Hadoop | Legacy On Premises | Cloud SAP | Mainframe Simplified Real-Time Universal DATA DELIVERY WITH ATTUNITY REPLICATE
  • 23. 6© 2018 Attunity Configure, execute and monitor data flows Multiple data centers On premises and cloud Scale to 1,000s of tasks OPERATIONS ANALYTICS CONTROL Historical and real-time reporting Capacity planning Performance monitoring Visually optimize operations Create and discover operational metadata Microservices integration via .NET and REST APIs ATTUNITY ENTERPRISE MANAGER
  • 24. 7© 2018 Attunity 7© 2017 Attunity WHAT IT IS ESSENTIAL CHARACTERISTICS OF MODERN DATA STREAMING Producers and consumers are independent Communication is asynchronous Records are persisted for future use High throughput – e.g., records/second Records are sent in re-playable, ordered sequence Geo-distributed replication Fault tolerance
  • 25. 8© 2018 Attunity 8© 2017 Attunity WHAT IT IS APACHE KAFKA Open source distributed streaming platform for moving, storing and processing high volumes of data in real time Developed by Jay Kreps and colleagues to process many continuous data flows at LinkedIn More scalable, more fault-tolerant and higher-performance than traditional message-oriented middleware Used for building real-time streaming data pipelines and streaming applications
  • 26. 9© 2018 Attunity 9© 2017 Attunity WHAT IT IS KAFKA VS. ENTERPRISE SERVICE BUS PREDECESSORS ActiveMQ, RabbitMQ and IBM MQSeries Centralized, highly scalable cluster to serve all applications across large enterprise environments Persistent storage system for configurable time periods, including forever Abstracted stream processing to easily create derived streams and datasets with minimal coding
  • 27. 10© 2018 Attunity 10© 2017 Attunity WHY DATA STREAMING MATTERS “LIFE DOESN’T HAPPEN IN BATCHES” Data streams enable businesses to react to events as they happen Streaming data improves efficiency and scalability Can be used for multiple purposes, by multiple users
  • 28. 11© 2018 Attunity 11© 2017 Attunity HOW IT WORKS HIGH LEVEL ARCHITECTURE Producers send records to brokers to be read by consumers Broker persists records to file system on disk for subsequent usage Records are grouped into topics for selected consumer use Topics can be partitioned to improve throughput and redundancy CONSUMER KAFKA BROKER CONSUMER CONSUMER PRODUCER PRODUCER PRODUCER DISK
  • 29. 12© 2018 Attunity 12© 2017 Attunity HOW IT WORKS KEY COMPONENTS (a.k.a. message): unit of data, similar to DB row or record (a.k.a. publisher/writer): process that creates and publishes records (a.k.a. subscriber/reader): process that reads records Kafka instance that receives records from producers, persists, provides them to consumers Group of two or more brokers that provide redundancy and scalability (a.k.a. stream): category of record to which a consumer subscribes Subset of topic created to enable redundancy, and parallel reading/writing for higher performance RECORD PRODUCER CONSUMER BROKER CLUSTER TOPIC PARTITION
  • 30. 13© 2018 Attunity 13© 2017 Attunity HOW IT WORKS DETAILED ARCHITECTURE Key TOPIC(S) Data 0111 1110 PRODUCER(S) Creates records with serialized key and value Identifies location of write partition for topic RECORD BROKER 0111 1110 0111 1110 Partition Leader RECORD 0111 1110 CONSUMER(S) Deserializes record to create original key and value Key Data 0111 1110 BROKER 0111 1110 Partition Replica TOPIC(S) *Adapted from Enabling Streaming Architectures for Continuous Data and Events with Kafka; Gartner; TJ Craig, Gary Oliffe, Soyeb Barot; 23 May 2018
  • 31. 14© 2018 Attunity 14© 2017 Attunity HOW IT WORKS DETAILED ARCHITECTURE Key TOPIC(S) Data 0111 1110 PRODUCER(S) Creates records with serialized key and value Identifies location of write partition for topic RECORD BROKER 0111 1110 0111 1110 Partition Leader RECORD 0111 1110 CONSUMER(S) Deserializes record to create original key and value Key Data 0111 1110 TOPIC(S) CHANGE DATA CAPTURE Eliminates manual scripting to configure record creation from source database transactions *Adapted from Enabling Streaming Architectures for Continuous Data and Events with Kafka; Gartner; TJ Craig, Gary Oliffe, Soyeb Barot; 23 May 2018 BROKER 0111 1110 Partition Replica
  • 32. 15© 2018 Attunity 15© 2017 Attunity Generate real-time events Multi topic, multi partition One-to-many event publication Schema evolution; easy schema registry integration DATABASE AS A STREAM UNIVERSAL STREAMING METADATA Amazon KinesisAzure Event Hub ATTUNITY AND DATA STREAMING
  • 33. 16© 2018 Attunity 16© 2017 Attunity USE CASES STREAMING INGESTION MESSAGE/EVENT BROKER PREPROCESSING FOR MACHINE LEARNING EVENT STREAM PROCESSING DATA PERSISTENCE REAL-TIME ANALYTIC PROCESSING MICROSERVICES
  • 34. 17© 2018 Attunity 17© 2017 Attunity KAFKA STREAMING INGESTION AND MESSAGE BROKER FORTUNE 100 FOOD PROCESSOR CDC to HDP data lake Attunity Replicate feeds HDFS, HBase for timely reporting and product delivery Needed real-time view of production capacity and customer orders Nightly batch loads couldn’t keep up => Fulfilment delays, inaccurate reports PROBLEM SOLUTION ATTUNITY REPLICATE Log based CDC SAP ECC 10 tables (purchase orders, production plans) HDP DATA LAKE
  • 35. 18© 2018 Attunity 18© 2017 Attunity Copies live transactions without touching production Securely transfers them for client usage on global AWS microservices platform Need to efficiently roll out extensive cloud-based microservices platform Must minimize latency and security risk while synchronizing massive transactional updates globally PROBLEM SOLUTION $1 Trillion ATTUNITY REPLICATEDB2 z/OS (on prem) KINESIS DYNAMO DB MICROSERVICE HUB ON RDS DYNAMO DB STREAMS RDS EMEA CUSTOMER APJ CUSTOMER AWS CLOUD CDC MICROSERVICES LEADING ASSET MANAGEMENT FIRM Assetsunder Management
  • 36. 19© 2018 Attunity 19© 2017 Attunity EVENT STREAM PROCESSING FORTUNE 100 PHARMACEUTICAL FIRM CDC to Kafka to Lambda Architecture Multi-pronged analysis of clinical data at scale Minimal administrative burden; no PROD impact Needed efficient, scalable delivery of clinical data for analytics Lacked tools for low-impact data capture PROBLEM SOLUTION ATTUNITY REPLICATE Log based CDC Lambda Architecture Clinical Systems Structured Analysis Clinical Systems Batch Historical Data Real Time Updates Stream Processing Graph Analysis Natural Language Processing Machine Learning KAFKA
  • 37. 20© 2018 Attunity 20© 2017 Attunity Improved ease of use Consistent, 100% automated processes across end points Reduced impact on MF production Improved performance to Kafka Better TCO Needed 360 degree customer view for CSAT initiatives Inefficient OGG solution Lack of data consistency and standardization PROBLEM ATTUNITY SOLUTION Fortune 100 company $100 BILLION STREAMING INGESTION FORTUNE 100 HEALTH BENEFITS FIRM DB2 Z/OS SQL SERVER ORACLE KAFKA ATTUNITY REPLICATE Log based CDC HDP DATA LAKE
  • 38. 21© 2018 Attunity MACHINE LEARNING PRE-PROCESSING LEADING PAYMENT PROCESSOR BANK/ MERCHANT GATEWAY APPLICATION AND DATABASE OPENSCORING.IO DECISION SERVICE ENGINE Credit Check, Authentication Decision < 100 MS CHANGE DATA CAPTURE A B Transactions Decisions logged real time MACHINE LEARNING Data analyzed over days Decision models published with new insights Data delivered in seconds Data delivered in minutes Decision performance monitored Real time decisions Decision measurement and tuning C D E F A
  • 39. 22© 2018 Attunity Overall Rating Product Capabilities Ease of Deployment 4.5 out of 5 4.4 4.2 4.1 out of 5 4.2 3.7 4.2 out of 5 4.3 4.0 4.1 out of 5 4.3 4.0 VENDOR OF CHOICE Replicate has been working great for several years, implementation was a breeze - DBA, Retail Industry Great vendor to work with and an incredibly easy tool to use - Senior Member of Technical Staff, Communications Industry
  • 40. 23© 2018 Attunity 23© 2017 Attunity Trusted by Microsoft with 3 OEMs, bundled inside SQL Server Trusted by Amazon (AWS) with strategic partnership for cloud database migration Trusted by IBM and Oracle with respective OEMs of Attunity technology Trusted by Teradata and HP as resellers for data warehouse and analytics Trusted by global system integrators Trusted by over 2000 customers for commitment, flexibility and speed 2000+ Trusted by SAP as certified solution in use with over 200 SAP customers Trusted by big data leaders for data lake solutions Trusted by IBM and Oracle with respective OEMs of Attunity technology Trusted by Teradata and HP as resellers for data warehouse and analytics PARTNER OF CHOICE
  • 41. 24© 2018 Attunity FINANCIAL SERVICES MANUFACTURING/ INDUSTRIAL HEALTH CARE GOVERNMENT TECHNOLOGY / TELECOM RETAIL OTHER INDUSTRIES 2000 CUSTOMERS AND HALF THE FORTUNE 100