SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Big Data and AI
The convergence of Big Data
and Machine Learning
October 18, 2018
Colin MacNaughton
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
INTRODUCTIONS
• Based in Silicon Valley
• Creators of the X Platform™- Memory Oriented
Application Platform
• Passionate about high performance computing
for mission critical enterprises
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
AGENDA
• MACHINE LEARNING: BIG DATA -> BETTER FEATURES
• HARNESSING BIG, FAST DATA IN REAL TIME
• USE CASE: REAL TIME FRAUD DETECTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
BIG DATA AND MACHINE LEARNING
Training
• Deep Learning has risen to the fore recently, and it is data hungry! When looking to make
accurate predictions we need large data sets to train and test our models.
In Production (real-time)
• The more data (features) we can access and aggregate in real time to feed as inputs to our
models, the more accurate our predictive output will be.
• This is an HTAP/HOAP problem: can we assemble this data at scale while it is also being
updated?
• Because models need to evolve continuously, loosely coupled (micro service) architectures
are a good choice, but at the risk of needing to move a lot of data around.
Big Data and Machine Learning go Hand in
Hand
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
TYPES OF APPLICATIONS
• Financial Trading
• IoT Event Processors
• Credit Card Processors
• E-Commerce
 Personalization Engines
 Value Based Pricing
• Ad Exchanges
• …
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
MACHINE LEARNING WORKFLOW
REFINE /
IMPROVE
FEATURE
SELECTION
DATA
AQUISITION
TRAIN
TEST
PRODUCTION
MONITOR
TODAY’S
FOCUS
TODAY’S
FOCUS
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
FEATURE SELECTION
It’s all about the data …but what data?
• Which pieces of data serve as the best predictors of what we are looking
to answer?
• Can I get an accurate (enough) result just from the data in the request a
user sent?
• If not can more data help?
FEATURE
SELECTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Can Big Data in Real Time help us leverage more meaningful features?
 How much better are our predictive models if they can leverage features
based on relevant historical/topical data on a transaction by transaction
basis?
 Can we assemble such data within a meaningful time frame in
production?
 Can we concurrently collect more data that we expect will be useful?
FEATURE
SELECTION
BIG DATA AND BETTER FEATURES
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
BIG DATA AND BETTER FEATURES
Example – Credit Card Fraud Detection
Feature Big Data Enhanced Feature
Amount Skew from median purchase, Amount
charged in last hour.
Merchant # of Prior Purchases by user
Location Distance from last purchase? Distance from
home(s)? Purchased from this location in the
past?
Time Last Purchase Time?
FEATURE
SELECTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
BIG DATA AND BETTER FEATURES
Example – Personalization
Feature Big Data Enhanced Feature
Time Seasonal Interests / Habits … every year
Jane goes snowshoeing in March.
Search Terms / Key words Past Interests / Behavior
Location • The last time John was in Paris, he
was interested in…
• John’s calendar says he’ll be in Paris
next September.
• XYZ is happening here now (or in the
future).
Demographics What are peers clicking on now?
FEATURE
SELECTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
MACHINE LEARNING IN PRODUCTION
Performance and Scale – Lots of data needed in real time
 Can I assemble the normalized feature data needed to feed my model in real time?
 Can I produce results fast enough that the prediction still matters?
Agility – Rapid Change: Models must evolve over time and so must the system
feeding data to it.
 Fail Fast – Ability to rapidly test and discard what doesn’t work.
 A/B testing
 Zero down time deployment, easy deployment to test environments.
High Availability
 No interruptions across Process, Machine or Data Center failure.
Business Logic
 ML isn’t the answer to every problem, can your compute/data infrastructure
handle traditional analytics and ML?
 Cyber Threats – duping the model.
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
PLAN FOR (Evolving) SCALE – COMPUTE + Data + HA
Data Tier
(Transactional
State
Reference Data)
Application
Tier
(Business
Logic)
Messaging
(HTTP, JMS)
Data Grid, RDBMS ...
• Data Update Contention
• Isolation and Ordering
• Data Access Latency
• Transaction coordination between
message and data stream.
• Only scales to a point.
• Complex Routing
• Complex Ordering
• Synchronous
Wrong
Scaling
Strategy
Shared storage for
HA and reliability
Launch more
instances for scale +
HA
Request Load
Balancing
Can you assemble the feature vectors needed to feed your model at
scale?
 Not with the above … Update Contention between threads /
instances prevents the ability to do big data reads.
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
PLAN FOR (Evolving) SCALE – COMPUTE + Data + HA
Data Tier
(Transactional
State
Reference Data)
Application
Tier
(Business
Logic)
Messaging
(Publish -
Subscribe)
In-Memory + Partitioned
Routing Strategy?
Processing Swim-lanes (ordered)
Messaging Fabric
Data And
Application Tier
Collapsed
+ Co-located Function + Data+ Replicated
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Service
2
PLAN FOR (Evolving) SCALE – MICRO SERVICES
Micro Services:
 Each Service owns private state.
 Collaborate asynchronously via messaging
 Easier to scale + less contention on shared state
 Pick up feature data in streaming processing
pipeline.
Benefits
• Reduce Risk -> Increased Agility
• Cost Effective -> Provision to hardware by granular service
needs.
• Resiliency -> Single service failure doesn’t bring down the entire
system.
Service
1
ML B
In
Process
Messaging Fabric
In
Process
Request /
Response
ML A
ML As Service
A/B testing made
simple w/ routing
rules
Business Logic and
Feature Vector Prep
{F1,F2 … Fn}
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
PLAN FOR (Evolving) SCALE – MICRO SERVICES
Data to aggregate across lots of disparate Microservices?
ML B
Messaging Fabric
Request /
Response
ML A
{F1,F2 … Fn}
Service
1
Parallel Fetch (Fork/Join)
• choice of messaging
provider matters, but
modern providers can
handle it.
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
PLAN FOR (Evolving) SCALE – DATA EVOLUTION
What Happens when Services are Updated?
ML B
Messaging Fabric
Request /
Response
ML A
{F1,F2 … Fn}
Service
1
 Choice of message encoding is critical.
 Older versions of services should
still function when new fields
added.
 Efficiency of Encoding Matters!
 Impedance mismatch between
State/Message encoding?
 Organization-wide agreed upon
“Rules of Engagement”
Version
2
Versio
n1
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
DON’T FORGET PLAIN OLD BUSINESS LOGIC
Traditional Analytics are Still Important!
• Not all analytics are best solved with ML … be judicious.
• Deep Neural Networks are a Black Box…
• … so when possible traditional rules/analytics should complement ML,
along with robust monitoring.
Adversarial Inputs
PRODUCTION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
PLAN WORKFLOW FOR REFINEMENT
 Plan for measuring and monitoring ML efficacy
 Behavior changes over time
 Models will need to evolve.
 Getting data out
 Consider infrastructural / security implications of
exposing production data for refinement training of models.
 Continuous training workflows?
DATA
AQUISITION
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
THE X PLATFORM
THE X PLATFORM
The X Platform is a memory oriented platform
for building multi-agent, transactional applications.
Smart Routing + Collocated Data + Business Logic
=
Full Promise of In-Memory Computing
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Message Driven
Stateful
Multi-Agent
 Totally Available
 Horizontally Scalable
 Ultra Performant
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
HA + SCALE ON THE X PLATFORM
Backup P3Backup P2
Smart Routing
(messaging traffic partitioned to align with data partitions)
Pipelined
Replication Backup P1Primary P1
Solace, Kafka, Falcon, JMS 2.0…
Primary P2 Primary P3
PARTITION 1 PARTITION 2 PARTITION 3
/${ENV}/ORDERS/#hash(${customerId},3)
/PROD/ORDERS/3/PROD/ORDERS/2/PROD/ORDERS/1
From Config From Message
Single
Threaded
Logic
KEY TAKEAWAYS
DATA:
• STRIPED – NO UPDATE CONTENTION, HORIZONTAL SCALE
• IN MEMORY – NO DATA ACCESS LATENCY, DISK BASED JOURNAL
BACKED
• PLAIN OLD JAVA OBJECTS– FLEXIBLE, EVOLVABLE ENCODING
MESSAGING
• CONTENT BASED – TRANSPARENT ROUTING TO DATA
• FIRE AND FORGET – EXACTLY ONCE PROCESSING, CONSISTENT
WITH STATE
• PLAIN OLD JAVA OBJECTS– FLEXIBLE, EVOLVABLE ENCODING
HIGH AVAILABILITY
• PIPELINED REPLICATION – NON BLOCKING PIPELINED MEMORY-
TO-MEMORY -> STREAM TRANSACTION PROCESSING
• NO DATA LOSS – ACROSS PROCESS, MACHINE, DATA CENTER
FAILURE
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
ML A ML B
WHAT DOES THIS MEAN FOR ML + BIG DATA IN REAL
TIME?
 SCALABLE
 By Service Partitioning
 FAST
 All Data In Memory (No Remoting)
 No Data Contention (Single Thread)
Service
1
Primar
y
ML B
Messaging Fabric
Request /
Response
(streams)
ML A
ML As Service
A/B testing made
simple w/ routing
rules
Business Logic and Feature Vector Prep
{F1,F2 … Fn}
Servic
e1
Backu
p
Service1
Primary
Service1
Backup
Service
1
Primar
y
Servic
e1
Backu
p
Service2
Primary
Service2
Backup
 AGILE
 Micro Service Architecture
 Trivial evolution of message + data
models
 HIGHLY AVAILABLE
 Memory-Memory Replication (Zero
Down Time)
 Exactly Once Delivery across
failures (Zero Duplication/Loss)
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Getting Data Out…
Journal
Storage
ANALYTICS/
TRAINING
Journal
Storage
In-memory
storage
Application Logic
(Message Handler)
Backup
ASYNCHRONOUS
(i.e. no impact on system throughput)
ASYNCHRONOUS
(i.e. no impact on system throughput)
Messaging Fabric
ASYNCHRONOUS,
Guaranteed
Messaging
Application Logic
(Message Handler)
In-memory
storage
CDC
Primary
Always Local State (POJO)
No Remote Lookup, No Contention,
Single Threaded
Ack
REPLICATION:
Concurrent, background operation
ICR
REMOTE
DATA
CENTER
NO MESSAGING
IN BACKUP ROLE
Change Data Capture:
Stream to Data Warehouse for continued
training.
Inter Cluster Replication:
Stream To Test Env
for Model Testing
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
USE CASE - REAL TIME FRAUD DETECTION
Receive CC Authorization Request
 Identify Card Holder
 Identify Merchant
 Perform Fraud Checks using
 CC Holder Specific Information
 Transaction History
Send CC Authorization Response
Reference Data Aggregation
Hybrid Rule Based Analytics + Machine
Learning
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Flow
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
FRAUD DETECTION
WITH THE X PLATFORM + TENSOR FLOW
50k Credit Cards / Instance
17.5m Transactions / Shard
100k Merchants / Shard
1.2ms median Authorization
Time (36.4 ms max)
Full Scan of two year’s worth of
transactions per card on each
authorization to feed ML
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Performance Summary for 2 Partitions
200k Merchants
100k Credit Cards
35 million Transactions
TensorFlow (no GPU)
2 Partitions, Full HA
7500k auth/sec
Auth Response Time = ~1.2ms
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
HAVE A LOOK FOR YOURSELF
Check Out the Source
https://github.com/neeveresearch/nvx-apps
Getting Started Guide
https://docs.neeveresearch.com
Get in Touch
contact@neeveresearch.com
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Questions
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Subu Sankara
subu@synerzip.com
510-579-9673
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Your trusted partner for
Agile software product development.
Synerzip
Accelerate the delivery of your product roadmap
Address technology skill gaps
Save at least 50% with India software development
Augment your team with optional on-site professionals
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Synerzip Clients
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
33
Connect with Synerzip
@Synerzip
linkedin.com/company/synerzip
facebook.com/Synerzip
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
Next Webinar
Presented by
Pankaj Parkar
Architect and Microsoft MVP, Synerzip
And
Yogesh Patel
Director of Engineering, Synerzip
Angular or React – Don’t Roll the Dice
November 14, 2018 at 10am CT
Copyright © 2018 Neeve Research, LLC, All Rights Reserved
TEXAS | SILICON VALLEY | NYC | INDIA

Weitere ähnliche Inhalte

Was ist angesagt?

Deep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseDeep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseGanesan Narayanasamy
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data editionMark Kerzner
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsightsUse cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsightsGord Sissons
 
Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...DataWorks Summit
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.DataWorks Summit
 
IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17David Spurway
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
Change data capture
Change data captureChange data capture
Change data captureJames Deppen
 
Privacy-Preserving AI Network - PlatON 2.0
Privacy-Preserving AI Network - PlatON 2.0 Privacy-Preserving AI Network - PlatON 2.0
Privacy-Preserving AI Network - PlatON 2.0 ShiHeng1
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkPolymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkDatabricks
 
Sprint's Data Modernization Journey
Sprint's Data Modernization JourneySprint's Data Modernization Journey
Sprint's Data Modernization JourneyHortonworks
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Jeffrey T. Pollock
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseJeffrey T. Pollock
 
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013Publicis Sapient Engineering
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial marketsHortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 

Was ist angesagt? (20)

Hadoop dev 01
Hadoop dev 01Hadoop dev 01
Hadoop dev 01
 
Deep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseDeep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the Enterprise
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsightsUse cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
 
Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.
 
IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Change data capture
Change data captureChange data capture
Change data capture
 
Privacy-Preserving AI Network - PlatON 2.0
Privacy-Preserving AI Network - PlatON 2.0 Privacy-Preserving AI Network - PlatON 2.0
Privacy-Preserving AI Network - PlatON 2.0
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkPolymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
 
Sprint's Data Modernization Journey
Sprint's Data Modernization JourneySprint's Data Modernization Journey
Sprint's Data Modernization Journey
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
 
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial markets
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 

Ähnlich wie Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton

Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalVMware Tanzu Korea
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and AnalyticsVMware Tanzu
 
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)Ontico
 
Delivering Mission Assurance in the Federal Government through Cloud Technology
Delivering Mission Assurance in the Federal Government through Cloud TechnologyDelivering Mission Assurance in the Federal Government through Cloud Technology
Delivering Mission Assurance in the Federal Government through Cloud TechnologyAmazon Web Services
 
Modern data integration expert sessions
Modern data integration expert sessionsModern data integration expert sessions
Modern data integration expert sessionsJessicaMurrell3
 
Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar ibi
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDBMongoDB
 
7 Emerging Data & Enterprise Integration Trends in 2022
7 Emerging Data & Enterprise Integration Trends in 20227 Emerging Data & Enterprise Integration Trends in 2022
7 Emerging Data & Enterprise Integration Trends in 2022Safe Software
 
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Precisely
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?Aerospike, Inc.
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsRick Perret
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life RevolutionCapgemini
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowaleCapgemini
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?SnapLogic
 
Open Source and the New Economics of IT - Ingres CIO Doug Harr
Open Source and the New Economics of IT - Ingres CIO Doug HarrOpen Source and the New Economics of IT - Ingres CIO Doug Harr
Open Source and the New Economics of IT - Ingres CIO Doug HarrAlfresco Software
 
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceHortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceThiago Santiago
 
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsConnected Data World
 
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringGain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringInfluxData
 

Ähnlich wie Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton (20)

Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
 
Delivering Mission Assurance in the Federal Government through Cloud Technology
Delivering Mission Assurance in the Federal Government through Cloud TechnologyDelivering Mission Assurance in the Federal Government through Cloud Technology
Delivering Mission Assurance in the Federal Government through Cloud Technology
 
Modern data integration expert sessions
Modern data integration expert sessionsModern data integration expert sessions
Modern data integration expert sessions
 
Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDB
 
7 Emerging Data & Enterprise Integration Trends in 2022
7 Emerging Data & Enterprise Integration Trends in 20227 Emerging Data & Enterprise Integration Trends in 2022
7 Emerging Data & Enterprise Integration Trends in 2022
 
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & Analytics
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
 
Open Source and the New Economics of IT - Ingres CIO Doug Harr
Open Source and the New Economics of IT - Ingres CIO Doug HarrOpen Source and the New Economics of IT - Ingres CIO Doug Harr
Open Source and the New Economics of IT - Ingres CIO Doug Harr
 
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data ScienceHortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
 
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
 
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringGain Deep Visibility into APIs and Integrations with Anypoint Monitoring
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
 

Mehr von Synerzip

HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...Synerzip
 
The QA/Testing Process
The QA/Testing ProcessThe QA/Testing Process
The QA/Testing ProcessSynerzip
 
Test Driven Development – What Works And What Doesn’t
Test Driven Development – What Works And What Doesn’t Test Driven Development – What Works And What Doesn’t
Test Driven Development – What Works And What Doesn’t Synerzip
 
Distributed/Dual-Shore Agile Software Development – Is It Effective?
Distributed/Dual-Shore Agile Software Development – Is It Effective?Distributed/Dual-Shore Agile Software Development – Is It Effective?
Distributed/Dual-Shore Agile Software Development – Is It Effective?Synerzip
 
Using Agile Approach with Fixed Budget Projects
Using Agile Approach with Fixed Budget ProjectsUsing Agile Approach with Fixed Budget Projects
Using Agile Approach with Fixed Budget ProjectsSynerzip
 
QA Role in Agile Teams
QA Role in Agile Teams QA Role in Agile Teams
QA Role in Agile Teams Synerzip
 
Agile For Mobile App Development
Agile For Mobile App Development Agile For Mobile App Development
Agile For Mobile App Development Synerzip
 
Using Agile in Non-Ideal Situations
Using Agile in Non-Ideal SituationsUsing Agile in Non-Ideal Situations
Using Agile in Non-Ideal SituationsSynerzip
 
Accelerating Agile Transformations - Ravi Verma
Accelerating Agile Transformations - Ravi VermaAccelerating Agile Transformations - Ravi Verma
Accelerating Agile Transformations - Ravi VermaSynerzip
 
Agile Product Management Basics
Agile Product Management BasicsAgile Product Management Basics
Agile Product Management BasicsSynerzip
 
Product Portfolio Kanban - by Erik Huddleston
Product Portfolio Kanban - by Erik HuddlestonProduct Portfolio Kanban - by Erik Huddleston
Product Portfolio Kanban - by Erik HuddlestonSynerzip
 
Modern Software Practices - by Damon Poole
Modern Software Practices - by Damon PooleModern Software Practices - by Damon Poole
Modern Software Practices - by Damon PooleSynerzip
 
Context Driven Agile Leadership
Context Driven Agile LeadershipContext Driven Agile Leadership
Context Driven Agile LeadershipSynerzip
 
Adopting TDD - by Don McGreal
Adopting TDD - by Don McGrealAdopting TDD - by Don McGreal
Adopting TDD - by Don McGrealSynerzip
 
Pragmatics of Agility - by Venkat Subramaniam
Pragmatics of Agility - by Venkat SubramaniamPragmatics of Agility - by Venkat Subramaniam
Pragmatics of Agility - by Venkat SubramaniamSynerzip
 
Cross Platform Mobile App Development
Cross Platform Mobile App DevelopmentCross Platform Mobile App Development
Cross Platform Mobile App DevelopmentSynerzip
 
Agile2011 Conference – Key Take Aways
Agile2011 Conference – Key Take AwaysAgile2011 Conference – Key Take Aways
Agile2011 Conference – Key Take AwaysSynerzip
 
Performance Evaluation in Agile
Performance Evaluation in AgilePerformance Evaluation in Agile
Performance Evaluation in AgileSynerzip
 
Scrum And Kanban (for better agile teams)
Scrum And Kanban (for better agile teams)Scrum And Kanban (for better agile teams)
Scrum And Kanban (for better agile teams)Synerzip
 
Managing Technical Debt - by Michael Hall
Managing Technical Debt - by Michael HallManaging Technical Debt - by Michael Hall
Managing Technical Debt - by Michael HallSynerzip
 

Mehr von Synerzip (20)

HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
 
The QA/Testing Process
The QA/Testing ProcessThe QA/Testing Process
The QA/Testing Process
 
Test Driven Development – What Works And What Doesn’t
Test Driven Development – What Works And What Doesn’t Test Driven Development – What Works And What Doesn’t
Test Driven Development – What Works And What Doesn’t
 
Distributed/Dual-Shore Agile Software Development – Is It Effective?
Distributed/Dual-Shore Agile Software Development – Is It Effective?Distributed/Dual-Shore Agile Software Development – Is It Effective?
Distributed/Dual-Shore Agile Software Development – Is It Effective?
 
Using Agile Approach with Fixed Budget Projects
Using Agile Approach with Fixed Budget ProjectsUsing Agile Approach with Fixed Budget Projects
Using Agile Approach with Fixed Budget Projects
 
QA Role in Agile Teams
QA Role in Agile Teams QA Role in Agile Teams
QA Role in Agile Teams
 
Agile For Mobile App Development
Agile For Mobile App Development Agile For Mobile App Development
Agile For Mobile App Development
 
Using Agile in Non-Ideal Situations
Using Agile in Non-Ideal SituationsUsing Agile in Non-Ideal Situations
Using Agile in Non-Ideal Situations
 
Accelerating Agile Transformations - Ravi Verma
Accelerating Agile Transformations - Ravi VermaAccelerating Agile Transformations - Ravi Verma
Accelerating Agile Transformations - Ravi Verma
 
Agile Product Management Basics
Agile Product Management BasicsAgile Product Management Basics
Agile Product Management Basics
 
Product Portfolio Kanban - by Erik Huddleston
Product Portfolio Kanban - by Erik HuddlestonProduct Portfolio Kanban - by Erik Huddleston
Product Portfolio Kanban - by Erik Huddleston
 
Modern Software Practices - by Damon Poole
Modern Software Practices - by Damon PooleModern Software Practices - by Damon Poole
Modern Software Practices - by Damon Poole
 
Context Driven Agile Leadership
Context Driven Agile LeadershipContext Driven Agile Leadership
Context Driven Agile Leadership
 
Adopting TDD - by Don McGreal
Adopting TDD - by Don McGrealAdopting TDD - by Don McGreal
Adopting TDD - by Don McGreal
 
Pragmatics of Agility - by Venkat Subramaniam
Pragmatics of Agility - by Venkat SubramaniamPragmatics of Agility - by Venkat Subramaniam
Pragmatics of Agility - by Venkat Subramaniam
 
Cross Platform Mobile App Development
Cross Platform Mobile App DevelopmentCross Platform Mobile App Development
Cross Platform Mobile App Development
 
Agile2011 Conference – Key Take Aways
Agile2011 Conference – Key Take AwaysAgile2011 Conference – Key Take Aways
Agile2011 Conference – Key Take Aways
 
Performance Evaluation in Agile
Performance Evaluation in AgilePerformance Evaluation in Agile
Performance Evaluation in Agile
 
Scrum And Kanban (for better agile teams)
Scrum And Kanban (for better agile teams)Scrum And Kanban (for better agile teams)
Scrum And Kanban (for better agile teams)
 
Managing Technical Debt - by Michael Hall
Managing Technical Debt - by Michael HallManaging Technical Debt - by Michael Hall
Managing Technical Debt - by Michael Hall
 

Kürzlich hochgeladen

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 

Kürzlich hochgeladen (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton

  • 1. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Big Data and AI The convergence of Big Data and Machine Learning October 18, 2018 Colin MacNaughton
  • 2. Copyright © 2018 Neeve Research, LLC, All Rights Reserved INTRODUCTIONS • Based in Silicon Valley • Creators of the X Platform™- Memory Oriented Application Platform • Passionate about high performance computing for mission critical enterprises
  • 3. Copyright © 2018 Neeve Research, LLC, All Rights Reserved AGENDA • MACHINE LEARNING: BIG DATA -> BETTER FEATURES • HARNESSING BIG, FAST DATA IN REAL TIME • USE CASE: REAL TIME FRAUD DETECTION
  • 4. Copyright © 2018 Neeve Research, LLC, All Rights Reserved BIG DATA AND MACHINE LEARNING Training • Deep Learning has risen to the fore recently, and it is data hungry! When looking to make accurate predictions we need large data sets to train and test our models. In Production (real-time) • The more data (features) we can access and aggregate in real time to feed as inputs to our models, the more accurate our predictive output will be. • This is an HTAP/HOAP problem: can we assemble this data at scale while it is also being updated? • Because models need to evolve continuously, loosely coupled (micro service) architectures are a good choice, but at the risk of needing to move a lot of data around. Big Data and Machine Learning go Hand in Hand
  • 5. Copyright © 2018 Neeve Research, LLC, All Rights Reserved TYPES OF APPLICATIONS • Financial Trading • IoT Event Processors • Credit Card Processors • E-Commerce  Personalization Engines  Value Based Pricing • Ad Exchanges • …
  • 6. Copyright © 2018 Neeve Research, LLC, All Rights Reserved MACHINE LEARNING WORKFLOW REFINE / IMPROVE FEATURE SELECTION DATA AQUISITION TRAIN TEST PRODUCTION MONITOR TODAY’S FOCUS TODAY’S FOCUS
  • 7. Copyright © 2018 Neeve Research, LLC, All Rights Reserved FEATURE SELECTION It’s all about the data …but what data? • Which pieces of data serve as the best predictors of what we are looking to answer? • Can I get an accurate (enough) result just from the data in the request a user sent? • If not can more data help? FEATURE SELECTION
  • 8. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Can Big Data in Real Time help us leverage more meaningful features?  How much better are our predictive models if they can leverage features based on relevant historical/topical data on a transaction by transaction basis?  Can we assemble such data within a meaningful time frame in production?  Can we concurrently collect more data that we expect will be useful? FEATURE SELECTION BIG DATA AND BETTER FEATURES
  • 9. Copyright © 2018 Neeve Research, LLC, All Rights Reserved BIG DATA AND BETTER FEATURES Example – Credit Card Fraud Detection Feature Big Data Enhanced Feature Amount Skew from median purchase, Amount charged in last hour. Merchant # of Prior Purchases by user Location Distance from last purchase? Distance from home(s)? Purchased from this location in the past? Time Last Purchase Time? FEATURE SELECTION
  • 10. Copyright © 2018 Neeve Research, LLC, All Rights Reserved BIG DATA AND BETTER FEATURES Example – Personalization Feature Big Data Enhanced Feature Time Seasonal Interests / Habits … every year Jane goes snowshoeing in March. Search Terms / Key words Past Interests / Behavior Location • The last time John was in Paris, he was interested in… • John’s calendar says he’ll be in Paris next September. • XYZ is happening here now (or in the future). Demographics What are peers clicking on now? FEATURE SELECTION
  • 11. Copyright © 2018 Neeve Research, LLC, All Rights Reserved MACHINE LEARNING IN PRODUCTION Performance and Scale – Lots of data needed in real time  Can I assemble the normalized feature data needed to feed my model in real time?  Can I produce results fast enough that the prediction still matters? Agility – Rapid Change: Models must evolve over time and so must the system feeding data to it.  Fail Fast – Ability to rapidly test and discard what doesn’t work.  A/B testing  Zero down time deployment, easy deployment to test environments. High Availability  No interruptions across Process, Machine or Data Center failure. Business Logic  ML isn’t the answer to every problem, can your compute/data infrastructure handle traditional analytics and ML?  Cyber Threats – duping the model. PRODUCTION
  • 12. Copyright © 2018 Neeve Research, LLC, All Rights Reserved PLAN FOR (Evolving) SCALE – COMPUTE + Data + HA Data Tier (Transactional State Reference Data) Application Tier (Business Logic) Messaging (HTTP, JMS) Data Grid, RDBMS ... • Data Update Contention • Isolation and Ordering • Data Access Latency • Transaction coordination between message and data stream. • Only scales to a point. • Complex Routing • Complex Ordering • Synchronous Wrong Scaling Strategy Shared storage for HA and reliability Launch more instances for scale + HA Request Load Balancing Can you assemble the feature vectors needed to feed your model at scale?  Not with the above … Update Contention between threads / instances prevents the ability to do big data reads. PRODUCTION
  • 13. Copyright © 2018 Neeve Research, LLC, All Rights Reserved PLAN FOR (Evolving) SCALE – COMPUTE + Data + HA Data Tier (Transactional State Reference Data) Application Tier (Business Logic) Messaging (Publish - Subscribe) In-Memory + Partitioned Routing Strategy? Processing Swim-lanes (ordered) Messaging Fabric Data And Application Tier Collapsed + Co-located Function + Data+ Replicated PRODUCTION
  • 14. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Service 2 PLAN FOR (Evolving) SCALE – MICRO SERVICES Micro Services:  Each Service owns private state.  Collaborate asynchronously via messaging  Easier to scale + less contention on shared state  Pick up feature data in streaming processing pipeline. Benefits • Reduce Risk -> Increased Agility • Cost Effective -> Provision to hardware by granular service needs. • Resiliency -> Single service failure doesn’t bring down the entire system. Service 1 ML B In Process Messaging Fabric In Process Request / Response ML A ML As Service A/B testing made simple w/ routing rules Business Logic and Feature Vector Prep {F1,F2 … Fn} PRODUCTION
  • 15. Copyright © 2018 Neeve Research, LLC, All Rights Reserved PLAN FOR (Evolving) SCALE – MICRO SERVICES Data to aggregate across lots of disparate Microservices? ML B Messaging Fabric Request / Response ML A {F1,F2 … Fn} Service 1 Parallel Fetch (Fork/Join) • choice of messaging provider matters, but modern providers can handle it. PRODUCTION
  • 16. Copyright © 2018 Neeve Research, LLC, All Rights Reserved PLAN FOR (Evolving) SCALE – DATA EVOLUTION What Happens when Services are Updated? ML B Messaging Fabric Request / Response ML A {F1,F2 … Fn} Service 1  Choice of message encoding is critical.  Older versions of services should still function when new fields added.  Efficiency of Encoding Matters!  Impedance mismatch between State/Message encoding?  Organization-wide agreed upon “Rules of Engagement” Version 2 Versio n1 PRODUCTION
  • 17. Copyright © 2018 Neeve Research, LLC, All Rights Reserved DON’T FORGET PLAIN OLD BUSINESS LOGIC Traditional Analytics are Still Important! • Not all analytics are best solved with ML … be judicious. • Deep Neural Networks are a Black Box… • … so when possible traditional rules/analytics should complement ML, along with robust monitoring. Adversarial Inputs PRODUCTION
  • 18. Copyright © 2018 Neeve Research, LLC, All Rights Reserved PLAN WORKFLOW FOR REFINEMENT  Plan for measuring and monitoring ML efficacy  Behavior changes over time  Models will need to evolve.  Getting data out  Consider infrastructural / security implications of exposing production data for refinement training of models.  Continuous training workflows? DATA AQUISITION
  • 19. Copyright © 2018 Neeve Research, LLC, All Rights Reserved THE X PLATFORM THE X PLATFORM The X Platform is a memory oriented platform for building multi-agent, transactional applications. Smart Routing + Collocated Data + Business Logic = Full Promise of In-Memory Computing
  • 20. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Message Driven Stateful Multi-Agent  Totally Available  Horizontally Scalable  Ultra Performant
  • 21. Copyright © 2018 Neeve Research, LLC, All Rights Reserved HA + SCALE ON THE X PLATFORM Backup P3Backup P2 Smart Routing (messaging traffic partitioned to align with data partitions) Pipelined Replication Backup P1Primary P1 Solace, Kafka, Falcon, JMS 2.0… Primary P2 Primary P3 PARTITION 1 PARTITION 2 PARTITION 3 /${ENV}/ORDERS/#hash(${customerId},3) /PROD/ORDERS/3/PROD/ORDERS/2/PROD/ORDERS/1 From Config From Message Single Threaded Logic KEY TAKEAWAYS DATA: • STRIPED – NO UPDATE CONTENTION, HORIZONTAL SCALE • IN MEMORY – NO DATA ACCESS LATENCY, DISK BASED JOURNAL BACKED • PLAIN OLD JAVA OBJECTS– FLEXIBLE, EVOLVABLE ENCODING MESSAGING • CONTENT BASED – TRANSPARENT ROUTING TO DATA • FIRE AND FORGET – EXACTLY ONCE PROCESSING, CONSISTENT WITH STATE • PLAIN OLD JAVA OBJECTS– FLEXIBLE, EVOLVABLE ENCODING HIGH AVAILABILITY • PIPELINED REPLICATION – NON BLOCKING PIPELINED MEMORY- TO-MEMORY -> STREAM TRANSACTION PROCESSING • NO DATA LOSS – ACROSS PROCESS, MACHINE, DATA CENTER FAILURE
  • 22. Copyright © 2018 Neeve Research, LLC, All Rights Reserved ML A ML B WHAT DOES THIS MEAN FOR ML + BIG DATA IN REAL TIME?  SCALABLE  By Service Partitioning  FAST  All Data In Memory (No Remoting)  No Data Contention (Single Thread) Service 1 Primar y ML B Messaging Fabric Request / Response (streams) ML A ML As Service A/B testing made simple w/ routing rules Business Logic and Feature Vector Prep {F1,F2 … Fn} Servic e1 Backu p Service1 Primary Service1 Backup Service 1 Primar y Servic e1 Backu p Service2 Primary Service2 Backup  AGILE  Micro Service Architecture  Trivial evolution of message + data models  HIGHLY AVAILABLE  Memory-Memory Replication (Zero Down Time)  Exactly Once Delivery across failures (Zero Duplication/Loss)
  • 23. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Getting Data Out… Journal Storage ANALYTICS/ TRAINING Journal Storage In-memory storage Application Logic (Message Handler) Backup ASYNCHRONOUS (i.e. no impact on system throughput) ASYNCHRONOUS (i.e. no impact on system throughput) Messaging Fabric ASYNCHRONOUS, Guaranteed Messaging Application Logic (Message Handler) In-memory storage CDC Primary Always Local State (POJO) No Remote Lookup, No Contention, Single Threaded Ack REPLICATION: Concurrent, background operation ICR REMOTE DATA CENTER NO MESSAGING IN BACKUP ROLE Change Data Capture: Stream to Data Warehouse for continued training. Inter Cluster Replication: Stream To Test Env for Model Testing
  • 24. Copyright © 2018 Neeve Research, LLC, All Rights Reserved USE CASE - REAL TIME FRAUD DETECTION Receive CC Authorization Request  Identify Card Holder  Identify Merchant  Perform Fraud Checks using  CC Holder Specific Information  Transaction History Send CC Authorization Response Reference Data Aggregation Hybrid Rule Based Analytics + Machine Learning
  • 25. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Flow
  • 26. Copyright © 2018 Neeve Research, LLC, All Rights Reserved FRAUD DETECTION WITH THE X PLATFORM + TENSOR FLOW 50k Credit Cards / Instance 17.5m Transactions / Shard 100k Merchants / Shard 1.2ms median Authorization Time (36.4 ms max) Full Scan of two year’s worth of transactions per card on each authorization to feed ML
  • 27. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Performance Summary for 2 Partitions 200k Merchants 100k Credit Cards 35 million Transactions TensorFlow (no GPU) 2 Partitions, Full HA 7500k auth/sec Auth Response Time = ~1.2ms
  • 28. Copyright © 2018 Neeve Research, LLC, All Rights Reserved HAVE A LOOK FOR YOURSELF Check Out the Source https://github.com/neeveresearch/nvx-apps Getting Started Guide https://docs.neeveresearch.com Get in Touch contact@neeveresearch.com
  • 29. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Questions
  • 30. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Subu Sankara subu@synerzip.com 510-579-9673
  • 31. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Your trusted partner for Agile software product development. Synerzip Accelerate the delivery of your product roadmap Address technology skill gaps Save at least 50% with India software development Augment your team with optional on-site professionals
  • 32. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Synerzip Clients
  • 33. Copyright © 2018 Neeve Research, LLC, All Rights Reserved 33 Connect with Synerzip @Synerzip linkedin.com/company/synerzip facebook.com/Synerzip
  • 34. Copyright © 2018 Neeve Research, LLC, All Rights Reserved Next Webinar Presented by Pankaj Parkar Architect and Microsoft MVP, Synerzip And Yogesh Patel Director of Engineering, Synerzip Angular or React – Don’t Roll the Dice November 14, 2018 at 10am CT
  • 35. Copyright © 2018 Neeve Research, LLC, All Rights Reserved TEXAS | SILICON VALLEY | NYC | INDIA