Suche senden
Hochladen
Real-time Analytics in Financial
•
3 gefällt mir
•
1,440 views
Yifeng Jiang
Folgen
Use case, architecture and challenges of real-time analytics in financial industry.
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 32
Jetzt herunterladen
Downloaden Sie, um offline zu lesen
Empfohlen
Real-time Analytics in Financial: Use Case, Architecture and Challenges
Real-time Analytics in Financial: Use Case, Architecture and Challenges
DataWorks Summit/Hadoop Summit
Hadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
Apache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
Using Hadoop for Cognitive Analytics
Using Hadoop for Cognitive Analytics
DataWorks Summit/Hadoop Summit
Social Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and Superset
Thiago Santiago
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
Thiago Santiago
Data Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awareness
DataWorks Summit/Hadoop Summit
Make Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for You
Hortonworks
Empfohlen
Real-time Analytics in Financial: Use Case, Architecture and Challenges
Real-time Analytics in Financial: Use Case, Architecture and Challenges
DataWorks Summit/Hadoop Summit
Hadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
Apache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
Using Hadoop for Cognitive Analytics
Using Hadoop for Cognitive Analytics
DataWorks Summit/Hadoop Summit
Social Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and Superset
Thiago Santiago
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
Thiago Santiago
Data Aggregation, Curation and analytics for security and situational awareness
Data Aggregation, Curation and analytics for security and situational awareness
DataWorks Summit/Hadoop Summit
Make Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for You
Hortonworks
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
Hortonworks
Apache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Impetus Technologies
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
DataWorks Summit/Hadoop Summit
Hilton's enterprise data journey
Hilton's enterprise data journey
DataWorks Summit
Data Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
Hadoop Summit Tokyo HDP Sandbox Workshop
Hadoop Summit Tokyo HDP Sandbox Workshop
DataWorks Summit/Hadoop Summit
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
DataWorks Summit
7 Predictive Analytics, Spark , Streaming use cases
7 Predictive Analytics, Spark , Streaming use cases
DataWorks Summit/Hadoop Summit
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
DataWorks Summit/Hadoop Summit
The Implacable advance of the data
The Implacable advance of the data
DataWorks Summit
HDF 3.2 - What's New
HDF 3.2 - What's New
Hortonworks
Data Science Crash Course
Data Science Crash Course
DataWorks Summit
Make Streaming Analytics work for you: The Devil is in the Details
Make Streaming Analytics work for you: The Devil is in the Details
DataWorks Summit/Hadoop Summit
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
DataWorks Summit/Hadoop Summit
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
DataWorks Summit/Hadoop Summit
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dataconomy Media
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
Oracle® Trading Community Architecture
Oracle® Trading Community Architecture
Oracle Groups
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial Services
Cloudera, Inc.
Weitere ähnliche Inhalte
Was ist angesagt?
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
Hortonworks
Apache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Impetus Technologies
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
DataWorks Summit/Hadoop Summit
Hilton's enterprise data journey
Hilton's enterprise data journey
DataWorks Summit
Data Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
Hadoop Summit Tokyo HDP Sandbox Workshop
Hadoop Summit Tokyo HDP Sandbox Workshop
DataWorks Summit/Hadoop Summit
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
DataWorks Summit
7 Predictive Analytics, Spark , Streaming use cases
7 Predictive Analytics, Spark , Streaming use cases
DataWorks Summit/Hadoop Summit
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
DataWorks Summit/Hadoop Summit
The Implacable advance of the data
The Implacable advance of the data
DataWorks Summit
HDF 3.2 - What's New
HDF 3.2 - What's New
Hortonworks
Data Science Crash Course
Data Science Crash Course
DataWorks Summit
Make Streaming Analytics work for you: The Devil is in the Details
Make Streaming Analytics work for you: The Devil is in the Details
DataWorks Summit/Hadoop Summit
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
DataWorks Summit/Hadoop Summit
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
DataWorks Summit/Hadoop Summit
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dataconomy Media
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
Was ist angesagt?
(20)
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
Apache Spark Crash Course
Apache Spark Crash Course
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Hilton's enterprise data journey
Hilton's enterprise data journey
Data Science Crash Course
Data Science Crash Course
Hadoop Summit Tokyo HDP Sandbox Workshop
Hadoop Summit Tokyo HDP Sandbox Workshop
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
7 Predictive Analytics, Spark , Streaming use cases
7 Predictive Analytics, Spark , Streaming use cases
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
The Implacable advance of the data
The Implacable advance of the data
HDF 3.2 - What's New
HDF 3.2 - What's New
Data Science Crash Course
Data Science Crash Course
Make Streaming Analytics work for you: The Devil is in the Details
Make Streaming Analytics work for you: The Devil is in the Details
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
A Tale of Two Regulations: Cross-Border Data Protection For Big Data Under GD...
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Andere mochten auch
Oracle® Trading Community Architecture
Oracle® Trading Community Architecture
Oracle Groups
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial Services
Cloudera, Inc.
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Perficient, Inc.
Analytics in Financial Services: Keynote Presentation for TDWI and NY Tech Co...
Analytics in Financial Services: Keynote Presentation for TDWI and NY Tech Co...
Fitzgerald Analytics, Inc.
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
Capgemini
[SmartNews] Globally Scalable Web Document Classification Using Word2Vec
[SmartNews] Globally Scalable Web Document Classification Using Word2Vec
Kouhei Nakaji
Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...
Cloudera, Inc.
Big Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of View
Pietro Leo
word2vec, LDA, and introducing a new hybrid algorithm: lda2vec
word2vec, LDA, and introducing a new hybrid algorithm: lda2vec
👋 Christopher Moody
Andere mochten auch
(9)
Oracle® Trading Community Architecture
Oracle® Trading Community Architecture
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial Services
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Analytics in Financial Services: Keynote Presentation for TDWI and NY Tech Co...
Analytics in Financial Services: Keynote Presentation for TDWI and NY Tech Co...
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
[SmartNews] Globally Scalable Web Document Classification Using Word2Vec
[SmartNews] Globally Scalable Web Document Classification Using Word2Vec
Moving from data to insights: How to effectively drive business decisions & g...
Moving from data to insights: How to effectively drive business decisions & g...
Big Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of View
word2vec, LDA, and introducing a new hybrid algorithm: lda2vec
word2vec, LDA, and introducing a new hybrid algorithm: lda2vec
Ähnlich wie Real-time Analytics in Financial
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
DataWorks Summit
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
Hortonworks
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
DataWorks Summit
Big Data vs. Big Risk: Real-Time Trade Surveillance in Financial Markets
Big Data vs. Big Risk: Real-Time Trade Surveillance in Financial Markets
Arcadia Data
Credit fraud prevention on hwx stack
Credit fraud prevention on hwx stack
Kirk Haslbeck
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
Denodo
Unlocking insights in streaming data
Unlocking insights in streaming data
Carolyn Duby
Powering the Future of Data
Powering the Future of Data
Bilot
Big data in Private Banking
Big data in Private Banking
Jérôme Kehrli
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
Hortonworks
Taming Big Data With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Big Data User Group Karlsruhe/Stuttgart
Forrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platforms
IBM Software India
Spark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWX
Kirk Haslbeck
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
DataWorks Summit/Hadoop Summit
The Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reduction
MongoDB
Time-series data analysis and persistence with Druid
Time-series data analysis and persistence with Druid
Raúl Marín
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Hortonworks
Machine Learning Applied to Real Time Scoring in Manufacturing and Energy Uti...
Machine Learning Applied to Real Time Scoring in Manufacturing and Energy Uti...
Kai Wähner
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter Jönsson
IBM Danmark
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
Connected Data World
Ähnlich wie Real-time Analytics in Financial
(20)
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
IIoT + Predictive Analytics: Solving for Disruption in Oil & Gas and Energy &...
Big Data vs. Big Risk: Real-Time Trade Surveillance in Financial Markets
Big Data vs. Big Risk: Real-Time Trade Surveillance in Financial Markets
Credit fraud prevention on hwx stack
Credit fraud prevention on hwx stack
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
Unlocking insights in streaming data
Unlocking insights in streaming data
Powering the Future of Data
Powering the Future of Data
Big data in Private Banking
Big data in Private Banking
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
Taming Big Data With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Forrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platforms
Spark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWX
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
The Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reduction
Time-series data analysis and persistence with Druid
Time-series data analysis and persistence with Druid
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Machine Learning Applied to Real Time Scoring in Manufacturing and Energy Uti...
Machine Learning Applied to Real Time Scoring in Manufacturing and Energy Uti...
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter Jönsson
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
Mehr von Yifeng Jiang
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Yifeng Jiang
introduction-to-apache-kafka
introduction-to-apache-kafka
Yifeng Jiang
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Yifeng Jiang
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Yifeng Jiang
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Spark Security
Spark Security
Yifeng Jiang
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
Yifeng Jiang
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Yifeng Jiang
Nifi workshop
Nifi workshop
Yifeng Jiang
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng Jiang
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Yifeng Jiang
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng Jiang
Yifeng spark-final-public
Yifeng spark-final-public
Yifeng Jiang
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Yifeng Jiang
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Yifeng Jiang
Apache Hiveの今とこれから
Apache Hiveの今とこれから
Yifeng Jiang
HDFS Deep Dive
HDFS Deep Dive
Yifeng Jiang
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Yifeng Jiang
Mehr von Yifeng Jiang
(20)
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
introduction-to-apache-kafka
introduction-to-apache-kafka
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Spark Security
Spark Security
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Nifi workshop
Nifi workshop
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng spark-final-public
Yifeng spark-final-public
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Apache Hiveの今とこれから
Apache Hiveの今とこれから
HDFS Deep Dive
HDFS Deep Dive
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Kürzlich hochgeladen
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Principled Technologies
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
RTylerCroy
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
gurkirankumar98700
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Slack Application Development 101 Slides
Slack Application Development 101 Slides
praypatel2
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Pooja Nehwal
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Katpro Technologies
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Delhi Call girls
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Digital Insurer
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Michael W. Hawkins
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Malak Abu Hammad
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Allon Mureinik
Kürzlich hochgeladen
(20)
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Slack Application Development 101 Slides
Slack Application Development 101 Slides
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Real-time Analytics in Financial
1.
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time Analytics in Financial Use Case, Architecture and Challenges 蒋
逸峰(しょう いつほう/Yifeng Jiang) Solutions Engineer, Hortonworks @uprush October 26, 2016
2.
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved The YAP Map by Google M
3.
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved http://www.wondermondo.com
4.
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Today’s Money & Financial Service moneymoney Financial Service 0110010100
0110010100
5.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Every Financial Service is a Big Data Service Ã
Financial services are BIG – Too big to fail à Every financial service is eventually a big data service – Number of transactions – Number of jobs – Third party data
6.
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved How Big is Big Data in Financial? Ã
Millions to billions transactions per day – Hundreds to tens of thousands transactions per second à Big Data in banking, payment, security, etc.
7.
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Big Data Use Case in Financial http://www.forbes.com/sites/bernardmarr/2016/09/09/big-data-in-banking-how-citibank-delivers-real-business-benefits-with-their-data-first-approach/#7759859f75ed
8.
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Real-time Analytics in Financial? Can you detect fraud from millions to billions transactions per day in real-time
? “The costs resulting from these anomalies is far easier to correct if spotted quickly – or even before it happens – through predictive modeling. ”
9.
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved 最近気になったニュース http://gendai.ismedia.jp/articles/-/48832 http://mainichi.jp/articles/20161012/k00/00e/040/243000c
10.
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
https://roboteer-tokyo.com/archives/4415
11.
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time Analytics in Financial Use Case, Architecture & Challenging
12.
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved A Simple Use Case –
Real-time Surveillance Detect abnormal transactions in Stock Exchange à Trigger alert if – A customer buy / sell amount exceeds 500M JPY in 3 minutes à 300K transactions per second à Abnormal must be detected within 10s Alert
13.
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time
Surveillance Architecture v1 Trading Data (real-time) Message Bus (Kafka) Enricher (Storm) Aggregator (Storm) Master data, raw & aggregated trade (HBase+Phoenix) Surveillance Rule Engine Surveillance Alerts master data look up Insert trade (raw & aggregated) Architecture v1 how?
14.
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time Data Ingestion Ã
From transaction database – Change Data Capture (CDC) – Not practical for most financial system à From gateway system – Receive data from gateway system – Send data to Kafka (as Kafka producer)
15.
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time
Surveillance Architecture v1 Trading Data (real-time) Message Bus (Kafka) Enricher (Storm) Aggregator (Storm) Master data, raw & aggregated trade (HBase+Phoenix) Surveillance Rule Engine Surveillance Alerts master data look up Insert trade (raw & aggregated) via CDC or gateway Architecture v1 overhead?
16.
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time Data Lookup Ã
Low latency data store – Master data – NoSQL database: HBase (+Phoenix), Redis à Use local Cache – LRU cache in Storm bolts
17.
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time
Surveillance Architecture v2 Trading Data (real-time) Message Bus (Kafka) Enricher (Storm) Aggregator (Storm) Master data, raw & aggregated trade (HBase+Phoenix) Surveillance Rule Engine Surveillance Alerts local master data cache master data look up Insert trade (raw & aggregated) via CDC or gateway Architecture v2 – With Cache exactly-once? exactly-once?
18.
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Exactly-Once? Message delivery semantics Ã
At-most-once: may lose data but no duplication à At-least-once: no data loss, but may duplicate à Exactly-once: no data loss, no duplication
19.
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Exactly-Once! NO real exactly-once message delivery in distributed system Ã
There is no such thing as exactly-once delivery à Exactly-once is an end-to-end requirement But… people like exactly-once, especially in financial service system
20.
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Exactly-once semantics
(better phrase “effectively-once”) with at-least-once + idempotent operations à Kafka & Storm guarantee at-least-once à De-duplicate by ensuring idempotent in your application Effectively-once
21.
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved De-duplication in window computation Most window computations can achieve idempotent Ã
Examples: aggregation, counting, etc. Ã De-duplicate messages in the window – Using local in-memory state store, e.g. a Set class Trading Events in Kafka IDRegistry (local in-memory) 2. lookup trade_id 3. count de-duplicated events 5. output aggregated data Aggregated Trade Data Aggregator (Storm) 4. Insert trade_id 1. Pull data in 3m window
22.
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved De-duplication in non-idempotent computation Ã
Exactly-once in non-idempotent computation – Example: join continuous data streams – Global state store required: HBase, Redis – Batching can help reduce number of ID lookup. Ã Exactly-once is expensive, avoid it at the best Click Logs in Kafka IDRegistry (external NoSQL) 2. Lookup click_id 5. Output joined click Joined Click Logs Joiner (Storm) 4. Insert click_id 1. Pull data continuously Query Logs 3. Lookup query
23.
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real-time
Surveillance Architecture v3 Trading Data (real-time) Message Bus (Kafka) Enricher (Storm) Aggregator (Storm) Master data, raw & aggregated trade (HBase+Phoenix) Surveillance Rule Engine Surveillance Alerts master data look up IDRegistry look up / insert, de-duplicate in window local master data cache IDRegistry (local in-memory) Insert trade (raw & aggregated) via CDC or gateway Architecture v3 – effectively-once order?
24.
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Late Messages http://www.slideshare.net/HadoopSummit/apache-beam-a-unified-model-for-batch-and-stream-processing-data
25.
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Handling Late Messages Ã
Expect late messages – Streaming application needs to handle out of order events, e.g., emits late messages to a special Kafka topic à Use source generated timestamp à Storm’s late message support in window computation (BaseWindowedBolt) – withTimestampField(String fieldName) – withLag(Duration duration)
26.
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Can I trust the data? Duplications! Out of order late messages! Data loss?
27.
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Monitor Data Processing Pipeline Quality Approaches to monitor data pipeline quality Ã
Audit completeness à Output duplicated and late messages to logs for auditing. à Define service level objective (SLO) of data quality and monitor the SLO.
28.
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Define Data Processing Pipeline SLO Design practical SLO for the pipeline Ã
Process 99.9999% events within a few seconds à and 100% events within a few hours à At-most-once semantics at any point of time à Near exactly-once semantics in near real-time à And exactly-once semantics eventually
29.
29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved The Rule Engine & The Architecture Hundreds of rules Ã
A stock trading price jump up / down > k% and total amount > m% in K minutes à Single ATM cash withdrawal > k% and number of ATM > m in K minutes Many of these rules fit into this simple architecture! Rule Engine ✓✗? ? ? ✓ Rule base only?
30.
30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Architecture --
with Predictive Analytics Real-time Surveillance Architecture — with Predicate Engine Trading Data (real-time) Message Bus (Kafka) Enricher (Storm) Aggregator (Storm) Master data, raw & aggregated trade (HBase+Phoenix) Surveillance Rule Engine Surveillance Alerts master data look up IDRegistry look up / insert, de-duplicate in window local master data cache IDRegistry (local in-memory) Insert trade (raw & aggregated) Financial Data Lake Train Machine Learning Model (Spark) load ML model Surveillance Predicate Engine (Storm) via CDC or gateway
31.
31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Lifecycle of Big Data Adoption in Financial Service Industry 2. Business Intelligence Data mining and visualization software that reveals trends and useful information 1. Data Pooling and Processing Connect data and create structure by merging, conditioning streams and archived data 3. Predictive Analytics Automated analytics integrated into workflow that unlock data value and improve profitability Hadoop enabled Big Data Platform Customers typically “Start Small, Think Big”
32.
32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved THANK YOU
Jetzt herunterladen