SlideShare ist ein Scribd-Unternehmen logo
1 von 32
From Fraud Detection to Big Data Platform:
Bringing Hadoop to the Enterprise at
Fiducia & GAD IT AG
Daniel Schmitt & Florian Herrmann
October 25th 2016
2
About us
9/15/2016World of Watson 2016
Daniel Schmitt (1985)
• Karlsruhe, Germany
• Business Intelligence Dep. at Fiducia & GAD IT AG
since 2009
Topics
Business Analytics Design and Implementation, Reporting, Planning
and all topics related to Analytics
Experience
Apache Hadoop, Cognos BI, Cognos TM1, GeoInformation Systems,
Cognos Enterprise Planning etc
3
About us
9/15/2016World of Watson 2016
Florian Herrmann (1988)
• Karlsruhe, Germany
• Database Development Dep. at Fiducia & GAD IT AG
since 2013
Topics
Data Modelling and Database Design for core banking system,
Performance Optimization and in-house consulting for all topics related
to DBs
Experience
Apache Hadoop, Database Systems (DB2-Family, Oracle) etc
1. The Challenge 2. The Solution
3. The Lessons Learned4. The Blueprint
1. The Challenge
Make the fraudsters shiver
6
Fiducia & GAD IT AG at a glance
9/15/2016World of Watson 2016
Computer Center
Services
Integration Platform Competence Center
Leading
Banking System
7
Fiducia & GAD IT AG at a glance
9/15/2016World of Watson 2016
167,000 workstations in banks
6,300m accounting entries per year
79m active accounts
36,000 self-service terminals
550m ATM cash withdrawals per year
Requirements
• Evaluation of all user initiated online transactions on fraud suspicion
• Integration in core banking system and existing banking processes
(Fiducia & GAD is just the service provider not the owner!)
• Model based on customer behavior
• Flexible system design for a fast reaction on new fraud patterns
8
Fraud Detection for online banking
9/15/2016World of Watson 2016
9
Fraud Detection for online banking
9/15/2016World of Watson 2016
Millions of transactions per day
Up to 100 transactions per second
Evaluation in less than 100 milliseconds
System adjustment in minutes
Be prepared for new datasources or -formats
10
Fraud Detection for online banking
9/15/2016World of Watson 2016
Transaction handling
Fraud Detection System
Development of
evaluation models
Storage of all
transactions
Evaluation
in milliseconds
Flexible adjustment
Evaluate Transaction
Accounting and Forwarding
2. The Solution
One elephant to rule them all
The Solution
9/15/2016World of Watson 201612
Velocity
Realtime evaluation
of incoming data.
Access on large data
volume within
milliseconds
Variety
Transactional data
won’t be enough in
foreseeable future
Volume
Store millions of
transaction details
each day over years
Flexibility
Quick response on
changing fraud patterns.
Integration of complex
data structures.
Model development
based on current events
The Solution
13 9/15/2016World of Watson 2016
Pig
Spark
Hive
Data Access
Storm
Phoenix
HBase
Governance
Sqoop
Kafka
Flume
Hadoop & YARN
Operations
Security
RangerKnox
Oozie
Zoo-
keeper
Ambari
The Solution
14 9/15/2016World of Watson 2016
Cognos Bi
Fidoop Gateway
Big SQL
Kafka
Core Banking System
Storm
Realtime
Processing
Datasources
R-Studio
HiveHBase
Spark
Jobs
…Java App
Ambari
Knox
Ranger
…
Hadoop (IOP)
The Solution
15 9/15/2016World of Watson 2016
Potential Use Cases
Master Worker Big Data
Plattform
Fraud
Detection
Usecase 2 Usecase 3
3. The Lessons Learned
What a year with the elephant taught us
The Lessons Learned
17 9/15/2016World of Watson 2016
- one has to manage things like
hardware configuration, network
architecture, disksizes, security and
more
- getting even the development skills
can take much time (not to mention
the understanding of a distributed
system)
- there is a bunch of components to
get used to
Hadoop is complex
The Lessons Learned
18 9/15/2016World of Watson 2016
Support means
- vendor support
- external support
- (and maybe) internal support
Support is a
key to success
The Lessons Learned
19 9/15/2016World of Watson 2016
Even standard tasks can
generate big effort or cause
a deadlock
- the advantage of fast feature
availability comes with a price
- some features are theoretically
available but not enterprise ready
- Hadoop is not an “out-of-the-box”
tool
The Lessons Learned
20 9/15/2016World of Watson 2016
Open source within a
distribution comes
with a price
Advantages:
stability, component interoperability,
easy installation, support …
The price:
Seeing fixed issues to be available in
a project but not in your distribution
can be frustrating
Bugs and feature requests are
difficult to handle as there is a
distributor and a open source project
The Lessons Learned
21 9/15/2016World of Watson 2016
New technologies
require a change of
thinking
- a distribution of open source
projects isn’t a single vendor tool
- establishing a distributed platform
can require new processes or
procedures
- sometimes building up a new thing
can help to get rid of old junk
The Lessons Learned
22 9/15/2016World of Watson 2016
Costs:
hardware as for a cluster you have to
buy servers
software support as open source is
free but not “for free”
external support if you don’t have all
skills (and you’ll need a lot)
integration as a new platform has to
be integrated decently
Establish Hadoop
as a plattform generates
relevant inital costs
4. The Blueprint
How to get the elephant started
The Blueprint
24 9/15/2016World of Watson 2016
Take a simple use case (if possible)Hadoop is complex
The Blueprint
25 9/15/2016World of Watson 2016
Use as few components as possible
Support is a
key to success
Hadoop is complex
The Blueprint
26 9/15/2016World of Watson 2016
In the beginning start with a security
that is as simple as possible
Hadoop is complex
The Blueprint
27 9/15/2016World of Watson 2016
Try to be agile in development as
building up a plattform will be
sophisticated
Even standard tasks can
generate big effort or cause
a deadlock
The Blueprint
28 9/15/2016World of Watson 2016
Be sure to have good management support
for budget decisions and escalations
Establish Hadoop
as a plattform generates
relevant inital costs
New technologies
require a change of
thinking
The Blueprint
29 9/15/2016World of Watson 2016
Concentrate on relevant parts and
avoid to much additional effort where
possible (buildtools etc)
Establish Hadoop
as a plattform generates
relevant inital costs
The Blueprint
30 9/15/2016World of Watson 2016
Calculate with training time and bugfixing
Even standard tasks can
generate big effort or cause
a deadlock
Hadoop is complex
The Blueprint
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
DataWorks Summit
 

Was ist angesagt? (20)

Data Lake,beyond the Data Warehouse
Data Lake,beyond the Data WarehouseData Lake,beyond the Data Warehouse
Data Lake,beyond the Data Warehouse
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
Data science workshop
Data science workshopData science workshop
Data science workshop
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
BDaas- BigData as a service
BDaas- BigData as a service  BDaas- BigData as a service
BDaas- BigData as a service
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results
 
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Creating an Enterprise AI Strategy
Creating an Enterprise AI StrategyCreating an Enterprise AI Strategy
Creating an Enterprise AI Strategy
 
How to Optimize Sales Analytics Using 10x the Data at 1/10th the Cost
How to Optimize Sales Analytics Using 10x the Data at 1/10th the CostHow to Optimize Sales Analytics Using 10x the Data at 1/10th the Cost
How to Optimize Sales Analytics Using 10x the Data at 1/10th the Cost
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
 
Big Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case StudyBig Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case Study
 
Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive Overview
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyond
 

Andere mochten auch

Andere mochten auch (20)

Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...Medical University of South Carolina: Using Big Data and Predictive Analytics...
Medical University of South Carolina: Using Big Data and Predictive Analytics...
 
Big Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data AccessBig Fish Games: Democratizing Data Access
Big Fish Games: Democratizing Data Access
 
BigInsights For Telecom
BigInsights For TelecomBigInsights For Telecom
BigInsights For Telecom
 
Cloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsCloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and Analytics
 
Big Data: Getting started with Big SQL self-study guide
Big Data:  Getting started with Big SQL self-study guideBig Data:  Getting started with Big SQL self-study guide
Big Data: Getting started with Big SQL self-study guide
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Integrating BigInsights and Puredata system for analytics with query federati...
Integrating BigInsights and Puredata system for analytics with query federati...Integrating BigInsights and Puredata system for analytics with query federati...
Integrating BigInsights and Puredata system for analytics with query federati...
 
Concept to production Nationwide Insurance BigInsights Journey with Telematics
Concept to production Nationwide Insurance BigInsights Journey with TelematicsConcept to production Nationwide Insurance BigInsights Journey with Telematics
Concept to production Nationwide Insurance BigInsights Journey with Telematics
 
Big Data: SQL on Hadoop from IBM
Big Data:  SQL on Hadoop from IBM Big Data:  SQL on Hadoop from IBM
Big Data: SQL on Hadoop from IBM
 
Big Data: Querying complex JSON data with BigInsights and Hadoop
Big Data:  Querying complex JSON data with BigInsights and HadoopBig Data:  Querying complex JSON data with BigInsights and Hadoop
Big Data: Querying complex JSON data with BigInsights and Hadoop
 
Big Data: Using free Bluemix Analytics Exchange Data with Big SQL
Big Data: Using free Bluemix Analytics Exchange Data with Big SQL Big Data: Using free Bluemix Analytics Exchange Data with Big SQL
Big Data: Using free Bluemix Analytics Exchange Data with Big SQL
 
Big Data: HBase and Big SQL self-study lab
Big Data:  HBase and Big SQL self-study lab Big Data:  HBase and Big SQL self-study lab
Big Data: HBase and Big SQL self-study lab
 
Big Data: Working with Big SQL data from Spark
Big Data:  Working with Big SQL data from Spark Big Data:  Working with Big SQL data from Spark
Big Data: Working with Big SQL data from Spark
 
Big Data: Big SQL and HBase
Big Data:  Big SQL and HBase Big Data:  Big SQL and HBase
Big Data: Big SQL and HBase
 
Bigdata based fraud detection
Bigdata based fraud detectionBigdata based fraud detection
Bigdata based fraud detection
 
Pénfigo
PénfigoPénfigo
Pénfigo
 
Digital, Social & Mobile in 2015
Digital, Social & Mobile in 2015Digital, Social & Mobile in 2015
Digital, Social & Mobile in 2015
 
Creative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage StartupsCreative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage Startups
 
Apache Drill
Apache DrillApache Drill
Apache Drill
 
In-Store Analysis with Hadoop
In-Store Analysis with HadoopIn-Store Analysis with Hadoop
In-Store Analysis with Hadoop
 

Ähnlich wie Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hadoop to the Enterprise

Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil Jadhav
Swapnil (Neil) Jadhav
 
Moving Past Infrastructure Limitations
Moving Past Infrastructure LimitationsMoving Past Infrastructure Limitations
Moving Past Infrastructure Limitations
Caserta
 

Ähnlich wie Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hadoop to the Enterprise (20)

Spark Streaming the Industrial IoT
Spark Streaming the Industrial IoTSpark Streaming the Industrial IoT
Spark Streaming the Industrial IoT
 
Cwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platformCwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platform
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
 
Big Data Architectures
Big Data ArchitecturesBig Data Architectures
Big Data Architectures
 
Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3
 
Hadoop und IoT
Hadoop und IoTHadoop und IoT
Hadoop und IoT
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil Jadhav
 
Content Recommendation using factorisation machines ; Pycon Ireland 2016
Content Recommendation using  factorisation machines ; Pycon Ireland 2016Content Recommendation using  factorisation machines ; Pycon Ireland 2016
Content Recommendation using factorisation machines ; Pycon Ireland 2016
 
ApacheCon NA 2015 - Gabriele Columbro - Is Open Source the right model in the...
ApacheCon NA 2015 - Gabriele Columbro - Is Open Source the right model in the...ApacheCon NA 2015 - Gabriele Columbro - Is Open Source the right model in the...
ApacheCon NA 2015 - Gabriele Columbro - Is Open Source the right model in the...
 
Journey to analytics in the cloud
Journey to analytics in the cloudJourney to analytics in the cloud
Journey to analytics in the cloud
 
Infochimps: Cloud for Big Data
Infochimps: Cloud for Big DataInfochimps: Cloud for Big Data
Infochimps: Cloud for Big Data
 
Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2Datameer6 for prospects - june 2016_v2
Datameer6 for prospects - june 2016_v2
 
short talk at Kean
short talk at Keanshort talk at Kean
short talk at Kean
 
Moving Past Infrastructure Limitations
Moving Past Infrastructure LimitationsMoving Past Infrastructure Limitations
Moving Past Infrastructure Limitations
 
SAP Big Data Innovation Lab at the University of Mannheim
SAP Big Data Innovation Lab at the University of MannheimSAP Big Data Innovation Lab at the University of Mannheim
SAP Big Data Innovation Lab at the University of Mannheim
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 

Kürzlich hochgeladen

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 

Kürzlich hochgeladen (20)

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 

Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hadoop to the Enterprise

  • 1. From Fraud Detection to Big Data Platform: Bringing Hadoop to the Enterprise at Fiducia & GAD IT AG Daniel Schmitt & Florian Herrmann October 25th 2016
  • 2. 2 About us 9/15/2016World of Watson 2016 Daniel Schmitt (1985) • Karlsruhe, Germany • Business Intelligence Dep. at Fiducia & GAD IT AG since 2009 Topics Business Analytics Design and Implementation, Reporting, Planning and all topics related to Analytics Experience Apache Hadoop, Cognos BI, Cognos TM1, GeoInformation Systems, Cognos Enterprise Planning etc
  • 3. 3 About us 9/15/2016World of Watson 2016 Florian Herrmann (1988) • Karlsruhe, Germany • Database Development Dep. at Fiducia & GAD IT AG since 2013 Topics Data Modelling and Database Design for core banking system, Performance Optimization and in-house consulting for all topics related to DBs Experience Apache Hadoop, Database Systems (DB2-Family, Oracle) etc
  • 4. 1. The Challenge 2. The Solution 3. The Lessons Learned4. The Blueprint
  • 5. 1. The Challenge Make the fraudsters shiver
  • 6. 6 Fiducia & GAD IT AG at a glance 9/15/2016World of Watson 2016 Computer Center Services Integration Platform Competence Center Leading Banking System
  • 7. 7 Fiducia & GAD IT AG at a glance 9/15/2016World of Watson 2016 167,000 workstations in banks 6,300m accounting entries per year 79m active accounts 36,000 self-service terminals 550m ATM cash withdrawals per year
  • 8. Requirements • Evaluation of all user initiated online transactions on fraud suspicion • Integration in core banking system and existing banking processes (Fiducia & GAD is just the service provider not the owner!) • Model based on customer behavior • Flexible system design for a fast reaction on new fraud patterns 8 Fraud Detection for online banking 9/15/2016World of Watson 2016
  • 9. 9 Fraud Detection for online banking 9/15/2016World of Watson 2016 Millions of transactions per day Up to 100 transactions per second Evaluation in less than 100 milliseconds System adjustment in minutes Be prepared for new datasources or -formats
  • 10. 10 Fraud Detection for online banking 9/15/2016World of Watson 2016 Transaction handling Fraud Detection System Development of evaluation models Storage of all transactions Evaluation in milliseconds Flexible adjustment Evaluate Transaction Accounting and Forwarding
  • 11. 2. The Solution One elephant to rule them all
  • 12. The Solution 9/15/2016World of Watson 201612 Velocity Realtime evaluation of incoming data. Access on large data volume within milliseconds Variety Transactional data won’t be enough in foreseeable future Volume Store millions of transaction details each day over years Flexibility Quick response on changing fraud patterns. Integration of complex data structures. Model development based on current events
  • 13. The Solution 13 9/15/2016World of Watson 2016 Pig Spark Hive Data Access Storm Phoenix HBase Governance Sqoop Kafka Flume Hadoop & YARN Operations Security RangerKnox Oozie Zoo- keeper Ambari
  • 14. The Solution 14 9/15/2016World of Watson 2016 Cognos Bi Fidoop Gateway Big SQL Kafka Core Banking System Storm Realtime Processing Datasources R-Studio HiveHBase Spark Jobs …Java App Ambari Knox Ranger … Hadoop (IOP)
  • 15. The Solution 15 9/15/2016World of Watson 2016 Potential Use Cases Master Worker Big Data Plattform Fraud Detection Usecase 2 Usecase 3
  • 16. 3. The Lessons Learned What a year with the elephant taught us
  • 17. The Lessons Learned 17 9/15/2016World of Watson 2016 - one has to manage things like hardware configuration, network architecture, disksizes, security and more - getting even the development skills can take much time (not to mention the understanding of a distributed system) - there is a bunch of components to get used to Hadoop is complex
  • 18. The Lessons Learned 18 9/15/2016World of Watson 2016 Support means - vendor support - external support - (and maybe) internal support Support is a key to success
  • 19. The Lessons Learned 19 9/15/2016World of Watson 2016 Even standard tasks can generate big effort or cause a deadlock - the advantage of fast feature availability comes with a price - some features are theoretically available but not enterprise ready - Hadoop is not an “out-of-the-box” tool
  • 20. The Lessons Learned 20 9/15/2016World of Watson 2016 Open source within a distribution comes with a price Advantages: stability, component interoperability, easy installation, support … The price: Seeing fixed issues to be available in a project but not in your distribution can be frustrating Bugs and feature requests are difficult to handle as there is a distributor and a open source project
  • 21. The Lessons Learned 21 9/15/2016World of Watson 2016 New technologies require a change of thinking - a distribution of open source projects isn’t a single vendor tool - establishing a distributed platform can require new processes or procedures - sometimes building up a new thing can help to get rid of old junk
  • 22. The Lessons Learned 22 9/15/2016World of Watson 2016 Costs: hardware as for a cluster you have to buy servers software support as open source is free but not “for free” external support if you don’t have all skills (and you’ll need a lot) integration as a new platform has to be integrated decently Establish Hadoop as a plattform generates relevant inital costs
  • 23. 4. The Blueprint How to get the elephant started
  • 24. The Blueprint 24 9/15/2016World of Watson 2016 Take a simple use case (if possible)Hadoop is complex
  • 25. The Blueprint 25 9/15/2016World of Watson 2016 Use as few components as possible Support is a key to success Hadoop is complex
  • 26. The Blueprint 26 9/15/2016World of Watson 2016 In the beginning start with a security that is as simple as possible Hadoop is complex
  • 27. The Blueprint 27 9/15/2016World of Watson 2016 Try to be agile in development as building up a plattform will be sophisticated Even standard tasks can generate big effort or cause a deadlock
  • 28. The Blueprint 28 9/15/2016World of Watson 2016 Be sure to have good management support for budget decisions and escalations Establish Hadoop as a plattform generates relevant inital costs New technologies require a change of thinking
  • 29. The Blueprint 29 9/15/2016World of Watson 2016 Concentrate on relevant parts and avoid to much additional effort where possible (buildtools etc) Establish Hadoop as a plattform generates relevant inital costs
  • 30. The Blueprint 30 9/15/2016World of Watson 2016 Calculate with training time and bugfixing Even standard tasks can generate big effort or cause a deadlock Hadoop is complex