This is an updated slide set based on my ACFE presentation in 2011. The idea is to present a concept to use Data Analytics in Fraud Investigations. For more information feel free to contact me via www.corma.de.
3. Jörn Weber
Certified Fraud Investigator
19 years experience - German law
enforcement
since1999 Managing Partner at
corma GmbH:
Solution provider
Partner for corporate security
About me
3
4. About corma GmbH
4
Stops suspects by:
analytical investigations
operative investigations
Saves time by:
online research
online monitoring
Increases efficiency &
saves money by:
data analytics
global intelligence
solutions
6. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
6
15. 1. Chain of Custody
• Record all your steps
Software: Hunchly https://www.hunch.ly/
Plain document
• Store original data in a secure area
• Create “digital fingerprints”: MD5 Hash
• Work only with a copy of the original data
corma Workflow
15
16. 2. Identify data format
• Research
http://www.file-extensions.org
http://www.filext.com
http://www.fileinfo.com
.gpi
.bqy
.blb
Understanding data
16
Garmin Point of Interest file
BrioQuery database file
ACT! database file
17. 2. Identify data format
• View (read only)
http://www.uvviewsoft.com
Understanding data
17
18. 2. Identify data format
• Deep view (editable)
http://www.ultraedit.com
Understanding data
18
19. 3. From raw data to smart structured data
Understanding data
19
Develop first ideas for analytical approach
22. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
22
23. Challenges
High data quality required for good
analysis results
Constantly increasing data quantity
Cleansing/Standardizing data
23
25. Why should data be cleansed:
Reliable analysis results are required
Saves time that otherwise would come
up during the analysis process
Reduces unwanted deviations &
variations
Identify entities (person, organization,
address)
Insights often lead to further findings
Cleansing/Standardizing data
25
26. Fast and flexible handling of large
quantities of data
Flexible import for various data sources
Intuitive research
Analyses, calculations, statistics
Business Intelligence
Ad-hoc reporting
26
Solution
27. Combine different data formats
Fix data quality issues
Identify missing data
Better link analysis results
Application of different tools for
standardized data cleansing
27
Solution
29. 29
Benefits
Develop workflow for recurring
processes
Standardize processes (templates)
Benefits:
Time saving
Flexible
Maximize effectiveness
Team “compatibility”
Easy to learn
30. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
30
39. Address verification – service
provider or software for large amounts of
data
AddressDoctor
http://www.addressdoctor.com
Experian
http://www.qas-experian.com.au
Data validation & enrichment
39
40. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
40
44. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
44
45. Analytics … yes … but structured:
Identify needed analytical steps
Develop „questions“ to data
What has prompted the need for the
analysis?
What is the key question that needs to be
answered?
„Create“ evidence out of data
What can you prove?
What do you want to prove?
Visualize your thinking!
Analyzing data
45
46. Analytical techniques
Chronologies and timelines (understand
timing and sequence of events)
Sorting (categorizing & hypothesis
generation)
Ranking, scoring, prioritizing (determine
which items are most important)
Network analysis – analyze relationships
between entities (people, organizations,
objects)
Analyzing data
46
47. Supporting tools:
Documenting processes in intranet/wiki
Select the right tool for each task
Train the users
Keep the users “busy”
Analyzing data
47
48. Query - an investigative question,
converted into database search
Analysis Sample: i2 IBM
48
55. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
55
58. Workflow
Understanding data
Cleansing / Standardizing data
Data validation & enrichment
Importing data
Analyzing data
Reporting
Monitoring
What are “Smart Solutions”?
58
Reasons for success:
team of investigators and analysts
sharing corma workflow with you
What is out there?
Which data am I looking at?
How much data?
Timeframe?
What do I see?
Which patterns emerge?
i.e. customer & transactional data
Company & address data
Product data
What seems to be missing?
Imagine what happens if you receive this data
Summary of what I have seen
Is it productive/yielding result?
Does it make sense?
Clarify best ideas how to interrogate with this data
Research
www.filext.com Database of file extensions and various programs that use them
http://www.file-extensions.org/filetype/extension/name/database-files
View (read only)
www.uvviewsoft.comAdvanced file viewer for wide range of formats, various plug-ins support additional formats
Deep view (editable)
UltraEdit is a flexible text, HTML, PHP and JavaScript editor:Integrated file viewer, unicode, …
Notes: Details kommen später
Notes: Details kommen später
Notes: schnelle Ergebnisse bei schlechter Datenqualität & großen Datenmengen
Notes:
Unicode issues
Phone numbers (in rot Mobilfunknummer, 0 Pflicht)
Address data
Handout:
Hidden control characters in raw data
Notes: That is why a reliable tool is needed InfoZoom (Übergang zum nächsten Slide)
Warranty fraud sample
Was?
Wie?
Beispiel Einzel vs. Batch
Export aus IZ – Upload – Re-import
Beispiel Einzel vs. Batch
Export aus IZ – Upload – Re-import
Beispiel Einzel vs. Batch
Export aus IZ – Upload – Re-import
Handout: Address Verification Software
Verify, correct and standardize international postal addresses using our Web Service, Online Products or API / Software Library.
Unique identifier
Schaden nr
Vertrags nr
Handout: check list
Handout: mit Details
Welche Alternative haben wir dazu anzubieten?
Timelinemaker, den Herr Moritz hat?
Bitte den mal fragen.