8. 8
Vision Processes are not purely transactional but more and more data- and
information driven
Focus Best-in-class strategy
Domains Advanced Data Analytics
Data Integration & Data Quality
Master Data Management
Business Intelligence
Awards FD Gazelle 2011, 2012, 2013 & 2014
Team JDA curious & can do specialist ranging from front-end developers, hard core
statisticians, mathemagitions, AI experts, MBA/ big four consulting and PHD’s.
We work with
(amongst other)
HDFS
Introduction
Who & What is Jibes….
10. “TODO”
Data Analytics is about analyzing data with the objective to gain new insights and
understand patterns and levers in data and put these to use for in achieving results
and/ or create commercial value.
• Objective: find meaningful insights and knowledge from data
• Results translated into models
• Data mining = the process of modeling
What methods and technology do we use?
•Data Visualization
•Machine learning
•Complex Event Processing
•Process mining
•Advanced text mining & analytics
•Predictive modeling
•Statistical analysis
•Entity extraction
•Natural language processing
What enablers?:
•HDFS
•Graph databases
•Libraries
Our focus areas & passion:
•Competitive Intelligence
•Lead Generation & funnel
management
•Competitive Pricing
•(micro) segmentatie
•Sentiment & koopmotieven
•Campagne frameworks
•R&D/ product ontwikkeling
•Klant & service management
JDA: passionate about topline initiatives & bottom line results
12. Introduction: ‘Big Data ?’
12
Skepticism & reality – it’s complicated…yet doable
Transaction based &
after the fact reporting…
…to real world complexity &
More data dimensions
13. Some statements (1/3): data types
13
Data at rest:
•Stored in databases or file
systems.
•Relatively stabile/ static
•Currenty most big-data
applications are aimed at
data at rest
Data in motion:
•Real-time data flows
•This data is not (always)
stored
•Examples:
• Sensor data
• Phone calls
• Network traffic
• Production process
• …
15. Some statements (3/3): crawl, walk & run
15
DescriptivePredictivePrescriptive
Time
Complexity
Cognitive
Descriptive: (median, frequency
spread, standard deviation,
trends)
Declarative: cross
reference tables,
hypothesis testing,
variance analysis,
percentiles, pareto
Predictive: cluster analysis,
correlations, regression,
discriminants, factor, CHAID,
neural network, associative
techniques etc)
16. Data Evolution
16
Machine generated data. Where does it come from (and where does it go?)
Machine and man generated data
•More sensors in systems
•More online and interconnected
•Structured data is relatively uniform and
small in size – challenge is in integration
•Unstructured data is more difficult to
capture and interpret
We store everything but what do we
do with it? + Who dares to through
something away?
17. The world - today…
17
Process orientated – focus on efficiency & standards based processing
What do we do with all unstructured data?
How to deal with complexity
en diversity?
•Account orientation?
•Pre-defined procedures and
processflow
•Long(er) lead times
•Limited agility in the process to
adapt to dialogue
•Procedure driven communication
(instead of objective driven =
process driven)
•Missed opportunities?
• A more dynamic customer dialogue, self-service
and omni-channel
• Customer specific information and dialogue
• Faster and better transactions (higher service level,
higher customer satisfaction and lower cost)
18. The world of tomorrow…
18
Key principles:
•Network effects !
•Profiling vs. Privacy: TRUST
•Data integrity & quality of data
•Focus & perseverance
•Intelligent interactions
Its about INSIGHTS en ACT accordingly
19. The world of tomorrow…
19
Consistent approach to gain customer insights and behaviour:
•Start small with a pilot group
•Work on data quality
•Expand target group
•Add functionality
•Data integration
•Analytics!
•Create a feedback loop!!
•Loyalty
•Profit
Example TESCO design for the future
A local example: Ahold
• New Loyalty system (app)
• Integration BOL, Albert & AH
Theme now: ‘the Customer Journey’
20. The container ‘Big Data’; a break down
What is different in ‘Big Data’?
New technology which enables massive parallel data processing in
distributed systems and highly scalable platforms. Examples:
• Apache Hadoop
• MapReduce
• NoSQL databases
Increased capabilities analytical tooling:
•Emergence of “data science”
•Predictive Analytics
•Data Mining
The emergence of data democracy:
• data more easily available (e.g. Cloud Computing)
• Business in the driver seat and not IT and/ or BI
Big Data? a new generation of software – better price/ performance
22. • Vacations (based on foreign ATM’s): an upward
trend
• New unemployment benefits per month. Note
an upward trend in January 2013.
• Salarisincrease & –decrease (April 2013):clearly
more decrease than increase.
Advanced Data Analytics
22
Banking: life events from transaction data understand and model
Key analysis to detect and understand life events:
• Salary increase and decrease
• Benefits
• Retirement
• Vacation money,bonus
• Vakantion
This is based on (anonimised) transaction data
onl, so not using text descroptions. This enables
very accuarate spotting and modelling of
events.
Life events ontransactional data
23. Many customers show regular and predictable
behaviour that is suitable for modeling.
Predicts balance with a reliability interval (red
lines)
Advanced Data Analytics
23
Banking: from understanding and modeling to prediction
• By using time series methods we can predict
balances for a large number of accounts.
• We can predict very reliable for over 70% of the
accounts.
• These methods can also be used to spot life
events that show strong iregularities from the
predicted course of action.
Predict account balance
Outlyers are easy to spot using statistical
methods.
24. The broadcasting model is losing ground to 2nd
screen, on
demand and pay TV.
Advanced Data Analytics
24
Media: from linear broadcasting to online on demand
25. Advanced Data Analytics
25
Other applications
1. BOL: cost reduction and high performance processing
• Lower cost due to hybrid platform (Hadoop and Oracle)
• More real time processing power to feed the recommendation engine
• Better pre-processing of data for analysts (higher productivity)
• Higher conversion, better margins
2. Scania: advanced mobility and fleet management
• Integrated and connected systems (data from trucks to central database
connected with history and other dimensions (geo, weather, demographics)
• Profiles; what is the best approach considering the circumstances
• Route support
• Predictive maintenance based on type of drivers, damage and fuel
consumption
3. UMCU: connect clinical data and research data (now 2 worlds):
• Better and bigger datasets through text analytics
• Build a bigger data catalogue of related illnesses and treatments to design
profiles and understand patterns (more data dimensions)
• Better statistical models and predictive models
Data Obesit?
27. Advanced Data Analytics &
27
Many data dimensions – How to make it consumable? Data Visualization
Instant Insights
28. Data Visualization
28
Data Visualisation; not new but of paramount importance
In 1869 Charles Joseph Minard (1781 – 1870) made this infographic about
Napoleons military campaign from Paris to Moscow and back depicting the losses of
Napoleons army in 1812. The campaign starts at the Polish border and the thick
band represents the size of the army en route to Moscow per geography, the thin
dark band shows the way back which also includes a timescale and a temperature.
30. 30
Not just visualization but also dynamic navigation
Instant Insights™
Jibes Instant Insights™ - Analytics as a Service
Powered by
Jibes Instant Insights; de cloud
oplossing voor SAS® Visual Analytics &
SAS® Visual Statistics.
•Een abonnement op Analytics
•100% variabele kosten
•Software direct beschikbaar in de cloud
•Eenvoudig data uploaden, visualiseren
en analyseren
•Advies & realisatie door Jibes Data
Scientists
•Visueel, geschikt voor ad hoc reporting &
Self service BI
•Toepasbare bewezen statistiek & eigen
modellen
•Geen aanschaf van een licentie en
hardware
•Niet zelf modellen bouwen en beheren
•24x7, anytime, any place (browser
based)
•Abonnementen vanaf 1.250,- per maand
•Secure (ISO 27001, NEN 7510)
•Schaalbaar & binnen een dag op te
leveren
•Per maand op te zeggen
38. Go data driven
38
Data is better than opinions: measure users
T/I = %tried of % invited (user interest)
Rn/T= %of retained users after n weeks (user retention)
Rn+1/Rn= slope of user gain / loss over time (user loyalty)
This is the trend
we need for succes
This is where products die
41. Wrapping up
Use (text) analytics to combine
structured and unstructured data
Analytics is a cycle, not an end point
Define new processes and built
Smart apps on new insights
Visualization is Big Data’s best friend
42. Wrapping up
Visit us at
D 05
or www.jibes.nl
or pvdhulst@jibes.nl
Tel. 06 3466 0007
Hinweis der Redaktion
Binnen 5 jaar zal het gele olifantje, genaamd Hadoop, de wereld van data-opslag beheersen…
Hadoop is een gedistribueerde omgeving voor data opslag en processing, veelal gebruikmakend van commodity, lage kosten, hardware.
…zal het volledig geaccepteerd zijn om voorspellende analytics direct en geautomatiseerd tot acties om te zetten zodat ze veel meer ‘voorschrijvend’ worden dan alleen voorspellend.