This document outlines an event held by Dolead and Google in July 2016. It discusses the history of big data, how to define big data, and how organizations can find value in data. Specifically, it describes how data volume, velocity, and variety have increased dramatically in recent years. It also provides examples of how metrics can be used to measure the impact of new product features and how data-driven decisions can boost business performance. The overall goal is to explain how organizations can maximize the potential of large, diverse, and fast-growing data sources.
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
La révolution Big Data, par Hadrien Baradel @ "Play with Data" event by Dolead x Google
1. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
Play with Data (1)
Dolead & Google
2. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
2
Good News : Our Brain’s memory capacity is 10 times larger than we thought
3. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
3
Good News : Our Brain’s memory capacity is 10 times larger than we thought
Basically the whole Internet
SALK INSTITUTE - 20 January 2016
4. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
4
Plan
Product Manager at Dolead
A short history of Big Data
How define Big Data
How we find value in data Hadrien Baradel
A short history of Big Data
How to define Big Data
How do we find value in data
5. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
5
WHAT IS DATA ?
DATA = EVENT + CONTEXT
VALUE-DRIVING DATA =
EVENT + CONTEXT + METRICS
6. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
A SHORT HISTORYof Big Data
7. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
7
ALL BEGINS WITH INFORMATIONS AND LIBRARIES
300 BCE - 48 AD : Library of Alexandria is the world’s largest data storage center
8. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
8
IS BIG DATA REALLY NEW?
« Information Explosion »
A term first used in 1941 (According to Oxford English Dictionary)
9. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
9
ALL BEGINS WITH INFORMATIONS AND LIBRARIES
1944 - Fremont Rider speculates that Yale Library will contain 200 million books stored in 6’000 miles by 2040
10. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
1
0
JUST MISSED SOMETHING
1991 - The Birth of Internet
11. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
11
IS BIG DATA REALLY NEW?
1989
«BIG DATA»
Early use of terms in magazine
article by a ficton author Erik
Larson
Commenting on Advertisers’ use
of data to target customers
12. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
12
IS BIG DATA REALLY NEW?
2010 Eric Schmidt
“Much data is now being created every two days, as
was created from beginning of human civilization to
the year 2003“
13. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
13
IS BIG DATA REALLY NEW?
2015 : Information double
More data has been created in the past two years than in the
entire previous history of the human race
14. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
CHARACTERIZATION
of Big Data
15. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
15
BIG DATA DEFINITION : THE V3s
Volume
Data quantity
Velocity
Data Speed
Variety
Data Types
16. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
16
WHAT IS A ZETTABYTE
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
terabytes
gigabytes
petabytes
exabytes
zettabyte
1 Terabyte = 250 DVD
17. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
17
HOW BIG IS BIG DATA ?
Size of Total Data
Entreprise Managed Data
Entreprise Created Data
Source IDC
18. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
18
HOW BIG IS BIG DATA ?
2010
10 Gigabytes
Today
500 Terabytes / day
Today
240 Terabytes / flight
19. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
1
9
HOW FAST IS BIG DATA ?
20. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
Structured Data? An example
Before
Structured Data
Generated By
companies
Updated every Month
SQL
20
21. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
Unstructured data?
After
Unstructured Data
Generated By Users
Real Time
NoSQL
21
22. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
22
THIS EXPLOSION OF DECENTRALISED DATA MEANS
2009 2010 2011 2012 2013 2014
Unstructured File-based Data Storage
Structured Block-based Data Storage
Before
Structured Data
Generated By
companies
Updated every Month
SQL
Source IDC
23. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
23
THIS EXPLOSION OF DECENTRALIZED DATA MEANS
2009 2010 2011 2012 2013 2014
Unstructured File-based Data Storage
Structured Block-based Data Storage
After
Unstructured Data
Generated By Users
Real Time
NoSQL
Source IDC
24. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
HOW DO WE FIND VALUE
in Big Data
25. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
HOW IS IT GROWING ?
Data production will be
44 times greater in 2020
than it was in 2009
70% of data created by
users, 80% hold by
companies
25
26. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
26
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
Internalized Data
27. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
27
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-
structured Data
Internalized Data
28. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
28
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-
structured Data
Internalized Data
External structured Data
29. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
29
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-
structured Data
Internalized Data
External structured Data
Data for organisations
and businesses
30. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
30
YOU HAVE A LOT OF DECISIONS TO MAKE
Marketing
Channel
BudgetTargetin
g
New Product
Development
31. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
31
USING DATA IS GOOD FOR YOUR BUSINESS
64%
of the companies that invest in “ analytics “
over performs on averaged the other ones
(S&P 500)
32. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
HOW DO WE FIND VALUE IN DATA
DATA IS NOT GOAL
33. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
33
How do we know that we made a great feature?
“ If you want to be a long-term
sucess, built a great product ”
Sam Altman, Y Combinator
34. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
34
How do we follow metrics? And how to be sure that all services have the same metrics?
35. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
35
What is a key metric ? How to choose it ?
36. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
36
What is a key metric ? How to choose it ?
37. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
37
What we had before the new feature
38. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
38
What get thought with the ability to apply keywords to a group
39. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
39
And what we have learned
Opportunity Type < 1 w 1 2 3 4 5
Add Keywords 85.71% 42.86% 28.57% 28.57% 14.29 % 14.29 %
Feb 23rd;2016 - Apr 11th, 2016
Opportunity Type < 1 w 1 2 3 4 5
Add Adgrouping Keywords 88.24% 58.82% 47.06% 47.06% 41.18 % 35.29 %
Apr 13rd;2016 - Jun 18th, 2016
40. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
40
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link
and compare large data sets
41. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
41
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link
and compare large data sets
Analysis
Identify patterns
42. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
42
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link
and compare large data sets
Analysis
Identify patterns
Mythology ?
Widespread belief that large data sets offer a higher form of intelligence and knowledge that can generate
insights that were previously impossible, with the aura of truth, objectivity and accuracy
43. DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
DOLEAD & GOOGLE EVENT – JUL 2016
WWW.DOLEAD.COM
DOLEADWe make digital advertizing easy
Prez Google - Play with Data (part
2)