SlideShare a Scribd company logo
1 of 28
Datapreneurs
&
Building Data Centric Business
http://www.slideshare.net/ssood/datapreneurs
linkedin.com/in/sureshsood
@soody
Areas for Discussion
1.) Data Driving Trends – Big Data & Machine Engineering
2.) Building Data Centric Business
3.) Future of Professions
4.) Talent Scarcity
5.) Democratisation of Data Science
http://www.marketingdistillery.com/2014/11/29/is-data-science-a-buzzword-modern-data-scientist-
defined/
2020 Global Data Forecast (Bytes)
2020 estimates suggest four times more digital data than all the grains of sand on Earth
Source: Pg. 4, Building a Digital Analytics Organization: Create Value by Integrating Analytical Processes,
Technology, and People into Business Operations by Judah Phillips, FT Press, 30 Jul 2013
Data Origination Trends
1) Mobile = multiple sensors comprising camera, microphone, GPS, accelerometer
2) Sensors everywhere including “eyes in the sky” via drones, satellites and roads
3) Online customer interactions generate IP addresses, time, geocode, page visits
4) Large scale data curation e.g. Airbnb, Google (Art, Street View, N Gram, Gdelt** ),
Guardian, Million Songs, OpenCalifornia, Openflights, OpenStreetMap,
Planethunters, Pandora, Shazam, Wikipedia
5) Data fusion e.g. LA Times Homicide Blog using coroner reports
6) Reviews e.g. Tripadvisor, Amazon, Yelp
7) Open Data e.g. data.gov and O Data protocol (http://www.odata.org/)
**The GDELT Project pushes the boundaries of “big data,” weighing in at over a quarter-billion rows with
59 fields for each record, spanning the geography of the entire planet, and covering a time horizon of
more than 35 years. GDELT is the largest open-access database on human society in existence. Its
archives contain nearly 400M latitude/longitude geographic coordinates spanning over 12,900 days,
making it one of the largest open-access spatio-temporal datasets as well.
Internet of Things “trillion sensors”
Source: www.tsensorssummit.org
Black Box Insurance
• Big data transforms actuarial insurance from using probability methods to estimate premiums into dynamic risk management
using real data generating individually tailored premiums
• Estimate 20 km work or home journey, data point acquired every min and journey captures 12 points per km. Assume 1000 km
per month driving or generating 12,000 points per month resulting in 144,000 points per car/annum. Hence, 1,000 cars leads to
144 million points per annum.
• Telematics technology (black box) monitor helps assess the driving behavior and prices policy based on true driver centric
premiums by capturing:
– Number of journeys
– Distances travelled
– Types of roads
– Speed
– Time of travel
– Acceleration and braking
– Any accidents
– Location ?
• Benefits low mileage, smooth and safe drivers
• Privacy vs. Saving monies on insurance (Canada ; http://bit.ly/Black_box)
The ANZ Heavy Traffic Index comprises flows
of vehicles weighing more than 3.5 tonnes
(primarily trucks) on 11 selected roads
around NZ. It is contemporaneous with GDP
growth.
The ANZ Light Traffic Index is made up of
light or total traffic flows (primarily cars and
vans) on 10 selected roads around the
country. It gives a six month lead on GDP
growth in normal circumstances (but cannot
predict sudden adverse events such as the
Global Financial Crisis).
http://www.anz.co.nz/about-us/economic-markets-research/truckometer/
ANZ TRUCKOMETER
http://tacocopter.com/
New Sources of Information (Big data) : Social Media + Internet of Things
Innovations
7,919 40,204
2,003,254,102 51
Gridded Data Sources
Variety of Data Types & Big Data Challenge
1. Astronomical
2. Documents
3. Earthquake
4. Email
5. Environmental sensors
6. Fingerprints
7. Health (personal) Images
8. Graph data (social network)
9. Location
10.Marine
11.Particle accelerator
12.Satellite
13.Scanned survey data
14.Sound
15.Text
16.Transactions
17.Video
Big Data consists of extensive datasets primarily in the characteristics of
volume, variety, velocity, and/or variability that require a scalable
architecture for efficient storage, manipulation, and analysis.
Computational portability is the movement of the computation to the location of the data.
HadoopConfigurations(SingleandMulti-Rack)
Adapted from: http://stackiq.com/
Cluster manager e.g. Apache Ambari, Apache Mesos, or Rocks
3 TB drives ,18 data nodes
configuration represents 648 TB
of raw storage HDFS standard
replication factor of 3
216 TB of usable storage
Name/secondary/data nodes – 6 core 96 GB
Management node – 4 core 16 GB
Computer
Data
Program
Output
Computer
Data
Output
Algorithms
Traditional Computing Paradigm & Machine Engineering
Machine learning is a scientific discipline that deals with the construction
and study of algorithms that can learn from data. Such algorithms operate
by building a model based on inputs and using that to make predictions or
decisions, rather than following only explicitly programmed instructions.
http://en.wikipedia.org/wiki/Machine_learning
8 Steps Towards Building the Data Centric Business
1. Put digital service (Vargo & Lusch) at centre of business blurring distinction with
physical products via sensors and apps
2. Identify data and monetisation opportunities using business model canvas
3. Select unique sources of data to help drive innovation
4. Uses data to drive interactions and customer experiences
5. Understand the data lifecycle from creation to storage
6. Value extraction from data (economic or social)
7. Review patterns of big data businesses
8. Got on top of big data technology trends and analytics software
Netflix – A Picture of A Data Driven Company
• ~75 million users
• 8.5 million events per second
• Zero loss?
• 550 billion events per day
• Hundreds of event types
• 1.3 PB/day
• 21GB /sec (peak)
• 37% of peak US internet bandwidth
• Operates on Amazon Web Services
Source : http://techblog.netflix.com/2016/02/evolution-of-netflix-data-pipeline.html
• Next generation radio telescope
• 100 x more sensitive & 1,000,000 X faster
• 5 square km of dish over 3000 km
• Two sites: Western Australia & Karoo Desert RSA
• Worlds most ambitious IT Project
• First real exascale ready application
• Largest global big-data challenge
• SKA SDP exascale systems:
• 100,000 nodes
• 800 cabinets
• consume 20 MW
• Expected failure rates of 300 nodes per week
Square Kilometre Array
http://www.ska.gov.au/
The Future of the Professions
(Susskind & Susskind 2015)
– Tax and audit work replaced by computer assisted techniques
– Technology automating and innovating
– Accounting work reconfiguring
– New business models
– Move from bespoke to “off the peg”
– Mastery of data with new tools and techniques - Big Data
– Diversification
– Shift to proactivity from reactivity
– Professionals replaced by less expert people and high performing systems
– Post-professional society expertise available online
The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel Susskind
(2015)
Adapted From:
The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel Susskind
(2015)
A HEALTH
KNOWLEDGE
COMMONS
New knowledge used
differently: people
managing their own health
information, personalising
their care and creating
new kinds of health
knowledge
https://www.nesta.org.uk/sites/default/files/the-nhs-in-2030.pdf
The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel
Susskind (2015)
We’re sitting on a big data time bomb
Catastrophic loss of transparency. Few IT professionals
have experience managing big data platforms at scale
— a situation that has created a massive skills
shortage in the industry. By 2018, U.S. companies will
be short 1.5 million managers able to make data-
based decisions. A recent McKinsey Quarterly report
estimates that, in order to close this gap, companies
would need to spend 50 percent of their data and
analytics budget on training frontline managers; it also
notes that few companies realize this need.
Source: CAMERON SIM, CREWSPARK, OCTOBER 24, 2015
http://venturebeat.com/2015/10/24/were-sitting-on-a-big-data-time-bomb/
Australia/NZ needs “30,000 data savvy managers by 2018”
• This statement derives from the McKinsey (2011) study “a shortage of talent necessary for organizations
to take advantage of big data. By 2018, the United States requires a talent pool of 140-190,000 deep
analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big
data to make effective decisions”.
• Taking 2% of the US economy as a rule of thumb, in 2018 Australia will require another 30,000 managers
or analysts. However, the shortage commences well before 2018. These numbers do not accommodate
the training of managers or analysts for overseas destinations.
• Another 2011 study from EMC Corporation interviewed nearly 500 data science and business intelligence
professionals globally. Two-thirds of the informants believe demand will outpace supply and 30% from
disciplines outside of computer science. Additionally, the study found the biggest obstacle to data science
as being education and training.
• A 2012 study “Data Equity: Unlocking the value of big data” commissioned by SAS UK and conducted by
the Centre for Economics and Business Research, an independent business research consultancy, found
unlocking big data leads to adding another 58,000 jobs to the UK economy (2012-2017).
• Gartner (2012) estimates by 2015 4.4 Million IT Jobs will be created globally to Support big data or 1.9
million jobs in the United Sates alone.
• Closer afield, the 2013 Hudson study “Tackling the Big Data Challenge” found 78% of the Australian
research informants, “believe organisations do not have the skills and competencies to successfully
undertake a big data project.
• Building on the McKinsey (2011), Gartner (2012) and Hudson (2013) estimations, Australia and the world
requires 3 distinct but related skills. Most specifically, the demand is very strong for data savvy managers
conversant with big data practice.
Australian Demand 2018:
-Deep skills 3,800
- Technology 8,000
- Data Savvy 30,000
India’s high demand for big data workers contrasts
with scarcity of skilled talent
The talent deficit is on two fronts, said Velamakanni: data
scientists who can perform analytics, and analytics consultants
who can understand and use the data. The first, big data
engineers and scientists, are extremely scarce. "In the second
category, we need better quality, and India is going to be short
of a million data consultants soon," he said.
Source: India's high demand for big data workers contrasts with scarcity of
skilled talent, Saritha Rai, June, 2014, http://www.techrepublic.com/article/indias-
high-demand-for-big-data-workers-contrasts-with-scarcity-of-skilled-talent/
Google Trends Worldwide, Australia and New Zealand - Accounting + Analytics
January 2004 - September 2015
Worldwide
Australia
New
Zealand
'The Predictive Accountant’ and Data Centric Practice
1. Data savvy
3. Focus shifts from being reactive to proactive and predictive
4. Leverages accounting data and predictive analytics software to find patterns in data and
insights
5. Uses the tools and dashboards to predict client scenarios before time: maximising
opportunity, limiting risks and proactively advising.
6. Accountant benefit from analytics by adding value when connecting with client
challenges and opportunities to identified customer patterns. Sharing these insights
delivers more value in the accounting conversations and helps tackle the real business
problems facing clients.
The Predictive Accountant Portal: Democratisation of Data Science
The Predictive Accountant Data Sources
Predictive
Analytics
Excel style
dashboard
Connected Practice
Digital Marketing / eNewsletters/ Integrated
business tools software
Apps Marketplace
Accounting Analytic Apps
Education
Analytic Training
“I had come to an entirely erroneous
conclusion which shows, my dear
Watson, how dangerous it is to
reason from insufficient data.”
The Adventure of the Speckled Bird

More Related Content

What's hot

Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
Matthew Urdan
 

What's hot (20)

Beyond dashboards
Beyond dashboardsBeyond dashboards
Beyond dashboards
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challenges
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Big data
Big dataBig data
Big data
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Personalized News and Video Recomendation System at LinkSure
Personalized News and Video Recomendation System at LinkSurePersonalized News and Video Recomendation System at LinkSure
Personalized News and Video Recomendation System at LinkSure
 
The promise and challenge of Big Data
The promise and challenge of Big DataThe promise and challenge of Big Data
The promise and challenge of Big Data
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infra
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 

Viewers also liked (13)

Interview workshop guide oct 15
Interview workshop guide oct 15Interview workshop guide oct 15
Interview workshop guide oct 15
 
Calculating the Cost of IT Downtime for Law Firms
Calculating the Cost of  IT Downtime for Law FirmsCalculating the Cost of  IT Downtime for Law Firms
Calculating the Cost of IT Downtime for Law Firms
 
[AU SPC 2011] Backup Restore SharePoint 2010
[AU SPC 2011] Backup Restore SharePoint 2010[AU SPC 2011] Backup Restore SharePoint 2010
[AU SPC 2011] Backup Restore SharePoint 2010
 
Lenymat07
Lenymat07Lenymat07
Lenymat07
 
Bullying
BullyingBullying
Bullying
 
CV-DEBASIS_NARENDRA
CV-DEBASIS_NARENDRACV-DEBASIS_NARENDRA
CV-DEBASIS_NARENDRA
 
Artying05
Artying05Artying05
Artying05
 
Artying10
Artying10Artying10
Artying10
 
Tugas Ekonomi Teknik
Tugas Ekonomi TeknikTugas Ekonomi Teknik
Tugas Ekonomi Teknik
 
Evaluation question 3 ppt
Evaluation question 3 pptEvaluation question 3 ppt
Evaluation question 3 ppt
 
Microprocessor lab
Microprocessor labMicroprocessor lab
Microprocessor lab
 
iOS Multithreading
iOS MultithreadingiOS Multithreading
iOS Multithreading
 
Minimal standard c program
Minimal standard c programMinimal standard c program
Minimal standard c program
 

Similar to Datapreneurs

Information technology by Hiresh Ahluwalia
Information technology by Hiresh AhluwaliaInformation technology by Hiresh Ahluwalia
Information technology by Hiresh Ahluwalia
333jack333
 

Similar to Datapreneurs (20)

QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
future2020
future2020future2020
future2020
 
Smart Data Module 6 d drive the future
Smart Data Module 6 d drive the futureSmart Data Module 6 d drive the future
Smart Data Module 6 d drive the future
 
Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...
 
Industrial internet big data usa market study
Industrial internet big data usa market studyIndustrial internet big data usa market study
Industrial internet big data usa market study
 
Industrial internet big data usa market study
Industrial internet big data usa market studyIndustrial internet big data usa market study
Industrial internet big data usa market study
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
Smart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart dataSmart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart data
 
Unlocking Value of Data in a Digital Age
Unlocking Value of Data in a Digital AgeUnlocking Value of Data in a Digital Age
Unlocking Value of Data in a Digital Age
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
 
Big Data Expo 2015 - IBM 5 predictions
Big Data Expo 2015 - IBM 5 predictionsBig Data Expo 2015 - IBM 5 predictions
Big Data Expo 2015 - IBM 5 predictions
 
National Big Data Analytics (BDA) Initiative - //bina/ 2014 conference
National Big Data Analytics (BDA) Initiative - //bina/ 2014 conferenceNational Big Data Analytics (BDA) Initiative - //bina/ 2014 conference
National Big Data Analytics (BDA) Initiative - //bina/ 2014 conference
 
Data sharing between private companies and research facilities
Data sharing between private companies and research facilitiesData sharing between private companies and research facilities
Data sharing between private companies and research facilities
 
Impact of big data on DCMI market
Impact of big data on DCMI marketImpact of big data on DCMI market
Impact of big data on DCMI market
 
Data-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive AdvantageData-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive Advantage
 
Information technology by Hiresh Ahluwalia
Information technology by Hiresh AhluwaliaInformation technology by Hiresh Ahluwalia
Information technology by Hiresh Ahluwalia
 
Big Data Seminar At Broadridge
Big Data Seminar At BroadridgeBig Data Seminar At Broadridge
Big Data Seminar At Broadridge
 
Big data seminar at Broadridge
Big data seminar at BroadridgeBig data seminar at Broadridge
Big data seminar at Broadridge
 
Big Data seminar BR-new
Big Data seminar BR-newBig Data seminar BR-new
Big Data seminar BR-new
 

More from suresh sood

Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016
suresh sood
 
Australian Business Culture
Australian Business Culture Australian Business Culture
Australian Business Culture
suresh sood
 
Transforming instagram data into location intelligence
Transforming instagram data into location intelligenceTransforming instagram data into location intelligence
Transforming instagram data into location intelligence
suresh sood
 

More from suresh sood (20)

Getting to the Edge of the Future - Tools & Trends of Foresight to Nowcasting
Getting to the Edge of the Future - Tools & Trends of Foresight to NowcastingGetting to the Edge of the Future - Tools & Trends of Foresight to Nowcasting
Getting to the Edge of the Future - Tools & Trends of Foresight to Nowcasting
 
Bigdata AI
Bigdata AI Bigdata AI
Bigdata AI
 
Bigdata ai
Bigdata aiBigdata ai
Bigdata ai
 
Data Science Innovations
Data Science InnovationsData Science Innovations
Data Science Innovations
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
 
Data science Innovations January 2018
Data science Innovations January 2018Data science Innovations January 2018
Data science Innovations January 2018
 
Data science innovations
Data science innovations Data science innovations
Data science innovations
 
Swarm jobs
Swarm jobsSwarm jobs
Swarm jobs
 
Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016Netnography online course part 1 of 3 17 november 2016
Netnography online course part 1 of 3 17 november 2016
 
Foresight Analytics
Foresight AnalyticsForesight Analytics
Foresight Analytics
 
Datainnovation
DatainnovationDatainnovation
Datainnovation
 
Bigdatahuman
BigdatahumanBigdatahuman
Bigdatahuman
 
Bigdataforesight
BigdataforesightBigdataforesight
Bigdataforesight
 
DBIA
DBIADBIA
DBIA
 
Australian Business Culture
Australian Business Culture Australian Business Culture
Australian Business Culture
 
Cool Tools
Cool Tools Cool Tools
Cool Tools
 
Transforming instagram data into location intelligence
Transforming instagram data into location intelligenceTransforming instagram data into location intelligence
Transforming instagram data into location intelligence
 
Crowdsourcing Social Media
Crowdsourcing Social Media Crowdsourcing Social Media
Crowdsourcing Social Media
 
Crowdsourcing co creation and ideation
Crowdsourcing co creation and ideationCrowdsourcing co creation and ideation
Crowdsourcing co creation and ideation
 
Analytic innovation transforming instagram data into predicitive analytics wi...
Analytic innovation transforming instagram data into predicitive analytics wi...Analytic innovation transforming instagram data into predicitive analytics wi...
Analytic innovation transforming instagram data into predicitive analytics wi...
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Recently uploaded (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Datapreneurs

  • 1. Datapreneurs & Building Data Centric Business http://www.slideshare.net/ssood/datapreneurs linkedin.com/in/sureshsood @soody
  • 2. Areas for Discussion 1.) Data Driving Trends – Big Data & Machine Engineering 2.) Building Data Centric Business 3.) Future of Professions 4.) Talent Scarcity 5.) Democratisation of Data Science
  • 3.
  • 5. 2020 Global Data Forecast (Bytes) 2020 estimates suggest four times more digital data than all the grains of sand on Earth Source: Pg. 4, Building a Digital Analytics Organization: Create Value by Integrating Analytical Processes, Technology, and People into Business Operations by Judah Phillips, FT Press, 30 Jul 2013
  • 6. Data Origination Trends 1) Mobile = multiple sensors comprising camera, microphone, GPS, accelerometer 2) Sensors everywhere including “eyes in the sky” via drones, satellites and roads 3) Online customer interactions generate IP addresses, time, geocode, page visits 4) Large scale data curation e.g. Airbnb, Google (Art, Street View, N Gram, Gdelt** ), Guardian, Million Songs, OpenCalifornia, Openflights, OpenStreetMap, Planethunters, Pandora, Shazam, Wikipedia 5) Data fusion e.g. LA Times Homicide Blog using coroner reports 6) Reviews e.g. Tripadvisor, Amazon, Yelp 7) Open Data e.g. data.gov and O Data protocol (http://www.odata.org/) **The GDELT Project pushes the boundaries of “big data,” weighing in at over a quarter-billion rows with 59 fields for each record, spanning the geography of the entire planet, and covering a time horizon of more than 35 years. GDELT is the largest open-access database on human society in existence. Its archives contain nearly 400M latitude/longitude geographic coordinates spanning over 12,900 days, making it one of the largest open-access spatio-temporal datasets as well.
  • 7. Internet of Things “trillion sensors” Source: www.tsensorssummit.org
  • 8. Black Box Insurance • Big data transforms actuarial insurance from using probability methods to estimate premiums into dynamic risk management using real data generating individually tailored premiums • Estimate 20 km work or home journey, data point acquired every min and journey captures 12 points per km. Assume 1000 km per month driving or generating 12,000 points per month resulting in 144,000 points per car/annum. Hence, 1,000 cars leads to 144 million points per annum. • Telematics technology (black box) monitor helps assess the driving behavior and prices policy based on true driver centric premiums by capturing: – Number of journeys – Distances travelled – Types of roads – Speed – Time of travel – Acceleration and braking – Any accidents – Location ? • Benefits low mileage, smooth and safe drivers • Privacy vs. Saving monies on insurance (Canada ; http://bit.ly/Black_box)
  • 9. The ANZ Heavy Traffic Index comprises flows of vehicles weighing more than 3.5 tonnes (primarily trucks) on 11 selected roads around NZ. It is contemporaneous with GDP growth. The ANZ Light Traffic Index is made up of light or total traffic flows (primarily cars and vans) on 10 selected roads around the country. It gives a six month lead on GDP growth in normal circumstances (but cannot predict sudden adverse events such as the Global Financial Crisis). http://www.anz.co.nz/about-us/economic-markets-research/truckometer/ ANZ TRUCKOMETER
  • 10. http://tacocopter.com/ New Sources of Information (Big data) : Social Media + Internet of Things Innovations 7,919 40,204 2,003,254,102 51 Gridded Data Sources
  • 11. Variety of Data Types & Big Data Challenge 1. Astronomical 2. Documents 3. Earthquake 4. Email 5. Environmental sensors 6. Fingerprints 7. Health (personal) Images 8. Graph data (social network) 9. Location 10.Marine 11.Particle accelerator 12.Satellite 13.Scanned survey data 14.Sound 15.Text 16.Transactions 17.Video Big Data consists of extensive datasets primarily in the characteristics of volume, variety, velocity, and/or variability that require a scalable architecture for efficient storage, manipulation, and analysis. Computational portability is the movement of the computation to the location of the data.
  • 12. HadoopConfigurations(SingleandMulti-Rack) Adapted from: http://stackiq.com/ Cluster manager e.g. Apache Ambari, Apache Mesos, or Rocks 3 TB drives ,18 data nodes configuration represents 648 TB of raw storage HDFS standard replication factor of 3 216 TB of usable storage Name/secondary/data nodes – 6 core 96 GB Management node – 4 core 16 GB
  • 13. Computer Data Program Output Computer Data Output Algorithms Traditional Computing Paradigm & Machine Engineering Machine learning is a scientific discipline that deals with the construction and study of algorithms that can learn from data. Such algorithms operate by building a model based on inputs and using that to make predictions or decisions, rather than following only explicitly programmed instructions. http://en.wikipedia.org/wiki/Machine_learning
  • 14. 8 Steps Towards Building the Data Centric Business 1. Put digital service (Vargo & Lusch) at centre of business blurring distinction with physical products via sensors and apps 2. Identify data and monetisation opportunities using business model canvas 3. Select unique sources of data to help drive innovation 4. Uses data to drive interactions and customer experiences 5. Understand the data lifecycle from creation to storage 6. Value extraction from data (economic or social) 7. Review patterns of big data businesses 8. Got on top of big data technology trends and analytics software
  • 15. Netflix – A Picture of A Data Driven Company • ~75 million users • 8.5 million events per second • Zero loss? • 550 billion events per day • Hundreds of event types • 1.3 PB/day • 21GB /sec (peak) • 37% of peak US internet bandwidth • Operates on Amazon Web Services Source : http://techblog.netflix.com/2016/02/evolution-of-netflix-data-pipeline.html
  • 16. • Next generation radio telescope • 100 x more sensitive & 1,000,000 X faster • 5 square km of dish over 3000 km • Two sites: Western Australia & Karoo Desert RSA • Worlds most ambitious IT Project • First real exascale ready application • Largest global big-data challenge • SKA SDP exascale systems: • 100,000 nodes • 800 cabinets • consume 20 MW • Expected failure rates of 300 nodes per week Square Kilometre Array http://www.ska.gov.au/
  • 17. The Future of the Professions (Susskind & Susskind 2015) – Tax and audit work replaced by computer assisted techniques – Technology automating and innovating – Accounting work reconfiguring – New business models – Move from bespoke to “off the peg” – Mastery of data with new tools and techniques - Big Data – Diversification – Shift to proactivity from reactivity – Professionals replaced by less expert people and high performing systems – Post-professional society expertise available online
  • 18. The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel Susskind (2015)
  • 19. Adapted From: The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel Susskind (2015) A HEALTH KNOWLEDGE COMMONS New knowledge used differently: people managing their own health information, personalising their care and creating new kinds of health knowledge https://www.nesta.org.uk/sites/default/files/the-nhs-in-2030.pdf
  • 20. The Future of the Professions How Technology Will Transform the Work of Human Experts, Richard Susskind and Daniel Susskind (2015)
  • 21. We’re sitting on a big data time bomb Catastrophic loss of transparency. Few IT professionals have experience managing big data platforms at scale — a situation that has created a massive skills shortage in the industry. By 2018, U.S. companies will be short 1.5 million managers able to make data- based decisions. A recent McKinsey Quarterly report estimates that, in order to close this gap, companies would need to spend 50 percent of their data and analytics budget on training frontline managers; it also notes that few companies realize this need. Source: CAMERON SIM, CREWSPARK, OCTOBER 24, 2015 http://venturebeat.com/2015/10/24/were-sitting-on-a-big-data-time-bomb/
  • 22. Australia/NZ needs “30,000 data savvy managers by 2018” • This statement derives from the McKinsey (2011) study “a shortage of talent necessary for organizations to take advantage of big data. By 2018, the United States requires a talent pool of 140-190,000 deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions”. • Taking 2% of the US economy as a rule of thumb, in 2018 Australia will require another 30,000 managers or analysts. However, the shortage commences well before 2018. These numbers do not accommodate the training of managers or analysts for overseas destinations. • Another 2011 study from EMC Corporation interviewed nearly 500 data science and business intelligence professionals globally. Two-thirds of the informants believe demand will outpace supply and 30% from disciplines outside of computer science. Additionally, the study found the biggest obstacle to data science as being education and training. • A 2012 study “Data Equity: Unlocking the value of big data” commissioned by SAS UK and conducted by the Centre for Economics and Business Research, an independent business research consultancy, found unlocking big data leads to adding another 58,000 jobs to the UK economy (2012-2017). • Gartner (2012) estimates by 2015 4.4 Million IT Jobs will be created globally to Support big data or 1.9 million jobs in the United Sates alone. • Closer afield, the 2013 Hudson study “Tackling the Big Data Challenge” found 78% of the Australian research informants, “believe organisations do not have the skills and competencies to successfully undertake a big data project. • Building on the McKinsey (2011), Gartner (2012) and Hudson (2013) estimations, Australia and the world requires 3 distinct but related skills. Most specifically, the demand is very strong for data savvy managers conversant with big data practice.
  • 23. Australian Demand 2018: -Deep skills 3,800 - Technology 8,000 - Data Savvy 30,000
  • 24. India’s high demand for big data workers contrasts with scarcity of skilled talent The talent deficit is on two fronts, said Velamakanni: data scientists who can perform analytics, and analytics consultants who can understand and use the data. The first, big data engineers and scientists, are extremely scarce. "In the second category, we need better quality, and India is going to be short of a million data consultants soon," he said. Source: India's high demand for big data workers contrasts with scarcity of skilled talent, Saritha Rai, June, 2014, http://www.techrepublic.com/article/indias- high-demand-for-big-data-workers-contrasts-with-scarcity-of-skilled-talent/
  • 25. Google Trends Worldwide, Australia and New Zealand - Accounting + Analytics January 2004 - September 2015 Worldwide Australia New Zealand
  • 26. 'The Predictive Accountant’ and Data Centric Practice 1. Data savvy 3. Focus shifts from being reactive to proactive and predictive 4. Leverages accounting data and predictive analytics software to find patterns in data and insights 5. Uses the tools and dashboards to predict client scenarios before time: maximising opportunity, limiting risks and proactively advising. 6. Accountant benefit from analytics by adding value when connecting with client challenges and opportunities to identified customer patterns. Sharing these insights delivers more value in the accounting conversations and helps tackle the real business problems facing clients.
  • 27. The Predictive Accountant Portal: Democratisation of Data Science The Predictive Accountant Data Sources Predictive Analytics Excel style dashboard Connected Practice Digital Marketing / eNewsletters/ Integrated business tools software Apps Marketplace Accounting Analytic Apps Education Analytic Training
  • 28. “I had come to an entirely erroneous conclusion which shows, my dear Watson, how dangerous it is to reason from insufficient data.” The Adventure of the Speckled Bird