The document discusses the rise of data science and its potential to change business. It notes that the amount of data being generated is growing exponentially and will soon exceed 40 zettabytes. However, most companies feel overwhelmed by the data they have. Data science uses techniques from many fields to extract meaningful insights from vast amounts of data. It has become a critical business asset for companies in almost every industry. The emergence of data science is enabling real-time, predictive analytics beyond what was previously possible.
Insurers' journeys to build a mastery in the IoT usage
A next generation introduction to data science and its potential to change business as we know it
1. 4/3/2014
1
By Health Symmetric, Inc.
A next generation introduction to data science and
its potential to change business as we know it
by David Smith
David Smith
President
dsmith@socialcare.com
linkedin.com/in/davidsmithaustin
A next generation
introduction to data
science and its
potential to change
business as we
know it
5. 4/3/2014
5
Good News: Big Data is Sexy
9
http://dilbert.com/strips/comic/2012-09-05/
Data Scientist
“Data Scientist”
• Data Scientist: The Sexiest Job of the 21st Century
Harvard Business Review, October 2012
• The “Hot new gig in town”
O’Reilly report
• The next sexy job in next 10 years will be statistician” – Hal Varian,
Google Chief Economist
• Geek Chic – Wall Street Journal – new cool kids on campus
• The future belongs to the companies and people that turn data
into products
• “The human expertise to capture and analyze big data is both the
most expensive and the most constraining factor for most
organizations pursuing big data initiatives” – Thomas Davenport
11. 4/3/2014
11
Data Science as a strategic asset
“85% of eBay’s analytic workload is new and
unknown. We are architected for the unknown.”
Oliver Ratzesberger, eBay
• Data exploration – data as the new oil
The exploration for data, rather than the exploration of data
Uncovering pockets of untapped data
Processing the whole data set, without sampling
eBay’s Singularity platform combines transactional data with
behavioral data, enabled identification of top sellers, driving
increased revenue from those sellers 21
Data Science as a strategic asset
“Groupon will not be the first or last organization to
compete and win on the power of data. It’s happening
everywhere.”
Reid Hoffman and James Slavet
Greylock Partners
Data harnessing – data as renewable energy
Harnessing naturally occurring data streams
Like harnessing raw energy to be converted into usable energy
Conversion of raw data into usable data 22
15. 4/3/2014
15
Big Data Numbers
• How many data in the world?
• 800 Terabytes, 2000
• 160 Exabytes, 2006
• 500 Exabytes(Internet), 2009
• 2.7 Zettabytes, 2012
• 35 Zettabytes by 2020
• How many data generated ONE day?
• 7 TB, Twitter
• 10 TB, Facebook Big data: The next frontier for innovation, competition, and productivity
McKinsey Global Institute
17. 4/3/2014
17
0
500
1,000
1,500
2,000
2,500
3,000
3,500
4,000
2003 2004 2005 2006 2007 2008 2009 2010 2011
Year
Petabytes/Day Global
• Mobile
• Device to Device
• Sensors
• Entertainment
• Smart Home
• Distributed Industrial
• Autos/Trucks
• Smart Toys
2012
Converged
Content
Traditional
Computation
Growth at the Edge of the Network
Internet of Things
•A system . . . that would be able to
instantaneously identify any kind of object.
•Network of objects . .
•One major next step in this development of the
Internet, which is to progressively evolve from a
network of interconnected computers to a
network of interconnected objects …
•From communicating people (Internet)
... to communicating items …
• From human triggered communication …
... to event triggered communication
20. 4/3/2014
20
Tapping into the Data
• Data Storage
• Reporting
• Analytics
• Advanced Analytics
– Computing with big datasets
is a fundamentally different
challenge than doing “big
compute” over a small
dataset
Unutilized data
that can be
available to
business
Utilized data
31. 4/3/2014
31
identity created
_at
updated_
at
external_
id_hash
idx_1 idx_2 data
partice_identity patient_identity created
_at
updated
_at
mrn
patients
practice_patients
identity practice_identity patient_identity
patient_soap_notes
identity name settings address phone deleted created
_at
updated
_at
roles_
and_
permissions
symptoms practice_
type
practice_
sub_type
customi
zation
practices
Some Existing SocialCare Beta Relations
patient_identity classifier signature created
_by
updated
_by
created
_epoch
updated
_epoch
data
patient_data_store
JSON data stored in this
field as an array.
No Postgres queries possible:
• Name
• Address
• Etc.
JSON data stored in this
field as an array.
No Postgres queries possible:
• Allergies
• SOAP Notes
• Medications
• Etc.
Patient #6
Physician #1
Practice #1 Practice #2
Practice #3
Clinical Quality Measures #1
Xray #1
Logical ID = 1
Version ID = 3
Physician #3
Lab #1
Observation #1
Physician #2
SOAP Note #1
Continuity
Of Care #1
Continuity
Of Care #2
Export
CCD
Import
CCD
Hospital #1
Is Primary Care
Physician For
Had Test
Works In
Has Sub‐practiceHas Sub‐practice
Work In
Has Quality Measure
Associated With
Document
Store
Made
Observation
Had Observation
Annotated
Document
Xray #1
Logical ID = 1
Version ID = 2
Xray #1
Logical ID = 1
Version ID = 1
Patient #9
(Remote)
Patient
Registry
Lab
Request #7
Lab
Response #8
Provider
Registry
Requestor
SubjectResponse
For
Source
Physician #10
(Remote)
Incoming
Referral
Outgoing
Referral
Made
Referral
Received
Referral
Patient #3
Subject Received
Referral
Subject
Made
Referral
SocialCare Example Objects and Relationships