2. BIG DATA
Some Overall Concepts
• The “Four V”s of Big Data (Satry Malladi – Chief Architect, Stubhub.com)
– Variety - Sources of data. Can be structured or unstructured
– Volume – Amount of data:
• The world economy produces 2.5 exabytes of data per day. Eighty percent is unstructured, largely untapped
with regards to potential. – Bob Picciano, SVP IBM Information and Analytics Group (1 EB = 1 Billion GB)
– Velocity – Rate at which data changes
– Veracity – Data Integrity
3. BIG DATA
Some Overall Concepts
• The “Four V”s of Big Data (Satry Malladi – Chief Architect, Stubhub.com)
– Variety - Sources of data. Can be structured or unstructured
– Volume – Amount of data:
• The world economy produces 2.5 exabytes of data per day. Eighty percent is unstructured, largely untapped
with regards to potential. – Bob Picciano, SVP IBM Information and Analytics Group (1 EB = 1 Billion GB)
– Velocity – Rate at which data changes
– Veracity – Data Integrity
• Evolution of Analytics (International Institute for Analytics {IIA})
– Analytics 1.0 - descriptive analytics that come from small sets of internal, structured data. Resulting
reports tend to stay within IT departments, away from decision makers, and look back, not ahead.
– Analytics 2.0 - complex, large and unstructured data sets; products (not reports) that make
information readily accessible
• (Analytics 2.0) represents the heart and soul of big data startup activity . . . but remains largely confined to
Silicon Valley – Thomas Davenport, Director of Research, IIA.
– Analytics 3.0 - bridges traditional analytics and big data, using "rapid, agile insight delivery" to put
analytics tools at the point of decision.
• Examples include LinkedIn's "People You May Know" and "Jobs You May Be Interested In" features.
4. BIG DATA
Opportunities
• Health Care Providers
– Kaiser using Big Data to Improve patient care: http://bit.ly/1lIHzdf
• Government Agencies
– Federal Healthcare Execs bullish on big data solutions: http://ubm.io/NUFLBw
• Big Pharma
– Big Pharma ~ Big Data Collaboration Models: http://onforb.es/1ea4kqv
• BioMed Research
– Big Data Analytics in Biomedical Research: http://bit.ly/1d2fLjq
5. BIG DATA
Challenges
• Security / Compliance
– 30 million data breaches in health care since 2009
– 96% of all health care institutions will have data breach
Sanjay Joshi – CTO , EMC “Big Data, Big Risk”, Data360 Conference , Mountain View, CA 4/02/14
• Data Complexity
– Data sources range from traditional structured inputs to capturing devices, sensors and
mobile applications (among others)
– Genomic information cheaper and easier to collect
– Common EHR data can include billing codes, lab results, prescription info and clinical
notes
– Data not standardized globally
“Big Data Analytics for Health Care”, SIAM Int’l Conference on Data Mining, Austin, TX 2013
6. BIG DATA
Other Resources
• IBM Big Data Library: http://bit.ly/1nHukvd
• McKinsley Big Data Report April ’13: http://bit.ly/1cSYy7i
• SIAM Big Data Analytics for Healthcare Presentation: http://bit.ly/1lqI09y
• Ted Talk on Big Data Collection: http://bit.ly/1gqrow5
• Webinar - Analytics 3.0: Opportunities for Healthcare (requires
registration): http://bit.ly/1g4cOyr