SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Big Data Analytics:
Facts and Feelings
Seth Grimes
Alta Plana Corporation
@sethgrimes
TDWI – Washington DC
June 21, 2013
Big Data Analytics: Facts and Feelings
Theses
• We gain knowledge when we make
connections.
• Data analysis is a process of connection
discovery.
• The more data, the greater the possibilities.
• The more data, the greater the need to filter,
reduce, and contextualize.
• Timeliness counts.
Big Data Analytics: Facts and Feelings
The World of Big Data
Machine data (e.g., logs, sensor outputs,
clickstreams).
Actions, interactions, and transactions:
geolocation and time.
Profiles: individual, demographic & behavioral.
Text, audio, images, and video.
Facts and feelings.
Big Data Analytics: Facts and Feelings
A 3-slide reprise...
Big Data Analytics: Facts and Feelings
Imperatives for the 2010s:
Do more with more.
“It’s Not Information Overload. It’s Filter
Failure”: Clay Shirky, 2008.
• More sources & types of data.
• Greater data volumes.
• New hardware and methods.
Automate more, more intelligently.
• Analytics.
• Semantics.
Engage. Socialize.
Big Data Analytics: Facts and Feelings
I see three categories of data:
1. Quantities, whether measured, observed, or
computed.
2. Content, which I’ll characterize as non-
quantitative information.
3. Metadata (semantic & structural) describing
quantities and content.
• Our concern is content, analytics & fusion.
• Structured/unstructured is a false
dichotomy.
• Where do relationships fit?
Big Data Analytics: Facts and Feelings
http://www.businessweek.com/magazine/content/04_19/b3882029_mz072.htm
En route.
Big Data Analytics: Facts and Feelings
End of reprise. So, is this Big Data?
Big Data Analytics: Facts and Feelings
Of course not. It’s a number, not data.
Size (Volume) is only one Big Data factor.
Other factors (standard definition) are
Velocity and Variety.
I reject reVisionist 3Vs extensions such as:
Variation/Variability.
Veracity.
Value.
These factors are the province of analytics.
Big Data Analytics: Facts and Feelings
Gary King, Harvard Univ. –
“Big Data isn’t about the data. It’s about
analytics.”
Me –
Analytics is a collection of tools and
techniques that extract insights from data.
I’d argue –
The Value in Big Data is in content, patterns,
and connections, derived via analytics.
Big Data Analytics: Facts and Feelings
Variability is an interpretive property. I say –
The sense of Big Data is in context and
intent…of both the data producer and the
data consumer, captured in metadata and
(also) derived via analytics.
Big Data Analytics: Facts and Feelings
As for Veracity, data is data. Consider:
“The Iraqi regime… possesses and produces
chemical and biological weapons.” –
George W. Bush, October 7, 2002.
Big Data Analytics: Facts and Feelings
Data… and more. Is this Big Data?
No, it’s a screen of aggregated query results.
Big Data Analytics: Facts and Feelings
The Big Data is behind it.
http://www.newyorker.com/online/blogs/culture/2012/05/google-knowledge-graph.html
Big Data Analytics: Facts and Feelings
And behind comparably-scaled/structured
systems.
Big Data Analytics: Facts and Feelings
Comparably-
scaled/structured
systems?
http://www.cambridgesemantics.com/semantic-
university/semantic-search-and-the-semantic-web
Big Data Analytics: Facts and Feelings
Graphs model language models relationships.
Big Data Analytics: Facts and Feelings
Another view, using an old GATE image.
http://gate.ac.uk/hamish/talks/ibot-slidy.html
Big Data Analytics: Facts and Feelings
… for instance, social language.
Big Data Analytics: Facts and Feelings
Text analytics applies natural-language
processing (NLP) techniques to discern –
Entities
Relationships
Context
Identity
– and get at the sense of “unstructured”
online, social, and enterprise information.
Semantic identity unites data of all types.
Big Data Analytics: Facts and Feelings
http://searchuserinterfaces.com/
Sensemaking:
“It is convenient to divide the entire
information access process into two
main components: information
retrieval through searching and
browsing, and analysis and
synthesis of results. This broader
process is often referred to in the
literature as sensemaking.
Sensemaking refers to an iterative
process of formulating a
conceptual representation from of
a large volume of information.”
– Marti Hearst, 2009
Big Data Analytics: Facts and Feelings
Intelligent computing – sensemaking –
involves:
Big (and little) Data.
• Quantities.
• Content.
• Metadata.
Analytics.
Semantics.
Integration.
Facts and feelings.
Big Data Analytics: Facts and Feelings
Feelings: Sentiment detection, classification.
Big Data Analytics: Facts and Feelings
http://techpresident.com/news/21618/pol
itico-facebook-sentiment-analysis-bogus
Big Data Analytics: Facts and Feelings
“Sentiment analysis is the task of identifying positive
and negative opinions, emotions, and evaluations.”
-- Wilson, Wiebe & Hoffman, 2005, “Recognizing
Contextual Polarity in Phrase-Level Sentiment
Analysis”
“Sentiment analysis or opinion mining is the
computational study of opinions, sentiments and
emotions expressed in text… An opinion on a feature
f is a positive or negative view, attitude, emotion or
appraisal on f from an opinion holder.”
-- Bing Liu, 2010, “Sentiment Analysis and Subjectivity,” in
Handbook of Natural Language Processing
Big Data Analytics: Facts and Feelings
Sentiment may be of interest at multiple
levels.
Corpus / data space, i.e., across multiple sources.
Document.
Statement / sentence.
Entity / topic / concept.
Human language is noisy and chaotic!
Jargon, slang, irony, ambiguity, anaphora,
polysemy, synonymy, etc.
Context is key. Discourse analysis comes into
play.
Big Data Analytics: Facts and Feelings
Emotion and effect.
Big Data Analytics: Facts and Feelings
Emotion and understanding.
Big Data Analytics: Facts and Feelings
Prediction/Feeling/Wish... and Intent.
http://www.aiaioo.com/whitepapers/intention_analysis_use_cases.pdf
http://sentibet.com/
Big Data Analytics: Facts and Feelings
Many options (text).
Big Data Analytics: Facts and Feelings
(Accessible) Data Everywhere
Big Data Analytics: Facts and Feelings
Beyond Text:
• Audio including speech.
• Images.
• Video.
http://www.geekosystem.com/
facebook-face-recognition/
http://www.sciencedirect.com/science
/article/pii/S0167639312000118
http://flylib.com/books/en/2.495.1.54/1/
Big Data Analytics: Facts and Feelings
Or just ask: Swipp mobile polling (example).
Big Data Analytics: Facts and Feelings
Surfaced.
Big Data Analytics: Facts and Feelings
A Big Data analytics architecture (example).
http://www.geeklawblog.com/2011/12/lexis-advance-platform-launch-two.html
Big Data Analytics: Facts and Feelings
Complementary.
Big Data Analytics:
Facts and Feelings
Seth Grimes
Alta Plana Corporation
@sethgrimes
TDWI – Washington DC
June 21, 2013

Weitere ähnliche Inhalte

Was ist angesagt?

Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis Bytesview
 
Global Analytics: Text, Speech, Sentiment, and Sense
Global Analytics: Text, Speech, Sentiment, and SenseGlobal Analytics: Text, Speech, Sentiment, and Sense
Global Analytics: Text, Speech, Sentiment, and SenseSeth Grimes
 
Text Analysis in Research
Text Analysis in ResearchText Analysis in Research
Text Analysis in ResearchBytesview
 
Text Analysis in Retail
Text Analysis in RetailText Analysis in Retail
Text Analysis in RetailBytesview
 
Text Analysis for Competitive Intelligence
Text Analysis for Competitive IntelligenceText Analysis for Competitive Intelligence
Text Analysis for Competitive IntelligenceBytesview
 
Marketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceMarketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceAugust Jackson
 
Emotion analysis
Emotion analysisEmotion analysis
Emotion analysisBytesview
 
Analytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAnalytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAdam Lee
 
Text Analytics Past, Present & Future
Text Analytics Past, Present & FutureText Analytics Past, Present & Future
Text Analytics Past, Present & FutureSeth Grimes
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...John Makridis
 
Marketsoft and marketing cube data quality to cc-v3
Marketsoft and marketing cube   data quality to cc-v3Marketsoft and marketing cube   data quality to cc-v3
Marketsoft and marketing cube data quality to cc-v3Marketsoft
 
Diy research trends webinar(2) revised(2)
Diy research trends webinar(2) revised(2)Diy research trends webinar(2) revised(2)
Diy research trends webinar(2) revised(2)QuestionPro
 
Min online webinar-montizecontent-markohurst-2011
Min online webinar-montizecontent-markohurst-2011Min online webinar-montizecontent-markohurst-2011
Min online webinar-montizecontent-markohurst-2011WIKOLO
 
Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009WIKOLO
 
Marketing in a social age travel updated (feb)
Marketing in a social age   travel updated (feb)Marketing in a social age   travel updated (feb)
Marketing in a social age travel updated (feb)Luke Hay
 
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...Maeve Lyons
 
AI - The Next Frontier for Connected Pharma
AI - The Next Frontier for Connected PharmaAI - The Next Frontier for Connected Pharma
AI - The Next Frontier for Connected Pharmasambiswal
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1Joni Salminen
 

Was ist angesagt? (20)

Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis
 
Global Analytics: Text, Speech, Sentiment, and Sense
Global Analytics: Text, Speech, Sentiment, and SenseGlobal Analytics: Text, Speech, Sentiment, and Sense
Global Analytics: Text, Speech, Sentiment, and Sense
 
Text Analysis in Research
Text Analysis in ResearchText Analysis in Research
Text Analysis in Research
 
Text Analysis in Retail
Text Analysis in RetailText Analysis in Retail
Text Analysis in Retail
 
Text Analysis for Competitive Intelligence
Text Analysis for Competitive IntelligenceText Analysis for Competitive Intelligence
Text Analysis for Competitive Intelligence
 
Marketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceMarketing Research and Competitive Intelligence
Marketing Research and Competitive Intelligence
 
Data analytics
Data analyticsData analytics
Data analytics
 
Emotion analysis
Emotion analysisEmotion analysis
Emotion analysis
 
Big Data No Big Deal
Big Data No Big DealBig Data No Big Deal
Big Data No Big Deal
 
Analytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAnalytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel Industry
 
Text Analytics Past, Present & Future
Text Analytics Past, Present & FutureText Analytics Past, Present & Future
Text Analytics Past, Present & Future
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
 
Marketsoft and marketing cube data quality to cc-v3
Marketsoft and marketing cube   data quality to cc-v3Marketsoft and marketing cube   data quality to cc-v3
Marketsoft and marketing cube data quality to cc-v3
 
Diy research trends webinar(2) revised(2)
Diy research trends webinar(2) revised(2)Diy research trends webinar(2) revised(2)
Diy research trends webinar(2) revised(2)
 
Min online webinar-montizecontent-markohurst-2011
Min online webinar-montizecontent-markohurst-2011Min online webinar-montizecontent-markohurst-2011
Min online webinar-montizecontent-markohurst-2011
 
Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009Site Search Analytics eMetrics Madrid 2009
Site Search Analytics eMetrics Madrid 2009
 
Marketing in a social age travel updated (feb)
Marketing in a social age   travel updated (feb)Marketing in a social age   travel updated (feb)
Marketing in a social age travel updated (feb)
 
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...
HealthXL: How Artificial Intelligence (AI) Can Improve Research & Care Models...
 
AI - The Next Frontier for Connected Pharma
AI - The Next Frontier for Connected PharmaAI - The Next Frontier for Connected Pharma
AI - The Next Frontier for Connected Pharma
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1
 

Andere mochten auch

Big Data - Fact or Fiction ?
Big Data -  Fact or Fiction ?Big Data -  Fact or Fiction ?
Big Data - Fact or Fiction ?Tyrone Systems
 
5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentationInfobrandz
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohenTaldor Group
 
Fun Facts about Big Data
Fun Facts about Big DataFun Facts about Big Data
Fun Facts about Big DataCrayon Data
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...CompTIA
 
The 5 key V's of Big Data
The 5 key V's of Big DataThe 5 key V's of Big Data
The 5 key V's of Big DataAnric Blatt
 
Melbourne Business School - mba talk october 14 - croll - 40m - lean analytics
Melbourne Business School - mba talk october 14 - croll - 40m - lean analyticsMelbourne Business School - mba talk october 14 - croll - 40m - lean analytics
Melbourne Business School - mba talk october 14 - croll - 40m - lean analyticsLean Analytics
 
Data scientist vccorp_aug_2016_vnu
Data scientist vccorp_aug_2016_vnuData scientist vccorp_aug_2016_vnu
Data scientist vccorp_aug_2016_vnuTuan Hoang
 
Emotions & moods organisation behavior
Emotions & moods organisation behaviorEmotions & moods organisation behavior
Emotions & moods organisation behaviorDhaval Gurnani
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBernard Marr
 
Bài giảng của Philip Kotler ở Việt Nam
Bài giảng của Philip Kotler ở Việt NamBài giảng của Philip Kotler ở Việt Nam
Bài giảng của Philip Kotler ở Việt NamNguyen Ngoc
 
A Brief History of Big Data
A Brief History of Big DataA Brief History of Big Data
A Brief History of Big DataBernard Marr
 
Big Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionBig Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionDavid Pittman
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 
10 Things your Audience Hates About your Presentation
10 Things your Audience Hates About your Presentation10 Things your Audience Hates About your Presentation
10 Things your Audience Hates About your PresentationStinson
 

Andere mochten auch (19)

Big Data - Fact or Fiction ?
Big Data -  Fact or Fiction ?Big Data -  Fact or Fiction ?
Big Data - Fact or Fiction ?
 
5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation5 facts everyone should know about big data presentation
5 facts everyone should know about big data presentation
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Bigdata
BigdataBigdata
Bigdata
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohen
 
Fun Facts about Big Data
Fun Facts about Big DataFun Facts about Big Data
Fun Facts about Big Data
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
 
The 5 key V's of Big Data
The 5 key V's of Big DataThe 5 key V's of Big Data
The 5 key V's of Big Data
 
Melbourne Business School - mba talk october 14 - croll - 40m - lean analytics
Melbourne Business School - mba talk october 14 - croll - 40m - lean analyticsMelbourne Business School - mba talk october 14 - croll - 40m - lean analytics
Melbourne Business School - mba talk october 14 - croll - 40m - lean analytics
 
Data scientist vccorp_aug_2016_vnu
Data scientist vccorp_aug_2016_vnuData scientist vccorp_aug_2016_vnu
Data scientist vccorp_aug_2016_vnu
 
Emotions & moods organisation behavior
Emotions & moods organisation behaviorEmotions & moods organisation behavior
Emotions & moods organisation behavior
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Bài giảng của Philip Kotler ở Việt Nam
Bài giảng của Philip Kotler ở Việt NamBài giảng của Philip Kotler ở Việt Nam
Bài giảng của Philip Kotler ở Việt Nam
 
A Brief History of Big Data
A Brief History of Big DataA Brief History of Big Data
A Brief History of Big Data
 
12 Interesting Facts about Big Data
12 Interesting Facts about Big Data12 Interesting Facts about Big Data
12 Interesting Facts about Big Data
 
Big Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionBig Data in Retail - Examples in Action
Big Data in Retail - Examples in Action
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
10 Things your Audience Hates About your Presentation
10 Things your Audience Hates About your Presentation10 Things your Audience Hates About your Presentation
10 Things your Audience Hates About your Presentation
 

Ähnlich wie Big Data Analytics: Facts and Feelings

1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptxRahulTr22
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESMicah Altman
 
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyHow is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyJonathan Gray
 
Lecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptxLecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptxAsadkhan47384
 
The Power of Big Data and Psychographics in the Electoral Process
The Power of Big Data and Psychographics in the Electoral ProcessThe Power of Big Data and Psychographics in the Electoral Process
The Power of Big Data and Psychographics in the Electoral ProcessGua Tabidze
 
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiBehavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiGalit Shmueli
 
Big Data, Republicans and 2016
Big Data, Republicans and 2016Big Data, Republicans and 2016
Big Data, Republicans and 2016steveparkhurst
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunishaShivlal Mewada
 
Information Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesInformation Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesHubbard One
 
AAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveysAAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveysCliff Lampe
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Amanda noonan
 
Data science innovations
Data science innovations Data science innovations
Data science innovations suresh sood
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeJosh Cowls
 
Data fluency for the 21st century
Data fluency for the 21st centuryData fluency for the 21st century
Data fluency for the 21st centuryMartinFrigaard
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).pptSanjayAcharaya
 
Big Data Ethics Cjbe july 2021
Big Data Ethics Cjbe july 2021Big Data Ethics Cjbe july 2021
Big Data Ethics Cjbe july 2021andygustafson
 

Ähnlich wie Big Data Analytics: Facts and Feelings (20)

1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx
 
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
 
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyHow is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
 
data, big data, open data
data, big data, open datadata, big data, open data
data, big data, open data
 
Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Lecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptxLecture 01-1-IIS.pptx
Lecture 01-1-IIS.pptx
 
The Power of Big Data and Psychographics in the Electoral Process
The Power of Big Data and Psychographics in the Electoral ProcessThe Power of Big Data and Psychographics in the Electoral Process
The Power of Big Data and Psychographics in the Electoral Process
 
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiBehavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
 
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
 
Big Data, Republicans and 2016
Big Data, Republicans and 2016Big Data, Republicans and 2016
Big Data, Republicans and 2016
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha
 
Information Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesInformation Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into Opportunities
 
AAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveysAAPOR - comparing found data from social media and made data from surveys
AAPOR - comparing found data from social media and made data from surveys
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015
 
Data science innovations
Data science innovations Data science innovations
Data science innovations
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Bigdata Hadoop introduction
Bigdata Hadoop introductionBigdata Hadoop introduction
Bigdata Hadoop introduction
 
Data fluency for the 21st century
Data fluency for the 21st centuryData fluency for the 21st century
Data fluency for the 21st century
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).ppt
 
Big Data Ethics Cjbe july 2021
Big Data Ethics Cjbe july 2021Big Data Ethics Cjbe july 2021
Big Data Ethics Cjbe july 2021
 

Mehr von Seth Grimes

Recent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingRecent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingSeth Grimes
 
Creating an AI Startup: What You Need to Know
Creating an AI Startup: What You Need to KnowCreating an AI Startup: What You Need to Know
Creating an AI Startup: What You Need to KnowSeth Grimes
 
NLP 2020: What Works and What's Next
NLP 2020: What Works and What's NextNLP 2020: What Works and What's Next
NLP 2020: What Works and What's NextSeth Grimes
 
Efficient Deep Learning in Natural Language Processing Production, with Moshe...
Efficient Deep Learning in Natural Language Processing Production, with Moshe...Efficient Deep Learning in Natural Language Processing Production, with Moshe...
Efficient Deep Learning in Natural Language Processing Production, with Moshe...Seth Grimes
 
From Customer Emotions to Actionable Insights, with Peter Dorrington
From Customer Emotions to Actionable Insights, with Peter DorringtonFrom Customer Emotions to Actionable Insights, with Peter Dorrington
From Customer Emotions to Actionable Insights, with Peter DorringtonSeth Grimes
 
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AIIntro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AISeth Grimes
 
Text Analytics Market Trends
Text Analytics Market TrendsText Analytics Market Trends
Text Analytics Market TrendsSeth Grimes
 
Text Analytics for NLPers
Text Analytics for NLPersText Analytics for NLPers
Text Analytics for NLPersSeth Grimes
 
Our FinTech Future – AI’s Opportunities and Challenges?
Our FinTech Future – AI’s Opportunities and Challenges? Our FinTech Future – AI’s Opportunities and Challenges?
Our FinTech Future – AI’s Opportunities and Challenges? Seth Grimes
 
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Seth Grimes
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...Seth Grimes
 
Fairness in Machine Learning and AI
Fairness in Machine Learning and AIFairness in Machine Learning and AI
Fairness in Machine Learning and AISeth Grimes
 
Classification with Memes–Uber case study
Classification with Memes–Uber case studyClassification with Memes–Uber case study
Classification with Memes–Uber case studySeth Grimes
 
Aspect Detection for Sentiment / Emotion Analysis
Aspect Detection for Sentiment / Emotion AnalysisAspect Detection for Sentiment / Emotion Analysis
Aspect Detection for Sentiment / Emotion AnalysisSeth Grimes
 
Content AI: From Potential to Practice
Content AI: From Potential to PracticeContent AI: From Potential to Practice
Content AI: From Potential to PracticeSeth Grimes
 
Text Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextText Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextSeth Grimes
 
Text Analytics Today
Text Analytics TodayText Analytics Today
Text Analytics TodaySeth Grimes
 
Social Data Sentiment Analysis
Social Data Sentiment AnalysisSocial Data Sentiment Analysis
Social Data Sentiment AnalysisSeth Grimes
 
12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content AnalyticsSeth Grimes
 

Mehr von Seth Grimes (20)

Recent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingRecent Advances in Natural Language Processing
Recent Advances in Natural Language Processing
 
Creating an AI Startup: What You Need to Know
Creating an AI Startup: What You Need to KnowCreating an AI Startup: What You Need to Know
Creating an AI Startup: What You Need to Know
 
NLP 2020: What Works and What's Next
NLP 2020: What Works and What's NextNLP 2020: What Works and What's Next
NLP 2020: What Works and What's Next
 
Efficient Deep Learning in Natural Language Processing Production, with Moshe...
Efficient Deep Learning in Natural Language Processing Production, with Moshe...Efficient Deep Learning in Natural Language Processing Production, with Moshe...
Efficient Deep Learning in Natural Language Processing Production, with Moshe...
 
From Customer Emotions to Actionable Insights, with Peter Dorrington
From Customer Emotions to Actionable Insights, with Peter DorringtonFrom Customer Emotions to Actionable Insights, with Peter Dorrington
From Customer Emotions to Actionable Insights, with Peter Dorrington
 
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AIIntro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
 
Emotion AI
Emotion AIEmotion AI
Emotion AI
 
Text Analytics Market Trends
Text Analytics Market TrendsText Analytics Market Trends
Text Analytics Market Trends
 
Text Analytics for NLPers
Text Analytics for NLPersText Analytics for NLPers
Text Analytics for NLPers
 
Our FinTech Future – AI’s Opportunities and Challenges?
Our FinTech Future – AI’s Opportunities and Challenges? Our FinTech Future – AI’s Opportunities and Challenges?
Our FinTech Future – AI’s Opportunities and Challenges?
 
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
 
Fairness in Machine Learning and AI
Fairness in Machine Learning and AIFairness in Machine Learning and AI
Fairness in Machine Learning and AI
 
Classification with Memes–Uber case study
Classification with Memes–Uber case studyClassification with Memes–Uber case study
Classification with Memes–Uber case study
 
Aspect Detection for Sentiment / Emotion Analysis
Aspect Detection for Sentiment / Emotion AnalysisAspect Detection for Sentiment / Emotion Analysis
Aspect Detection for Sentiment / Emotion Analysis
 
Content AI: From Potential to Practice
Content AI: From Potential to PracticeContent AI: From Potential to Practice
Content AI: From Potential to Practice
 
Text Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextText Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's Next
 
Text Analytics Today
Text Analytics TodayText Analytics Today
Text Analytics Today
 
Social Data Sentiment Analysis
Social Data Sentiment AnalysisSocial Data Sentiment Analysis
Social Data Sentiment Analysis
 
12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics
 

Big Data Analytics: Facts and Feelings

  • 1. Big Data Analytics: Facts and Feelings Seth Grimes Alta Plana Corporation @sethgrimes TDWI – Washington DC June 21, 2013
  • 2. Big Data Analytics: Facts and Feelings Theses • We gain knowledge when we make connections. • Data analysis is a process of connection discovery. • The more data, the greater the possibilities. • The more data, the greater the need to filter, reduce, and contextualize. • Timeliness counts.
  • 3. Big Data Analytics: Facts and Feelings The World of Big Data Machine data (e.g., logs, sensor outputs, clickstreams). Actions, interactions, and transactions: geolocation and time. Profiles: individual, demographic & behavioral. Text, audio, images, and video. Facts and feelings.
  • 4. Big Data Analytics: Facts and Feelings A 3-slide reprise...
  • 5. Big Data Analytics: Facts and Feelings Imperatives for the 2010s: Do more with more. “It’s Not Information Overload. It’s Filter Failure”: Clay Shirky, 2008. • More sources & types of data. • Greater data volumes. • New hardware and methods. Automate more, more intelligently. • Analytics. • Semantics. Engage. Socialize.
  • 6. Big Data Analytics: Facts and Feelings I see three categories of data: 1. Quantities, whether measured, observed, or computed. 2. Content, which I’ll characterize as non- quantitative information. 3. Metadata (semantic & structural) describing quantities and content. • Our concern is content, analytics & fusion. • Structured/unstructured is a false dichotomy. • Where do relationships fit?
  • 7. Big Data Analytics: Facts and Feelings http://www.businessweek.com/magazine/content/04_19/b3882029_mz072.htm En route.
  • 8. Big Data Analytics: Facts and Feelings End of reprise. So, is this Big Data?
  • 9. Big Data Analytics: Facts and Feelings Of course not. It’s a number, not data. Size (Volume) is only one Big Data factor. Other factors (standard definition) are Velocity and Variety. I reject reVisionist 3Vs extensions such as: Variation/Variability. Veracity. Value. These factors are the province of analytics.
  • 10. Big Data Analytics: Facts and Feelings Gary King, Harvard Univ. – “Big Data isn’t about the data. It’s about analytics.” Me – Analytics is a collection of tools and techniques that extract insights from data. I’d argue – The Value in Big Data is in content, patterns, and connections, derived via analytics.
  • 11. Big Data Analytics: Facts and Feelings Variability is an interpretive property. I say – The sense of Big Data is in context and intent…of both the data producer and the data consumer, captured in metadata and (also) derived via analytics.
  • 12. Big Data Analytics: Facts and Feelings As for Veracity, data is data. Consider: “The Iraqi regime… possesses and produces chemical and biological weapons.” – George W. Bush, October 7, 2002.
  • 13. Big Data Analytics: Facts and Feelings Data… and more. Is this Big Data? No, it’s a screen of aggregated query results.
  • 14. Big Data Analytics: Facts and Feelings The Big Data is behind it. http://www.newyorker.com/online/blogs/culture/2012/05/google-knowledge-graph.html
  • 15. Big Data Analytics: Facts and Feelings And behind comparably-scaled/structured systems.
  • 16. Big Data Analytics: Facts and Feelings Comparably- scaled/structured systems? http://www.cambridgesemantics.com/semantic- university/semantic-search-and-the-semantic-web
  • 17. Big Data Analytics: Facts and Feelings Graphs model language models relationships.
  • 18. Big Data Analytics: Facts and Feelings Another view, using an old GATE image. http://gate.ac.uk/hamish/talks/ibot-slidy.html
  • 19. Big Data Analytics: Facts and Feelings … for instance, social language.
  • 20. Big Data Analytics: Facts and Feelings Text analytics applies natural-language processing (NLP) techniques to discern – Entities Relationships Context Identity – and get at the sense of “unstructured” online, social, and enterprise information. Semantic identity unites data of all types.
  • 21. Big Data Analytics: Facts and Feelings http://searchuserinterfaces.com/ Sensemaking: “It is convenient to divide the entire information access process into two main components: information retrieval through searching and browsing, and analysis and synthesis of results. This broader process is often referred to in the literature as sensemaking. Sensemaking refers to an iterative process of formulating a conceptual representation from of a large volume of information.” – Marti Hearst, 2009
  • 22. Big Data Analytics: Facts and Feelings Intelligent computing – sensemaking – involves: Big (and little) Data. • Quantities. • Content. • Metadata. Analytics. Semantics. Integration. Facts and feelings.
  • 23. Big Data Analytics: Facts and Feelings Feelings: Sentiment detection, classification.
  • 24. Big Data Analytics: Facts and Feelings http://techpresident.com/news/21618/pol itico-facebook-sentiment-analysis-bogus
  • 25. Big Data Analytics: Facts and Feelings “Sentiment analysis is the task of identifying positive and negative opinions, emotions, and evaluations.” -- Wilson, Wiebe & Hoffman, 2005, “Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis” “Sentiment analysis or opinion mining is the computational study of opinions, sentiments and emotions expressed in text… An opinion on a feature f is a positive or negative view, attitude, emotion or appraisal on f from an opinion holder.” -- Bing Liu, 2010, “Sentiment Analysis and Subjectivity,” in Handbook of Natural Language Processing
  • 26. Big Data Analytics: Facts and Feelings Sentiment may be of interest at multiple levels. Corpus / data space, i.e., across multiple sources. Document. Statement / sentence. Entity / topic / concept. Human language is noisy and chaotic! Jargon, slang, irony, ambiguity, anaphora, polysemy, synonymy, etc. Context is key. Discourse analysis comes into play.
  • 27. Big Data Analytics: Facts and Feelings Emotion and effect.
  • 28. Big Data Analytics: Facts and Feelings Emotion and understanding.
  • 29. Big Data Analytics: Facts and Feelings Prediction/Feeling/Wish... and Intent. http://www.aiaioo.com/whitepapers/intention_analysis_use_cases.pdf http://sentibet.com/
  • 30. Big Data Analytics: Facts and Feelings Many options (text).
  • 31. Big Data Analytics: Facts and Feelings (Accessible) Data Everywhere
  • 32. Big Data Analytics: Facts and Feelings Beyond Text: • Audio including speech. • Images. • Video. http://www.geekosystem.com/ facebook-face-recognition/ http://www.sciencedirect.com/science /article/pii/S0167639312000118 http://flylib.com/books/en/2.495.1.54/1/
  • 33. Big Data Analytics: Facts and Feelings Or just ask: Swipp mobile polling (example).
  • 34. Big Data Analytics: Facts and Feelings Surfaced.
  • 35. Big Data Analytics: Facts and Feelings A Big Data analytics architecture (example). http://www.geeklawblog.com/2011/12/lexis-advance-platform-launch-two.html
  • 36. Big Data Analytics: Facts and Feelings Complementary.
  • 37. Big Data Analytics: Facts and Feelings Seth Grimes Alta Plana Corporation @sethgrimes TDWI – Washington DC June 21, 2013