SlideShare ist ein Scribd-Unternehmen logo
1 von 50
Downloaden Sie, um offline zu lesen
Adapting NLP tools to diverse data:
challenges and solutions
Dr. Diana Maynard
University of Sheffield, UK
October 2017, San Servolo, Italy
Twitter Fun Facts
• 500	million	tweets	sent	per	day
• 24%	of	all	internet	male	users use	Twitter	(vs	21%	of	
women)
• 37%	of	Twitter	users	are	18-29
• 25%	of	Twitter	users	are	30-49
Which country has the most Twitter
users?
Twitter Users per country
• US:	67	million
• Brazil:	27.7	million
• Japan:	25.9	million
• Mexico:	23.5	million
....
• UK:	13	million
Which	country	has	the	highest	
penetration	of	Twitter	users?
1/3	of	all	
internet	
users	there	
are	on	
Twitter
Who do we follow on Twitter?
Top	10	most	followed	Twitter	users
2017 2015 2013
Katy Perry Katy Perry Katy Perry
Justin Bieber Justin Bieber Justin Bieber
Barack Obama Taylor Swift Lady Gaga
Taylor Swift Barack Obama Barack Obama
Rihanna Youtube Taylor Swift
Ellen de Generes Lady Gaga YouTube
Lady Gaga Rihanna Britney Spears
Youtube Ellen de Generes Rihanna
Justin Timberlake Twitter Instagram
Twitter Justin Timberlake Justin Timberlake
Social	media:	a	valuable	source	of	information
l business	insights
l sharing	and	receiving	important	news
l campaigns
l all	kinds	of	collective	intelligence
l an	alternative	to	traditional	polls
l and	much	more
Uses	of	social	media	during	disasters
• Broadcasting	info	about	the	disaster
• Requesting	info	from	local	people	and	
eyewitnesses	
• Requesting	and	offering	help	and	
support
• Disaster	mapping
• Mobilising the	crowd	to	support	
initiatives
1
2
Ecuador, 7.8 earthquake , April 2017, ~700 people
die
Droughts, affecting 60 million in 34 countries
Maxwell, California, Feb 2017
Portugal, forest fires, 64 confirmed deaths, Jun 2017
Manchester, May 2017, 22 dead
Haiti, Hurricane Matthew, Oct 2016,
~500 people died, farming devastated
1
3
Harnessing	the	Crowd
• Using	citizen	
reporters,	and	digital	
responders	for	
mapping	crises
• Ushahidi deployed	
over	50k	times
• Free	and	open	source
• Working	with	us	on	
the	COMRADES
project
Why	is	social	media	interesting	for	NLP?
l Fast-growing,	highly	dynamic	and	high	volume	
source	of	data
l Reflects	language	used	in	today's	society
l Reflects	current	views	of	society
l It's	a	great	source	of	material	for	opinion	mining	
tools
l Challenging	research	area	due	to	specialised	use	of	
language
Gartner	3V	definition	of	Big	Data
• Volume
• Velocity
• Variety
• High	volume &	velocity of	messages:
• 500	million	tweets	per	day
• Massive	variety:	
• Stock	markets
• Earthquakes
• Social	arrangements
Big	Data	is	not	new!
Staff sorting 4M used tickets from #London Underground
to analyse line use in 1939
Problems	and	solutions	for	NLP
• Volume and	velocity can	largely	be	dealt	with	using	
modern	processing	methods
• GATE	social	media	toolkit	allows	(almost)	real-time	
collection,	analysis	and	visualisation of	a	twitter	stream,	
e.g.	studies	of	Brexit	tweets,	UK	elections
• But	still	issues	with	data	collection	(e.g.	Twitter	rate	
limits)
• Variety and	veracity are	rather	more	complex.
Social	Media	sites	are	not	all	the	same
l Twitter	has	varied	uptake	per	country:
• Low	in	China	(often	censored,	local	competitor	– Weibo)
• Low	in	Denmark,	Germany	(Facebook	is	preferred)
• Medium	in	UK,	though	often	complementary	to	Facebook
• High	in	USA
l Networks	have	common	themes:
• Individuals	as	nodes	in	a	common	graph
• Relations	between	people
• Sharing	and	privacy	restrictions
• No	curation	of	content
• Multimedia	posting	and	re-posting
Challenges	of	social	media
l Strongly	temporal	and	dynamic:	
l Temporal	information	(e.g.	post	timestamp)	can	be	combined	
with	opinion	mining,	to	examine	the	volatility	of	attitudes	
towards	topics	over	time	(e.g.	gay	marriage)
l Exploiting	social	context:	who	is	the	user	connected	to?	How	
frequently	do	they	interact?
l Derive	automatically	semantic	models	of	social	networks,	
measure	user	authority,	cluster	similar	users	into	groups,	as	well	
as	model	trust	and	strength	of	connection
l Implicit	information	about	the	user:	research	on	recognising	
gender,	location,	and	age	of	Twitter	users.
l Helpful	for	generating	opinion	summaries	by	user	demographics
Linguistic	challenges	of	social	media
• Language
• Problem:	typically	exhibits	very	different	language	style
• Solution:	train	specific	language	processing	components
• Relevance
• Problem:	topics	and	comments	can	rapidly	diverge.	
• Solution:	train	a	classifier	or	use	clustering	techniques
• Lack	of	context
• Problem:	hard	to	disambiguate	entities
• Solution:	data	aggregation,	metadata,	entity	linking
People	don’t	write	“properly”
l Grundman:politics makes	#climatechange scientific	issue,people
don’t	like	knowitall rational	voice	tellin em wat	2do	
l Want	to	solve	the	problem	of	#ClimateChange?	Just	#vote	for	a	
#politician!	Poof!	Problem	gone!	#sarcasm	#TVP	#99%	
l Human	Caused	#ClimateChange is	a	Monumental	Scam!	
http://www.youtube.com/watch?v=LiX792kNQeE	…	F**k	yes!!	
Lying	to	us	like	MOFO's	Tax	The	Air	We	Breath!	F**k	Them!
l The	last	people	I	will	listen2	about	guns	r	those	that	know	
nothing	about	them&politicians who	live	in	states	w/strictest	
gun	laws	#cali #ny
NLP	Pre-Processing	Pipeline	
Language	ID
TokenisationPOS tagging
Text
Named Entity Recognition and Linking
NER	(Professor	Plum)
dbpedia.org/resource/.....
Michael_Jackson
Michael_Jackson_(writer)
Entity	Linking
Pipelines	for	tweets
Errors	have	a	cumulative	effect
Good	performance	is	important	at	each	stage
Per-stage
Overall
Language	Identification
LADY	GAGA	IS	BETTER	THE	5th	TIME	OH	BABY(:
je	bent	Jacques	cousteau niet die	een nieuwe soort heeft
ontdekt,	het	is	duidelijk,	ze bedekken hun gezicht.	Get	
over	it
I'm	at	地铁望京站 Subway	Wangjing (Beijing)	
http://t.co/KxHzYm00
The Jan. 21 show started with the unveiling of an
impressive three-story castle from which Gaga emerges.
The band members were in various portals, separated
from each other for most of the show. For the next 2
hours and 15 minutes, Lady Gaga repeatedly stormed
the moveable castle, turning it into her own gothic
Barbie Dreamhouse .
Newswire
Twitter
Tokenisation	is	only	80%	accurate	on	tweets
Improper	grammar,	e.g.	apostrophe	usage:
doesn't	 →	does n't
doesnt →	doesn’t
Smileys	and	emoticons:	loss	of	information	(e.g.	sentiment)
I	<3	you
This	piece	;,,(	so	emotional
Punctuation	for	emphasis
*HUGS	YOU**KISSES	YOU* →	* HUGS YOU**KISSES YOU *
Words	run	together	/	skip
I	wonde rif Tsubasa is	okay..
We	need	tools	for	hashtag	analysis
l Hashtags	need	unravelling	and	disambiguating:
l #nowthatcherisdead
l #powergenitalia
l #lesbocages
l #molestationnursery
l #teacherstalking
l #therapist
Test	your	social	media	skills
What	do	these	hashtags	mean?
l #kktny
l #fomo
l #jomo
l #ootd
l #wcw
Hashtag	Hijacking
Have	sex	to	save	the	planet!
Tweet	Normalisation
l “RT	@Bthompson WRITEZ:	@libbyabrego honored?!	Everybody	
knows	the	libster is	nice	with	it...lol...(thankkkks a	bunch;))”
l OMG!	I’m	so	guilty!!!	Sprained	biibii’s leg!	ARGHHHHHH!!!!!!
l For	some	components	to	work	well	(POS	tagger,	parser),	we	need	
normalisation
l BUT	uppercasing,	and	repetition	often	convey	strong	sentiment
l Other	forms	of	“misspelling”	might	indicate	information	about	the	
author
l First	challenge:	separate	out-of-vocabulary	and	in-vocabulary
l Second	challenge:	fix	mis-spelled	IV	words	(e.g.	Levenshtein edit	
distance)
l The	ZOMG	phenomenon
Part-of-speech	tagging
• Similar	issues	as	for	normalisation	– we	don’t	have	big	datasets	
to	train	on
• Label	unlabelled	data	with	multiple	taggers	and	accept	tweets	
where	tagger	votes	never	conflict
• Model	words	using	Brown	clustering and	word	representations
(Turian 2010)
2m,	2ma,	2mar,	2mara,	2maro,	2marrow,	2mor,	2mora,		2moro,	2morow,	
2morr,	2morro,	2morrow,	2moz,	2mr,	2mro,	2mrrw,	2mrw,	2mw,	tmmrw,	tmo,	
tmoro,	tmorrow,	tmoz,	tmr,	tmro,	tmrow,	tmrrow,	tmrrw,	tmrw,	tmrww,	tmw,	
tomaro,	tomarow,	tomarro,	tomarrow,	tomm,	tommarow,	tommarrow,	
tommoro,	tommorow,	tommorrow,	tommorw,	tommrow,	tomo,	tomolo,	
tomoro,	tomorow,	tomorro,	tomorrw,	tomoz,	tomrw,	tomz
NER	on	Tweets
l NER	on	Tweets	much	harder	than	on	longer	text
l Very	short,	so	ambiguous	terms	hard	to	interpret
l Poor	grammar	and	spelling,	use	of	abbreviations,	
shorthands
l Twitter-specific	features:	hashtags,	@mentions,	etc.
l Tools	designed	for	longer	texts	do	very	badly	on	
Twitter
l As	low	as	40%	accuracy	of	traditional	NER	taggers
Lack	of	context	causes	ambiguity
Branching	out	from	Lincoln	park	after	dark	...	Hello	
Russian	Navy,	it's	like	the	same	thing	but	with	glitter!
??
It’s	all	about	the	Named	Entities
Branching	out	from	Lincoln	park	after	dark ...	Hello	
Russian	Navy,	it's	like	the	same	thing	but	with	glitter!
Some	cool	applications	with	GATE
Real-time	Opinion	Monitoring
42
vs replies
Climate	change,	ISIS	and	Trump
43
Querying	election	data	with	MIMIR
• Dataset: every
tweet by MP /
Candidate / Party,
plus all
replies/retweets
• Find all tweets
where a
Conservative MP
talked about the
economy
Parties	/	themes	co-occurrence
UK economy
Europe
Tax and revenue
NHS
Borders and Immigration
Scotland
Employment
Community and society
Public health
Media and communications
LabourPartyCandidate
LabourPartyMP
ConservativePartyCandidate
ConservativePartyMP
UKIPCandidate
OtherMP
SNPCandidate
GreenPartyCandidate
LiberalDemocratsCandidate
SNPOther
Behaviour	Analysis
• Based on the assumption that users in different behavioural
stages communicate differently (different emotions,
directives, etc.)
Pajarito @lindopajarito . 2h
Our building needs 40% of all energy consumed in Switzerland! L
DJPajarito @DJPajaritoGenial . 12h
I'm so proud when I remember to save energy and I know
however small it's helping.
Desirability: Negative sentiment (expressing personal
frustration- anger/sadness)
Buzz: Positive sentiment (happiness/joy). I/we + present tense
HotelPajarito @HotelPajarito . 18h
Join us today today to switch of a light for EH! J
Invitation: Positive sentiment (happy) + use of vocatives
Hate	speech	towards	MPs	on	Twitter
20172015
http://greenwoodma.servehttp.com/data/buzzfeed/sunburst.html
Some	useful	links
• GATE		http://gate.ac.uk
• GateCloud https://cloud.gate.ac.uk
• Come	on	a	GATE	training	course!
• Brexit	study	blog	post	from	NESTA	http://www.nesta.org.uk/blog/network-
analysis-top-eu-referendum-tweeters
• Brexit	study	blog	posts	from	Sheffield	
http://gate4ugc.blogspot.co.uk/search/label/Brexit
• UK	elections	monitor		http://gate.ac.uk/projects/pft
• Blog	post	on	abuse	of	MPs
https://www.buzzfeed.com/tomphillips/twitter-abuse-of-mps-during-the-
election-doubled-after-the
• COMRADES	project	on	social	media	during	disasters	
http://gate.ac.uk/projects/comrades
Publications
• Diana	Maynard,	Ian	Roberts,	Mark	A.	Greenwood,	Dominic	Rout	and	Kalina
Bontcheva.	A	Framework	for	Real-time	Semantic	Social	Media	Analysis.	Web	
Semantics:	Science,	Services	and	Agents	on	the	World	Wide	Web,	2017.	
• K.	Bontcheva,	L.	Derczynski,	A.	Funk,	M.A.	Greenwood,	D.	Maynard,	N.	Aswani.	
TwitIE:	An	Open-Source	Information	Extraction	Pipeline	for	Microblog	Text.	
Proceedings	of	the	International	Conference	on	Recent	Advances	in	Natural	
Language	Processing	(RANLP	2013).
• D.	Maynard	and	K.	Bontcheva.	Understanding	climate	change	tweets:	an	open	
source	toolkit	for	social	media	analysis.	In	Proc.	of	EnviroInfo 2015,	
Copenhagen,	Sep.	2015
• Diana	Maynard,	Kalina Bontcheva,	Isabelle	Augenstein.	Natural	Language	
Processing	for	the	Semantic	Web.	Morgan	and	Claypool,	December	2016.	ISBN:	
9781627059091	(contains	a	chapter	on	social	media	analysis)
• M.	Fernandez,	L.	Piccolo,	D.	Maynard,	M.	Wippoo,	C.	Meili,	H.	Alani.	Pro-
Environmental	Campaigns	via	Social	Media:	Analysing Awareness	and	
Behaviour Patterns.	To	appear	in	Journal	of	Web	Science	2017.
• Available	(and	more)	at	https://gate.ac.uk/gate/doc/papers.html
Acknowledgements
This	work	supported	by:
• the	European	Union/EU	under	the	Information	and	
Communication	Technologies	(ICT)	theme	of	the	7th	
Framework	and	H2020	Programmes	for	R&D	
• DecarboNet (610829)	http://www.decarbonet.eu
● SoBigData (654024)	 http://www.sobigdata.eu
● COMRADES	(687847)	http://www.comrades-project.eu
● Nesta http://nesta.org.uk

Weitere ähnliche Inhalte

Was ist angesagt?

Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
rupika08
 
Social Networking Pew Internet Research in June 2015 http
Social Networking Pew Internet Research in June 2015 httpSocial Networking Pew Internet Research in June 2015 http
Social Networking Pew Internet Research in June 2015 http
Taylor Your Career
 
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan HardtProject #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
CBS Competitiveness Platform
 

Was ist angesagt? (20)

Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
 
Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...
Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...
Manichean Progress: Positive and Negative States of the Art in Web-Scale Data...
 
Powers and Problems of Integrating Social Media Data with Public Health and S...
Powers and Problems of Integrating Social Media Data with Public Health and S...Powers and Problems of Integrating Social Media Data with Public Health and S...
Powers and Problems of Integrating Social Media Data with Public Health and S...
 
Social Networking Pew Internet Research in June 2015 http
Social Networking Pew Internet Research in June 2015 httpSocial Networking Pew Internet Research in June 2015 http
Social Networking Pew Internet Research in June 2015 http
 
Predicting what gets ‘Likes’ on Facebook: case study of BlogTO
Predicting what gets ‘Likes’ on Facebook:  case study of BlogTOPredicting what gets ‘Likes’ on Facebook:  case study of BlogTO
Predicting what gets ‘Likes’ on Facebook: case study of BlogTO
 
Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...
 
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsDoing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
 
Visualizing the Social Data of #Marius
Visualizing the Social Data of #Marius Visualizing the Social Data of #Marius
Visualizing the Social Data of #Marius
 
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan HardtProject #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
Project #Marius - Chris Zimmerman, Ravi Vatrapu, Yuran Chen & Dan Hardt
 
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in  Data Journalism, Open Data and Data ActivismGitHub as Transparency Device in  Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
 
Handling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCHandling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGC
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
 
Mapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital MethodsMapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital Methods
 
Echo Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the EvidenceEcho Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the Evidence
 
Social media and the Quebec Student Conflict
Social media and the Quebec Student ConflictSocial media and the Quebec Student Conflict
Social media and the Quebec Student Conflict
 
Duty identity-credibility
Duty identity-credibilityDuty identity-credibility
Duty identity-credibility
 
Leveraging the Power of Social Media
Leveraging the Power of Social MediaLeveraging the Power of Social Media
Leveraging the Power of Social Media
 
big data
big databig data
big data
 
Big data
Big dataBig data
Big data
 

Ähnlich wie Adapting NLP tools to diverse data: challenges and solutions

Isentia_CAS Presentation
Isentia_CAS PresentationIsentia_CAS Presentation
Isentia_CAS Presentation
Richard Spencer
 
LECTURE 10 - Cyberculture
LECTURE 10 - CybercultureLECTURE 10 - Cyberculture
LECTURE 10 - Cyberculture
Kim Flintoff
 

Ähnlich wie Adapting NLP tools to diverse data: challenges and solutions (18)

Women Role Models
Women Role ModelsWomen Role Models
Women Role Models
 
Using language to save the world: interactions between society, behaviour and...
Using language to save the world: interactions between society, behaviour and...Using language to save the world: interactions between society, behaviour and...
Using language to save the world: interactions between society, behaviour and...
 
Isentia_CAS Presentation
Isentia_CAS PresentationIsentia_CAS Presentation
Isentia_CAS Presentation
 
1000views Blog Digest No. 3 (June 2014)
 1000views Blog Digest No. 3 (June 2014)  1000views Blog Digest No. 3 (June 2014)
1000views Blog Digest No. 3 (June 2014)
 
21st century learning 2010 cspa
21st century learning 2010 cspa21st century learning 2010 cspa
21st century learning 2010 cspa
 
Corrected-Student Analysis
Corrected-Student AnalysisCorrected-Student Analysis
Corrected-Student Analysis
 
Using Social Media As An Organizing Tool
Using Social Media As An Organizing ToolUsing Social Media As An Organizing Tool
Using Social Media As An Organizing Tool
 
Social Media for non profit
Social Media for non profitSocial Media for non profit
Social Media for non profit
 
LECTURE 10 - Cyberculture
LECTURE 10 - CybercultureLECTURE 10 - Cyberculture
LECTURE 10 - Cyberculture
 
Main Presentation Slides from the #iprevent San Diego Conference - April 20-2...
Main Presentation Slides from the #iprevent San Diego Conference - April 20-2...Main Presentation Slides from the #iprevent San Diego Conference - April 20-2...
Main Presentation Slides from the #iprevent San Diego Conference - April 20-2...
 
The language of social media
The language of social mediaThe language of social media
The language of social media
 
Stratford Festival Reviews media kit 2018
Stratford Festival Reviews media kit 2018Stratford Festival Reviews media kit 2018
Stratford Festival Reviews media kit 2018
 
Super Social Everybody
Super Social EverybodySuper Social Everybody
Super Social Everybody
 
Jacqueline Jensen - Learning Ruby on Rails - 5th Annual LEXI Women's Leadersh...
Jacqueline Jensen - Learning Ruby on Rails - 5th Annual LEXI Women's Leadersh...Jacqueline Jensen - Learning Ruby on Rails - 5th Annual LEXI Women's Leadersh...
Jacqueline Jensen - Learning Ruby on Rails - 5th Annual LEXI Women's Leadersh...
 
Aldo van Weezel - Taking social media to the next level
Aldo van Weezel - Taking social media to the next levelAldo van Weezel - Taking social media to the next level
Aldo van Weezel - Taking social media to the next level
 
Gender, Paratexts and Everyday Heroes? From #WheresRey to 'Chewbacca Mom'
Gender, Paratexts and Everyday Heroes? From #WheresRey to 'Chewbacca Mom'Gender, Paratexts and Everyday Heroes? From #WheresRey to 'Chewbacca Mom'
Gender, Paratexts and Everyday Heroes? From #WheresRey to 'Chewbacca Mom'
 
Reinventing the Arts - NYFA Panel
Reinventing the Arts - NYFA PanelReinventing the Arts - NYFA Panel
Reinventing the Arts - NYFA Panel
 
Summer 2009 Road Trip social media presentation
Summer 2009 Road Trip social media presentationSummer 2009 Road Trip social media presentation
Summer 2009 Road Trip social media presentation
 

Mehr von Diana Maynard

Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Diana Maynard
 

Mehr von Diana Maynard (20)

Methodological possibilities for strengthening the monitoring of SDG indicato...
Methodological possibilities for strengthening the monitoring of SDG indicato...Methodological possibilities for strengthening the monitoring of SDG indicato...
Methodological possibilities for strengthening the monitoring of SDG indicato...
 
Getting the-most-out-of-conferences
Getting the-most-out-of-conferencesGetting the-most-out-of-conferences
Getting the-most-out-of-conferences
 
Ontologies as bridges between data sources and user queries: the KNOWMAK proj...
Ontologies as bridges between data sources and user queries: the KNOWMAK proj...Ontologies as bridges between data sources and user queries: the KNOWMAK proj...
Ontologies as bridges between data sources and user queries: the KNOWMAK proj...
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATE
 
Challenges of social media analysis in the real world
Challenges of social media analysis in the real worldChallenges of social media analysis in the real world
Challenges of social media analysis in the real world
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
 
Tools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media AnalysisTools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media Analysis
 
Social media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATESocial media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATE
 
Text analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATEText analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATE
 
Cls8 decarbonet
Cls8 decarbonetCls8 decarbonet
Cls8 decarbonet
 
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?
 
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?
 
Do we really know what people mean when they tweet?
Do we really know what people mean when they tweet?Do we really know what people mean when they tweet?
Do we really know what people mean when they tweet?
 
Disability and Adventure Travel: the Double-Edged Sword
Disability and Adventure Travel: the Double-Edged SwordDisability and Adventure Travel: the Double-Edged Sword
Disability and Adventure Travel: the Double-Edged Sword
 
Who cares about sarcastic tweets? Investigating the impact of sarcasm on sent...
Who cares about sarcastic tweets? Investigating the impact of sarcasm on sent...Who cares about sarcastic tweets? Investigating the impact of sarcasm on sent...
Who cares about sarcastic tweets? Investigating the impact of sarcasm on sent...
 
Multimodal opinion mining from social media
Multimodal opinion mining from social mediaMultimodal opinion mining from social media
Multimodal opinion mining from social media
 
Practical Opinion Mining for Social Media
Practical Opinion Mining for Social MediaPractical Opinion Mining for Social Media
Practical Opinion Mining for Social Media
 
What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...What do you really mean when you tweet? Challenges for opinion mining on soci...
What do you really mean when you tweet? Challenges for opinion mining on soci...
 
A tailor-made one-size-fits-all approach to sentiment analysis
A tailor-made one-size-fits-all approach to sentiment analysisA tailor-made one-size-fits-all approach to sentiment analysis
A tailor-made one-size-fits-all approach to sentiment analysis
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Adapting NLP tools to diverse data: challenges and solutions