1. What is the problem? How can we deal with concept drift? Summary
Extensional Mapping-Chains for studying Concept
Drift in Political Ontologies
Shenghui Wang1 Stefan Schlobach2
Janet Takens3 Wouter van Atteveldt3
1
The Network Institute
2
Department of Computer Science
3
Department of Communication Science
Vrije Universiteit Amsterdam
ICA 2010
Singapore
2. What is the problem? How can we deal with concept drift? Summary
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
3. What is the problem? How can we deal with concept drift? Summary
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
4. What is the problem? How can we deal with concept drift? Summary
Content analysis in Communication Science
Communication scientists study all sorts of media content
related to human communication
Content analysis based on the NET method
concepts: political actors and issues
relations: associations, opinions, or actions.
Example
Het Openbaar Ministerie (OM) wil de komende vier jaar mensen-
handel uitroeien.
-1
om human trafficking
6. What is the problem? How can we deal with concept drift? Summary
Network-based communication science study
What information can we extract from these networks?
Politicians are networking
Politics is perceived by citizens via media
Media study by semantic network analysis
Who is determining the subjects?
Who is teaming up?
Who is more credible?
Who owns which topic?
7. What is the problem? How can we deal with concept drift? Summary
Before network analysis
We first need to build the networks!
Requires: large corpora with annotated textual content
Manual coding against coding books (ontologies)
Automated content analysis in progress
8. What is the problem? How can we deal with concept drift? Summary
Before network analysis
We first need to build the networks!
Requires: large corpora with annotated textual content
Manual coding against coding books (ontologies)
Automated content analysis in progress
9. What is the problem? How can we deal with concept drift? Summary
What is the problem?
Problems with constructing annotated content
Data from different time periods or genres
Coded by different teams at different moments
Manifesto Research Group: 25 countries, from 1945 to 2006
Comparative Policy Agendas project: media content,
manifestos, legislative texts, government press statements, etc.
Election campaign coverage from 1994 to 2006
10. What is the problem? How can we deal with concept drift? Summary
What are the challenges?
Interoperability problem while sharing information
Different teams use different code books
Example
illegal immigration
labour migrants
Different coding books should be merged or at least connected
Not the focus of this paper
11. What is the problem? How can we deal with concept drift? Summary
What are the challenges?
Interoperability problem while sharing information
Different teams use different code books
Example
illegal immigration
labour migrants
Different coding books should be merged or at least connected
Not the focus of this paper
12. What is the problem? How can we deal with concept drift? Summary
What are the challenges?
Interoperability problem while sharing information
Different teams use different code books
Example
illegal immigration
labour migrants
Different coding books should be merged or at least connected
Not the focus of this paper
13. What is the problem? How can we deal with concept drift? Summary
Follow the Fashion?
14. What is the problem? How can we deal with concept drift? Summary
Women’s role?
Suffragettes said that women’s role in society is unacceptable
Pope says that women’s role in society is unacceptable
15. What is the problem? How can we deal with concept drift? Summary
Concept drift
Our problem: Concept drift
Meaning of concepts changes over time
Analysis based on evolving concepts must consider temporal
locality
Study concept drift itself is useful
16. What is the problem? How can we deal with concept drift? Summary
Datasets
Five political ontologies which were used to annotate
newspaper articles
23 639 manually annotated newspaper articles during five
recent Dutch national election campaigns
There even exist manual mappings but most of them are
lexically very similar
17. What is the problem? How can we deal with concept drift? Summary
Detecting concept drift
We use extensional mapping techniques
Consider concepts at different time to be different concepts
Use extensional method to detect the links between concepts
at different time
Assumption: similar sentences should be coded with similar
concepts, therefore, similar concepts should have similar
extension.
18. What is the problem? How can we deal with concept drift? Summary
Representing concept drift using mapping chains
19. What is the problem? How can we deal with concept drift? Summary
Evaluating concept drift
What can we learn from those chains?
Do they agree with the political reality?
Do they tell us something we do not noticed before?
Are some concepts more stable/unstable than others?
Quantitative evaluation is interesting, but qualitative analysis
seems to tell us something too.
20. What is the problem? How can we deal with concept drift? Summary
Qualitative analysis of mapping chains
Association vs. similarity
Early erroneous associations can turn large parts of the
analysis practically useless.
21. What is the problem? How can we deal with concept drift? Summary
Qualitative analysis of mapping chains
Association vs. similarity
Early erroneous associations can turn large parts of the
analysis practically useless.
23. What is the problem? How can we deal with concept drift? Summary
If we know two end-point concepts have the same meaning
Kite-shaped chains
02_mensenrechten
98_avluchtelingen 02_jusititie
03_politie
02_criminaliteit
03_justitie
98_rcriminaliteit 02_drugkoeriers
02_cellentekort
03_criminaliteit
94_asielzoekers 98_kabinet kokmierlods 02_bedrijfsleven
02_democratie
98_asielzoekers
02_buitenlanders 03_asielzoekers 06_asielzoekers
98_okerken 02_instroom beperking 03_opvang illegalen
02_asielzoekers 03_vluchtelingen
03_illegalen
24. What is the problem? How can we deal with concept drift? Summary
“christelijken” (Christians)
98_oabortus 02_normen waarden
94_christelijken 03_multiculturele samenleving 06_christenen
98_ochristelijk christenen 02_multiculturele samenleving
“asielzoekers” (Asylum seeker)
02_mensenrechten
98_avluchtelingen 02_jusititie
03_politie
02_criminaliteit
03_justitie
98_rcriminaliteit 02_drugkoeriers
02_cellentekort
03_criminaliteit
94_asielzoekers 98_kabinet kokmierlods 02_bedrijfsleven
02_democratie
98_asielzoekers
02_buitenlanders 03_asielzoekers 06_asielzoekers
98_okerken 02_instroom beperking 03_opvang illegalen
02_asielzoekers 03_vluchtelingen
03_illegalen
25. What is the problem? How can we deal with concept drift? Summary
Summary
By looking at extensions of concepts, we can detect concept
drift
Domain experts found that the detected concept drift makes
sense
Automated matching techniques can help domain experts to
find hidden links between concepts
More work needs to be done