SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Learning Structured Knowledge
from Social Tagging Data:
a critical review of methods and techniques
Hang Dong, Wei Wang, Hai-Ning Liang
CSSE, Xi’an Jiaotong-Liverpool University
The 8th IEEE International Conference on Social Computing and
Networking (SocialCom 2015), Chengdu, China, 19-21 Dec 2015
Email:
Last edited on 12th Jan 2016 by Hang Dong.
Content
• Background: social tagging & Folksonomies
• Knowledge Structures / Ontologies
• Methods & techniques for learning KS
from social tagging data
• Issues & future studies
Background: Folksonomies
• Folksonomy = Folk + Taxonomy (by Thomas
Vander Wal in 2007, http://www.vanderwal.net/folksonomy.html)
• Share a resource – create personal free
short text description
• Social Tagging Systems: delicious, flickr,
douban, bibsonomy...
• Noisy, ambiguous then not very useful.
Noisy & Complex Social Tags
• apple (polysemy)
• to-read, myThesis (personal use)
• sematnikweb (typo, multiword)
• data-mining, DataMining (multiword)
• $$$$Tsk (nonsense tag)
• Mac,Macintosh,Apple (synonym)
• Programming, Javascript, perl (gen-spec)
From Social Tagging Data to Ontologies
Researcher generated data
(user-tag-resource-date) Useful and evolving ontologies
Acknowledgement to the figure from M. Triana. Ontology...
what? (Jan 2012)
http://www.micheltriana.com/blog/2012/01/20/ontology-what
http://www.bibsonomy.org/tag/knowledge
The structure of social tagging data
Acknowledgement to the figure from A. García-Silva, O. Corcho, H. Alani, and A. Gómez-Pérez,
“Review of the state of the art: Discovering and associating semantics to tags in folksonomies,”
The Knowledge Engineering Review, vol. 27, no. 01, pp. 57-85, Mar. 2012.
Applications
• General
– (Semantic-based) Recommender systems
– Ontology enrichment
– Information retrieval and navigation
• Specific
– E-learning discovery
– Enterprise search and knowledge management
– Academic communication
– Image annotation, recognition, representation
– Music mood mining
A spectrum of Knowledge Structures /
ontologies
Low semantics
High semantics
Folksonomies
Terms / Concepts list
Concept hierarchies
Taxonomies
Ontologies
Key idea:
Concept
Adapted from the figures in R. R. Souza, D. Tudhope, and M. B.
Almeida, “Towards a taxonomy of KOS: Dimensions for classifying
Knowledge Organization Systems,” 2012.
Ontology learning layer cake
Adapted from the Figure 1 in Paul Buitelaar, Philipp Cimiano, and Bernardo Magnini: ‘Ontology
Learning from Text: An Overview’, 2003
Part B
Part A
Learning term (or concept) lists
Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods
and Techniques,” in The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015). (accepted)
Word Sense Disambiguation
Context definition
(identify synonyms)
Tag Co-occurrences
External sense
inventory
Term list /
Concept list
Social tagging data
Semantic distance
measure
Supervised learning:
Classification
WordNet
Wikipedia
DBpedia
Domain
ontologies
Sense selection
Word Sense Induction
• No external sense inventory.
• Unsupervised.
Term list /
Concept listSocial tagging data
Data representation :
(a) Vector space model
(b) Co-occurrence graph
TF-IDF (a)
Clustering algorithms
Hierarchical Agglomerative Clustering (a)
DBSCAN (a)
Non-negative Matrix
Factorization (a)
Graph-based (b)
Learning (hierarchical) relations
Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods
and Techniques,” in The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015). (accepted)
“popularity-generality” assumption
• The more popular/influential/frequent a tag
is, the more general it is! [51]
If sim(tag1, tag2) >= threshold then do
If tag1 is more popular than tag2 then do
add tag2 as the child of tag1 [37]
Popularity metrics: Set Theory
• tag sets, user sets and/or resources sets
Algorithm 1 [35]:
If R(tagB) ⊆ R(tagA) then do
add tagB as a child of tagA
Algorithm 2 [40]:
inc (tagA, tagB) = |R(tagA) ∩ R(tagB)|/|R(tagA)|
= gen (tagB, tagA)
Popularity metrics: Centrality in SNA
Social Network Analysis (SNA)
• Build a graph: nodes are tags, edges are
their similarities [37]
• Measure how popular/influential a tag is:
– Degree centrality
– Betweenness centrality
Association rule mining
• Support and Confidence [45] [41]
• Important issue: Aggregate the tripartite structure
into two-dimensional space.
• Association analysis is not enough to be used solely
for hierarchical relation extraction.
Association rule mining
Acknowledgement to the figure from C. Schmitz, A. Hotho, R. Jäschke, and G. Stumme, "Mining Association
Rules in Folksonomies," Data Science and Classification, Studies in Classification, Data Analysis, and
Knowledge Organization V. Batagelj, H.-H. Bock, A. Ferligoj and A. Žiberna, eds., pp. 261-270: Springer Berlin
Heidelberg, 2006.
minsupp = .05%, mincon = 50%
Supervised learning
• Binary classification: hierarchical or not [41]
• Feature extraction:
– Support and confidence
– Cosine similarity
– Inclusion and generalization measures
– …
• Instance labeling using WordNet and ConceptNet
• Class Imbalance Problem
• C 4.5, Naïve Bayes, Random Forest, AdaBoost,
Log. Regression, SVM
Acknowledgement to the figure from A. S. C. Rêgo, L. B. Marinho, and C. E. S. Pires, “A supervised learning
approach to detect subsumption relations between tags in folksonomies,” in the Proc. 30th Annu. ACM Symp.
Applied Computing (SAC '15), Salamanca, Spain, 2015, pp. 409-415.
Crowdsourcing & semi-supervised methods
• Special functions of social tagging systems
– Allowing users to annotate with tag pairs or
hierarchies (TagTree, Delicious, Bibsonomy).
– Some implicit functions: Flickr’s photo
management: photo-set-collection [50].
• Using relational clustering algorithms to
aggregate the hierarchies [50].
Issues of existing studies
• External resources - not suitable for social tagging data.
• Only co-occurrence features – not precise.
• Unsupervised learning – need to specify # of clusters, can generate
odd clusters.
• The “popularity-generality” assumption – not accurate.
• Crowdsourcing methods – not natural, more work for users, few
data.
• Not showing the evolution of knowledge.
Future studies
• Using integral or new methods to derive semantics from
social tagging data
• Investigate the evolution of knowledge in Folksonomies
– From User-Resource-Tag to User-Resource-Tag-Date
• Big Data + ontology learning
• Specific domains + applications
– Image
– Academic resources
– Music
Key References
• [2] J. Gemmell, A. Shepitsen, M. Mobasher, and R. Burke, “Personalization in folksonomies based on
tag clustering,” in the Proc. 6th Workshop Intelligent Techniques for Web Personalization and
Recommender Systems, Chicago, Illinois, USA, 2008.
• [15] P. Andrews and J. Pane, “Sense induction in folksonomies: a review,” Artificial Intelligence
Review, vol. 40, no. 2, pp. 147-174, Aug. 2013.
• [18] A. García-Silva, O. Corcho, H. Alani, and A. Gómez-Pérez, “Review of the state of the art:
Discovering and associating semantics to tags in folksonomies,” The Knowledge Engineering Review,
vol. 27, no. 01, pp. 57-85, Mar. 2012.
• [30] S. Angeletou, M. Sabou, and E. Motta, ‘Semantically enriching folksonomies with FLOR,’ in Proc.
1st Int. Workshop Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008)
at The 5th Annu. European Semantic Web Conf. (ESWC 2008), Tenerife, Spain, 2008.
• [33] A. Joorabchi, M. English, and A. E. Mahdi, “Automatic mapping of user tags to Wikipedia
concepts: The case of a Q&A website – StackOverflow,” Journal of Information Science, published
online before print, May 2015.
• [35] P. Mika, “Ontologies are us: A unified model of social networks and semantics,” in The
Semantic Web–ISWC 2005. Springer, 2005, pp. 522-536.
• [37] P. Heymann and H. Garcia-Molina. (2006, Apr. 26). Collaborative creation of communal
hierarchical taxonomies in social tagging systems [online]. Technical Report, Stanford InfoLab.
Available: http://ilpubs.stanford.edu:8090/775/
Key References
• [40] P. De Meo, G. Quattrone, and D. Ursino, “Exploitation of semantic relationships and
hierarchical data structures to support a user in his annotation and browsing activities in
folksonomies,” Information Systems, vol. 34, no. 6, pp. 511-535, Sep. 2009.
• [41] A. S. C. Rêgo, L. B. Marinho, and C. E. S. Pires, “A supervised learning approach to detect
subsumption relations between tags in folksonomies,” in the Proc. 30th Annu. ACM Symp. Applied
Computing (SAC '15), Salamanca, Spain, 2015, pp. 409-415.
• [45] C. Schmitz, A. Hotho, R. Jäschke, and G. Stumme, “Mining association rules in folksonomies,” in
Data Science and Classification, V. Batagelj, H.-H. Bock, A. Ferligoj, & A. Žiberna, Eds. Springer Berlin
Heidelberg, 2006, pp. 261-270.
• [50] A. Plangprasopchok, K. Lerman, and L. Getoor, “Growing a tree in the forest: constructing
folksonomies by integrating structured metadata,” in the Proc. of the 16th ACM SIGKDD Int. Conf.
Knowledge discovery and data mining, Washington, DC, USA, 2010.
• [51] F. Almoqhim, D. Millard, and N. Shadbolt, “Improving on popularity as a proxy for generality
when building tag hierarchies from folksonomies,” in Social Informatics, Vol. 8851, L. Aiello and D.
McFarland, Eds. Springer International Publishing, 2014, pp. 95-111.
The full list of references can be found in the paper:
Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging
Data: A Critical Review of Methods and Techniques,” in The 8th IEEE International Conference on
Social Computing and Networking (SocialCom 2015). (accepted)
Q & A
Thank you for your attention

Weitere ähnliche Inhalte

Was ist angesagt?

Information Architecture and User-friendly design
Information Architecture and User-friendly designInformation Architecture and User-friendly design
Information Architecture and User-friendly designMayaLisa
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"summersocialwebshop
 
Representation Patterns for Cultural Heritage Resources
Representation Patterns for Cultural Heritage Resources Representation Patterns for Cultural Heritage Resources
Representation Patterns for Cultural Heritage Resources Richard Urban
 
Cataloguing of learning objects using social tagging
Cataloguing of learning objects using social taggingCataloguing of learning objects using social tagging
Cataloguing of learning objects using social taggingLuciana Zaina
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
 
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...UKC Library and IT
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education researchChristian Bokhove
 
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceRelationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceAmit Sheth
 
on the ontological necessity of the multidisciplinary development of the web
on the ontological necessity of the multidisciplinary development of the webon the ontological necessity of the multidisciplinary development of the web
on the ontological necessity of the multidisciplinary development of the webFabien Gandon
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...Yu-Chang Hsu
 
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강Han Woo PARK
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Mohsen Shahriari
 
Scholarly Social Machines
Scholarly Social MachinesScholarly Social Machines
Scholarly Social MachinesDavid De Roure
 
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...Rebecca Reynolds
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainAngelo Salatino
 
Harnessing the Social Web: The Science of Identity Disambiguation
Harnessing the Social Web: The Science of Identity DisambiguationHarnessing the Social Web: The Science of Identity Disambiguation
Harnessing the Social Web: The Science of Identity DisambiguationMatthew Rowe
 
Ica2012 preconference keynote
Ica2012 preconference keynoteIca2012 preconference keynote
Ica2012 preconference keynoteHan Woo PARK
 

Was ist angesagt? (20)

Information Architecture and User-friendly design
Information Architecture and User-friendly designInformation Architecture and User-friendly design
Information Architecture and User-friendly design
 
CV
CVCV
CV
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"
 
Representation Patterns for Cultural Heritage Resources
Representation Patterns for Cultural Heritage Resources Representation Patterns for Cultural Heritage Resources
Representation Patterns for Cultural Heritage Resources
 
Cataloguing of learning objects using social tagging
Cataloguing of learning objects using social taggingCataloguing of learning objects using social tagging
Cataloguing of learning objects using social tagging
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...
InfoFest Kent 2017: Information literacy frameworks in Higher Education, Dr C...
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
 
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceRelationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
 
on the ontological necessity of the multidisciplinary development of the web
on the ontological necessity of the multidisciplinary development of the webon the ontological necessity of the multidisciplinary development of the web
on the ontological necessity of the multidisciplinary development of the web
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
 
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
“무형의 대학”(The New Invisible College) 저자 C. Wagner 교수 초청특강
 
DCLA14_Haythornthwaite_Absar_Paulin
DCLA14_Haythornthwaite_Absar_PaulinDCLA14_Haythornthwaite_Absar_Paulin
DCLA14_Haythornthwaite_Absar_Paulin
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
 
Scholarly Social Machines
Scholarly Social MachinesScholarly Social Machines
Scholarly Social Machines
 
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...
Challenges in Defining, Designing, and Measuring “Digital Literacy” Developm...
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domain
 
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
 
Harnessing the Social Web: The Science of Identity Disambiguation
Harnessing the Social Web: The Science of Identity DisambiguationHarnessing the Social Web: The Science of Identity Disambiguation
Harnessing the Social Web: The Science of Identity Disambiguation
 
Ica2012 preconference keynote
Ica2012 preconference keynoteIca2012 preconference keynote
Ica2012 preconference keynote
 

Ähnlich wie Learning structured knowledge from social tagging data: a critical review of methods and techniques

Transitioning Education’s Knowledge Infrastructure ICLS 2018
Transitioning Education’s Knowledge Infrastructure ICLS 2018Transitioning Education’s Knowledge Infrastructure ICLS 2018
Transitioning Education’s Knowledge Infrastructure ICLS 2018Simon Buckingham Shum
 
Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Christian Bokhove
 
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...A Research Synthesis of Social Media in Higher Education. Challenges and Limi...
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...Franco Rau
 
Introduction to OpenSemcq
Introduction to OpenSemcqIntroduction to OpenSemcq
Introduction to OpenSemcqmbtosic
 
Semantic Analysis for Curricular Mapping, Gap Analysis and Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis and RemediationSemantic Analysis for Curricular Mapping, Gap Analysis and Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis and RemediationJennifer Staley, M.Ed., CPLP
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNINGdatasciencekorea
 
Methodological innovation for mathematics education research
Methodological innovation for mathematics education researchMethodological innovation for mathematics education research
Methodological innovation for mathematics education researchChristian Bokhove
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designdonellemckinley
 
cv-xiaozhong-NSF-2011
cv-xiaozhong-NSF-2011cv-xiaozhong-NSF-2011
cv-xiaozhong-NSF-2011Xiaozhong Liu
 
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)Jennifer Staley
 
Presentation of doctoral research at Abo Akademi University, Turku, Finland
Presentation of doctoral research at Abo Akademi University, Turku, FinlandPresentation of doctoral research at Abo Akademi University, Turku, Finland
Presentation of doctoral research at Abo Akademi University, Turku, FinlandLyndsey Middleton
 
Research Design: Twitter and professional learning
Research Design: Twitter and professional learningResearch Design: Twitter and professional learning
Research Design: Twitter and professional learningPeter Evans
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...James Howison
 
Developing a multiple-document-processing performance assessment for epistem...
 Developing a multiple-document-processing performance assessment for epistem... Developing a multiple-document-processing performance assessment for epistem...
Developing a multiple-document-processing performance assessment for epistem...Simon Knight
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data scienceHan Woo PARK
 
Semantic Analysis for Curricular Mapping, Gap Analysis & Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis & RemediationSemantic Analysis for Curricular Mapping, Gap Analysis & Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis & RemediationJennifer Staley, M.Ed., CPLP
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and ScholarshipDavid De Roure
 

Ähnlich wie Learning structured knowledge from social tagging data: a critical review of methods and techniques (20)

Transitioning Education’s Knowledge Infrastructure ICLS 2018
Transitioning Education’s Knowledge Infrastructure ICLS 2018Transitioning Education’s Knowledge Infrastructure ICLS 2018
Transitioning Education’s Knowledge Infrastructure ICLS 2018
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
 
Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017
 
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...A Research Synthesis of Social Media in Higher Education. Challenges and Limi...
A Research Synthesis of Social Media in Higher Education. Challenges and Limi...
 
Introduction to OpenSemcq
Introduction to OpenSemcqIntroduction to OpenSemcq
Introduction to OpenSemcq
 
Who are you and makes you special?
Who are you and makes you special?Who are you and makes you special?
Who are you and makes you special?
 
Semantic Analysis for Curricular Mapping, Gap Analysis and Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis and RemediationSemantic Analysis for Curricular Mapping, Gap Analysis and Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis and Remediation
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNING
 
Methodological innovation for mathematics education research
Methodological innovation for mathematics education researchMethodological innovation for mathematics education research
Methodological innovation for mathematics education research
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
 
cv-xiaozhong-NSF-2011
cv-xiaozhong-NSF-2011cv-xiaozhong-NSF-2011
cv-xiaozhong-NSF-2011
 
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)
Using Semantic Analysis for Curricular Alignment (Sloan-C Presentation)
 
Presentation of doctoral research at Abo Akademi University, Turku, Finland
Presentation of doctoral research at Abo Akademi University, Turku, FinlandPresentation of doctoral research at Abo Akademi University, Turku, Finland
Presentation of doctoral research at Abo Akademi University, Turku, Finland
 
Research Design: Twitter and professional learning
Research Design: Twitter and professional learningResearch Design: Twitter and professional learning
Research Design: Twitter and professional learning
 
Future of ICT in Educator-NICs
Future of ICT in Educator-NICsFuture of ICT in Educator-NICs
Future of ICT in Educator-NICs
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...
 
Developing a multiple-document-processing performance assessment for epistem...
 Developing a multiple-document-processing performance assessment for epistem... Developing a multiple-document-processing performance assessment for epistem...
Developing a multiple-document-processing performance assessment for epistem...
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data science
 
Semantic Analysis for Curricular Mapping, Gap Analysis & Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis & RemediationSemantic Analysis for Curricular Mapping, Gap Analysis & Remediation
Semantic Analysis for Curricular Mapping, Gap Analysis & Remediation
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and Scholarship
 

Mehr von Hang Dong

Rules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataRules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataHang Dong
 
Excel VBA programming basics
Excel VBA programming basicsExcel VBA programming basics
Excel VBA programming basicsHang Dong
 
语义沙龙:如何自动建构社会标签中的语义关系
语义沙龙:如何自动建构社会标签中的语义关系语义沙龙:如何自动建构社会标签中的语义关系
语义沙龙:如何自动建构社会标签中的语义关系Hang Dong
 
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked Data
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked DataEnrichment of Cross-Lingual Information on Chinese Genealogical Linked Data
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked DataHang Dong
 
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例Hang Dong
 
Modeling health related topics in an online forum designed for the deaf & har...
Modeling health related topics in an online forum designed for the deaf & har...Modeling health related topics in an online forum designed for the deaf & har...
Modeling health related topics in an online forum designed for the deaf & har...Hang Dong
 
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...Identifying Evaluation Standards for Online Information Literacy Tutorials (O...
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...Hang Dong
 
My hometown -- Wuhan
My hometown -- WuhanMy hometown -- Wuhan
My hometown -- WuhanHang Dong
 

Mehr von Hang Dong (8)

Rules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataRules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging data
 
Excel VBA programming basics
Excel VBA programming basicsExcel VBA programming basics
Excel VBA programming basics
 
语义沙龙:如何自动建构社会标签中的语义关系
语义沙龙:如何自动建构社会标签中的语义关系语义沙龙:如何自动建构社会标签中的语义关系
语义沙龙:如何自动建构社会标签中的语义关系
 
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked Data
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked DataEnrichment of Cross-Lingual Information on Chinese Genealogical Linked Data
Enrichment of Cross-Lingual Information on Chinese Genealogical Linked Data
 
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例
关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例
 
Modeling health related topics in an online forum designed for the deaf & har...
Modeling health related topics in an online forum designed for the deaf & har...Modeling health related topics in an online forum designed for the deaf & har...
Modeling health related topics in an online forum designed for the deaf & har...
 
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...Identifying Evaluation Standards for Online Information Literacy Tutorials (O...
Identifying Evaluation Standards for Online Information Literacy Tutorials (O...
 
My hometown -- Wuhan
My hometown -- WuhanMy hometown -- Wuhan
My hometown -- Wuhan
 

Kürzlich hochgeladen

VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escorts
VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our EscortsVIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escorts
VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escortssonatiwari757
 
SELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYSELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYdizinfo
 
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptx
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptxFactors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptx
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptxvemusae
 
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Delhi Call girls
 
Your LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageYour LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageSocioCosmos
 
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...anilsa9823
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxednyonat
 
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Delhi Call girls
 
Film show production powerpoint for site
Film show production powerpoint for siteFilm show production powerpoint for site
Film show production powerpoint for siteAshtonCains
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensSocioCosmos
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Paymentanilsa9823
 
Film the city investagation powerpoint :)
Film the city investagation powerpoint :)Film the city investagation powerpoint :)
Film the city investagation powerpoint :)AshtonCains
 
Ready to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosReady to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosSocioCosmos
 
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779Delhi Call girls
 
9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings RepublikGenuineGirls
 
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...Nitya salvi
 
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779Delhi Call girls
 

Kürzlich hochgeladen (20)

VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escorts
VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our EscortsVIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escorts
VIP Chandigarh Call Girls Service 7001035870 Enjoy Call Girls With Our Escorts
 
SELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYSELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANY
 
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptx
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptxFactors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptx
Factors-on-Authenticity-and-Validity-of-Evidences-and-Information.pptx
 
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
Delhi  99530 vip 56974  Genuine Escort Service Call Girls in MasudpurDelhi  99530 vip 56974  Genuine Escort Service Call Girls in Masudpur
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
 
Your LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageYour LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence Package
 
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
 
Vip Call Girls Tilak Nagar ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Tilak Nagar ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Tilak Nagar ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Tilak Nagar ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
 
Film show production powerpoint for site
Film show production powerpoint for siteFilm show production powerpoint for site
Film show production powerpoint for site
 
🔝9953056974 🔝Call Girls In Mehrauli Escort Service Delhi NCR
🔝9953056974 🔝Call Girls In Mehrauli  Escort Service Delhi NCR🔝9953056974 🔝Call Girls In Mehrauli  Escort Service Delhi NCR
🔝9953056974 🔝Call Girls In Mehrauli Escort Service Delhi NCR
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
 
Film the city investagation powerpoint :)
Film the city investagation powerpoint :)Film the city investagation powerpoint :)
Film the city investagation powerpoint :)
 
Ready to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosReady to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with Sociocosmos
 
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
 
9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik
 
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
 
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
 

Learning structured knowledge from social tagging data: a critical review of methods and techniques

  • 1. Learning Structured Knowledge from Social Tagging Data: a critical review of methods and techniques Hang Dong, Wei Wang, Hai-Ning Liang CSSE, Xi’an Jiaotong-Liverpool University The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015), Chengdu, China, 19-21 Dec 2015 Email: Last edited on 12th Jan 2016 by Hang Dong.
  • 2. Content • Background: social tagging & Folksonomies • Knowledge Structures / Ontologies • Methods & techniques for learning KS from social tagging data • Issues & future studies
  • 3. Background: Folksonomies • Folksonomy = Folk + Taxonomy (by Thomas Vander Wal in 2007, http://www.vanderwal.net/folksonomy.html) • Share a resource – create personal free short text description • Social Tagging Systems: delicious, flickr, douban, bibsonomy... • Noisy, ambiguous then not very useful.
  • 4. Noisy & Complex Social Tags • apple (polysemy) • to-read, myThesis (personal use) • sematnikweb (typo, multiword) • data-mining, DataMining (multiword) • $$$$Tsk (nonsense tag) • Mac,Macintosh,Apple (synonym) • Programming, Javascript, perl (gen-spec)
  • 5. From Social Tagging Data to Ontologies Researcher generated data (user-tag-resource-date) Useful and evolving ontologies Acknowledgement to the figure from M. Triana. Ontology... what? (Jan 2012) http://www.micheltriana.com/blog/2012/01/20/ontology-what http://www.bibsonomy.org/tag/knowledge
  • 6. The structure of social tagging data Acknowledgement to the figure from A. García-Silva, O. Corcho, H. Alani, and A. Gómez-Pérez, “Review of the state of the art: Discovering and associating semantics to tags in folksonomies,” The Knowledge Engineering Review, vol. 27, no. 01, pp. 57-85, Mar. 2012.
  • 7. Applications • General – (Semantic-based) Recommender systems – Ontology enrichment – Information retrieval and navigation • Specific – E-learning discovery – Enterprise search and knowledge management – Academic communication – Image annotation, recognition, representation – Music mood mining
  • 8. A spectrum of Knowledge Structures / ontologies Low semantics High semantics Folksonomies Terms / Concepts list Concept hierarchies Taxonomies Ontologies Key idea: Concept Adapted from the figures in R. R. Souza, D. Tudhope, and M. B. Almeida, “Towards a taxonomy of KOS: Dimensions for classifying Knowledge Organization Systems,” 2012.
  • 9. Ontology learning layer cake Adapted from the Figure 1 in Paul Buitelaar, Philipp Cimiano, and Bernardo Magnini: ‘Ontology Learning from Text: An Overview’, 2003 Part B Part A
  • 10. Learning term (or concept) lists Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques,” in The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015). (accepted)
  • 11. Word Sense Disambiguation Context definition (identify synonyms) Tag Co-occurrences External sense inventory Term list / Concept list Social tagging data Semantic distance measure Supervised learning: Classification WordNet Wikipedia DBpedia Domain ontologies Sense selection
  • 12. Word Sense Induction • No external sense inventory. • Unsupervised. Term list / Concept listSocial tagging data Data representation : (a) Vector space model (b) Co-occurrence graph TF-IDF (a) Clustering algorithms Hierarchical Agglomerative Clustering (a) DBSCAN (a) Non-negative Matrix Factorization (a) Graph-based (b)
  • 13. Learning (hierarchical) relations Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques,” in The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015). (accepted)
  • 14. “popularity-generality” assumption • The more popular/influential/frequent a tag is, the more general it is! [51] If sim(tag1, tag2) >= threshold then do If tag1 is more popular than tag2 then do add tag2 as the child of tag1 [37]
  • 15. Popularity metrics: Set Theory • tag sets, user sets and/or resources sets Algorithm 1 [35]: If R(tagB) ⊆ R(tagA) then do add tagB as a child of tagA Algorithm 2 [40]: inc (tagA, tagB) = |R(tagA) ∩ R(tagB)|/|R(tagA)| = gen (tagB, tagA)
  • 16. Popularity metrics: Centrality in SNA Social Network Analysis (SNA) • Build a graph: nodes are tags, edges are their similarities [37] • Measure how popular/influential a tag is: – Degree centrality – Betweenness centrality
  • 17. Association rule mining • Support and Confidence [45] [41] • Important issue: Aggregate the tripartite structure into two-dimensional space. • Association analysis is not enough to be used solely for hierarchical relation extraction.
  • 18. Association rule mining Acknowledgement to the figure from C. Schmitz, A. Hotho, R. Jäschke, and G. Stumme, "Mining Association Rules in Folksonomies," Data Science and Classification, Studies in Classification, Data Analysis, and Knowledge Organization V. Batagelj, H.-H. Bock, A. Ferligoj and A. Žiberna, eds., pp. 261-270: Springer Berlin Heidelberg, 2006. minsupp = .05%, mincon = 50%
  • 19. Supervised learning • Binary classification: hierarchical or not [41] • Feature extraction: – Support and confidence – Cosine similarity – Inclusion and generalization measures – … • Instance labeling using WordNet and ConceptNet • Class Imbalance Problem • C 4.5, Naïve Bayes, Random Forest, AdaBoost, Log. Regression, SVM
  • 20. Acknowledgement to the figure from A. S. C. Rêgo, L. B. Marinho, and C. E. S. Pires, “A supervised learning approach to detect subsumption relations between tags in folksonomies,” in the Proc. 30th Annu. ACM Symp. Applied Computing (SAC '15), Salamanca, Spain, 2015, pp. 409-415.
  • 21. Crowdsourcing & semi-supervised methods • Special functions of social tagging systems – Allowing users to annotate with tag pairs or hierarchies (TagTree, Delicious, Bibsonomy). – Some implicit functions: Flickr’s photo management: photo-set-collection [50]. • Using relational clustering algorithms to aggregate the hierarchies [50].
  • 22. Issues of existing studies • External resources - not suitable for social tagging data. • Only co-occurrence features – not precise. • Unsupervised learning – need to specify # of clusters, can generate odd clusters. • The “popularity-generality” assumption – not accurate. • Crowdsourcing methods – not natural, more work for users, few data. • Not showing the evolution of knowledge.
  • 23. Future studies • Using integral or new methods to derive semantics from social tagging data • Investigate the evolution of knowledge in Folksonomies – From User-Resource-Tag to User-Resource-Tag-Date • Big Data + ontology learning • Specific domains + applications – Image – Academic resources – Music
  • 24. Key References • [2] J. Gemmell, A. Shepitsen, M. Mobasher, and R. Burke, “Personalization in folksonomies based on tag clustering,” in the Proc. 6th Workshop Intelligent Techniques for Web Personalization and Recommender Systems, Chicago, Illinois, USA, 2008. • [15] P. Andrews and J. Pane, “Sense induction in folksonomies: a review,” Artificial Intelligence Review, vol. 40, no. 2, pp. 147-174, Aug. 2013. • [18] A. García-Silva, O. Corcho, H. Alani, and A. Gómez-Pérez, “Review of the state of the art: Discovering and associating semantics to tags in folksonomies,” The Knowledge Engineering Review, vol. 27, no. 01, pp. 57-85, Mar. 2012. • [30] S. Angeletou, M. Sabou, and E. Motta, ‘Semantically enriching folksonomies with FLOR,’ in Proc. 1st Int. Workshop Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008) at The 5th Annu. European Semantic Web Conf. (ESWC 2008), Tenerife, Spain, 2008. • [33] A. Joorabchi, M. English, and A. E. Mahdi, “Automatic mapping of user tags to Wikipedia concepts: The case of a Q&A website – StackOverflow,” Journal of Information Science, published online before print, May 2015. • [35] P. Mika, “Ontologies are us: A unified model of social networks and semantics,” in The Semantic Web–ISWC 2005. Springer, 2005, pp. 522-536. • [37] P. Heymann and H. Garcia-Molina. (2006, Apr. 26). Collaborative creation of communal hierarchical taxonomies in social tagging systems [online]. Technical Report, Stanford InfoLab. Available: http://ilpubs.stanford.edu:8090/775/
  • 25. Key References • [40] P. De Meo, G. Quattrone, and D. Ursino, “Exploitation of semantic relationships and hierarchical data structures to support a user in his annotation and browsing activities in folksonomies,” Information Systems, vol. 34, no. 6, pp. 511-535, Sep. 2009. • [41] A. S. C. Rêgo, L. B. Marinho, and C. E. S. Pires, “A supervised learning approach to detect subsumption relations between tags in folksonomies,” in the Proc. 30th Annu. ACM Symp. Applied Computing (SAC '15), Salamanca, Spain, 2015, pp. 409-415. • [45] C. Schmitz, A. Hotho, R. Jäschke, and G. Stumme, “Mining association rules in folksonomies,” in Data Science and Classification, V. Batagelj, H.-H. Bock, A. Ferligoj, & A. Žiberna, Eds. Springer Berlin Heidelberg, 2006, pp. 261-270. • [50] A. Plangprasopchok, K. Lerman, and L. Getoor, “Growing a tree in the forest: constructing folksonomies by integrating structured metadata,” in the Proc. of the 16th ACM SIGKDD Int. Conf. Knowledge discovery and data mining, Washington, DC, USA, 2010. • [51] F. Almoqhim, D. Millard, and N. Shadbolt, “Improving on popularity as a proxy for generality when building tag hierarchies from folksonomies,” in Social Informatics, Vol. 8851, L. Aiello and D. McFarland, Eds. Springer International Publishing, 2014, pp. 95-111. The full list of references can be found in the paper: Hang Dong, Wei Wang, Hai-Ning Liang. “Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques,” in The 8th IEEE International Conference on Social Computing and Networking (SocialCom 2015). (accepted)
  • 26. Q & A
  • 27. Thank you for your attention

Hinweis der Redaktion

  1. There are 3794882 (around 3.8m) annotations on Bibsonomy till July 2015.
  2. Define “annotation”, “co-occurrence”. Social tagging data are not only about tags.
  3. Add references later for each line.
  4. Part A and Part B. The reason to do this is that some research directly goes to Part B without a clearly defined set of concepts.
  5. Explain each type and its issues.
  6. Should I make it clearer for each element in this graph? Tag co-occurrence patterns Semantic distance measures External sense inventory Classification algorithms
  7. SNA and set theory are statistical methods.
  8. Inc (tagA, tagB) stands for “Inclusion degree of tagA in tagB”.
  9. Need further reading about this. [problematic slide!] About formal concept analysis Tri-concept association rule mining
  10. Needs further reading about the article! [problematic slide!]
  11. Other issues: cluster analysis, normally need to specify the number of clusters first; may generate odd clusters.
  12. It is not possible to put all reference here, but we want to acknowledge all research in this area.