SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
© author(s) of these slides including research results from the KOM research network and TU Darmstadt; otherwise it is specified at the respective slide
9-Sep-12
Prof. Dr.-Ing. Ralf Steinmetz
KOM - Multimedia Communications Lab
RSWeb_Context_Content__v4.0 __20120908_MA
Context Determines Content
An Approach to Resource Recommendation in
Folksonomies
Thomas Rodenhausen
Mojisola Anjorin
Renato Domínguez García
Christoph Rensing
soccer
barca
!"#$"%"&
'(')'(*+
Tango
,-./0%#0"1
,022/
3"44"256-7!-82'('9
'(*'
'(*9
'(':
'('*
%6;"#$04-&"
KOM – Multimedia Communications Lab 2
§ Motivation
§ Context of an Entity in a Folksonomy
§ VSScore Framework
§ Evaluation Methodology and Metrics
§ Results
§ Conclusion & Future Work
Overview
KOM – Multimedia Communications Lab 3
Concept drift is a challenge for graph-based ranking algorithms
§ e.g. Ambiguous tags can cause concept drift as a single tag might represent
multiple semantic concepts
Challenge: Concept Drift
FC Barcelona Website
News about Messi
Dallas Cowboys‘ Website
football
?
?
KOM – Multimedia Communications Lab 4
Assumption on the Context of an Entity
§ The context of an entity in a folksonomy describes the entity well
Assumptions on a Folksonomy’s Content and Structure
§ Tags of a resource describe the resource’s content well
§ Tags of a user describe the user’s interests well
§ Resources of a user describe the user’s interests well
§ Resources of a tag describe the tag’s semantic well
§ Users of a tag describe the tag’s semantic well
§ Users of a resource describe the resource’s content well
Context of an Entity in a Folksonomy
[adapted from Abel 2011]
KOM – Multimedia Communications Lab 5
Context of an Entity in a Folksonomy
Assumption on the Context of an Entity e in a Folksonomy
§ The context of an entity e is given by the strength of relations between e and
other entities in the folksonomy
soccer
barca
!"#$"%"&
'(')'(*+
Tango
,-./0%#0"1
,022/
3"44"256-7!-82'('9
'(*'
'(*9
'(':
'('*
%6;"#$04-&"
KOM – Multimedia Communications Lab 6
Context of an Entity in a Folksonomy
Assumption on the Context of an Entity e in a Folksonomy
§ A vector se created with a ranking algorithm (e.g. FolkRank) for a single query
entity e, describes the relationship between e and other entities in the
folksonomy well
soccer
barca
!"#$"%"&
'(')'(*+
Tango
,-./0%#0"1
,022/
3"44"256-7!-82'('9
'(*'
'(*9
'(':
'('*
%6;"#$04-&" sFCBarcelona =
0.16
0.02
0.10
0.12
0.08
0.01
0.03
…
barca
movieFreak
messi
soccer
barcaFan
Tango
Dallas Cowboys
…
KOM – Multimedia Communications Lab 7
Context of an Entity in a Folksonomy
Assumption on the Context of an Entity e in a Folksonomy
§ A vector se created with a ranking algorithm (e.g. FolkRank) for a single query
entity e, describes the relationship between e and other entities in the
folksonomy well
soccer
barca
!"#$"%"&
'(')'(*+
Tango
,-./0%#0"1
,022/
3"44"256-7!-82'('9
'(*'
'(*9
'(':
'('*
%6;"#$04-&" sFCBarcelona =
0.16
0.02
0.10
0.12
0.08
0.01
0.03
…
barca
movieFreak
messi
soccer
barcaFan
Tango
Dallas Cowboys
…
§  Entities in a folksonomy represent
semantic concepts
§  Any entity (e.g. a query entity) can be
represented by it’s context
KOM – Multimedia Communications Lab 8
VSScore is a flexible framework which incorporates context-specific
information into the recommendation process
§ Based on the vector space model
Vector Space Score (VSScore)
KOM – Multimedia Communications Lab 9
VSScore is a flexible framework which incorporates context-specific
information into the recommendation process
§ Based on the vector space model
§ Creates a vector representation of semantic concepts for each entity in the
folksonomy using a ranking algorithm e.g. FolkRank
Vector Space Score (VSScore)
resources’
contexts
query-entity’s
context
0.8
…
0.1
0.3
…
0.7
0.4
…
0.6
[Hotho et al. 2006]
barca
…
Dallas Cowboys
KOM – Multimedia Communications Lab 10
VSScore is a flexible framework which incorporates context-specific
information into the recommendation process
§ Based on the vector space model
§ Creates a vector representation of semantic concepts for each entity in the
folksonomy using a ranking algorithm e.g. FolkRank
§ Applies the cosine similarity to calculate the distance between these vectors
Vector Space Score (VSScore)
[Hotho et al. 2006]
resources’
contexts
query-entity’s
context
δ
barca
Dallas Cowboys
0.8
…
0.1
0.3
…
0.7
0.4
…
0.6
barca
…
Dallas Cowboys
KOM – Multimedia Communications Lab 11
§ Motivation
§ Context of an Entity in a Folksonomy
§ VSScore Framework
§ Evaluation Methodology and Metrics
§ Results
§ Conclusion & Future Work
Overview
KOM – Multimedia Communications Lab 12
Interests Match
§ User as query node
Guided Search
§ Tag as query node
Evaluation Ranking Tasks
user
given as query node
Interests
Match
Find me a
resource
Related
Resources
resource
[adapted from Bogers 2009]
Guided
Search
tag
KOM – Multimedia Communications Lab 13
Tango
Buenos
Aires
Dancing
Festival
Tango
Buenos
Aires
Dancing
Festival
A post is a Pu,r= {(u,r,t)|(u,r,t) ∈ Y}
For LeavePostOut, the recommendation task
with user as input is harder as with tag as input
Evaluation Methodology: LeavePostOut
[Jäschke et al. 2007]
KOM – Multimedia Communications Lab 14
RTr,t= {(u,r,t)|(u,r,t) ∈ Y}
For LeaveRTOut, the recommendation task
with tag as input is harder as with user as input
Evaluation Methodology: LeaveRTOut
Tango
Buenos
Aires
Dancing
Festival
Tango
Buenos
Aires
Dancing
Festival
KOM – Multimedia Communications Lab 15
Evaluation Metrics
Mean Normalized Precision: The mean of the normalized
Precision at k (with respect to the
maximal achievable Precision at
k) over several queries Q
e.g. for LeavePostOut with
k = 10, Precisionmax(k) = 1/10
The mean of the Average
Precision over several queries Q
MNP(Q, k) =
1
|Q|
|Q|

j=1
Precisionj(k)
Precisionmax,j(k)
MAP(Q) =
1
|Q|
|Q|

j=1
1
mj
mj

k=1
Precision(Rjk)
Mean Average Precision:
[Manning et al 2008]
KOM – Multimedia Communications Lab 16
Bibsonomy corpus with p-core extraction at level 5 to reduce noise
and to focus on the dense portion of the corpus
Evaluation Corpus
Knowledge and Data Engineering Group, University of Kassel: Benchmark Folksonomy Data from Bibsonomy, version of July 7th 2011
Before After
Users 7243 69
Bookmark resources 281550 9
Bibtex resources 469654 134
Tags 216094 179
Tag assignments 2740834 3269
Bookmark posts 330192 51
Bibtex posts 526691 959
FReSET – Domínguez García et al 2012
http://www.kom.tu-darmstadt.de/research-results/downloads/software/freset/
KOM – Multimedia Communications Lab 17
A violin plot is a combination of a box plot and a density trace
Visualization of Results with Violin Plots
Median
3rd Quartile
1st Quartile
KOM – Multimedia Communications Lab 18
Evaluation results for recommendation task Interests Match
Evaluation Results for LeavePostOut
(Popularity of a resource is calculated as the sum of tags and users of a resource)
KOM – Multimedia Communications Lab 19
Evaluation Results for LeavePostOut
Approaches MAP
VSScore 0.1972
FolkRank 0.1809
Popularity 0.0943
Evaluation results for recommendation task Interests Match
KOM – Multimedia Communications Lab 20
Evaluation Results for LeaveRTOut
Evaluation results for recommendation task Guided Search
KOM – Multimedia Communications Lab 21
Evaluation results for recommendation task Guided Search
Evaluation Results for LeaveRTOut
Approaches MAP
Popularity 0.0834
VSScore 0.0592
FolkRank 0.0529
KOM – Multimedia Communications Lab 22
Results for Statistical Significance Tests
Pairwise comparisons based on Average Precision with significance
level of p = 0.05
§ Scenario A: User-based resource recommendation
§ Scenario B: Ranking of user‘s resources
§ Scenario C: Tag-based resource recommendation
Methodology Interests Match Guided Search
LeavePostOut VSScoreA VSScoreC
LeaveNPostsOut VSScoreA FolkRankC, VSScoreC
LeaveRTOut FolkRankB VSScoreC
LeaveNRTsOut FolkRankB VSScoreC
Wilcoxon signed-rank tests
KOM – Multimedia Communications Lab 23
VSScore is a Framework leveraging context-specific information
inherently found in a folksonomy for resource recommendation.
Limitations
§ VSScore is computationally complex, therefore evaluations were performed on
a limited corpus size
Future Work
§ Reduce high-dimensional vector space to reduce computational complexity
§ Evaluate on larger corpora from different domains
§ Investigate further recommendation scenarios e.g. tag or user recommendation
Conclusion and Future Work
soccer
barca
!#$%
'(')'(*+
Tango
,-./0%#01
,022/
344256-7!-82'('9
'(*'
'(*9
'(':
'('*
%6;#$04-
resources’
contexts
query-entity’s
context
δ
barca
Dallas Cowboys
0.8
…
0.1
0.3
…
0.7
0.4
…
0.6
KOM – Multimedia Communications Lab 24
Questions  Contact

Weitere ähnliche Inhalte

Andere mochten auch

Similarity Measurement Preliminary Results
Similarity  Measurement  Preliminary ResultsSimilarity  Measurement  Preliminary Results
Similarity Measurement Preliminary Results
xiaojuzheng
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
Chris Johnson
 

Andere mochten auch (11)

Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
 
Exploiting Semantic Information for Graph-based Recommendations of Learning R...
Exploiting Semantic Information for Graph-based Recommendations of Learning R...Exploiting Semantic Information for Graph-based Recommendations of Learning R...
Exploiting Semantic Information for Graph-based Recommendations of Learning R...
 
Iknow ranking sem_info_v9.0__2012.09.07_anjorin
Iknow ranking sem_info_v9.0__2012.09.07_anjorinIknow ranking sem_info_v9.0__2012.09.07_anjorin
Iknow ranking sem_info_v9.0__2012.09.07_anjorin
 
Erster f vortrag_personalized_rec_sys_for_rbl__20110919_ma_v5.0
Erster f vortrag_personalized_rec_sys_for_rbl__20110919_ma_v5.0Erster f vortrag_personalized_rec_sys_for_rbl__20110919_ma_v5.0
Erster f vortrag_personalized_rec_sys_for_rbl__20110919_ma_v5.0
 
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Similarity Measurement Preliminary Results
Similarity  Measurement  Preliminary ResultsSimilarity  Measurement  Preliminary Results
Similarity Measurement Preliminary Results
 
An evaluation of SimRank and Personalized PageRank to build a recommender sys...
An evaluation of SimRank and Personalized PageRank to build a recommender sys...An evaluation of SimRank and Personalized PageRank to build a recommender sys...
An evaluation of SimRank and Personalized PageRank to build a recommender sys...
 
Otot Manusia
Otot ManusiaOtot Manusia
Otot Manusia
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 

Ähnlich wie Rs web context_content__v4.0__20120908_ma

Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
Maribel Acosta Deibe
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Amit Sheth
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Bradley Allen
 
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter BonczFOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
Ioan Toma
 
Coates p: the use of genetic programing in exploring 3 d design worlds
Coates p: the use of genetic programing in exploring 3 d design worldsCoates p: the use of genetic programing in exploring 3 d design worlds
Coates p: the use of genetic programing in exploring 3 d design worlds
ArchiLab 7
 

Ähnlich wie Rs web context_content__v4.0__20120908_ma (20)

Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021
 
Using Information Scent to Model Users in Web1.0 and Web2.0
Using Information Scent to Model Users in Web1.0 and Web2.0Using Information Scent to Model Users in Web1.0 and Web2.0
Using Information Scent to Model Users in Web1.0 and Web2.0
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
SSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open DataSSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open Data
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Search
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
 
Self adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofSelf adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation of
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter BonczFOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
FOSDEM2014 - Social Network Benchmark (SNB) Graph Generator - Peter Boncz
 
IEEE 2014 C# Projects
IEEE 2014 C# ProjectsIEEE 2014 C# Projects
IEEE 2014 C# Projects
 
IEEE 2014 C# Projects
IEEE 2014 C# ProjectsIEEE 2014 C# Projects
IEEE 2014 C# Projects
 
(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages
 
Knowledge Graphs and Milestone
Knowledge Graphs and MilestoneKnowledge Graphs and Milestone
Knowledge Graphs and Milestone
 
Mapping Domain Names to Categories
Mapping Domain Names to CategoriesMapping Domain Names to Categories
Mapping Domain Names to Categories
 
WMJ&GMBwosc08-Effective Learning & Production Via Modelling
WMJ&GMBwosc08-Effective Learning & Production Via ModellingWMJ&GMBwosc08-Effective Learning & Production Via Modelling
WMJ&GMBwosc08-Effective Learning & Production Via Modelling
 
Serious Games Analytics - Lecture at TU Darmstadt
Serious Games Analytics - Lecture at TU DarmstadtSerious Games Analytics - Lecture at TU Darmstadt
Serious Games Analytics - Lecture at TU Darmstadt
 
Coates p: the use of genetic programing in exploring 3 d design worlds
Coates p: the use of genetic programing in exploring 3 d design worldsCoates p: the use of genetic programing in exploring 3 d design worlds
Coates p: the use of genetic programing in exploring 3 d design worlds
 

Rs web context_content__v4.0__20120908_ma

  • 1. © author(s) of these slides including research results from the KOM research network and TU Darmstadt; otherwise it is specified at the respective slide 9-Sep-12 Prof. Dr.-Ing. Ralf Steinmetz KOM - Multimedia Communications Lab RSWeb_Context_Content__v4.0 __20120908_MA Context Determines Content An Approach to Resource Recommendation in Folksonomies Thomas Rodenhausen Mojisola Anjorin Renato Domínguez García Christoph Rensing soccer barca !"#$"%"& '(')'(*+ Tango ,-./0%#0"1 ,022/ 3"44"256-7!-82'('9 '(*' '(*9 '(': '('* %6;"#$04-&"
  • 2. KOM – Multimedia Communications Lab 2 § Motivation § Context of an Entity in a Folksonomy § VSScore Framework § Evaluation Methodology and Metrics § Results § Conclusion & Future Work Overview
  • 3. KOM – Multimedia Communications Lab 3 Concept drift is a challenge for graph-based ranking algorithms § e.g. Ambiguous tags can cause concept drift as a single tag might represent multiple semantic concepts Challenge: Concept Drift FC Barcelona Website News about Messi Dallas Cowboys‘ Website football ? ?
  • 4. KOM – Multimedia Communications Lab 4 Assumption on the Context of an Entity § The context of an entity in a folksonomy describes the entity well Assumptions on a Folksonomy’s Content and Structure § Tags of a resource describe the resource’s content well § Tags of a user describe the user’s interests well § Resources of a user describe the user’s interests well § Resources of a tag describe the tag’s semantic well § Users of a tag describe the tag’s semantic well § Users of a resource describe the resource’s content well Context of an Entity in a Folksonomy [adapted from Abel 2011]
  • 5. KOM – Multimedia Communications Lab 5 Context of an Entity in a Folksonomy Assumption on the Context of an Entity e in a Folksonomy § The context of an entity e is given by the strength of relations between e and other entities in the folksonomy soccer barca !"#$"%"& '(')'(*+ Tango ,-./0%#0"1 ,022/ 3"44"256-7!-82'('9 '(*' '(*9 '(': '('* %6;"#$04-&"
  • 6. KOM – Multimedia Communications Lab 6 Context of an Entity in a Folksonomy Assumption on the Context of an Entity e in a Folksonomy § A vector se created with a ranking algorithm (e.g. FolkRank) for a single query entity e, describes the relationship between e and other entities in the folksonomy well soccer barca !"#$"%"& '(')'(*+ Tango ,-./0%#0"1 ,022/ 3"44"256-7!-82'('9 '(*' '(*9 '(': '('* %6;"#$04-&" sFCBarcelona = 0.16 0.02 0.10 0.12 0.08 0.01 0.03 … barca movieFreak messi soccer barcaFan Tango Dallas Cowboys …
  • 7. KOM – Multimedia Communications Lab 7 Context of an Entity in a Folksonomy Assumption on the Context of an Entity e in a Folksonomy § A vector se created with a ranking algorithm (e.g. FolkRank) for a single query entity e, describes the relationship between e and other entities in the folksonomy well soccer barca !"#$"%"& '(')'(*+ Tango ,-./0%#0"1 ,022/ 3"44"256-7!-82'('9 '(*' '(*9 '(': '('* %6;"#$04-&" sFCBarcelona = 0.16 0.02 0.10 0.12 0.08 0.01 0.03 … barca movieFreak messi soccer barcaFan Tango Dallas Cowboys … §  Entities in a folksonomy represent semantic concepts §  Any entity (e.g. a query entity) can be represented by it’s context
  • 8. KOM – Multimedia Communications Lab 8 VSScore is a flexible framework which incorporates context-specific information into the recommendation process § Based on the vector space model Vector Space Score (VSScore)
  • 9. KOM – Multimedia Communications Lab 9 VSScore is a flexible framework which incorporates context-specific information into the recommendation process § Based on the vector space model § Creates a vector representation of semantic concepts for each entity in the folksonomy using a ranking algorithm e.g. FolkRank Vector Space Score (VSScore) resources’ contexts query-entity’s context 0.8 … 0.1 0.3 … 0.7 0.4 … 0.6 [Hotho et al. 2006] barca … Dallas Cowboys
  • 10. KOM – Multimedia Communications Lab 10 VSScore is a flexible framework which incorporates context-specific information into the recommendation process § Based on the vector space model § Creates a vector representation of semantic concepts for each entity in the folksonomy using a ranking algorithm e.g. FolkRank § Applies the cosine similarity to calculate the distance between these vectors Vector Space Score (VSScore) [Hotho et al. 2006] resources’ contexts query-entity’s context δ barca Dallas Cowboys 0.8 … 0.1 0.3 … 0.7 0.4 … 0.6 barca … Dallas Cowboys
  • 11. KOM – Multimedia Communications Lab 11 § Motivation § Context of an Entity in a Folksonomy § VSScore Framework § Evaluation Methodology and Metrics § Results § Conclusion & Future Work Overview
  • 12. KOM – Multimedia Communications Lab 12 Interests Match § User as query node Guided Search § Tag as query node Evaluation Ranking Tasks user given as query node Interests Match Find me a resource Related Resources resource [adapted from Bogers 2009] Guided Search tag
  • 13. KOM – Multimedia Communications Lab 13 Tango Buenos Aires Dancing Festival Tango Buenos Aires Dancing Festival A post is a Pu,r= {(u,r,t)|(u,r,t) ∈ Y} For LeavePostOut, the recommendation task with user as input is harder as with tag as input Evaluation Methodology: LeavePostOut [Jäschke et al. 2007]
  • 14. KOM – Multimedia Communications Lab 14 RTr,t= {(u,r,t)|(u,r,t) ∈ Y} For LeaveRTOut, the recommendation task with tag as input is harder as with user as input Evaluation Methodology: LeaveRTOut Tango Buenos Aires Dancing Festival Tango Buenos Aires Dancing Festival
  • 15. KOM – Multimedia Communications Lab 15 Evaluation Metrics Mean Normalized Precision: The mean of the normalized Precision at k (with respect to the maximal achievable Precision at k) over several queries Q e.g. for LeavePostOut with k = 10, Precisionmax(k) = 1/10 The mean of the Average Precision over several queries Q MNP(Q, k) = 1 |Q| |Q| j=1 Precisionj(k) Precisionmax,j(k) MAP(Q) = 1 |Q| |Q| j=1 1 mj mj k=1 Precision(Rjk) Mean Average Precision: [Manning et al 2008]
  • 16. KOM – Multimedia Communications Lab 16 Bibsonomy corpus with p-core extraction at level 5 to reduce noise and to focus on the dense portion of the corpus Evaluation Corpus Knowledge and Data Engineering Group, University of Kassel: Benchmark Folksonomy Data from Bibsonomy, version of July 7th 2011 Before After Users 7243 69 Bookmark resources 281550 9 Bibtex resources 469654 134 Tags 216094 179 Tag assignments 2740834 3269 Bookmark posts 330192 51 Bibtex posts 526691 959 FReSET – Domínguez García et al 2012 http://www.kom.tu-darmstadt.de/research-results/downloads/software/freset/
  • 17. KOM – Multimedia Communications Lab 17 A violin plot is a combination of a box plot and a density trace Visualization of Results with Violin Plots Median 3rd Quartile 1st Quartile
  • 18. KOM – Multimedia Communications Lab 18 Evaluation results for recommendation task Interests Match Evaluation Results for LeavePostOut (Popularity of a resource is calculated as the sum of tags and users of a resource)
  • 19. KOM – Multimedia Communications Lab 19 Evaluation Results for LeavePostOut Approaches MAP VSScore 0.1972 FolkRank 0.1809 Popularity 0.0943 Evaluation results for recommendation task Interests Match
  • 20. KOM – Multimedia Communications Lab 20 Evaluation Results for LeaveRTOut Evaluation results for recommendation task Guided Search
  • 21. KOM – Multimedia Communications Lab 21 Evaluation results for recommendation task Guided Search Evaluation Results for LeaveRTOut Approaches MAP Popularity 0.0834 VSScore 0.0592 FolkRank 0.0529
  • 22. KOM – Multimedia Communications Lab 22 Results for Statistical Significance Tests Pairwise comparisons based on Average Precision with significance level of p = 0.05 § Scenario A: User-based resource recommendation § Scenario B: Ranking of user‘s resources § Scenario C: Tag-based resource recommendation Methodology Interests Match Guided Search LeavePostOut VSScoreA VSScoreC LeaveNPostsOut VSScoreA FolkRankC, VSScoreC LeaveRTOut FolkRankB VSScoreC LeaveNRTsOut FolkRankB VSScoreC Wilcoxon signed-rank tests
  • 23. KOM – Multimedia Communications Lab 23 VSScore is a Framework leveraging context-specific information inherently found in a folksonomy for resource recommendation. Limitations § VSScore is computationally complex, therefore evaluations were performed on a limited corpus size Future Work § Reduce high-dimensional vector space to reduce computational complexity § Evaluate on larger corpora from different domains § Investigate further recommendation scenarios e.g. tag or user recommendation Conclusion and Future Work soccer barca !#$% '(')'(*+ Tango ,-./0%#01 ,022/ 344256-7!-82'('9 '(*' '(*9 '(': '('* %6;#$04- resources’ contexts query-entity’s context δ barca Dallas Cowboys 0.8 … 0.1 0.3 … 0.7 0.4 … 0.6
  • 24. KOM – Multimedia Communications Lab 24 Questions Contact