SlideShare a Scribd company logo
1 of 1
Download to read offline
Work Flow Acknowledgement This work was partly carried out during the tenure of an ERCIM “Alain Bensoussan” Fellowship Programme. The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no.246016 and no.610594, and the Spanish Ministry of Science and Innovation (TIN2013-47090-C3-2). Configuration Reproducibility, Reproduction, and Benchmarking RiVal – an open source recommender system evaluation toolkit written in Java -- allows for complete control of the evaluation dimensions that take place in any experimental evaluation of a recommender system: data splitting, definition of evaluation strategies, and computation of evaluation metrics. The toolkit is available as Maven dependencies and as a standalone program. It integrates three recommendation frameworks: Mahout, LensKit, MyMediaLite. http://rival.recommenders.net Alan Said, Alejandro Bellogín alansaid@acm.org, alejandro.bellogin@uam.es RiVal – A Toolkit to Foster Reproducibility in Recommender System Evaluation RecSys 2014 – Foster City, CA, USA Further Reading RiVal – A Toolkit to Foster Reproducibility in Recommender System Evaluation in RecSys 2014 A. Said, A. Bellogín Poster abstract Comparative Recommender System Evaluation: Benchmarking Recom- mendation Frameworks In RecSys 2014 A. Said, A. Bellogín Future Work 
•Integrate more RecSys libraries 
•Evaluate more than accuracy (novelty, diversity). 
•Set up a RESTful API 
•Full CLI support 
•And more See github.com/recommenders/rival/issues Example Comparing nDCG@10 for the same algorithms on the same dataset in Apache Mahout, MyMediaLite & Lenskit Select strategy 
•By time Cross validation 
•Random 
•Ratio Select framework 
•Apache Mahout 
•LensKit 
•MyMediaLite Select algorithm 
•Tune settings Recommend Define strategy 
•What is the ground truth 
•What users to evaluate 
•What items to evaluate Select error metrics 
•For rating prediction 
•RMSE, MAE Select ranking metrics 
•For top-k recommendation 
•nDCG, Precision/Recall, MAP Recommend Split Candidate items Evaluate We can use each of the modules independently or in cascade These are two examples of how the modules can be called from command line

More Related Content

What's hot

Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
kjaisuo
 
Project Experience2
Project Experience2Project Experience2
Project Experience2
ajith k
 

What's hot (12)

Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for Healthcare
 
Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015Panel members v2_datajournals_repositories_repofringe3aug2015
Panel members v2_datajournals_repositories_repofringe3aug2015
 
MS Sql Server:Reporting models
MS Sql Server:Reporting modelsMS Sql Server:Reporting models
MS Sql Server:Reporting models
 
Publish or Perish
Publish or Perish Publish or Perish
Publish or Perish
 
Hospital's publically reported indicators
Hospital's publically reported indicatorsHospital's publically reported indicators
Hospital's publically reported indicators
 
Sowmya Strands Inovation Live
Sowmya Strands Inovation LiveSowmya Strands Inovation Live
Sowmya Strands Inovation Live
 
Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI) Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI)
 
Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
Cmgt 554 week 6 individual assingment paper patton fuller community hospital ...
 
eBIRT Update
eBIRT UpdateeBIRT Update
eBIRT Update
 
Feature Analysis of Research Metrics Systems
Feature Analysis of Research Metrics SystemsFeature Analysis of Research Metrics Systems
Feature Analysis of Research Metrics Systems
 
Project Experience2
Project Experience2Project Experience2
Project Experience2
 
An MPages Development Community
An MPages Development CommunityAn MPages Development Community
An MPages Development Community
 

Viewers also liked (7)

Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross Entropy
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 

Similar to RiVal - A toolkit to foster reproducibility in Recommender System evaluation

Analysis, design and implementation of a Multi-Criteria Recommender System ba...
Analysis, design and implementation of a Multi-Criteria Recommender System ba...Analysis, design and implementation of a Multi-Criteria Recommender System ba...
Analysis, design and implementation of a Multi-Criteria Recommender System ba...
Davide Giannico
 

Similar to RiVal - A toolkit to foster reproducibility in Recommender System evaluation (20)

Hudup - A Framework of E-commercial Recommendation Algorithms
Hudup - A Framework of E-commercial Recommendation AlgorithmsHudup - A Framework of E-commercial Recommendation Algorithms
Hudup - A Framework of E-commercial Recommendation Algorithms
 
A QUANTITATIVE APPROACH IN HEURISTIC EVALUATION OF E-COMMERCE WEBSITES
A QUANTITATIVE APPROACH IN HEURISTIC EVALUATION OF E-COMMERCE WEBSITES A QUANTITATIVE APPROACH IN HEURISTIC EVALUATION OF E-COMMERCE WEBSITES
A QUANTITATIVE APPROACH IN HEURISTIC EVALUATION OF E-COMMERCE WEBSITES
 
IRJET-Smart Tourism Recommender System
IRJET-Smart Tourism Recommender SystemIRJET-Smart Tourism Recommender System
IRJET-Smart Tourism Recommender System
 
MOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMMOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEM
 
Scale
ScaleScale
Scale
 
Product Recommendation Systems based on Hybrid Approach Technology
Product Recommendation Systems based on Hybrid Approach TechnologyProduct Recommendation Systems based on Hybrid Approach Technology
Product Recommendation Systems based on Hybrid Approach Technology
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
SiddharthaSharma_Resume
SiddharthaSharma_ResumeSiddharthaSharma_Resume
SiddharthaSharma_Resume
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Analysis, design and implementation of a Multi-Criteria Recommender System ba...
Analysis, design and implementation of a Multi-Criteria Recommender System ba...Analysis, design and implementation of a Multi-Criteria Recommender System ba...
Analysis, design and implementation of a Multi-Criteria Recommender System ba...
 
A Hybrid Approach for Personalized Recommender System Using Weighted TFIDF on...
A Hybrid Approach for Personalized Recommender System Using Weighted TFIDF on...A Hybrid Approach for Personalized Recommender System Using Weighted TFIDF on...
A Hybrid Approach for Personalized Recommender System Using Weighted TFIDF on...
 
Solo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 UpcSolo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 Upc
 
Empirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion MiningEmpirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion Mining
 
Resume
ResumeResume
Resume
 
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
 
Performance Evaluation of Open Source Data Mining Tools
Performance Evaluation of Open Source Data Mining ToolsPerformance Evaluation of Open Source Data Mining Tools
Performance Evaluation of Open Source Data Mining Tools
 
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
Cross Domain Recommender System using Machine Learning and Transferable Knowl...Cross Domain Recommender System using Machine Learning and Transferable Knowl...
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
 
L injection toward effective collaborative filtering using uninteresting items
L injection toward effective collaborative filtering using uninteresting itemsL injection toward effective collaborative filtering using uninteresting items
L injection toward effective collaborative filtering using uninteresting items
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
About pellustro - The cloud-based platform for assessments
About pellustro - The cloud-based platform for assessmentsAbout pellustro - The cloud-based platform for assessments
About pellustro - The cloud-based platform for assessments
 

More from Alejandro Bellogin

Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Alejandro Bellogin
 

More from Alejandro Bellogin (14)

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenarios
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systems
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender Systems
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 
Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - Slides
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slam
 
Predicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - PosterPredicting performance in Recommender Systems - Poster
Predicting performance in Recommender Systems - Poster
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 

RiVal - A toolkit to foster reproducibility in Recommender System evaluation

  • 1. Work Flow Acknowledgement This work was partly carried out during the tenure of an ERCIM “Alain Bensoussan” Fellowship Programme. The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no.246016 and no.610594, and the Spanish Ministry of Science and Innovation (TIN2013-47090-C3-2). Configuration Reproducibility, Reproduction, and Benchmarking RiVal – an open source recommender system evaluation toolkit written in Java -- allows for complete control of the evaluation dimensions that take place in any experimental evaluation of a recommender system: data splitting, definition of evaluation strategies, and computation of evaluation metrics. The toolkit is available as Maven dependencies and as a standalone program. It integrates three recommendation frameworks: Mahout, LensKit, MyMediaLite. http://rival.recommenders.net Alan Said, Alejandro Bellogín alansaid@acm.org, alejandro.bellogin@uam.es RiVal – A Toolkit to Foster Reproducibility in Recommender System Evaluation RecSys 2014 – Foster City, CA, USA Further Reading RiVal – A Toolkit to Foster Reproducibility in Recommender System Evaluation in RecSys 2014 A. Said, A. Bellogín Poster abstract Comparative Recommender System Evaluation: Benchmarking Recom- mendation Frameworks In RecSys 2014 A. Said, A. Bellogín Future Work •Integrate more RecSys libraries •Evaluate more than accuracy (novelty, diversity). •Set up a RESTful API •Full CLI support •And more See github.com/recommenders/rival/issues Example Comparing nDCG@10 for the same algorithms on the same dataset in Apache Mahout, MyMediaLite & Lenskit Select strategy •By time Cross validation •Random •Ratio Select framework •Apache Mahout •LensKit •MyMediaLite Select algorithm •Tune settings Recommend Define strategy •What is the ground truth •What users to evaluate •What items to evaluate Select error metrics •For rating prediction •RMSE, MAE Select ranking metrics •For top-k recommendation •nDCG, Precision/Recall, MAP Recommend Split Candidate items Evaluate We can use each of the modules independently or in cascade These are two examples of how the modules can be called from command line