SlideShare ist ein Scribd-Unternehmen logo
1 von 18
A Scalable Collaborative Filtering
Framework based on Co-clustering
Author: Thomas George, Srujana Merugu in ICDM’05.
Presenter: Rei-Zhe Liu. Date: 2010/10/26.
Outline
 Introduction
 System architecture
 Experiments and result
 Conclusion
2
Introduction
 We propose a dynamic collaborative filtering
approach that can support the entry of new users,
items and ratings using a hybrid of incremental
and batch versions of the co-clustering algorithm.
 Empirical comparison of our approach with SVD,
NNMF and correlation-based collaborative filtering
techniques indicates comparable accuracy at a
much lower computational effort.
3
System architecture
Problem definition(1/2)
 The approximate matrix for prediction is given by
5
Problem definition(2/2)
 We can now pose the prediction of unknown ratings
as a co-clustering problem where we seek to find the
optimal user and item clustering such that the
approximation error with respect to the known
ratings of A is minimized,
 where ensures that only the known ratings contribute
to the loss function.

6
Algorithm(1/3)
7
Algorithm(2/3)
8
Algorithm(3/3)
9
System description
 P1 handles the prediction and
incremental training.
 P2 is responsible for the static
training.
 During incremental training P1,
also updates the raw ratings.
 P2 performs co-clustering
repeatedly by reading A(the
current ratings matrix) and
updating S(summary statistics)
when done.
 Data Objects A and S are stored
at 2 parts: (a)stable part
(b)increment part.
 At the end of each co-clustering
run, the two parts are merged to
obtain a new set of stable values.
10
Experiments and results
Data sets and Exp. Settings(1/2)
 Data set
 MovieLens: 943-1882 user-by-movie matrix. Totally
100,000 ratings. Rated from 1 to 5.
 Evaluation methodology
 The prediction accuracy was measured using the mean
absolute error (MAE), which is the average of the
absolute values of the errors over all the predictions.
 The static training time was estimated in terms of the
CPU time taken for the core training routines (viz. co-
clustering and SVD).
 The prediction time was estimated by averaging over the
response time taken for all the predictions.
12
Data sets and Exp. Settings(2/2)
 For evaluating the prediction accuracy, we created ten
80-20% random train-test splits of the datasets and
averaged the results over the various splits.
 We considered two scenarios, —(i) static testing, where
the known ratings do not change, and (ii) dynamic
testing, where the ratings are updated incrementally.
 Algorithms
 We compared the performance of our co-clustering
based approach with SVD [13], NNMF [10] and classic
correlation-based collaborative filtering [12].
 An incremental SVD-based approach [14] using a
folding in technique was also implemented in order to
evaluate the prediction accuracy in dynamic scenarios
with changing ratings.13
Evaluation(1/3)
k = l = SVD rank =NNMF rank=3
k = l = SVD rank=3
14
Evaluation(2/3)
Dataset:
Mov1
Dataset:
MovieLens
15
CoC: (m+n+kl-k-l)
NNMF, SVD:
(m+n)(k+l)
Evaluation(3/3)
16
Dataset:
MovieLens
Conclusion
Conclusion
 In this paper, we presented a new dynamic
collaborative filtering approach based on simultaneous
clustering of users and items.
 Empirical results indicate that our approach can provide
high quality predictions at a much lower computational
cost compared to traditional correlation and SVD-based
approaches.
18

Weitere ähnliche Inhalte

Was ist angesagt?

SigOpt_Bayesian_Optimization_Primer
SigOpt_Bayesian_Optimization_PrimerSigOpt_Bayesian_Optimization_Primer
SigOpt_Bayesian_Optimization_PrimerIan Dewancker
 
fmelleHumanActivityRecognitionWithMobileSensors
fmelleHumanActivityRecognitionWithMobileSensorsfmelleHumanActivityRecognitionWithMobileSensors
fmelleHumanActivityRecognitionWithMobileSensorsFridtjof Melle
 
Meta learned Confidence for Few-shot Learning
Meta learned Confidence for Few-shot LearningMeta learned Confidence for Few-shot Learning
Meta learned Confidence for Few-shot LearningKIMMINHA3
 
Learning Methods in a Neural Network
Learning Methods in a Neural NetworkLearning Methods in a Neural Network
Learning Methods in a Neural NetworkSaransh Choudhary
 
RapidMiner: Learning Schemes In Rapid Miner
RapidMiner:  Learning Schemes In Rapid MinerRapidMiner:  Learning Schemes In Rapid Miner
RapidMiner: Learning Schemes In Rapid MinerDataminingTools Inc
 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine LearningUpekha Vandebona
 
Genetic algorithm for hyperparameter tuning
Genetic algorithm for hyperparameter tuningGenetic algorithm for hyperparameter tuning
Genetic algorithm for hyperparameter tuningDr. Jyoti Obia
 
RapidMiner: Data Mining And Rapid Miner
RapidMiner: Data Mining And Rapid MinerRapidMiner: Data Mining And Rapid Miner
RapidMiner: Data Mining And Rapid MinerDataminingTools Inc
 
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021Chris Ohk
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...journalBEEI
 
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind MapAshish Patel
 
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)Mahmoud Alfarra
 
Deep learning MindMap
Deep learning MindMapDeep learning MindMap
Deep learning MindMapAshish Patel
 
Caravan insurance data mining prediction models
Caravan insurance data mining prediction modelsCaravan insurance data mining prediction models
Caravan insurance data mining prediction modelsMuthu Kumaar Thangavelu
 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsIRJET Journal
 
Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Chung-Il Kim
 

Was ist angesagt? (20)

mlsys_portrait
mlsys_portraitmlsys_portrait
mlsys_portrait
 
SigOpt_Bayesian_Optimization_Primer
SigOpt_Bayesian_Optimization_PrimerSigOpt_Bayesian_Optimization_Primer
SigOpt_Bayesian_Optimization_Primer
 
fmelleHumanActivityRecognitionWithMobileSensors
fmelleHumanActivityRecognitionWithMobileSensorsfmelleHumanActivityRecognitionWithMobileSensors
fmelleHumanActivityRecognitionWithMobileSensors
 
Meta learned Confidence for Few-shot Learning
Meta learned Confidence for Few-shot LearningMeta learned Confidence for Few-shot Learning
Meta learned Confidence for Few-shot Learning
 
Differential evolution optimization technique
Differential evolution optimization techniqueDifferential evolution optimization technique
Differential evolution optimization technique
 
Learning Methods in a Neural Network
Learning Methods in a Neural NetworkLearning Methods in a Neural Network
Learning Methods in a Neural Network
 
RapidMiner: Learning Schemes In Rapid Miner
RapidMiner:  Learning Schemes In Rapid MinerRapidMiner:  Learning Schemes In Rapid Miner
RapidMiner: Learning Schemes In Rapid Miner
 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine Learning
 
Fuzzy inference systems
Fuzzy inference systemsFuzzy inference systems
Fuzzy inference systems
 
Genetic algorithm for hyperparameter tuning
Genetic algorithm for hyperparameter tuningGenetic algorithm for hyperparameter tuning
Genetic algorithm for hyperparameter tuning
 
RapidMiner: Data Mining And Rapid Miner
RapidMiner: Data Mining And Rapid MinerRapidMiner: Data Mining And Rapid Miner
RapidMiner: Data Mining And Rapid Miner
 
Similarity learning
  Similarity learning  Similarity learning
Similarity learning
 
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
 
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind Map
 
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)
Graph-Based Technique for Extracting Keyphrases In a Single-Document (GTEK)
 
Deep learning MindMap
Deep learning MindMapDeep learning MindMap
Deep learning MindMap
 
Caravan insurance data mining prediction models
Caravan insurance data mining prediction modelsCaravan insurance data mining prediction models
Caravan insurance data mining prediction models
 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy Algorithms
 
Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]
 

Andere mochten auch

Brokerage 2007 presentation distributed
Brokerage 2007 presentation distributedBrokerage 2007 presentation distributed
Brokerage 2007 presentation distributedimec.archive
 
2 deus leaflet wp2
2 deus leaflet wp22 deus leaflet wp2
2 deus leaflet wp2imec.archive
 
I Minds2009 Health Decision Support Prof Bart De Moor (Ibbt Esat Ku Leuven)
I Minds2009 Health Decision Support  Prof  Bart De Moor (Ibbt Esat Ku Leuven)I Minds2009 Health Decision Support  Prof  Bart De Moor (Ibbt Esat Ku Leuven)
I Minds2009 Health Decision Support Prof Bart De Moor (Ibbt Esat Ku Leuven)imec.archive
 
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...imec.archive
 
Maduf07 Expert Opinion And Potential Estimation Lieven De Marez
Maduf07 Expert Opinion And Potential Estimation   Lieven De MarezMaduf07 Expert Opinion And Potential Estimation   Lieven De Marez
Maduf07 Expert Opinion And Potential Estimation Lieven De Marezimec.archive
 
T map 로그에서 발생한 Java Locale 문제들
T map 로그에서 발생한 Java Locale 문제들T map 로그에서 발생한 Java Locale 문제들
T map 로그에서 발생한 Java Locale 문제들Chanil Park
 
Qo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent ProgrammaQo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent Programmaimec.archive
 
Ddo1 Bernd Langeheine 081017 Ghent
Ddo1 Bernd Langeheine   081017 GhentDdo1 Bernd Langeheine   081017 Ghent
Ddo1 Bernd Langeheine 081017 Ghentimec.archive
 
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed Bart De Nil En Jero...
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed   Bart De Nil En Jero...Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed   Bart De Nil En Jero...
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed Bart De Nil En Jero...imec.archive
 
tviexpress
tviexpresstviexpress
tviexpressTVIRICH
 
The Library's "Place"
The Library's "Place"The Library's "Place"
The Library's "Place"LHPeaden
 
Mark Sterns : entrepreneurship and faithfulness
Mark Sterns : entrepreneurship and faithfulnessMark Sterns : entrepreneurship and faithfulness
Mark Sterns : entrepreneurship and faithfulnessmicahdavis
 
Brokerage2006 de logistieke keten
Brokerage2006 de logistieke ketenBrokerage2006 de logistieke keten
Brokerage2006 de logistieke ketenimec.archive
 
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_imec.archive
 
Brokerage 2007presentation user
Brokerage 2007presentation userBrokerage 2007presentation user
Brokerage 2007presentation userimec.archive
 
Grid07 4 Tzannetakis
Grid07 4 TzannetakisGrid07 4 Tzannetakis
Grid07 4 Tzannetakisimec.archive
 
Analyse Gent M #11 & Launch Startup Garage
Analyse Gent M #11 & Launch Startup GarageAnalyse Gent M #11 & Launch Startup Garage
Analyse Gent M #11 & Launch Startup Garageimec.archive
 
Brokerage2006 beheer van volgende generatie telecom services
Brokerage2006 beheer van volgende generatie telecom servicesBrokerage2006 beheer van volgende generatie telecom services
Brokerage2006 beheer van volgende generatie telecom servicesimec.archive
 

Andere mochten auch (20)

Brokerage 2007 presentation distributed
Brokerage 2007 presentation distributedBrokerage 2007 presentation distributed
Brokerage 2007 presentation distributed
 
2 deus leaflet wp2
2 deus leaflet wp22 deus leaflet wp2
2 deus leaflet wp2
 
I Minds2009 Health Decision Support Prof Bart De Moor (Ibbt Esat Ku Leuven)
I Minds2009 Health Decision Support  Prof  Bart De Moor (Ibbt Esat Ku Leuven)I Minds2009 Health Decision Support  Prof  Bart De Moor (Ibbt Esat Ku Leuven)
I Minds2009 Health Decision Support Prof Bart De Moor (Ibbt Esat Ku Leuven)
 
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...Acknowledge 08 Ontwikkeling Front End  Benny Daems   Ibbt Edm U Hasselt En Al...
Acknowledge 08 Ontwikkeling Front End Benny Daems Ibbt Edm U Hasselt En Al...
 
Maduf07 Expert Opinion And Potential Estimation Lieven De Marez
Maduf07 Expert Opinion And Potential Estimation   Lieven De MarezMaduf07 Expert Opinion And Potential Estimation   Lieven De Marez
Maduf07 Expert Opinion And Potential Estimation Lieven De Marez
 
I Lab4 Usecases
I Lab4 UsecasesI Lab4 Usecases
I Lab4 Usecases
 
T map 로그에서 발생한 Java Locale 문제들
T map 로그에서 발생한 Java Locale 문제들T map 로그에서 발생한 Java Locale 문제들
T map 로그에서 발생한 Java Locale 문제들
 
Qo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent ProgrammaQo E E2 E6 Slotevent Programma
Qo E E2 E6 Slotevent Programma
 
Ddo1 Bernd Langeheine 081017 Ghent
Ddo1 Bernd Langeheine   081017 GhentDdo1 Bernd Langeheine   081017 Ghent
Ddo1 Bernd Langeheine 081017 Ghent
 
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed Bart De Nil En Jero...
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed   Bart De Nil En Jero...Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed   Bart De Nil En Jero...
Erfgoed2 0 6 Nieuwe Perspectieven Voor Digitaal Erfgoed Bart De Nil En Jero...
 
tviexpress
tviexpresstviexpress
tviexpress
 
Show Me the Outcomes - United States
Show Me the Outcomes - United StatesShow Me the Outcomes - United States
Show Me the Outcomes - United States
 
The Library's "Place"
The Library's "Place"The Library's "Place"
The Library's "Place"
 
Mark Sterns : entrepreneurship and faithfulness
Mark Sterns : entrepreneurship and faithfulnessMark Sterns : entrepreneurship and faithfulness
Mark Sterns : entrepreneurship and faithfulness
 
Brokerage2006 de logistieke keten
Brokerage2006 de logistieke ketenBrokerage2006 de logistieke keten
Brokerage2006 de logistieke keten
 
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_
Zorg en technologie_IBBT_Brokerage_HS_Peter_Degadt0120416_
 
Brokerage 2007presentation user
Brokerage 2007presentation userBrokerage 2007presentation user
Brokerage 2007presentation user
 
Grid07 4 Tzannetakis
Grid07 4 TzannetakisGrid07 4 Tzannetakis
Grid07 4 Tzannetakis
 
Analyse Gent M #11 & Launch Startup Garage
Analyse Gent M #11 & Launch Startup GarageAnalyse Gent M #11 & Launch Startup Garage
Analyse Gent M #11 & Launch Startup Garage
 
Brokerage2006 beheer van volgende generatie telecom services
Brokerage2006 beheer van volgende generatie telecom servicesBrokerage2006 beheer van volgende generatie telecom services
Brokerage2006 beheer van volgende generatie telecom services
 

Ähnlich wie Scalable Collaborative Filtering Framework via Co-clustering

A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsAhmed Magdy Ezzeldin, MSc.
 
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...ijnlc
 
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...AIRCC Publishing Corporation
 
Threshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesThreshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesjournalBEEI
 
A Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISA Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISIJSRD
 
A Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISA Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISIJSRD
 
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...csandit
 
Configuration Navigation Analysis Model for Regression Test Case Prioritization
Configuration Navigation Analysis Model for Regression Test Case PrioritizationConfiguration Navigation Analysis Model for Regression Test Case Prioritization
Configuration Navigation Analysis Model for Regression Test Case Prioritizationijsrd.com
 
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET Journal
 
A Comparative Study on Identical Face Classification using Machine Learning
A Comparative Study on Identical Face Classification using Machine LearningA Comparative Study on Identical Face Classification using Machine Learning
A Comparative Study on Identical Face Classification using Machine LearningIRJET Journal
 
A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...o_almasi
 
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTScsandit
 
A Review on Prediction of Compressive Strength and Slump by Using Different M...
A Review on Prediction of Compressive Strength and Slump by Using Different M...A Review on Prediction of Compressive Strength and Slump by Using Different M...
A Review on Prediction of Compressive Strength and Slump by Using Different M...IRJET Journal
 
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...IAEME Publication
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...IOSR Journals
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
Parameter Estimation of Software Reliability Growth Models Using Simulated An...
Parameter Estimation of Software Reliability Growth Models Using Simulated An...Parameter Estimation of Software Reliability Growth Models Using Simulated An...
Parameter Estimation of Software Reliability Growth Models Using Simulated An...Editor IJCATR
 

Ähnlich wie Scalable Collaborative Filtering Framework via Co-clustering (20)

Ajila (1)
Ajila (1)Ajila (1)
Ajila (1)
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
 
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...
ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...
 
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...
Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...
 
Threshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesThreshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniques
 
A Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISA Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFIS
 
A Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFISA Defect Prediction Model for Software Product based on ANFIS
A Defect Prediction Model for Software Product based on ANFIS
 
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
EFFICIENT USE OF HYBRID ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM COMBINED WITH N...
 
Configuration Navigation Analysis Model for Regression Test Case Prioritization
Configuration Navigation Analysis Model for Regression Test Case PrioritizationConfiguration Navigation Analysis Model for Regression Test Case Prioritization
Configuration Navigation Analysis Model for Regression Test Case Prioritization
 
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
 
A Comparative Study on Identical Face Classification using Machine Learning
A Comparative Study on Identical Face Classification using Machine LearningA Comparative Study on Identical Face Classification using Machine Learning
A Comparative Study on Identical Face Classification using Machine Learning
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
 
A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...
 
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
 
A Review on Prediction of Compressive Strength and Slump by Using Different M...
A Review on Prediction of Compressive Strength and Slump by Using Different M...A Review on Prediction of Compressive Strength and Slump by Using Different M...
A Review on Prediction of Compressive Strength and Slump by Using Different M...
 
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...
AN IMPROVED METHOD FOR IDENTIFYING WELL-TEST INTERPRETATION MODEL BASED ON AG...
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
Parameter Estimation of Software Reliability Growth Models Using Simulated An...
Parameter Estimation of Software Reliability Growth Models Using Simulated An...Parameter Estimation of Software Reliability Growth Models Using Simulated An...
Parameter Estimation of Software Reliability Growth Models Using Simulated An...
 

Kürzlich hochgeladen

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 

Kürzlich hochgeladen (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 

Scalable Collaborative Filtering Framework via Co-clustering

  • 1. A Scalable Collaborative Filtering Framework based on Co-clustering Author: Thomas George, Srujana Merugu in ICDM’05. Presenter: Rei-Zhe Liu. Date: 2010/10/26.
  • 2. Outline  Introduction  System architecture  Experiments and result  Conclusion 2
  • 3. Introduction  We propose a dynamic collaborative filtering approach that can support the entry of new users, items and ratings using a hybrid of incremental and batch versions of the co-clustering algorithm.  Empirical comparison of our approach with SVD, NNMF and correlation-based collaborative filtering techniques indicates comparable accuracy at a much lower computational effort. 3
  • 5. Problem definition(1/2)  The approximate matrix for prediction is given by 5
  • 6. Problem definition(2/2)  We can now pose the prediction of unknown ratings as a co-clustering problem where we seek to find the optimal user and item clustering such that the approximation error with respect to the known ratings of A is minimized,  where ensures that only the known ratings contribute to the loss function.  6
  • 10. System description  P1 handles the prediction and incremental training.  P2 is responsible for the static training.  During incremental training P1, also updates the raw ratings.  P2 performs co-clustering repeatedly by reading A(the current ratings matrix) and updating S(summary statistics) when done.  Data Objects A and S are stored at 2 parts: (a)stable part (b)increment part.  At the end of each co-clustering run, the two parts are merged to obtain a new set of stable values. 10
  • 12. Data sets and Exp. Settings(1/2)  Data set  MovieLens: 943-1882 user-by-movie matrix. Totally 100,000 ratings. Rated from 1 to 5.  Evaluation methodology  The prediction accuracy was measured using the mean absolute error (MAE), which is the average of the absolute values of the errors over all the predictions.  The static training time was estimated in terms of the CPU time taken for the core training routines (viz. co- clustering and SVD).  The prediction time was estimated by averaging over the response time taken for all the predictions. 12
  • 13. Data sets and Exp. Settings(2/2)  For evaluating the prediction accuracy, we created ten 80-20% random train-test splits of the datasets and averaged the results over the various splits.  We considered two scenarios, —(i) static testing, where the known ratings do not change, and (ii) dynamic testing, where the ratings are updated incrementally.  Algorithms  We compared the performance of our co-clustering based approach with SVD [13], NNMF [10] and classic correlation-based collaborative filtering [12].  An incremental SVD-based approach [14] using a folding in technique was also implemented in order to evaluate the prediction accuracy in dynamic scenarios with changing ratings.13
  • 14. Evaluation(1/3) k = l = SVD rank =NNMF rank=3 k = l = SVD rank=3 14
  • 18. Conclusion  In this paper, we presented a new dynamic collaborative filtering approach based on simultaneous clustering of users and items.  Empirical results indicate that our approach can provide high quality predictions at a much lower computational cost compared to traditional correlation and SVD-based approaches. 18