SlideShare ist ein Scribd-Unternehmen logo
1 von 21
http://Learning-Layers-euhttp://Learning-Layers-eu
Learning Layers
Scaling up Technologies for Informal Learning in SME Clusters
Towards a Scalable Social Recommender
Engine for Online Marketplaces
The Case of Apache Solr
Emanuel Lacic, Dominik Kowald, Denis Parra, Christoph Trattner
1
http://Learning-Layers-eu
Many thanks to
Emanuel Lacic
elacic@know-center.at
Graz University of Technology
Austria
2
Denis Parra
dparra@ing.puc.cl
Pontificia Universidad
Catolica
Chile
Christoph Trattner
ctrattner@know-center.at
Know-Center Graz
Austria
http://Learning-Layers-eu
What will this talk be about?
• (Real-time) product recommendations for
online marketplaces
• Scalability of recommender systems
• Utilizing social network data for the
recommendations of products to people
3
http://Learning-Layers-eu
How did this work start?
• Joint project with the Austrian start-up Blanc-Noir
• Personalized product recommender for online
marketplaces based on
– Actions in the marketplaces (e.g., Ebay, Amazon)
– Product information
– Social network data (e.g., Facebook, G+)
– Filter criteria
• Provided at (near) real-time!
… especially if there is a lot of data
… together with many data updates
4
http://Learning-Layers-eu
So now, how we have solved
that issue?
5
http://Learning-Layers-eu
What‘s available out there?
• Frameworks/approaches for scalable
recommendations
– Distributed data processing
• Apache Hadoop / Mahout (map/reduce paradigm)
– Relational databases
• MySQL, PostgreSQL (e.g., RecDB project)
– Collaborative Filtering improvements
• Matrix factorization
• Lack of a framework / approach that combines all
things we need
6
http://Learning-Layers-eu 7
http://Learning-Layers-eu
Why Solr?
• „High-performance, full-featured text search engine library“
… but more precise …
• „High-performance, fully-featured token matching and scoring
library“ [Grainger, 2012]
… which provides ….
– full-text searches (content-based)
– powerful queries (e.g., MoreLikeThis or Facets)
– (near) real-time data updates (no pre/re-calculations)
– easy schema updates (social data integration)
• Established open-source software (Apache license) with big
community
8
http://Learning-Layers-eu 9
Our framework
https://github.com/learning-layers/SocRec
http://Learning-Layers-eu
How does the thing perform?
• Dataset of virtual world SecondLife
– Marketplace and social data
10
http://Learning-Layers-eu
What‘s about the marketplace
and social data features?
11
0
0.05
0.1
0.15
0.2
Purchases Categories Title Description Interests Groups Likes Comments Interactions
nDCG
Data Features
0
0.1
0.2
0.3
0.4
0.5
Purchases Categories Title Description Interests Groups Likes Comments Interactions
UserCoverage
Data Features
• Both types of data are important for the recommender quality and
user coverage
http://Learning-Layers-eu
What‘s about the hybrids?
12
0
0.01
0.02
0.03
0.04
0.05
MP CCFm CFs ALL
nDCG
Recommendation Algorithms
0
0.2
0.4
0.6
0.8
1
MP CCFm CFs ALL
UserCoverage
Data Features
• The hybrid approach provides a good trade-off of recommender
quality and user coverage
http://Learning-Layers-eu
What‘s about the scalability?
13
• Recommendations can be provided in (near) real-time in both cases
(with and without data update)
http://Learning-Layers-eu
What we have shown!
• Apache Solr is more than a search engine!
• Actually it is a great framework to implement a scalable
recommender engine for online marketplaces
• Near real-time recommendations through build-in query-functions
• Near real-time data updates
• Easy integration of social data
+ a high-performance full-text search engine for free!
• Evaluation on dataset gathered from SecondLife
• Different marketplace and social data features are important
• Hybrid approaches produce more robust recommendations
• It scales!
14
http://Learning-Layers-eu
What do we want to do in the
future?
15
• Online study together with BlancNoir with “real” data
• Impact of geo-spatial data
• Impact of temporal data (see WebScience track)
• Comparative study with other backend solutions (e.g.,
ElasticSearch)
http://Learning-Layers-eu
Thank you for your attention!
Code and framework:
https://github.com/learning-layers/SocRec
Questions?
Dominik Kowald
dkowald@know-center.at
Know-Center
Graz University of Technology (Austria)
16
http://Learning-Layers-eu
Backup
17
http://Learning-Layers-eu
Short hands-on session
• Collaborative Filtering
• Content-Based
18
http://Learning-Layers-eu
SecondLife dataset
19
http://Learning-Layers-eu
How to Use the Engine?
• Implement and run a new recommender
20
http://Learning-Layers-eu
Recommendation Algorithms
implemented in the Engine
• MostPopular (MP)
– Recommends for any user the most purchased items
• Collaborative Filtering (CF)
– Find similar users (k nearest neighbors) and recommend novel items of
those users [Schafer et al., 2007]
– In Solr: select queries and facet counts
• Content-Based (C)
– Analyze item meta-data to find similar items [Pazzani et al., 2007]
– In Solr: MoreLikeThis function
• Hybrid (CCF)
– Combine different algorithms to overcome their individual limitations
[Burke et al., 2002]
– Each algorithm can be weighted / tuned according to its performance
21

Weitere ähnliche Inhalte

Was ist angesagt?

A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
LIBER Europe
 

Was ist angesagt? (20)

A Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning PoolA Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning Pool
 
Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013
 
A Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application DevelopmentA Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application Development
 
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
 
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
 
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the WebSyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
 
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
 
Strategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic LibrariesStrategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic Libraries
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
 
Digital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in LibrariesDigital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in Libraries
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
 
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
The  Selection Between An Open Source And Vended Software in Libraries:Oppor...The  Selection Between An Open Source And Vended Software in Libraries:Oppor...
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
 
Building and using ontologies (2015)
Building and using ontologies (2015)Building and using ontologies (2015)
Building and using ontologies (2015)
 
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
 
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data TypesYjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
 
E Infrastructure for OA
E Infrastructure for OAE Infrastructure for OA
E Infrastructure for OA
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...
 
Demand Analysis
Demand AnalysisDemand Analysis
Demand Analysis
 

Andere mochten auch

DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
Dejan Kovachev
 
Requirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the WebRequirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the Web
IstvanKoren
 

Andere mochten auch (17)

HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag ReuseHT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
 
WWW2015 PHD Symposium
WWW2015 PHD SymposiumWWW2015 PHD Symposium
WWW2015 PHD Symposium
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
 
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
 
A Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning SupportA Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning Support
 
Analysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex NetworksAnalysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex Networks
 
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
 
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
 
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
 
Informal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive VideoInformal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive Video
 
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation ScenarioPrototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
 
Unpacking the Layers Box
Unpacking the Layers BoxUnpacking the Layers Box
Unpacking the Layers Box
 
Requirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the WebRequirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the Web
 
Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)
 
Triptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demandTriptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demand
 

Ähnlich wie SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces

Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLA
Paul Barsch
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to users
jobinwilson
 

Ähnlich wie SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces (20)

C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
 
Architectural Considerations for Startups
Architectural Considerations for StartupsArchitectural Considerations for Startups
Architectural Considerations for Startups
 
Practical model management in the age of Data science and ML
Practical model management in the age of Data science and MLPractical model management in the age of Data science and ML
Practical model management in the age of Data science and ML
 
ECIR Recommendation Challenges
ECIR Recommendation ChallengesECIR Recommendation Challenges
ECIR Recommendation Challenges
 
GraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in GraphdatenbankenGraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in Graphdatenbanken
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
Invento Labs Web 2.0
Invento Labs Web 2.0Invento Labs Web 2.0
Invento Labs Web 2.0
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLA
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to users
 
Recommendation engines matching items to users
Recommendation engines matching items to usersRecommendation engines matching items to users
Recommendation engines matching items to users
 
Science Gateway Canvas
Science Gateway CanvasScience Gateway Canvas
Science Gateway Canvas
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
Neo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in ActionNeo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in Action
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Tech essentials for Product managers
Tech essentials for Product managersTech essentials for Product managers
Tech essentials for Product managers
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Product Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configurationProduct Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configuration
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
 

Mehr von Dominik Kowald

Mehr von Dominik Kowald (6)

AFEL: TagRec Framework
AFEL: TagRec FrameworkAFEL: TagRec Framework
AFEL: TagRec Framework
 
AFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag RecommendationsAFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag Recommendations
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
 
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
 
Temporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in TwitterTemporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in Twitter
 

Kürzlich hochgeladen

Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
ZurliaSoop
 
Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
eliklein8
 

Kürzlich hochgeladen (20)

BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIRBVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
 
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
 
Production diary Film the city powerpoint
Production diary Film the city powerpointProduction diary Film the city powerpoint
Production diary Film the city powerpoint
 
Vellore Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Vellore Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort ServiceVellore Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Vellore Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
 
Film show pre-production powerpoint for site
Film show pre-production powerpoint for siteFilm show pre-production powerpoint for site
Film show pre-production powerpoint for site
 
Finance-and-Operations-in-the-Azure-Cloud.pdf
Finance-and-Operations-in-the-Azure-Cloud.pdfFinance-and-Operations-in-the-Azure-Cloud.pdf
Finance-and-Operations-in-the-Azure-Cloud.pdf
 
Film show production powerpoint for site
Film show production powerpoint for siteFilm show production powerpoint for site
Film show production powerpoint for site
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
 
Film show post-production powerpoint for site
Film show post-production powerpoint for siteFilm show post-production powerpoint for site
Film show post-production powerpoint for site
 
Unlock the power of Instagram with SocioCosmos. Start your journey towards so...
Unlock the power of Instagram with SocioCosmos. Start your journey towards so...Unlock the power of Instagram with SocioCosmos. Start your journey towards so...
Unlock the power of Instagram with SocioCosmos. Start your journey towards so...
 
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
 
International Airport Call Girls 🥰 8617370543 Service Offer VIP Hot Model
International Airport Call Girls 🥰 8617370543 Service Offer VIP Hot ModelInternational Airport Call Girls 🥰 8617370543 Service Offer VIP Hot Model
International Airport Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Film show investigation powerpoint for the site
Film show investigation powerpoint for the siteFilm show investigation powerpoint for the site
Film show investigation powerpoint for the site
 
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 OnlyVIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
 
Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
 
Karol Bagh, Delhi Call girls :8448380779 Model Escorts | 100% verified
Karol Bagh, Delhi Call girls :8448380779 Model Escorts | 100% verifiedKarol Bagh, Delhi Call girls :8448380779 Model Escorts | 100% verified
Karol Bagh, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
The Butterfly Effect
The Butterfly EffectThe Butterfly Effect
The Butterfly Effect
 
Marketing Plan - Social Media. The Sparks Foundation
Marketing Plan -  Social Media. The Sparks FoundationMarketing Plan -  Social Media. The Sparks Foundation
Marketing Plan - Social Media. The Sparks Foundation
 
Interpreting the brief for the media IDY
Interpreting the brief for the media IDYInterpreting the brief for the media IDY
Interpreting the brief for the media IDY
 

SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces

  • 1. http://Learning-Layers-euhttp://Learning-Layers-eu Learning Layers Scaling up Technologies for Informal Learning in SME Clusters Towards a Scalable Social Recommender Engine for Online Marketplaces The Case of Apache Solr Emanuel Lacic, Dominik Kowald, Denis Parra, Christoph Trattner 1
  • 2. http://Learning-Layers-eu Many thanks to Emanuel Lacic elacic@know-center.at Graz University of Technology Austria 2 Denis Parra dparra@ing.puc.cl Pontificia Universidad Catolica Chile Christoph Trattner ctrattner@know-center.at Know-Center Graz Austria
  • 3. http://Learning-Layers-eu What will this talk be about? • (Real-time) product recommendations for online marketplaces • Scalability of recommender systems • Utilizing social network data for the recommendations of products to people 3
  • 4. http://Learning-Layers-eu How did this work start? • Joint project with the Austrian start-up Blanc-Noir • Personalized product recommender for online marketplaces based on – Actions in the marketplaces (e.g., Ebay, Amazon) – Product information – Social network data (e.g., Facebook, G+) – Filter criteria • Provided at (near) real-time! … especially if there is a lot of data … together with many data updates 4
  • 5. http://Learning-Layers-eu So now, how we have solved that issue? 5
  • 6. http://Learning-Layers-eu What‘s available out there? • Frameworks/approaches for scalable recommendations – Distributed data processing • Apache Hadoop / Mahout (map/reduce paradigm) – Relational databases • MySQL, PostgreSQL (e.g., RecDB project) – Collaborative Filtering improvements • Matrix factorization • Lack of a framework / approach that combines all things we need 6
  • 8. http://Learning-Layers-eu Why Solr? • „High-performance, full-featured text search engine library“ … but more precise … • „High-performance, fully-featured token matching and scoring library“ [Grainger, 2012] … which provides …. – full-text searches (content-based) – powerful queries (e.g., MoreLikeThis or Facets) – (near) real-time data updates (no pre/re-calculations) – easy schema updates (social data integration) • Established open-source software (Apache license) with big community 8
  • 10. http://Learning-Layers-eu How does the thing perform? • Dataset of virtual world SecondLife – Marketplace and social data 10
  • 11. http://Learning-Layers-eu What‘s about the marketplace and social data features? 11 0 0.05 0.1 0.15 0.2 Purchases Categories Title Description Interests Groups Likes Comments Interactions nDCG Data Features 0 0.1 0.2 0.3 0.4 0.5 Purchases Categories Title Description Interests Groups Likes Comments Interactions UserCoverage Data Features • Both types of data are important for the recommender quality and user coverage
  • 12. http://Learning-Layers-eu What‘s about the hybrids? 12 0 0.01 0.02 0.03 0.04 0.05 MP CCFm CFs ALL nDCG Recommendation Algorithms 0 0.2 0.4 0.6 0.8 1 MP CCFm CFs ALL UserCoverage Data Features • The hybrid approach provides a good trade-off of recommender quality and user coverage
  • 13. http://Learning-Layers-eu What‘s about the scalability? 13 • Recommendations can be provided in (near) real-time in both cases (with and without data update)
  • 14. http://Learning-Layers-eu What we have shown! • Apache Solr is more than a search engine! • Actually it is a great framework to implement a scalable recommender engine for online marketplaces • Near real-time recommendations through build-in query-functions • Near real-time data updates • Easy integration of social data + a high-performance full-text search engine for free! • Evaluation on dataset gathered from SecondLife • Different marketplace and social data features are important • Hybrid approaches produce more robust recommendations • It scales! 14
  • 15. http://Learning-Layers-eu What do we want to do in the future? 15 • Online study together with BlancNoir with “real” data • Impact of geo-spatial data • Impact of temporal data (see WebScience track) • Comparative study with other backend solutions (e.g., ElasticSearch)
  • 16. http://Learning-Layers-eu Thank you for your attention! Code and framework: https://github.com/learning-layers/SocRec Questions? Dominik Kowald dkowald@know-center.at Know-Center Graz University of Technology (Austria) 16
  • 18. http://Learning-Layers-eu Short hands-on session • Collaborative Filtering • Content-Based 18
  • 20. http://Learning-Layers-eu How to Use the Engine? • Implement and run a new recommender 20
  • 21. http://Learning-Layers-eu Recommendation Algorithms implemented in the Engine • MostPopular (MP) – Recommends for any user the most purchased items • Collaborative Filtering (CF) – Find similar users (k nearest neighbors) and recommend novel items of those users [Schafer et al., 2007] – In Solr: select queries and facet counts • Content-Based (C) – Analyze item meta-data to find similar items [Pazzani et al., 2007] – In Solr: MoreLikeThis function • Hybrid (CCF) – Combine different algorithms to overcome their individual limitations [Burke et al., 2002] – Each algorithm can be weighted / tuned according to its performance 21