SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Bastian Karweg, C. Hütter, Prof. K. Böhm




                                  “Evolving Social Search Based on Bookmarks
                                   and Status Messages from Social Networks “


Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm




       KIT – University of the State of Baden-Wuerttemberg                  © Bastian Karweg, 2010 - 2011
     and National Research Center of the Helmholtz Association                                              www.kit.edu
Introduction and motivation


                                                  Scenario
                                 Jane is planning to make some delicious
                             pancakes for her son’s birthday. A friend recently
                              recommended a link to a great recipe, but she
                                 doesn’t remember neither who it was nor
                                   on which social network he posted.

                                                      Goal:

                                        Personalize Jane‘s search result set
                                     in a way that to take all recommendations
                                        into account and idealy show her the
                                         pancake recipe she was looking for.



2   06.12.2011    © Bastian Karweg, 2010 - 2011           Institute for Program Structures and Data Organisation
                                                                                     (IPD), Chair Prof. K. Böhm
Influence factors:

                     The more you know about who is searching something,
                          the better are you able to target the results:


                 •   language                                      •   age, gender
                 •   location              personalization         •   interests, actions
                 •   search type                                   •   friends, contacts



                      1. Identification of each user has to be possible
                      2. His data has to be available, ideally in a
                         standardized form on a central location
                      3. The user needs sufficient control systems to
                         grant or deny access to all or part of his data


3   06.12.2011                     © Bastian Karweg, 2010 - 2011           Institute for Program Structures and Data Organisation
                                                                                                      (IPD), Chair Prof. K. Böhm
Example 1




                                                 http://www.onlineschools.org/blog/facebook-obsession/



4   06.12.2011   © Bastian Karweg, 2010 - 2011                        Institute for Program Structures and Data Organisation
                                                                                                 (IPD), Chair Prof. K. Böhm
Social Web




                                                      content                interaction


                                                           user profile +
                                                       contacts (social graph)



                                                   most of them do not yet use
                                                   their social data for search.


                                                                        http://www.theconversationprism.com/

5     06.12.2011   © Bastian Karweg, 2010 - 2011             Institute for Program Structures and Data Organisation
                                                                                        (IPD), Chair Prof. K. Böhm
Example 2




                                                 •        social bookmarking service
                                                 •        founded in 2006
                                                 •        1.5mm users
                                                 •        24mm bookmarks (links)


                 Also runs a standard fulltext based search engine.


                  Use available social data for search.
                  Aggregate data from different platforms.

6   06.12.2011            © Bastian Karweg, 2010 - 2011                  Institute for Program Structures and Data Organisation
                                                                                                    (IPD), Chair Prof. K. Böhm
Section 2/5

    THEORETICAL
    APPROACH

7   06.12.2011    © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                             (IPD), Chair Prof. K. Böhm
3 Types of social search                                 [ Evans2009 ]



           „Collective“                       „Collaborative“                „Friend-filtered“

             Using the                               Using the                     Using
    „wisdom of the crowds“                     „village paradigm“         „personalized results“
      => The more popular a                     => queries are all        => based on what friends
    content, the better it ranks.             answered by experts.        have shared in the past.

    As discussed in                      As discussed in                 Newest approach and
    [Hotho2006]                          [Horowitz2010]                  main topic of our work.

    Examples                             Examples                        Recent examples
    Digg.com, reddit,                    Aarkvark.com,                   Bing social, GooglePlus,
    Delicious Bookmarks                  Q&A Communities                 Blekko slashtag

    Problems                             Problem                         Problems
    Easy manipulation,                   No instant results;             Not sure if hypotheses
    „one-size-fits-all“                  Needs reliable experts on       work out as predicted.
                                         many topics

8      06.12.2011                   © Bastian Karweg, 2010 - 2011        Institute for Program Structures and Data Organisation
                                                                                                    (IPD), Chair Prof. K. Böhm
Theoretical approach


                              Where to start?

    A)      Measure the amount of social interactions for each search result.

                  Engagement intensity

    B)      How strongly should somebody’s recommendation influence the
            searchers results?

                  Trust levels



9   06.12.2011                © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                         (IPD), Chair Prof. K. Böhm
Engagement intensity

     The more effort the user has to go through, the higher the value:




10     06.12.2011              © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                          (IPD), Chair Prof. K. Böhm
Trust Levels

 • Trust is established as an asymetric relation between users
 • The user can adjust the trust levels for each contact
 • The system can assist in fine tuning within trust levels




11   06.12.2011         © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                   (IPD), Chair Prof. K. Böhm
Our model for SRS
                                                      Full-Text
                            1                         Relevancy
                                                      (Top 1000)



                                                                                               „bookmarked“
                                                                                               „liked“
                                                                   Adam                        „shared“
                                                                   Eve                         „+1‘d“, (…)
                                                                   Jane
                                                                   John
2

           Social Relevance Score (SRS):




12   06.12.2011                 © Bastian Karweg, 2010 - 2011         Institute for Program Structures and Data Organisation
                                                                                                 (IPD), Chair Prof. K. Böhm
Hypotheses


                  The number of links available for social search depends
                  on the number of friends a user has in his social graph.


                  There is a certain number of friends the user needs for
                  social search to work „properly“ for any query.


                  The result quality of classic full text search improves
                  when combining it with the Social Relevance Score.



                      We needed „enough data“ to back these up.


13   06.12.2011              © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                        (IPD), Chair Prof. K. Böhm
Section 3/5

     FIELD STUDY


14   06.12.2011    © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                              (IPD), Chair Prof. K. Böhm
still
Platform: Social-Search.com                                        available

                                                                                          one time only




                  pancake recipe




15   06.12.2011                    © Bastian Karweg, 2010 - 2011      Institute for Program Structures and Data Organisation
                                                                                                 (IPD), Chair Prof. K. Böhm
Field Study Size

                  Crawling social streams every 10 minutes for 58 days
                        (from 09th of Nov.10 until 05th of Jan.11)


                     2.385 testers                                       468.889 friends

                        430, 13%


                                                                         217010, 4
                                    1651, 51%                               6%                 251879, 5
                    1164, 36%
                                                                                                  4%




                         facebook   twitter                                    facebook        twitter




16   06.12.2011                          © Bastian Karweg, 2010 - 2011               Institute for Program Structures and Data Organisation
                                                                                                                (IPD), Chair Prof. K. Böhm
User-Graph




                                                                                       Folkd.com



                                                                 Created using gephi.org
                                                               excerpt of 60.000 Relations
                                                                  between 40.000 Nutzern

17   06.12.2011   © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                             (IPD), Chair Prof. K. Böhm
Resulting datasets:



        Stream Data                   Search Data        Search Simulation


        Extracted and         Analyzed a random            Run a comparison
           crawled                sample of                    test with
           428.522                         2.098        36 query terms
 link-recommendations. social search sessions.           on all test-accounts.




18   06.12.2011         © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                   (IPD), Chair Prof. K. Böhm
Section 4/5

     RESULTS AND
     EVALUATION

19   06.12.2011    © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                              (IPD), Chair Prof. K. Böhm
Impact of the user‘s friend count
Log(10)




                                                                                              Log(10)


20    06.12.2011   © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                              (IPD), Chair Prof. K. Böhm
Results and Evaluation



        So how many friends does one need
       for social search to “work properly“?

                         Defining „work proper“:

            minimum      1 social results for the average query

                  good   5 social results for the average query

                                  contains the search term and has friend engagement

21   06.12.2011            © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                      (IPD), Chair Prof. K. Böhm
It depends on the search term:                                                  non-linear scale




                                       50                               300


22   06.12.2011     © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                               (IPD), Chair Prof. K. Böhm
Influence of SRS


       1.                                   2.                           3.




                  average                     number of clicks                 average time
               click-position                needed per search                spent on search




                   5.70 => 2.92                    1.39 => 1.14          24.46 sec => 13.56 sec

       The user finds a suitable               The user needs 0.25             The user detects
          result on average                    clicks less to get to a          relevant results
        2.78 positions earlier!                    suitable result!           significantly faster.
                                                                                                                 

23    06.12.2011                   © Bastian Karweg, 2010 - 2011          Institute for Program Structures and Data Organisation
                                                                                                     (IPD), Chair Prof. K. Böhm
Section 5/5

     SUMMARY &
     QUESTIONS

24   06.12.2011    © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                              (IPD), Chair Prof. K. Böhm
Summary

       Using SRS can measurably improve social search



       Social Search will be a major part of all future search engines.
       This development is confirmed by the current market developments
       (Google Plus, Bing social, blekko slashtag …)



       The success of a social search depends on:
                  Connectivity of the searching user
                  Popularity of the search term
                  Time since when the user is using social media
                  Size of available social data

25    06.12.2011                    © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                                               (IPD), Chair Prof. K. Böhm
Thanks for your attention!


                         Questions?

                         Bastian Karweg
                         Mobile Advertising GmbH (CEO)

                         Twitter: @timetrax
                         E-Mail: Karweg@mobile-advertising.com
                         Web:     www.bastiankarweg.de




26   06.12.2011   © Bastian Karweg, 2010 - 2011   Institute for Program Structures and Data Organisation
                                                                             (IPD), Chair Prof. K. Böhm

Weitere ähnliche Inhalte

Andere mochten auch

Growing Patchouli PREPARATION
Growing Patchouli PREPARATIONGrowing Patchouli PREPARATION
Growing Patchouli PREPARATIONRichard Beresford
 
Bbc three, young people and faith
Bbc three, young people and faithBbc three, young people and faith
Bbc three, young people and faithRuth Deller
 
Str8ts Weekly Extreme #39 - Solution
Str8ts Weekly Extreme #39 - SolutionStr8ts Weekly Extreme #39 - Solution
Str8ts Weekly Extreme #39 - SolutionSlowThinker
 
ソーシャルメディアリスクマネジメントー炎上対策ー
ソーシャルメディアリスクマネジメントー炎上対策ーソーシャルメディアリスクマネジメントー炎上対策ー
ソーシャルメディアリスクマネジメントー炎上対策ーGoro Kosaka
 
Beauty bread as face scrubber2
Beauty bread as face scrubber2Beauty bread as face scrubber2
Beauty bread as face scrubber2Viroo Mirji
 
Diabolic Str8ts #8
Diabolic Str8ts #8Diabolic Str8ts #8
Diabolic Str8ts #8SlowThinker
 
Sse strategy dice_2011
Sse strategy dice_2011Sse strategy dice_2011
Sse strategy dice_2011LudvigSSE
 
Trabajo de microsf excel
Trabajo de microsf excelTrabajo de microsf excel
Trabajo de microsf exceladri20eli
 
Microsoft power point automation-opensourcetestingtools_matrix-1
Microsoft power point   automation-opensourcetestingtools_matrix-1Microsoft power point   automation-opensourcetestingtools_matrix-1
Microsoft power point automation-opensourcetestingtools_matrix-1tactqa
 
Human resource analytics
Human resource analyticsHuman resource analytics
Human resource analyticsAnika Jindal
 
1ID and Fort Riley Weekly News Update for March 9 2012
1ID and Fort Riley Weekly News Update for March 9 20121ID and Fort Riley Weekly News Update for March 9 2012
1ID and Fort Riley Weekly News Update for March 9 2012Noel Waterman
 
9 May 2012 1HBCT Weekly News Update
9 May 2012 1HBCT Weekly News Update9 May 2012 1HBCT Weekly News Update
9 May 2012 1HBCT Weekly News UpdateNoel Waterman
 
Karya tulis ilmiah populer
Karya tulis ilmiah populerKarya tulis ilmiah populer
Karya tulis ilmiah populerMOH. SHOFI'I
 
Vancouver real estate stats package, january 2013
Vancouver real estate stats package, january 2013Vancouver real estate stats package, january 2013
Vancouver real estate stats package, january 2013Matt Collinge
 
Vancouver housing starts april 2011
Vancouver housing starts april 2011Vancouver housing starts april 2011
Vancouver housing starts april 2011Matt Collinge
 
Pedro García-Noblejas, en Forbes (octubre 2013)
Pedro García-Noblejas, en Forbes (octubre 2013)Pedro García-Noblejas, en Forbes (octubre 2013)
Pedro García-Noblejas, en Forbes (octubre 2013)Interbrand Madrid
 

Andere mochten auch (20)

Growing Patchouli PREPARATION
Growing Patchouli PREPARATIONGrowing Patchouli PREPARATION
Growing Patchouli PREPARATION
 
Bbc three, young people and faith
Bbc three, young people and faithBbc three, young people and faith
Bbc three, young people and faith
 
Str8ts Weekly Extreme #39 - Solution
Str8ts Weekly Extreme #39 - SolutionStr8ts Weekly Extreme #39 - Solution
Str8ts Weekly Extreme #39 - Solution
 
ソーシャルメディアリスクマネジメントー炎上対策ー
ソーシャルメディアリスクマネジメントー炎上対策ーソーシャルメディアリスクマネジメントー炎上対策ー
ソーシャルメディアリスクマネジメントー炎上対策ー
 
Beauty bread as face scrubber2
Beauty bread as face scrubber2Beauty bread as face scrubber2
Beauty bread as face scrubber2
 
Diabolic Str8ts #8
Diabolic Str8ts #8Diabolic Str8ts #8
Diabolic Str8ts #8
 
Movie maker 002
Movie maker 002Movie maker 002
Movie maker 002
 
Sse strategy dice_2011
Sse strategy dice_2011Sse strategy dice_2011
Sse strategy dice_2011
 
Trabajo de microsf excel
Trabajo de microsf excelTrabajo de microsf excel
Trabajo de microsf excel
 
Microsoft power point automation-opensourcetestingtools_matrix-1
Microsoft power point   automation-opensourcetestingtools_matrix-1Microsoft power point   automation-opensourcetestingtools_matrix-1
Microsoft power point automation-opensourcetestingtools_matrix-1
 
Human resource analytics
Human resource analyticsHuman resource analytics
Human resource analytics
 
1ID and Fort Riley Weekly News Update for March 9 2012
1ID and Fort Riley Weekly News Update for March 9 20121ID and Fort Riley Weekly News Update for March 9 2012
1ID and Fort Riley Weekly News Update for March 9 2012
 
9 May 2012 1HBCT Weekly News Update
9 May 2012 1HBCT Weekly News Update9 May 2012 1HBCT Weekly News Update
9 May 2012 1HBCT Weekly News Update
 
WOMMU Big Data Preso 05.22.13
WOMMU Big Data Preso 05.22.13WOMMU Big Data Preso 05.22.13
WOMMU Big Data Preso 05.22.13
 
writing_research_paper_slides(goerzen_2011)
writing_research_paper_slides(goerzen_2011)writing_research_paper_slides(goerzen_2011)
writing_research_paper_slides(goerzen_2011)
 
Astrology & HR
Astrology & HRAstrology & HR
Astrology & HR
 
Karya tulis ilmiah populer
Karya tulis ilmiah populerKarya tulis ilmiah populer
Karya tulis ilmiah populer
 
Vancouver real estate stats package, january 2013
Vancouver real estate stats package, january 2013Vancouver real estate stats package, january 2013
Vancouver real estate stats package, january 2013
 
Vancouver housing starts april 2011
Vancouver housing starts april 2011Vancouver housing starts april 2011
Vancouver housing starts april 2011
 
Pedro García-Noblejas, en Forbes (octubre 2013)
Pedro García-Noblejas, en Forbes (octubre 2013)Pedro García-Noblejas, en Forbes (octubre 2013)
Pedro García-Noblejas, en Forbes (octubre 2013)
 

Ähnlich wie Evolving Social Search - Presentation CIKM 2011

The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGPhilip Bourne
 
Extreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific CollaborationsExtreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific CollaborationsVon Welch
 
The Changing Role of Information Professionals: New Opportunities Created by ...
The Changing Role of Information Professionals: New Opportunities Created by ...The Changing Role of Information Professionals: New Opportunities Created by ...
The Changing Role of Information Professionals: New Opportunities Created by ...Larry Hawes
 
Research Communities Infographic
Research Communities InfographicResearch Communities Infographic
Research Communities InfographicTom De Ruyck
 
The Australian Ad Observatory - ICA 2023.pptx
The Australian Ad Observatory - ICA 2023.pptxThe Australian Ad Observatory - ICA 2023.pptx
The Australian Ad Observatory - ICA 2023.pptxssuser606ca2
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkAdina Chuang Howe
 
Big Data, Watson & The Future of Sourcing
Big Data, Watson & The Future of SourcingBig Data, Watson & The Future of Sourcing
Big Data, Watson & The Future of SourcingKevin Wheeler
 
Estudios Clinicos Epidemiologia Odontologia
Estudios Clinicos Epidemiologia OdontologiaEstudios Clinicos Epidemiologia Odontologia
Estudios Clinicos Epidemiologia OdontologiaSergio Uribe
 
Blogs and Podcasting
Blogs and PodcastingBlogs and Podcasting
Blogs and PodcastingJohn Breslin
 
Research Process
Research ProcessResearch Process
Research ProcessJedi Labs
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsPhilip Bourne
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public versionjhudms
 

Ähnlich wie Evolving Social Search - Presentation CIKM 2011 (20)

AMIA 2014
AMIA 2014AMIA 2014
AMIA 2014
 
Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"
 
UX Resume
UX ResumeUX Resume
UX Resume
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAG
 
Public Engagement and Education
Public Engagement and EducationPublic Engagement and Education
Public Engagement and Education
 
Extreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific CollaborationsExtreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific Collaborations
 
The Changing Role of Information Professionals: New Opportunities Created by ...
The Changing Role of Information Professionals: New Opportunities Created by ...The Changing Role of Information Professionals: New Opportunities Created by ...
The Changing Role of Information Professionals: New Opportunities Created by ...
 
EDF2012 - CODE
EDF2012 - CODEEDF2012 - CODE
EDF2012 - CODE
 
Research Communities Infographic
Research Communities InfographicResearch Communities Infographic
Research Communities Infographic
 
The Australian Ad Observatory - ICA 2023.pptx
The Australian Ad Observatory - ICA 2023.pptxThe Australian Ad Observatory - ICA 2023.pptx
The Australian Ad Observatory - ICA 2023.pptx
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data Talk
 
Big Data, Watson & The Future of Sourcing
Big Data, Watson & The Future of SourcingBig Data, Watson & The Future of Sourcing
Big Data, Watson & The Future of Sourcing
 
Estudios Clinicos Epidemiologia Odontologia
Estudios Clinicos Epidemiologia OdontologiaEstudios Clinicos Epidemiologia Odontologia
Estudios Clinicos Epidemiologia Odontologia
 
Data at the NIH
Data at the NIHData at the NIH
Data at the NIH
 
Blogs and Podcasting
Blogs and PodcastingBlogs and Podcasting
Blogs and Podcasting
 
Payel ux portfolio
Payel ux portfolioPayel ux portfolio
Payel ux portfolio
 
visual sorting
visual sortingvisual sorting
visual sorting
 
Research Process
Research ProcessResearch Process
Research Process
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public version
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Evolving Social Search - Presentation CIKM 2011

  • 1. Bastian Karweg, C. Hütter, Prof. K. Böhm “Evolving Social Search Based on Bookmarks and Status Messages from Social Networks “ Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm KIT – University of the State of Baden-Wuerttemberg © Bastian Karweg, 2010 - 2011 and National Research Center of the Helmholtz Association www.kit.edu
  • 2. Introduction and motivation Scenario Jane is planning to make some delicious pancakes for her son’s birthday. A friend recently recommended a link to a great recipe, but she doesn’t remember neither who it was nor on which social network he posted. Goal: Personalize Jane‘s search result set in a way that to take all recommendations into account and idealy show her the pancake recipe she was looking for. 2 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 3. Influence factors: The more you know about who is searching something, the better are you able to target the results: • language • age, gender • location personalization • interests, actions • search type • friends, contacts 1. Identification of each user has to be possible 2. His data has to be available, ideally in a standardized form on a central location 3. The user needs sufficient control systems to grant or deny access to all or part of his data 3 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 4. Example 1 http://www.onlineschools.org/blog/facebook-obsession/ 4 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 5. Social Web content interaction user profile + contacts (social graph) most of them do not yet use their social data for search. http://www.theconversationprism.com/ 5 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 6. Example 2 • social bookmarking service • founded in 2006 • 1.5mm users • 24mm bookmarks (links) Also runs a standard fulltext based search engine.  Use available social data for search.  Aggregate data from different platforms. 6 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 7. Section 2/5 THEORETICAL APPROACH 7 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 8. 3 Types of social search [ Evans2009 ] „Collective“ „Collaborative“ „Friend-filtered“ Using the Using the Using „wisdom of the crowds“ „village paradigm“ „personalized results“ => The more popular a => queries are all => based on what friends content, the better it ranks. answered by experts. have shared in the past. As discussed in As discussed in Newest approach and [Hotho2006] [Horowitz2010] main topic of our work. Examples Examples Recent examples Digg.com, reddit, Aarkvark.com, Bing social, GooglePlus, Delicious Bookmarks Q&A Communities Blekko slashtag Problems Problem Problems Easy manipulation, No instant results; Not sure if hypotheses „one-size-fits-all“ Needs reliable experts on work out as predicted. many topics 8 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 9. Theoretical approach Where to start? A) Measure the amount of social interactions for each search result. Engagement intensity B) How strongly should somebody’s recommendation influence the searchers results? Trust levels 9 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 10. Engagement intensity The more effort the user has to go through, the higher the value: 10 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 11. Trust Levels • Trust is established as an asymetric relation between users • The user can adjust the trust levels for each contact • The system can assist in fine tuning within trust levels 11 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 12. Our model for SRS Full-Text 1 Relevancy (Top 1000) „bookmarked“ „liked“ Adam „shared“ Eve „+1‘d“, (…) Jane John 2 Social Relevance Score (SRS): 12 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 13. Hypotheses The number of links available for social search depends on the number of friends a user has in his social graph. There is a certain number of friends the user needs for social search to work „properly“ for any query. The result quality of classic full text search improves when combining it with the Social Relevance Score. We needed „enough data“ to back these up. 13 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 14. Section 3/5 FIELD STUDY 14 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 15. still Platform: Social-Search.com available one time only pancake recipe 15 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 16. Field Study Size Crawling social streams every 10 minutes for 58 days (from 09th of Nov.10 until 05th of Jan.11) 2.385 testers 468.889 friends 430, 13% 217010, 4 1651, 51% 6% 251879, 5 1164, 36% 4% facebook twitter facebook twitter 16 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 17. User-Graph Folkd.com Created using gephi.org excerpt of 60.000 Relations between 40.000 Nutzern 17 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 18. Resulting datasets: Stream Data Search Data Search Simulation Extracted and Analyzed a random Run a comparison crawled sample of test with 428.522 2.098 36 query terms link-recommendations. social search sessions. on all test-accounts. 18 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 19. Section 4/5 RESULTS AND EVALUATION 19 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 20. Impact of the user‘s friend count Log(10) Log(10) 20 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 21. Results and Evaluation So how many friends does one need for social search to “work properly“? Defining „work proper“: minimum 1 social results for the average query good 5 social results for the average query contains the search term and has friend engagement 21 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 22. It depends on the search term: non-linear scale 50 300 22 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 23. Influence of SRS 1. 2. 3. average number of clicks average time click-position needed per search spent on search 5.70 => 2.92 1.39 => 1.14 24.46 sec => 13.56 sec The user finds a suitable The user needs 0.25 The user detects result on average clicks less to get to a relevant results 2.78 positions earlier! suitable result! significantly faster.    23 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 24. Section 5/5 SUMMARY & QUESTIONS 24 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 25. Summary Using SRS can measurably improve social search Social Search will be a major part of all future search engines. This development is confirmed by the current market developments (Google Plus, Bing social, blekko slashtag …) The success of a social search depends on:  Connectivity of the searching user  Popularity of the search term  Time since when the user is using social media  Size of available social data 25 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm
  • 26. Thanks for your attention! Questions? Bastian Karweg Mobile Advertising GmbH (CEO) Twitter: @timetrax E-Mail: Karweg@mobile-advertising.com Web: www.bastiankarweg.de 26 06.12.2011 © Bastian Karweg, 2010 - 2011 Institute for Program Structures and Data Organisation (IPD), Chair Prof. K. Böhm