SlideShare ist ein Scribd-Unternehmen logo
1 von 50
Managing Online Business
     Communities
       The ROBUST Project
       Steffen Staab, Thomas Gottron
          & ROBUST Project Team




                                       EC Project 257859
Business Communities

• Information ecosystems
   – Employees
   – Business Partners, Customers
   – General Public                 Valuable asset




     Risks                              Opportunities
Use Cases



 Lotus Connections             SAP Community Network (SCN)   MeaningMine
 Communities                   Communities                   Communities
 • Employees                   • Customers                   • Social media
 • Working groups              • Partners                    • News
 • Interest Groups             • Suppliers                   • Web fora
 • Projects                    • Developers                  • Public communities
 Business value                Business value                Business value
 • Task relevant information   • Products support            • Topics
 • Collaboration               • Services                    • Opinions
 • Innovation                  • Find business partners      • Service for partners
 Volume                        Volume                        Volume
 • 4,000 posts/day             • 6,000 posts/day             • 1,400,000 posts/day
 • 386,000 employees           • 1,700,000 subscribers       • 708,000 web sources
 • 1.5GB content/day           • 16GB log/day                • 45GB content/day

        Employees                 Business Partners               Public Domain
         Intranet                     Extranet                      Internet
High Level Architecture

                              Risk Dashboard




           Community                                   Community Risk


                                   Communication Bus
           Simulation                                  Management


           Cloud Based Data                            User Roles
           Management                                  and Models


                                                       Community
                                                       Analysis
         Community Data
Requirements

                                     Community Risk Management

                                                  • Risk Formalization
 Community Analysis                               • Risk Detection
                                                  • Risk Forecasting
             • Feature Selection                  • Risk Management
             • Structural Analysis                • Risk Visualization
             • Behavior Analysis
             • Content Analysis
                                     User Roles and Models
             • Cross Community
                                                  • User Needs
 Community Simulation                             • Motivation
                                                  • Roles
             • Community Models                   • Groups
             • Policy Models                      • Community Value
             • Influence
             • Simulation            Cloud Based Data Management
             • Prediction
                                                  • Operations on Data
                                                  • Scalability
                                                  • Real Time
                                                  • Parallel Execution
                                                  • Stream Based
The ROBUST platform


Risk Dashboard and
   Visualizations
A new Role: the Community Manager


                  •   Definition of risks/opportunities
                  •   Monitoring
                  •   Interaction with community
                  •   Reaction and countermeasures
                   Need for a control center
Risk matrix visualization


          Risks ordered by probability   Negative impacts   Positive impacts




                                                                               A single risk can have
                                                                                     multiple impacts
Community Health: Graphic Equalizer
Community Analysis,
    User Roles & Risk Management


      Determine Risk:
Users Leaving the Community
“Churn”

 • Users churn = Users leave the community, become
   inactive

 • Questions:
   – Why do users leave the community?
   – Who is leaving the community?
   – Impact?


 • Tasks:
   – Detect “Churn”
   – Predict “Churn”
   – Evaluate “Churn”
                                         Angeletou et al. ISWC 2011
What is Churn? Activity?

                         No churn                        No churn
       Activity bursts              Holiday




                         No churn                         Churn!
       Inconsistent                 Cease to be active
Why Churn?
Network Effects

 • Churn: churning users influence other users they
   communicate with
                                           First Churner
                   First
            influenced
                Churner
                               • Temporal ordering: Churners that
                                 churned subsequent to each other
                               • Frequently, after a slow start,
  Cascade                        resulting in a cascade of churning




                           Churn of a single user          Influences his neighbours
                                                                Karnstedt et al. WebSci 2011
Role Analysis: who churns?
 • A role represents the standing, or part, that a user has
   within a given community
   Role  Dimension    Reciprocity       Initialisation   Persistence   Popularity

   Elitist             high: Threads                                    low
                       low: Neighbours
   Joining                               low              high
   Conversationalist
   Popular Initiator                     high                           high


   Supporter           medium            low                            medium


   Taciturn            low                                low


   Ignored                                                              low: No replies
Role Analysis: Approach


        Reciprocity, initialisation,   Feature levels change with the
        persistence, popularity        dynamics of the community

Behaviour
Ontology




 Run rules over each user’s features   Based on related work, we associate
 and derive the community role         roles with a collection of feature-to-level
 composition                           mappings
                                       e.g. in-degree -> high, out-degree -> high


                                                            Angeletou et al. ISWC 2011
Churn Analysis on Roles
 • Role composition in community




 • Development towards an unhealthy role
   composition!
Community & Content Analysis


     Opportunity Detection:
Discovering Interesting Contents
IBM Connections




         Analogies to
           Twitter!
Retweets
                       RT @janedoe: My
                                    Follower
                        dear @johndoe
                        had troubles to
                         wake up this
  @janedoe                #morning




        My dear
     @johndoe had
    troubles to wake
    up this #morning
Retweets
           • Retweet indicates
             quality
             – „of interest for others“



           • Idea:
             – Learn to predict retweets!




             Likelihood of retweet as
             metric for Interestingness
                             Naveed et al. WebSci 2011
Retweets: Prediction model

  Aim: Prediction of probability of retweet

  Logistic regression:                         1
                                  f (z) =            -z
                                            1+e
                                 z = w0 + w1x1 ++w2 x2 +¼+ wn xn

  Model parameters wi learned on training data

                  Dataset     Users         Tweets        Retweets
       Choudhury               118,506       9,998,756        7.89%
       Choudhury (extended)    277,666      29,000,000        8.64%
       Petrovic               4,050,944     21,477,484        8.46%
Logistic Regression: Weights

                      Feature       Dimensions   Weight
        Constant                (intercept)          -5.45
                                Direct message    -147.89
                                Username           146.82
        Message feature
                                Hashtag             42.27
                                URL                249.09
                                Valence            -26.88
        Sentiment               Arousal             33.97
                                Dominance           19.56
                                Positive             -21.8
        Emoticons
                                Negative              9.94
                                Positive            13.66
        Exclamation
                                Negative              8.72
                                !                  -16.85
        Punctuation
                                ?                   23.67
        Terms                   Odds                19.79
Logistic Regression: Topic Weights


                                   Topic                          Weight
      social media market post site web tool traffic network        27.54
      follow thank twitter welcome hello check nice cool people     16.08
      credit money market business rate economy home                15.25
      christmas shop tree xmas present today wrap finish              2.87
      home work hour long wait airport week flight head             -14.43
      twitter update facebook account page set squidoo check        -14.43
      cold snow warm today degree weather winter morning            -26.56
      night sleep work morning time bed feel tired home             -75.19
Re-Ranking using Interestingness

 • Top-k interesting tweets for „beer“
  Rang Username   Tweet
   1 BeeracrossTX UK beer mag declares "the end of beer writing." @StanHieronymus says not so in the US.
                  http://bit.ly/424HRQ #beer
   2    narmmusic      beer summit @bspward @jhinderaker no one had billy beer? heehee #narm - beer summit
                       @bspward @jhinde http://tinyurl.com/n29oxj
   3    beeriety       Go green and turn those empty beer bottles into recycled beer glasses! | http://bit.ly/2src7F
                       #beer #recycle (via: @td333)
   4    hblackmon      Great Divide beer dinner @ Porter Beer Bar on 8/19 - $45 for 3 courses + beer pairings.
                       http://trunc.it/172wt
   5    nycraftbeer    Interesting Concept-Beer Petitions.com launches&hopes 2help craft beer drinkers enjoy beer
                       they want @their fave pubs. http://bit.ly/11gJQN
   6    carichardson   Beer Cheddar Soup: Dish number two in my famed beer dinner series is Beer Cheddar
                       Soup. I hadn’t had too.. http://bit.ly/1diDdF
   7    BeerBrewing    New York City Beer Events - Beer Tasting - New York Beer Festivals - New York Craft Beer
                       http://is.gd/39kXj #beer
   8    delphiforums   Love beer? Our member is trying to build up a new beer drinker's forum. Grab a #beer and
                       join us: http://tr.im/pD1n
   9    Jamie_Mason #Baltimore Beer Week continues w/ a beer brkfst, beer pioneers luncheon, drink & donate
                    event, beer tastings & more. http://ping.fm/VyTwg
   10   carichardson   Seattle and Beer: I went to Seattle last weekend. It was my friend’s stag - he likes
                       beer - we drank beer.. http://tinyurl.com/cpb4n9
                                                                                              Naveed et al. CIKM 2011
LiveTweet




            http://livetweet.west.uni-koblenz.de/

                                       Che Alhadi et al. TREC 2011, ECIR 2012
Compare Interestingness to Detect Trends
 Tomorrow I am going to …



                            … play tennis

                            … play golf

                            … do some gardening

                            … spread wisdom

                            … go to a Justin Bieber concert

                            … win the lottery
Community Simulation


Risk Mitigation: Simulating
Effects of Policy Changes
Policies

 • Governance of Communities
    – Def.: Steering and coordinating actions of
      community members


 • Implementation:
    – Direct intervention of community owner
    – Functionality of the community platform


 • Mitigation of risks:
    – Change platform functionality
    – Impact?


                                                   Schwagereit WebSci 2011
Governance by Policy Change
                                      What if using
                                       Policy 2 ?

        Policy 1



                                Community
                                 Manager


                              Prediction


                                Policy 2
Scenario: Policy Controlled Content Generation

 •   Users generate content
 •   Users search content
 •   Users consume content
 •   Users interact with content
     – Rate content
     – Reply to content


 • Where can a (platform) policy have effects?
Policy impacts on user behaviour
                               Policy

   User Need for
     Content
   Consumption

               Content       Content
                                        Rating   Replying
               Selection   Assessment



   User Need for
     Content
     Creation

               Content      Content
               Creation    Evaluation


                               Policy
Attention Management

 • Part of Community: forum activity
 • Goal: Steer user activity to specific forums
 • User model parameters:
    – Activity rate for creating threads
    – Activity rate for creating replies
    – Preferences for activity in specific forums
 • Community Model
    – Varied restrictions for thread creation
 • Observed Metrics
    – Response time on threads
Simulation based on observed parameters
                                                    Online Community

    Metrics of     Community                Community
    Community       Artifacts                Activity



                                   Community       Community
                                    Members         Platform
                                   (Persons)       (Software)

                            Analysis
           Comparison                                    Abstraction

                                   User Model      Community
                                        +           Platform
                                   Parameters        Model



    Metrics of      Simulated               Simulated
    Simulated       Community               Community
    Community        Artifacts               Activity
                                                          Simulation
Variable Parameters indicated by different policy

 • Users search content
    – Presentation  Ranking threads in content views
        •   Recency
        •   Social Closeness
        •   Topical closeness
        •   Popularity
    – Observation: Influence which questions are answered
 • Users generate content
    – Restrict number of questions asked per forum
        • Users turn to other questions
    – Observation: Response time in some fora reduced
Backend Technologies


Indexing Distributed
  Semantic Graphs
Community Information

 • Examples
   – Male persons who have a public profile document
   – Computing science papers authored by social scientists
   – American actors who are also politicians and are married
     to a model.


 • Maybe specific databases available:
   – Person search engines
   – Bibliographic databases
   – Movie database
Linked Data
     Semantic Web Technology to
     1. Provide structured data on the web
     2. Link data across data sources



      Thing            Thing            Thing            Thing            Thing


      Thing            Thing            Thing            Thing            Thing


              typed            typed            typed            typed
               links            links            links            links



        A               B                C                 D               E
The LOD Cloud
Querying linked data

 SELECT ?x
 WHERE {
    ?x rdfs:type foaf:Person .
    ?x rdfs:type pim:Male .
    ?x foaf:maker ?y .
    ?y rdfs:type
       foaf:PersonalProfileDocument .
 }
Querying linked data – using an index

 SELECT ?x
 WHERE {
    ?x rdfs:type foaf:Person .
    ?x rdfs:type pim:Male .
    ?x foaf:maker ?y .
    ?y rdfs:type
       foaf:PersonalProfileDocument .
 }
Schema Index Overview




                        Konrath et al. BTC 2011, JWS 2012
How it works ...

SELECT ?x
FROM …
WHERE {
   ?x rdfs:type foaf:Person .
   ?x rdfs:type pim:Male .
   ?x foaf:maker ?y .
   ?y rdfs:type
       foaf:PersonalProfileDocument .
}




                                    pim:                     foaf:         foaf:
                                    Male                    Person         PPD


                                        rdfs:type        rdfs:type       rdfs:type


                                                    ?x                      ?y
                                                            foaf:maker
How it works ...

    pim:Male      foaf:Person      foaf:PPD




     TC_18                         TC_76


                        5_76
                        maker



     EQC_5            foaf:maker




      DS 1     DS 2
Building the Index from a Stream
  Stream of data (coming from a LD crawler)

    … D16, D15, D14, D13, D12, D11, D10, D9, D8, D7, D6, D5, D4, D3, D2, D1



                                                     FiFo
                                                                    1
                                         C3            4
                                                                    6
                                         C2            3
                                                                    4
                                                       2
                                         C2                         2
                                                       1                3
                                         C1                         5
Does it work good?
  Comparison of stream based vs. Gold standard Schema on 11 M triple data set
Managing Business Communities


   Lessons learned
Conclusions

 • Business communities vary with regard to
   – Interaction
   – Interests
   – Type of conversation


 • Novel analysis techniques needed:
   – Integration of different data sources
   – Simulation of policy changes
   – Value of users


 • Concrete needs confirmed by project external
   companies looking for such technology
References
•   S. Angeletou, M. Rowe, H. Alani. Modelling and analysis of user behaviour in online communities.
    In Proceedings of the 10th international conference on The semantic web - Volume Part I,
•   A. Che Alhadi, T. Gottron, J. Kunegis, N. Naveed. Livetweet: Microblog retrieval based on
    interestingness. In TREC’11: Proceedings of the Text Retrieval Conference 2011, 2011.
•   A. Che Alhadi, T. Gottron, J. Kunegis, N. Naveed. Livetweet: Monitoring and predicting interesting
    microblog posts. In ECIR’12: Proceedings of the 34th European Conference on Information Retrieval,
    2012.
•   M. Karnstedt, M. Rowe, J. Chan, H. Alani, C. Hayes. The effect of user features on churn in social
    networks. In WebSci ’11: Proceedings of the 3rd International Conference on Web Science, 2011.
•   M. Konrath, T. Gottron, A. Scherp. Schemex – web-scale indexed schema extraction of linked open
    data. In Semantic Web Challenge, Submission to the Billion Triple Track, 2011.
•   M. Konrath, T. Gottron, S. Staab, A. Scherp. Schemex—efficient construction of a data catalogue by
    stream-based indexing of linked data. Journal of Web Semantics, 2012. to appear.
•   N. Naveed, T. Gottron, J. Kunegis, A. Che Alhadi. Bad news travel fast: A content-based analysis of
    interestingness on twitter. In WebSci ’11: Proceedings of the 3rd International Conference on Web
    Science, 2011.
•   N. Naveed, T. Gottron, J. Kunegis, A. Che Alhadi. Searching microblogs: Coping with sparsity and
    document quality. In CIKM’11: Proceedings of 20th ACM Conference on Information and Knowledge
    Management, 2011.
•   F. Schwagereit, A. Scherp, S. Staab. Survey on governance of user-generated content in web
    communities. In WebSci ’11: Proceedings of the 3rd International Conference on Web Science, 2011.
Thank You!




             EC Project 257859

Weitere ähnliche Inhalte

Was ist angesagt?

강장묵 차세대소셜네트워크 Social Network Service
강장묵 차세대소셜네트워크 Social Network Service강장묵 차세대소셜네트워크 Social Network Service
강장묵 차세대소셜네트워크 Social Network Service
JM code group
 

Was ist angesagt? (13)

Crs piloting-idce-0113
Crs piloting-idce-0113Crs piloting-idce-0113
Crs piloting-idce-0113
 
Slides
SlidesSlides
Slides
 
From Communication to vibrant communities
From Communication to vibrant communitiesFrom Communication to vibrant communities
From Communication to vibrant communities
 
Atlassian Webinar: SunSpace
Atlassian Webinar: SunSpaceAtlassian Webinar: SunSpace
Atlassian Webinar: SunSpace
 
A Twitter for your Intranet
A Twitter for your IntranetA Twitter for your Intranet
A Twitter for your Intranet
 
Social Media Workshop AFSM Switzerland
Social Media Workshop AFSM SwitzerlandSocial Media Workshop AFSM Switzerland
Social Media Workshop AFSM Switzerland
 
This isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network ageThis isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network age
 
Public v -_ibm_social_software_story_-_soc_med_for_ce_os_sept2011
Public v -_ibm_social_software_story_-_soc_med_for_ce_os_sept2011Public v -_ibm_social_software_story_-_soc_med_for_ce_os_sept2011
Public v -_ibm_social_software_story_-_soc_med_for_ce_os_sept2011
 
Social media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applicationsSocial media mining and multimedia analysis research and applications
Social media mining and multimedia analysis research and applications
 
강장묵 차세대소셜네트워크 Social Network Service
강장묵 차세대소셜네트워크 Social Network Service강장묵 차세대소셜네트워크 Social Network Service
강장묵 차세대소셜네트워크 Social Network Service
 
20111123 mwa2011-marc smith
20111123 mwa2011-marc smith20111123 mwa2011-marc smith
20111123 mwa2011-marc smith
 
Extending Lotus Connections with Domino using TDI
Extending Lotus Connections with Domino using TDIExtending Lotus Connections with Domino using TDI
Extending Lotus Connections with Domino using TDI
 
Top web 2 0 tools
Top web 2 0 toolsTop web 2 0 tools
Top web 2 0 tools
 

Ähnlich wie Managing Online Business Communities

Engage 2013 - Integrated Campaign Reporting + MTA
Engage 2013 - Integrated Campaign Reporting + MTAEngage 2013 - Integrated Campaign Reporting + MTA
Engage 2013 - Integrated Campaign Reporting + MTA
Webtrends
 
AICPA Leadership Retreat - Technology
AICPA Leadership Retreat - TechnologyAICPA Leadership Retreat - Technology
AICPA Leadership Retreat - Technology
ASAE
 

Ähnlich wie Managing Online Business Communities (20)

Engage 2013 - Integrated Campaign Reporting + MTA
Engage 2013 - Integrated Campaign Reporting + MTAEngage 2013 - Integrated Campaign Reporting + MTA
Engage 2013 - Integrated Campaign Reporting + MTA
 
Чираг Шах «Коллективный поиск, взаимодействие пользователей: подходы к изучен...
Чираг Шах «Коллективный поиск, взаимодействие пользователей: подходы к изучен...Чираг Шах «Коллективный поиск, взаимодействие пользователей: подходы к изучен...
Чираг Шах «Коллективный поиск, взаимодействие пользователей: подходы к изучен...
 
Human Interactions in Mixed Service-Oriented Systems
Human Interactions in Mixed Service-Oriented SystemsHuman Interactions in Mixed Service-Oriented Systems
Human Interactions in Mixed Service-Oriented Systems
 
Generating Context-aware Recommendations using Banking Data in a Mobile Recom...
Generating Context-aware Recommendations using Banking Data in a Mobile Recom...Generating Context-aware Recommendations using Banking Data in a Mobile Recom...
Generating Context-aware Recommendations using Banking Data in a Mobile Recom...
 
Introduction to Information Architecture & Design - SVA Workshop 03/22/14
Introduction to Information Architecture & Design - SVA Workshop 03/22/14Introduction to Information Architecture & Design - SVA Workshop 03/22/14
Introduction to Information Architecture & Design - SVA Workshop 03/22/14
 
Social networking text mining - analytics in km 13.dec.2011
Social networking   text mining - analytics in km 13.dec.2011Social networking   text mining - analytics in km 13.dec.2011
Social networking text mining - analytics in km 13.dec.2011
 
SVA Workshop - Fall 121011
SVA Workshop - Fall 121011SVA Workshop - Fall 121011
SVA Workshop - Fall 121011
 
SVA Workshop 021112
SVA Workshop 021112SVA Workshop 021112
SVA Workshop 021112
 
Introduction to Information Architecture and Design - SVA Workshop 03/23/13
Introduction to Information Architecture and Design - SVA Workshop 03/23/13Introduction to Information Architecture and Design - SVA Workshop 03/23/13
Introduction to Information Architecture and Design - SVA Workshop 03/23/13
 
AICPA Leadership Retreat - Technology
AICPA Leadership Retreat - TechnologyAICPA Leadership Retreat - Technology
AICPA Leadership Retreat - Technology
 
Introduction to Information Architecture and Design - SVA Workshop 0929
Introduction to Information Architecture and Design - SVA Workshop 0929Introduction to Information Architecture and Design - SVA Workshop 0929
Introduction to Information Architecture and Design - SVA Workshop 0929
 
Introduction to Information Architecture and Design - SVA Workshop 0712
Introduction to Information Architecture and Design - SVA Workshop 0712Introduction to Information Architecture and Design - SVA Workshop 0712
Introduction to Information Architecture and Design - SVA Workshop 0712
 
Introduction to Information Architecture and Design - SVA Workshop 062312
Introduction to Information Architecture and Design - SVA Workshop 062312Introduction to Information Architecture and Design - SVA Workshop 062312
Introduction to Information Architecture and Design - SVA Workshop 062312
 
Introduction to Information Architecture and Design - SVA Workshop 09/28/13
Introduction to Information Architecture and Design - SVA Workshop 09/28/13Introduction to Information Architecture and Design - SVA Workshop 09/28/13
Introduction to Information Architecture and Design - SVA Workshop 09/28/13
 
Introduction to Information Architecture & Design - SVA Workshop 06/21/14
Introduction to Information Architecture & Design - SVA Workshop 06/21/14Introduction to Information Architecture & Design - SVA Workshop 06/21/14
Introduction to Information Architecture & Design - SVA Workshop 06/21/14
 
Introduction to Information Architecture and Design - SVA Workshop 021613
Introduction to Information Architecture and Design - SVA Workshop 021613Introduction to Information Architecture and Design - SVA Workshop 021613
Introduction to Information Architecture and Design - SVA Workshop 021613
 
Introduction to Information Architecture and Design - SVA Workshop 120812
Introduction to Information Architecture and Design - SVA Workshop 120812Introduction to Information Architecture and Design - SVA Workshop 120812
Introduction to Information Architecture and Design - SVA Workshop 120812
 
SVA Workshop 032512
SVA Workshop 032512SVA Workshop 032512
SVA Workshop 032512
 
Social media! are you ready?
Social media! are you ready?Social media! are you ready?
Social media! are you ready?
 
Introduction to Information Architecture and Design - SVA Workshop 06/22/13
Introduction to Information Architecture and Design - SVA Workshop 06/22/13Introduction to Information Architecture and Design - SVA Workshop 06/22/13
Introduction to Information Architecture and Design - SVA Workshop 06/22/13
 

Mehr von Steffen Staab

Mehr von Steffen Staab (20)

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Knowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sureKnowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sure
 
Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine Learning
 
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
 
Web Futures: Inclusive, Intelligent, Sustainable
Web Futures: Inclusive, Intelligent, SustainableWeb Futures: Inclusive, Intelligent, Sustainable
Web Futures: Inclusive, Intelligent, Sustainable
 
Eyeing the Web
Eyeing the WebEyeing the Web
Eyeing the Web
 
Concepts in Application Context ( How we may think conceptually )
Concepts in Application Context ( How we may think conceptually )Concepts in Application Context ( How we may think conceptually )
Concepts in Application Context ( How we may think conceptually )
 
Storing and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the CloudStoring and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the Cloud
 
Semantics reloaded
Semantics reloadedSemantics reloaded
Semantics reloaded
 
Ontologien und Semantic Web - Impulsvortrag Terminologietag
Ontologien und Semantic Web - Impulsvortrag TerminologietagOntologien und Semantic Web - Impulsvortrag Terminologietag
Ontologien und Semantic Web - Impulsvortrag Terminologietag
 
Opinion Formation and Spreading
Opinion Formation and SpreadingOpinion Formation and Spreading
Opinion Formation and Spreading
 
The Web We Want
The Web We WantThe Web We Want
The Web We Want
 
10 Jahre Web Science
10 Jahre Web Science10 Jahre Web Science
10 Jahre Web Science
 
(Semi-)Automatic analysis of online contents
(Semi-)Automatic analysis of online contents(Semi-)Automatic analysis of online contents
(Semi-)Automatic analysis of online contents
 
Programming with Semantic Broad Data
Programming with Semantic Broad DataProgramming with Semantic Broad Data
Programming with Semantic Broad Data
 
Text Mining using LDA with Context
Text Mining using LDA with ContextText Mining using LDA with Context
Text Mining using LDA with Context
 
Wwsss intro2016-final
Wwsss intro2016-finalWwsss intro2016-final
Wwsss intro2016-final
 
10 Years Web Science
10 Years Web Science10 Years Web Science
10 Years Web Science
 
Semantic Web Technologies: Principles and Practices
Semantic Web Technologies: Principles and PracticesSemantic Web Technologies: Principles and Practices
Semantic Web Technologies: Principles and Practices
 
Closing Session ISWC 2015
Closing Session ISWC 2015Closing Session ISWC 2015
Closing Session ISWC 2015
 

Kürzlich hochgeladen

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Managing Online Business Communities

  • 1. Managing Online Business Communities The ROBUST Project Steffen Staab, Thomas Gottron & ROBUST Project Team EC Project 257859
  • 2. Business Communities • Information ecosystems – Employees – Business Partners, Customers – General Public Valuable asset Risks Opportunities
  • 3. Use Cases Lotus Connections SAP Community Network (SCN) MeaningMine Communities Communities Communities • Employees • Customers • Social media • Working groups • Partners • News • Interest Groups • Suppliers • Web fora • Projects • Developers • Public communities Business value Business value Business value • Task relevant information • Products support • Topics • Collaboration • Services • Opinions • Innovation • Find business partners • Service for partners Volume Volume Volume • 4,000 posts/day • 6,000 posts/day • 1,400,000 posts/day • 386,000 employees • 1,700,000 subscribers • 708,000 web sources • 1.5GB content/day • 16GB log/day • 45GB content/day Employees Business Partners Public Domain Intranet Extranet Internet
  • 4. High Level Architecture Risk Dashboard Community Community Risk Communication Bus Simulation Management Cloud Based Data User Roles Management and Models Community Analysis Community Data
  • 5. Requirements Community Risk Management • Risk Formalization Community Analysis • Risk Detection • Risk Forecasting • Feature Selection • Risk Management • Structural Analysis • Risk Visualization • Behavior Analysis • Content Analysis User Roles and Models • Cross Community • User Needs Community Simulation • Motivation • Roles • Community Models • Groups • Policy Models • Community Value • Influence • Simulation Cloud Based Data Management • Prediction • Operations on Data • Scalability • Real Time • Parallel Execution • Stream Based
  • 6. The ROBUST platform Risk Dashboard and Visualizations
  • 7. A new Role: the Community Manager • Definition of risks/opportunities • Monitoring • Interaction with community • Reaction and countermeasures  Need for a control center
  • 8. Risk matrix visualization Risks ordered by probability Negative impacts Positive impacts A single risk can have multiple impacts
  • 10. Community Analysis, User Roles & Risk Management Determine Risk: Users Leaving the Community
  • 11. “Churn” • Users churn = Users leave the community, become inactive • Questions: – Why do users leave the community? – Who is leaving the community? – Impact? • Tasks: – Detect “Churn” – Predict “Churn” – Evaluate “Churn” Angeletou et al. ISWC 2011
  • 12. What is Churn? Activity? No churn No churn Activity bursts Holiday No churn Churn! Inconsistent Cease to be active
  • 14. Network Effects • Churn: churning users influence other users they communicate with First Churner First influenced Churner • Temporal ordering: Churners that churned subsequent to each other • Frequently, after a slow start, Cascade resulting in a cascade of churning Churn of a single user Influences his neighbours Karnstedt et al. WebSci 2011
  • 15. Role Analysis: who churns? • A role represents the standing, or part, that a user has within a given community Role Dimension Reciprocity Initialisation Persistence Popularity Elitist high: Threads low low: Neighbours Joining low high Conversationalist Popular Initiator high high Supporter medium low medium Taciturn low low Ignored low: No replies
  • 16. Role Analysis: Approach Reciprocity, initialisation, Feature levels change with the persistence, popularity dynamics of the community Behaviour Ontology Run rules over each user’s features Based on related work, we associate and derive the community role roles with a collection of feature-to-level composition mappings e.g. in-degree -> high, out-degree -> high Angeletou et al. ISWC 2011
  • 17. Churn Analysis on Roles • Role composition in community • Development towards an unhealthy role composition!
  • 18. Community & Content Analysis Opportunity Detection: Discovering Interesting Contents
  • 19. IBM Connections Analogies to Twitter!
  • 20. Retweets RT @janedoe: My Follower dear @johndoe had troubles to wake up this @janedoe #morning My dear @johndoe had troubles to wake up this #morning
  • 21. Retweets • Retweet indicates quality – „of interest for others“ • Idea: – Learn to predict retweets! Likelihood of retweet as metric for Interestingness Naveed et al. WebSci 2011
  • 22. Retweets: Prediction model Aim: Prediction of probability of retweet Logistic regression: 1 f (z) = -z 1+e z = w0 + w1x1 ++w2 x2 +¼+ wn xn Model parameters wi learned on training data Dataset Users Tweets Retweets Choudhury 118,506 9,998,756 7.89% Choudhury (extended) 277,666 29,000,000 8.64% Petrovic 4,050,944 21,477,484 8.46%
  • 23. Logistic Regression: Weights Feature Dimensions Weight Constant (intercept) -5.45 Direct message -147.89 Username 146.82 Message feature Hashtag 42.27 URL 249.09 Valence -26.88 Sentiment Arousal 33.97 Dominance 19.56 Positive -21.8 Emoticons Negative 9.94 Positive 13.66 Exclamation Negative 8.72 ! -16.85 Punctuation ? 23.67 Terms Odds 19.79
  • 24. Logistic Regression: Topic Weights Topic Weight social media market post site web tool traffic network 27.54 follow thank twitter welcome hello check nice cool people 16.08 credit money market business rate economy home 15.25 christmas shop tree xmas present today wrap finish 2.87 home work hour long wait airport week flight head -14.43 twitter update facebook account page set squidoo check -14.43 cold snow warm today degree weather winter morning -26.56 night sleep work morning time bed feel tired home -75.19
  • 25. Re-Ranking using Interestingness • Top-k interesting tweets for „beer“ Rang Username Tweet 1 BeeracrossTX UK beer mag declares "the end of beer writing." @StanHieronymus says not so in the US. http://bit.ly/424HRQ #beer 2 narmmusic beer summit @bspward @jhinderaker no one had billy beer? heehee #narm - beer summit @bspward @jhinde http://tinyurl.com/n29oxj 3 beeriety Go green and turn those empty beer bottles into recycled beer glasses! | http://bit.ly/2src7F #beer #recycle (via: @td333) 4 hblackmon Great Divide beer dinner @ Porter Beer Bar on 8/19 - $45 for 3 courses + beer pairings. http://trunc.it/172wt 5 nycraftbeer Interesting Concept-Beer Petitions.com launches&hopes 2help craft beer drinkers enjoy beer they want @their fave pubs. http://bit.ly/11gJQN 6 carichardson Beer Cheddar Soup: Dish number two in my famed beer dinner series is Beer Cheddar Soup. I hadn’t had too.. http://bit.ly/1diDdF 7 BeerBrewing New York City Beer Events - Beer Tasting - New York Beer Festivals - New York Craft Beer http://is.gd/39kXj #beer 8 delphiforums Love beer? Our member is trying to build up a new beer drinker's forum. Grab a #beer and join us: http://tr.im/pD1n 9 Jamie_Mason #Baltimore Beer Week continues w/ a beer brkfst, beer pioneers luncheon, drink & donate event, beer tastings & more. http://ping.fm/VyTwg 10 carichardson Seattle and Beer: I went to Seattle last weekend. It was my friend’s stag - he likes beer - we drank beer.. http://tinyurl.com/cpb4n9 Naveed et al. CIKM 2011
  • 26. LiveTweet http://livetweet.west.uni-koblenz.de/ Che Alhadi et al. TREC 2011, ECIR 2012
  • 27. Compare Interestingness to Detect Trends Tomorrow I am going to … … play tennis … play golf … do some gardening … spread wisdom … go to a Justin Bieber concert … win the lottery
  • 28. Community Simulation Risk Mitigation: Simulating Effects of Policy Changes
  • 29. Policies • Governance of Communities – Def.: Steering and coordinating actions of community members • Implementation: – Direct intervention of community owner – Functionality of the community platform • Mitigation of risks: – Change platform functionality – Impact? Schwagereit WebSci 2011
  • 30. Governance by Policy Change What if using Policy 2 ? Policy 1 Community Manager Prediction Policy 2
  • 31. Scenario: Policy Controlled Content Generation • Users generate content • Users search content • Users consume content • Users interact with content – Rate content – Reply to content • Where can a (platform) policy have effects?
  • 32. Policy impacts on user behaviour Policy User Need for Content Consumption Content Content Rating Replying Selection Assessment User Need for Content Creation Content Content Creation Evaluation Policy
  • 33. Attention Management • Part of Community: forum activity • Goal: Steer user activity to specific forums • User model parameters: – Activity rate for creating threads – Activity rate for creating replies – Preferences for activity in specific forums • Community Model – Varied restrictions for thread creation • Observed Metrics – Response time on threads
  • 34. Simulation based on observed parameters Online Community Metrics of Community Community Community Artifacts Activity Community Community Members Platform (Persons) (Software) Analysis Comparison Abstraction User Model Community + Platform Parameters Model Metrics of Simulated Simulated Simulated Community Community Community Artifacts Activity Simulation
  • 35. Variable Parameters indicated by different policy • Users search content – Presentation  Ranking threads in content views • Recency • Social Closeness • Topical closeness • Popularity – Observation: Influence which questions are answered • Users generate content – Restrict number of questions asked per forum • Users turn to other questions – Observation: Response time in some fora reduced
  • 37. Community Information • Examples – Male persons who have a public profile document – Computing science papers authored by social scientists – American actors who are also politicians and are married to a model. • Maybe specific databases available: – Person search engines – Bibliographic databases – Movie database
  • 38. Linked Data Semantic Web Technology to 1. Provide structured data on the web 2. Link data across data sources Thing Thing Thing Thing Thing Thing Thing Thing Thing Thing typed typed typed typed links links links links A B C D E
  • 40. Querying linked data SELECT ?x WHERE { ?x rdfs:type foaf:Person . ?x rdfs:type pim:Male . ?x foaf:maker ?y . ?y rdfs:type foaf:PersonalProfileDocument . }
  • 41. Querying linked data – using an index SELECT ?x WHERE { ?x rdfs:type foaf:Person . ?x rdfs:type pim:Male . ?x foaf:maker ?y . ?y rdfs:type foaf:PersonalProfileDocument . }
  • 42. Schema Index Overview Konrath et al. BTC 2011, JWS 2012
  • 43. How it works ... SELECT ?x FROM … WHERE { ?x rdfs:type foaf:Person . ?x rdfs:type pim:Male . ?x foaf:maker ?y . ?y rdfs:type foaf:PersonalProfileDocument . } pim: foaf: foaf: Male Person PPD rdfs:type rdfs:type rdfs:type ?x ?y foaf:maker
  • 44. How it works ... pim:Male foaf:Person foaf:PPD TC_18 TC_76 5_76 maker EQC_5 foaf:maker DS 1 DS 2
  • 45. Building the Index from a Stream  Stream of data (coming from a LD crawler) … D16, D15, D14, D13, D12, D11, D10, D9, D8, D7, D6, D5, D4, D3, D2, D1 FiFo 1 C3 4 6 C2 3 4 2 C2 2 1 3 C1 5
  • 46. Does it work good? Comparison of stream based vs. Gold standard Schema on 11 M triple data set
  • 47. Managing Business Communities Lessons learned
  • 48. Conclusions • Business communities vary with regard to – Interaction – Interests – Type of conversation • Novel analysis techniques needed: – Integration of different data sources – Simulation of policy changes – Value of users • Concrete needs confirmed by project external companies looking for such technology
  • 49. References • S. Angeletou, M. Rowe, H. Alani. Modelling and analysis of user behaviour in online communities. In Proceedings of the 10th international conference on The semantic web - Volume Part I, • A. Che Alhadi, T. Gottron, J. Kunegis, N. Naveed. Livetweet: Microblog retrieval based on interestingness. In TREC’11: Proceedings of the Text Retrieval Conference 2011, 2011. • A. Che Alhadi, T. Gottron, J. Kunegis, N. Naveed. Livetweet: Monitoring and predicting interesting microblog posts. In ECIR’12: Proceedings of the 34th European Conference on Information Retrieval, 2012. • M. Karnstedt, M. Rowe, J. Chan, H. Alani, C. Hayes. The effect of user features on churn in social networks. In WebSci ’11: Proceedings of the 3rd International Conference on Web Science, 2011. • M. Konrath, T. Gottron, A. Scherp. Schemex – web-scale indexed schema extraction of linked open data. In Semantic Web Challenge, Submission to the Billion Triple Track, 2011. • M. Konrath, T. Gottron, S. Staab, A. Scherp. Schemex—efficient construction of a data catalogue by stream-based indexing of linked data. Journal of Web Semantics, 2012. to appear. • N. Naveed, T. Gottron, J. Kunegis, A. Che Alhadi. Bad news travel fast: A content-based analysis of interestingness on twitter. In WebSci ’11: Proceedings of the 3rd International Conference on Web Science, 2011. • N. Naveed, T. Gottron, J. Kunegis, A. Che Alhadi. Searching microblogs: Coping with sparsity and document quality. In CIKM’11: Proceedings of 20th ACM Conference on Information and Knowledge Management, 2011. • F. Schwagereit, A. Scherp, S. Staab. Survey on governance of user-generated content in web communities. In WebSci ’11: Proceedings of the 3rd International Conference on Web Science, 2011.
  • 50. Thank You! EC Project 257859

Hinweis der Redaktion

  1. Business communities slightly different:Grouped around a business/enterpriseAim: add value to business (Knowledge management, public relations, customer aquisition, support, etc.)Conclusion: communities have a value, that needs to be taken care ofValue is endangered by risks (e.g. experts leaving) or might be increased by seizing opportunities (e.g. connect people working on the same topic)
  2. Intrinsic: features of the service or product.Extrinsic: External factors. E.g. Peer pressure.Expectation of eventual payback in terms of information, Agreeable social relations. Enhancement of reputation, Spreading of their ideas , In this way there exists an Implicit Social Contract