SlideShare ist ein Scribd-Unternehmen logo
1 von 60
Using Behaviour Analysis to
Detect Cultural Aspects in Social
         Web Systems
                     Dr Matthew Rowe
         Knowledge Media Institute, The Open University,
                Milton Keynes, United Kingdom

http://people.kmi.open.ac.uk/rowe | http://www.matthew-rowe.com
Web 1.0



                   •   Web of documents
                   •   Web presence constrained to HTML ‘experts’
                   •   Fixed categories
                   •   Static content




                                             http://www.flickr.com/photos/complexify/97303317/

Using Behaviour Analysis to Detect Cultural Aspects in                               1
Social Web Systems
Web 2.0



                   •   Data access through APIs
                   •   Collective Intelligence
                   •   User generated content
                   •   Web presence for all
                   •   Tagging




                                         http://www.flickr.com/photos/9119028@N05/591163479

Using Behaviour Analysis to Detect Cultural Aspects in                             2
Social Web Systems
A Social Web




               A Social Web System is an online platform that
               offers a useful service, normally for free, to
               users, through which they can interact and network




                                 http://mmt.me.uk/slides/deri20110401/images/walledgardens.jpg

Using Behaviour Analysis to Detect Cultural Aspects in                               3
Social Web Systems
Example 1




Using Behaviour Analysis to Detect Cultural Aspects in   4
Social Web Systems
Example 2




Using Behaviour Analysis to Detect Cultural Aspects in   5
Social Web Systems
Δs of Social Web
                                     Systems
• Social Web Systems differ in their:
     – Domain
          • Flickr = photos
          • Facebook = social networking
          • Twitter = microblogging
     – Audience
          • SAP Community Network = programmers
          • Slashdot = technology enthusiasts


• How else do they differ?
• What are the Δs?

Using Behaviour Analysis to Detect Cultural Aspects in   6
Social Web Systems
The Utility of
                                     Behaviour Analysis
•   WeGov
     – Investigating the role of social networks in eGovernment
     – Enabling:
          • Tracking of political discussions and topics
          • Injection of policy content to maximise exposure


•   ROBUST
     – Risk and opportunity management in online communities
     – Enabling
          • Assessment of user churn in online communities
          • Community evolution prediction
          • Monitoring of community health


• Behaviour analysis is required to understand:
     – What behaviour drives content creation
     – How behaviour is associated with community evolution


Using Behaviour Analysis to Detect Cultural Aspects in            7
Social Web Systems
Thesis: Microcultures




        Social Web Systems contain micro-cultures
        that differ in terms of
           a) user behaviour
           b) how attention is generated
           c) role compositions in such systems




Using Behaviour Analysis to Detect Cultural Aspects in   8
Social Web Systems
Outline
• Analysis 1: Generating Attention
     –   Understanding Attention Factors
     –   Approach
     –   Experiments
     –   Findings
• Analysis 2: Behaviour Role Compositions
     –   Analysing Community Evolution
     –   Approach
     –   Experiments
     –   Findings
• Microcultures: Evidence
Using Behaviour Analysis to Detect Cultural Aspects in   9
Social Web Systems
Analysis I

Generating Attention
Using Behaviour Analysis to Detect Cultural Aspects in   10
Social Web Systems
Shared Content
• Social Web Systems are now used to:
     – Ask questions
     – Post opinions and ideas
     – Discuss events and current issues


• Content analysis in online communities is attractive for:
     – Market analysis
     – Brand consensus and product opinion


• Social network analytics in the US is predicted to reach
  $1 billion by 2014 (Forrester 2009)

• Masses of data is now being published in social web systems:
     – Facebook has more than 60 million status updates per day   (Facebook statistics
        2010)




Using Behaviour Analysis to Detect Cultural Aspects in                              11
Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in   12
Social Web Systems
The Need for
                                     Analysis
• Analysts need to know which piece of content will generate
  the most activity
   – i.e. the most auspicious or influential
   – Helps focus the attention of human and computerised
     analysts
          • What to track?


• Need to understand the effect features (community and
  content) have on attention to content

            Which features are key to stimulating activity?
           How do these features influence activity length?

How do Social Web Systems differ in how attention is generated?


Using Behaviour Analysis to Detect Cultural Aspects in         13
Social Web Systems
Approach:
                                     Attention Prediction
• Two-stage approach to predict attention to content:

     1. Identify seed posts
          • E.g. thread starters on a message board
          • Will a given post start a discussion?
          • What are the properties that seed posts exhibit?
               – What parameters tend to trigger a discussion?


     2. Predict discussion activity levels
          • From the identified seed posts
          • What is the level of activity that a seed post will
            generate?
          • What features correlate with heightened activity?


Using Behaviour Analysis to Detect Cultural Aspects in            14
Social Web Systems
Features

         Which features are key to stimulating activity?

• For each post, model: a) the author, b) the content and
  c) the topical concentration of the author

• F1: User Features
     – In-degree, out-degree: social network properties of the author
     – Post count, age, post rate: participation information of the author
• F2: Content Features
     – Post length, referral count, time in day: surface features of the
       post
     – Complexity: cumulative entropy of terms in the post
     – Readability: Gunning Fog index of the post
     – Informativeness: TF-IDF measure of terms within the post
     – Polarity: average sentiment of terms in the post
Using Behaviour Analysis to Detect Cultural Aspects in                     15
Social Web Systems
Features (2)
• F3: Focus Features
     – Topic entropy: the concentration of the author across
       community forums
          • Higher entropy indicates a wider spread of forum activity
          • More random distribution, less concentrated
     – Topic Likelihood: the likelihood that a user posts in a
       specific forum given his post history
          • Measures the affinity that a user has with a given forum
          • Lower likelihood indicates a user posting on an unfamiliar
            topic




Using Behaviour Analysis to Detect Cultural Aspects in                   16
Social Web Systems
Social Web Systems:
                                     Datasets



• Microblogging Platform: Twitter
   – Collected a random subset over 24-hour period
   – Attention measure: length of @reply chain
• Community Message Board: Boards.ie
   – Analysed all posts and forums in 2006
   – Attention measure: number of posts in a thread
• Support Forum: SAP Community Network
   – Attention measure: number of replies to a question
• News-sharing Platform: Digg
     – Used previous dataset of ‘popular’ stories
     – Attention measure: number of comments (and replies) to a story

Using Behaviour Analysis to Detect Cultural Aspects in              17
Social Web Systems
Experiments
• Experiment 1: Identifying Seed Posts
     – Will this post yield a reply?
     – Experiment 1(a): Model Selection
          • Which model performs best?
     – Experiment 1(b): Feature Assessment
          • How do features correlate with seed posts?
     – Datasets: Twitter and Boards.ie

• Experiment 2: Activity Level Prediction
     – What is the level of activity that seed posts yield?
     – Experiment 2(a): Model Selection
     – Experiment 2(b): Feature Assessment
          • How do features correlate with heightened attention?
     – Datasets: Twitter, Boards.ie, SCN and Digg

Using Behaviour Analysis to Detect Cultural Aspects in             18
Social Web Systems
Experiments
• Experiment 1: Identifying Seed Posts
     – Will this post yield a reply
     – Experiment 1(a): Model Selection
          • Which model performs best?
     – Experiment 1(b): Feature Assessment
          • How do features correlate with seed posts?
     – Datasets: Twitter and Boards.ie

• Experiment 2: Activity Level Prediction
     – What is the level of activity that seed posts yield?
     – Experiment 2(a): Model Selection
     – Experiment 2(b): Feature Assessment
          • How do features correlate with heightened attention?
     – Datasets: Twitter, Boards.ie, SCN and Digg

Using Behaviour Analysis to Detect Cultural Aspects in             19
Social Web Systems
Results: 1(a) Model
                                     Selection
• Which model performs best?

           Twitter                                       Boards.ie




Using Behaviour Analysis to Detect Cultural Aspects in               20
Social Web Systems
Results: 1(b) Feature
                                     Assessment
• How do features correlate with seed posts?




Using Behaviour Analysis to Detect Cultural Aspects in   21
Social Web Systems
Results: 1(b) Feature
                                      Assessment


 Twitter




Boards.ie




 Using Behaviour Analysis to Detect Cultural Aspects in   22
 Social Web Systems
Experiments
• Experiment 1: Identifying Seed Posts
     – Will this post yield a reply
     – Experiment 1(a): Model Selection
          • Which model performs best?
     – Experiment 1(b): Feature Assessment
          • How do features correlate with seed posts?
     – Datasets: Twitter and Boards.ie

• Experiment 2: Activity Level Prediction
     – What is the level of activity that seed posts yield?
     – Experiment 2(a): Model Selection
     – Experiment 2(b): Feature Assessment
          • How do features correlate with heightened attention?
     – Datasets: Twitter, Boards.ie, SCN and Digg

Using Behaviour Analysis to Detect Cultural Aspects in             23
Social Web Systems
Activity Distribution


 Twitter                                                            Boards.ie


1. Predict a ranking
2. Compare ranking against ground truth
3. Measure using Normalised Discounted Cumulative Gain @ varying ranks (k)
       • k={1,5,10,20,50,100}
4. Best model: highest nDCG averaged over k


  SCN                                                                 Digg




Using Behaviour Analysis to Detect Cultural Aspects in                       24
Social Web Systems
Results: 2(a) Model
                                     Selection
• Which model performs best?




Using Behaviour Analysis to Detect Cultural Aspects in   25
Social Web Systems
Results: 2(b) Feature
                                     Assessment
• How do features correlate with heightened attention?




Using Behaviour Analysis to Detect Cultural Aspects in   26
Social Web Systems
Results: 2(b) Feature
                                     Assessment
• How do features correlate with heightened attention?



                                         •   Heightened Activity on Twitter=
                                                 • Shorter posts
                                                 • Denser vocabulary
                                                 • Fewer hyperlinks
                                                 • Earlier in the day!




Using Behaviour Analysis to Detect Cultural Aspects in                         27
Social Web Systems
Results: 2(b) Feature
                                     Assessment
• How do features correlate with heightened attention?



                                                   •     Heightened Activity on Boards.ie=
                                                             • Concentrated topics
                                                             • Longer posts
                                                             • Wider vocabulary
                                                             • Fewer referrals
                                                             • Negative sentiment




Using Behaviour Analysis to Detect Cultural Aspects in                             28
Social Web Systems
Results: 2(b) Feature
                                     Assessment
• How do features correlate with heightened attention?



  •   Heightened Activity on SCN=
          • Less author participation
          • Contacted fewer people
          • User contacted by many people
          • Longer posts
          • Wider vocabulary
          • More hyperlinks




Using Behaviour Analysis to Detect Cultural Aspects in   29
Social Web Systems
Results: 2(b) Feature
                                     Assessment
• How do features correlate with heightened attention?



                      •   Heightened Activity on Digg=
                              • Concentrated topics
                              • Longer posts
                              • Later in the day
                              • Familiar community terms




Using Behaviour Analysis to Detect Cultural Aspects in     30
Social Web Systems
Generating Attention:
                                         Findings
     How do Social Web Systems differ in how attention is generated?

    •   Commonalities
         – Fewer hyperlinks for Microblogging platforms and discussion message
           boards
         – Use familiar language to the community
         – Negative content yields more activity
         – Activity distribution
            What drives attention in one system is not the
    •   Idiosyncrasies    same as another
         – More hyperlinks on support forums
         – Lower topic affinity on news-sharing system
         – Models differ: a) best performing, b) coefficients:
              • Content: Twitter
              • User: Boards.ie, SCN
              • Focus: Digg

Anticipating Discussion Activity on Community Forums. M Rowe, S Angeletou and H Alani. The
Third IEEE International Conference on Social Computing. Boston, USA. (2011)

    Using Behaviour Analysis to Detect Cultural Aspects in                         31
    Social Web Systems
Analysis II

Behaviour Role
Compositions
Using Behaviour Analysis to Detect Cultural Aspects in   32
Social Web Systems
Online Communities in
                                     Social Web Systems
• Social Web Systems support online communities to
  function and grow, enabling:
     – Idea generation
     – Customer support
     – Problem solving


• Managing and hosting communities can be
     – Expensive
     – Time-consuming


• Social Web Systems have large investments, therefore
  they must:
     – flourish and remain active
     – remain… ‘healthy’

Using Behaviour Analysis to Detect Cultural Aspects in   33
Social Web Systems
Increased Community
                                     Activity



  What did the community look like at the point?




Using Behaviour Analysis to Detect Cultural Aspects in   34
Social Web Systems
Decreased
                                     Community Activity


                  What were the conditions
                  at this point?




Using Behaviour Analysis to Detect Cultural Aspects in   35
Social Web Systems
The Need to Assess
                                     Behaviour
• How can we gauge community health?
     – Post Count?
     – Communication/Interaction?
     – Behaviour?


• Domination of one behaviour could lead to churn
     – Preece, 2000
• Behaviour in online community is influenced by the roles that
  users assume
     – Preece, 2001


• To provide health insights we need to monitor behaviour over
  time
     – Combined with basic health metrics (e.g. post count)
• Enabling detection of how behaviour differs between systems

Using Behaviour Analysis to Detect Cultural Aspects in        36
Social Web Systems
Modelling, Representing and
                                     Tracking Behaviour: How?

• Users exhibit different behaviour in different contexts:
     – How can we model user behaviour and represent its change over
       time?


• According to [Chan et al, 2010] users can be classified by
  their community role:
     – What behaviour correlates with community roles?
     – How can we label users as the system changes?


• Communities evolve and change over time:
     – Is there a correlation between community composition and
       health?
     – Can we predict community changes based on composition data?


 How do Social Web Systems differ in terms of behaviour?

Using Behaviour Analysis to Detect Cultural Aspects in               37
Social Web Systems
Behaviour Ontology
•   How can we model user behaviour and represent its change over
    time?




                                                   http://purl.org/net/oubo/0.3
Using Behaviour Analysis to Detect Cultural Aspects in                    38
Social Web Systems
Behaviour Features
• In-degree Ratio
     – Proportion of users that reply to user      ui
• Posts Replied Ratio
     – Proportion of posts by     ui that yield a reply
• Thread Initiation Ratio
     – Proportion of threads started by      ui
• Bi-directional Threads Ratio
     – Proportion of threads where      ui is involved in a reciprocal action
• Bi-directional Neighbours Ratio
     – Proportion of    ui‘s neighbours with whom a reciprocal action has
       taken place
• Average Posts per Thread
     – Mean number of posts in the threads that           ui has participated in
• Standard Deviation of Posts per Thread
     – Standard deviation of posts in the threads that         ui has posted in
Using Behaviour Analysis to Detect Cultural Aspects in                             39
Social Web Systems
Behaviour Roles


       Elitist
       Grunt
       Joining Conversationalist
       Popular Initiator
       Popular Participant
       Supporter
       Taciturn        Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing
                       discussion forums using common user roles. In Proc. Web Science
       Ignored         Conf. (WebSci10), Raleigh, NC: US, 2010.




Using Behaviour Analysis to Detect Cultural Aspects in                        40
Social Web Systems
Behaviour Roles (2)
•   What behaviour correlates with community roles?
                    T abl e 1. Roles and t he feat ure-t o-level mappings

             R ol e                              Feat ur e                                      L evel
             E l i t i st                        I n-D egr ee R at i o                           l ow
                                                 B i -di r ect i onal T hr eads R at i o        hi gh
                                                 B i -di r ect i onal N ei ghb our s R at i o    l ow
             G r unt                             B i -di r ect i onal T hr eads R at i o        m ed
                                                 B i -di r ect i onal N ei ghb our s R at i o    m ed
                                                 A ver age Post s p er T hr ead                  l ow
                                                 ST D of Post s p er T hr ead                    l ow
             Joi ni ng Conver sat i onal i st    T hr ead I ni t i at i on R at i o              l ow
                                                 A ver age Post s p er T hr ead                 hi gh
                                                 ST D of Post s p er T hr ead                   hi gh
             Popul ar I ni t i at or             I n-D egr ee R at i o                          hi gh
                                                 T hr ead I ni t i at i on R at i o             hi gh
             Popul ar Par t i ci pant s          I n-D egr ee R at i o                          hi gh
                                                 T hr ead I ni t i at i on R at i o              l ow
                                                 A ver age Post s p er T hr ead                 m ed
                                                 ST D of Post s p er T hr ead                   m ed
             Supp or t er                        I n-D egr ee R at i o                           m ed
                                                 B i -di r ect i onal T hr eads R at i o        m ed
                                                 B i -di r ect i onal N ei ghb our s R at i o    m ed
             T aci t ur n                        B i -di r ect i onal T hr eads R at i o         l ow
                                                 B i -di r ect i onal N ei ghb our s R at i o    l ow
                                                 A ver age Post s p er T hr ead                  l ow
                                                 ST D of Post s p er T hr ead                    l ow
             I gnor ed                           Post s R epl i ed R at i o                      l ow




Using Behaviour Analysis to Detect Cultural Aspects in                                                   41
Social Web Systems
Constructing and
                                          Applying Rules
•   How can we label users as the system changes?

Structural, social network,                   Feature levels change with the
reciprocity, persistence, participation       dynamics of the community




Run rules over each user’s features           Based on related work, we associate
and derive the community role composition     roles with a collection of feature-to-level
                                              mappings
                                              e.g. in-degree -> high, out-degree -> high

Using Behaviour Analysis to Detect Cultural Aspects in                                  42
Social Web Systems
Composition vs
                                     Activity
• Is there a correlation between community composition and
  health?

• Community Message board: Boards.ie
     – All posts used from 2004 – 2006
     – Selected 3 forums for analysis
          • F246: Commuting and Transport
          • F388: Rugby
          • F411: Mobile Phones and PDAs
• Support Forum: Tiddlywiki
     – Software development forum used by BT’s development team


• Measured at 12-week increments:
     – Forum composition (% of roles)
          • E.g. 20% elitists, 10% grunts, etc
     – Number of posts
Using Behaviour Analysis to Detect Cultural Aspects in            43
Social Web Systems
Correlation Results
                                     (1): Boards.ie




                      Forum 246 – Commuting and Transport


Using Behaviour Analysis to Detect Cultural Aspects in      44
Social Web Systems
Correlation Results
                                      (2): Boards.ie




Forum 246 – Commuting             Forum 388 – Rugby       Forum 411 – Mobile Phones
     and Transport                                                and PDAs




 Using Behaviour Analysis to Detect Cultural Aspects in                     45
 Social Web Systems
Correlation Results:
                                     Tiddlywiki




Using Behaviour Analysis to Detect Cultural Aspects in   46
Social Web Systems
Evolution Results (1):
                                     Boards.ie




                      Forum 246 – Commuting and Transport


Using Behaviour Analysis to Detect Cultural Aspects in      47
Social Web Systems
Evolution Results (2):
                                      Boards.ie




Forum 246 – Commuting             Forum 388 – Rugby       Forum 411 – Mobile Phones
     and Transport                                                and PDAs




 Using Behaviour Analysis to Detect Cultural Aspects in                     48
 Social Web Systems
Evolution Results:
                                     Tiddlywiki




Using Behaviour Analysis to Detect Cultural Aspects in    49
Social Web Systems
Predicting Community
                                     Health
• Can we predict community changes based on
  composition data?

1. Activity Change Detection:
    –    Predict either an increase or decrease in activity
    –    Features: roles and percentages
    –    Class label: increase/decrease
    –    Performed 10-fold cross validation with J48 decision tree


2. Post Count Prediction:
    –    Predict post count from role composition
    –    Independent variables: roles and percentages
    –    Dependent variable: post count
    –    Induced linear regression model and assessed the model
Using Behaviour Analysis to Detect Cultural Aspects in           50
Social Web Systems
Activity Change
                                     Detection

                                     Boards.ie




                                    Tiddlywiki




Using Behaviour Analysis to Detect Cultural Aspects in   51
Social Web Systems
Post Count
                                     Prediction
                                    Boards.ie




                                   Tiddlywiki




Using Behaviour Analysis to Detect Cultural Aspects in   52
Social Web Systems
Post Count
                                     Prediction
                                    Boards.ie




                                   Tiddlywiki
              •   Increased Community Activity on Boards.ie =
                       • More initiators
                       • More participants
                       • Less supporters
                       • Fewer ignored



Using Behaviour Analysis to Detect Cultural Aspects in          53
Social Web Systems
Post Count
                                     Prediction
                                    Boards.ie

              •   Increased Community Activity on Tiddlywiki =
                       • More conversationalists
                       • More initiators
                       • Fewer supporters
                       • Fewer ignored




                                   Tiddlywiki




Using Behaviour Analysis to Detect Cultural Aspects in           54
Social Web Systems
Clustering Communities
                                     by Composition




Using Behaviour Analysis to Detect Cultural Aspects in   55
Social Web Systems
Behaviour Role
                                         Compositions: Findings
    • How do Social Web Systems differ in terms of
      behaviour?

    • Commonalities
         – No grunts in either system
         – Increase in ignored users and supporters decreases health
         – Increase in initiators increases activity


    • Idiosyncrasies
         – No elitists found on support-forum
         – Conversationalists improve activity in certain cases
         – Optimum behaviour compositions differ

Modelling and Analysis of User Behaviour in Online Communities. S Angeletou, M Rowe and H
Alani. International Semantic Web Conference. Bonn, Germany. (2011)

    Using Behaviour Analysis to Detect Cultural Aspects in                         56
    Social Web Systems
Thesis: Microcultures

Recap

        Social Web Systems contain micro-cultures
        that differ in terms of
           a) user behaviour
           b) how attention is generated
           c) role compositions in such systems




Using Behaviour Analysis to Detect Cultural Aspects in   57
Social Web Systems
Microcultures:
                                     Evidence
• Social Web Systems contain micro-cultures that differ
  in terms of
     – a) User behaviour
          • Non-existence of roles in certain communities
          • Conversation behaviour important in certain communities
     – b) How attention is generated
          • Differences in optimum prediction models
          • Factors differ in driving activity
               – E.g. referrals, topic affinity
     – c) Role compositions in such systems
          • Intra and inter composition differences


Using Behaviour Analysis to Detect Cultural Aspects in           58
Social Web Systems
Questions?
Web:     http://people.kmi.open.ac.uk/rowe
         http://www.matthew-rowe.com
Email: m.c.rowe@open.ac.uk
Twitter: @mattroweshow




Using Behaviour Analysis to Detect Cultural Aspects in   59
Social Web Systems

Weitere ähnliche Inhalte

Was ist angesagt?

Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery:  Learning from a #selfie #humor Image Set from InstagramCoding Social Imagery:  Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
Shalin Hai-Jew
 

Was ist angesagt? (19)

The evolution of research on social media
The evolution of research on social mediaThe evolution of research on social media
The evolution of research on social media
 
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
 
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
Netnography: Overview and How to (Schulich School of Business, MBA class, Soc...
 
Ethnography in the virtual world: Methodological opportunities and challenges
Ethnography in the virtual world: Methodological opportunities and challengesEthnography in the virtual world: Methodological opportunities and challenges
Ethnography in the virtual world: Methodological opportunities and challenges
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging Site
 
Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)Social Web 2014: Final Presentations (Part II)
Social Web 2014: Final Presentations (Part II)
 
Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)Lecture 5: Personalization on the Social Web (2014)
Lecture 5: Personalization on the Social Web (2014)
 
Netnography and Research Ethics: From ACR 2015 Doctoral Symposium
Netnography and Research Ethics: From ACR 2015 Doctoral SymposiumNetnography and Research Ethics: From ACR 2015 Doctoral Symposium
Netnography and Research Ethics: From ACR 2015 Doctoral Symposium
 
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery:  Learning from a #selfie #humor Image Set from InstagramCoding Social Imagery:  Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
 
Studying young people’s online social practices
Studying young people’s online social practicesStudying young people’s online social practices
Studying young people’s online social practices
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
Social Data and Multimedia Analytics for News and Events Applications
Social Data and Multimedia Analytics for News and Events ApplicationsSocial Data and Multimedia Analytics for News and Events Applications
Social Data and Multimedia Analytics for News and Events Applications
 
Virtual Ethnography: Bridging the Gap between Market Research and Social Media
Virtual Ethnography: Bridging the Gap between Market Research and Social MediaVirtual Ethnography: Bridging the Gap between Market Research and Social Media
Virtual Ethnography: Bridging the Gap between Market Research and Social Media
 
Netnography webinar
Netnography webinarNetnography webinar
Netnography webinar
 
Linking In-Game Events and Entities to Social Data on the Web
Linking In-Game Events and Entities to Social Data on the WebLinking In-Game Events and Entities to Social Data on the Web
Linking In-Game Events and Entities to Social Data on the Web
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data science
 
NodeXL Research
NodeXL ResearchNodeXL Research
NodeXL Research
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
interacting with social media content about events
interacting with social media content about eventsinteracting with social media content about events
interacting with social media content about events
 

Ähnlich wie Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems

eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid network
 
Socialmediametrics Papercliq 101109175746 Phpapp01 1
Socialmediametrics Papercliq 101109175746 Phpapp01 1Socialmediametrics Papercliq 101109175746 Phpapp01 1
Socialmediametrics Papercliq 101109175746 Phpapp01 1
kurt_hulett
 

Ähnlich wie Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems (20)

Socialcom2011 discussionactivityprediction
Socialcom2011 discussionactivitypredictionSocialcom2011 discussionactivityprediction
Socialcom2011 discussionactivityprediction
 
Monitoring and Analysis of Online Communities
Monitoring and Analysis of Online CommunitiesMonitoring and Analysis of Online Communities
Monitoring and Analysis of Online Communities
 
Social Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semanticsSocial Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semantics
 
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the MapNew Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
 
Visualising activity in learning networks using open data and educational ...
Visualising activity in learning networks   using open data and educational  ...Visualising activity in learning networks   using open data and educational  ...
Visualising activity in learning networks using open data and educational ...
 
SRS presentation
SRS presentationSRS presentation
SRS presentation
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
 
Making More Sense Out of Social Data
Making More Sense Out of Social DataMaking More Sense Out of Social Data
Making More Sense Out of Social Data
 
Seams2016 presentation calikli_et_al
Seams2016 presentation calikli_et_alSeams2016 presentation calikli_et_al
Seams2016 presentation calikli_et_al
 
Ifip wg-galway-
Ifip wg-galway-Ifip wg-galway-
Ifip wg-galway-
 
Lecture4 Social Web
Lecture4 Social Web Lecture4 Social Web
Lecture4 Social Web
 
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTubeWebometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTube
 
Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...
 
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
 
Web 2.0 Meets Standards
Web 2.0 Meets StandardsWeb 2.0 Meets Standards
Web 2.0 Meets Standards
 
Social Media Metrics
Social Media MetricsSocial Media Metrics
Social Media Metrics
 
Socialmediametrics Papercliq 101109175746 Phpapp01 1
Socialmediametrics Papercliq 101109175746 Phpapp01 1Socialmediametrics Papercliq 101109175746 Phpapp01 1
Socialmediametrics Papercliq 101109175746 Phpapp01 1
 
Social Media Metrics
Social Media MetricsSocial Media Metrics
Social Media Metrics
 
ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4
 

Mehr von Matthew Rowe

Existing Research and Future Research Agenda
Existing Research and Future Research AgendaExisting Research and Future Research Agenda
Existing Research and Future Research Agenda
Matthew Rowe
 
Modelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online CommunitiesModelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online Communities
Matthew Rowe
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
Matthew Rowe
 
Forecasting Audience Increase on Youtube
Forecasting Audience Increase on YoutubeForecasting Audience Increase on Youtube
Forecasting Audience Increase on Youtube
Matthew Rowe
 
Predicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic WebPredicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic Web
Matthew Rowe
 

Mehr von Matthew Rowe (20)

Social Computing Research with Apache Spark
Social Computing Research with Apache SparkSocial Computing Research with Apache Spark
Social Computing Research with Apache Spark
 
Predicting Online Community Churners using Gaussian Sequences
Predicting Online Community Churners using Gaussian SequencesPredicting Online Community Churners using Gaussian Sequences
Predicting Online Community Churners using Gaussian Sequences
 
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
 
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting RatingsSemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
 
The Semantic Evolution of Online Communities
The Semantic Evolution of Online CommunitiesThe Semantic Evolution of Online Communities
The Semantic Evolution of Online Communities
 
From Mining to Understanding: The Evolution of Social Web Users
From Mining to Understanding: The Evolution of Social Web UsersFrom Mining to Understanding: The Evolution of Social Web Users
From Mining to Understanding: The Evolution of Social Web Users
 
Mining User Lifecycles from Online Community Platforms and their Application ...
Mining User Lifecycles from Online Community Platforms and their Application ...Mining User Lifecycles from Online Community Platforms and their Application ...
Mining User Lifecycles from Online Community Platforms and their Application ...
 
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
 
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
 
Identity: Physical, Cyber, Future
Identity: Physical, Cyber, FutureIdentity: Physical, Cyber, Future
Identity: Physical, Cyber, Future
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
 
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
 
Attention Economics in Social Web Systems
Attention Economics in Social Web SystemsAttention Economics in Social Web Systems
Attention Economics in Social Web Systems
 
What makes communities tick? Community health analysis using role compositions
What makes communities tick? Community health analysis using role compositionsWhat makes communities tick? Community health analysis using role compositions
What makes communities tick? Community health analysis using role compositions
 
Existing Research and Future Research Agenda
Existing Research and Future Research AgendaExisting Research and Future Research Agenda
Existing Research and Future Research Agenda
 
Tutorial: Social Semantics
Tutorial: Social SemanticsTutorial: Social Semantics
Tutorial: Social Semantics
 
Modelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online CommunitiesModelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online Communities
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
Forecasting Audience Increase on Youtube
Forecasting Audience Increase on YoutubeForecasting Audience Increase on Youtube
Forecasting Audience Increase on Youtube
 
Predicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic WebPredicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic Web
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems

  • 1. Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems Dr Matthew Rowe Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom http://people.kmi.open.ac.uk/rowe | http://www.matthew-rowe.com
  • 2. Web 1.0 • Web of documents • Web presence constrained to HTML ‘experts’ • Fixed categories • Static content http://www.flickr.com/photos/complexify/97303317/ Using Behaviour Analysis to Detect Cultural Aspects in 1 Social Web Systems
  • 3. Web 2.0 • Data access through APIs • Collective Intelligence • User generated content • Web presence for all • Tagging http://www.flickr.com/photos/9119028@N05/591163479 Using Behaviour Analysis to Detect Cultural Aspects in 2 Social Web Systems
  • 4. A Social Web A Social Web System is an online platform that offers a useful service, normally for free, to users, through which they can interact and network http://mmt.me.uk/slides/deri20110401/images/walledgardens.jpg Using Behaviour Analysis to Detect Cultural Aspects in 3 Social Web Systems
  • 5. Example 1 Using Behaviour Analysis to Detect Cultural Aspects in 4 Social Web Systems
  • 6. Example 2 Using Behaviour Analysis to Detect Cultural Aspects in 5 Social Web Systems
  • 7. Δs of Social Web Systems • Social Web Systems differ in their: – Domain • Flickr = photos • Facebook = social networking • Twitter = microblogging – Audience • SAP Community Network = programmers • Slashdot = technology enthusiasts • How else do they differ? • What are the Δs? Using Behaviour Analysis to Detect Cultural Aspects in 6 Social Web Systems
  • 8. The Utility of Behaviour Analysis • WeGov – Investigating the role of social networks in eGovernment – Enabling: • Tracking of political discussions and topics • Injection of policy content to maximise exposure • ROBUST – Risk and opportunity management in online communities – Enabling • Assessment of user churn in online communities • Community evolution prediction • Monitoring of community health • Behaviour analysis is required to understand: – What behaviour drives content creation – How behaviour is associated with community evolution Using Behaviour Analysis to Detect Cultural Aspects in 7 Social Web Systems
  • 9. Thesis: Microcultures Social Web Systems contain micro-cultures that differ in terms of a) user behaviour b) how attention is generated c) role compositions in such systems Using Behaviour Analysis to Detect Cultural Aspects in 8 Social Web Systems
  • 10. Outline • Analysis 1: Generating Attention – Understanding Attention Factors – Approach – Experiments – Findings • Analysis 2: Behaviour Role Compositions – Analysing Community Evolution – Approach – Experiments – Findings • Microcultures: Evidence Using Behaviour Analysis to Detect Cultural Aspects in 9 Social Web Systems
  • 11. Analysis I Generating Attention Using Behaviour Analysis to Detect Cultural Aspects in 10 Social Web Systems
  • 12. Shared Content • Social Web Systems are now used to: – Ask questions – Post opinions and ideas – Discuss events and current issues • Content analysis in online communities is attractive for: – Market analysis – Brand consensus and product opinion • Social network analytics in the US is predicted to reach $1 billion by 2014 (Forrester 2009) • Masses of data is now being published in social web systems: – Facebook has more than 60 million status updates per day (Facebook statistics 2010) Using Behaviour Analysis to Detect Cultural Aspects in 11 Social Web Systems
  • 13. Using Behaviour Analysis to Detect Cultural Aspects in 12 Social Web Systems
  • 14. The Need for Analysis • Analysts need to know which piece of content will generate the most activity – i.e. the most auspicious or influential – Helps focus the attention of human and computerised analysts • What to track? • Need to understand the effect features (community and content) have on attention to content Which features are key to stimulating activity? How do these features influence activity length? How do Social Web Systems differ in how attention is generated? Using Behaviour Analysis to Detect Cultural Aspects in 13 Social Web Systems
  • 15. Approach: Attention Prediction • Two-stage approach to predict attention to content: 1. Identify seed posts • E.g. thread starters on a message board • Will a given post start a discussion? • What are the properties that seed posts exhibit? – What parameters tend to trigger a discussion? 2. Predict discussion activity levels • From the identified seed posts • What is the level of activity that a seed post will generate? • What features correlate with heightened activity? Using Behaviour Analysis to Detect Cultural Aspects in 14 Social Web Systems
  • 16. Features Which features are key to stimulating activity? • For each post, model: a) the author, b) the content and c) the topical concentration of the author • F1: User Features – In-degree, out-degree: social network properties of the author – Post count, age, post rate: participation information of the author • F2: Content Features – Post length, referral count, time in day: surface features of the post – Complexity: cumulative entropy of terms in the post – Readability: Gunning Fog index of the post – Informativeness: TF-IDF measure of terms within the post – Polarity: average sentiment of terms in the post Using Behaviour Analysis to Detect Cultural Aspects in 15 Social Web Systems
  • 17. Features (2) • F3: Focus Features – Topic entropy: the concentration of the author across community forums • Higher entropy indicates a wider spread of forum activity • More random distribution, less concentrated – Topic Likelihood: the likelihood that a user posts in a specific forum given his post history • Measures the affinity that a user has with a given forum • Lower likelihood indicates a user posting on an unfamiliar topic Using Behaviour Analysis to Detect Cultural Aspects in 16 Social Web Systems
  • 18. Social Web Systems: Datasets • Microblogging Platform: Twitter – Collected a random subset over 24-hour period – Attention measure: length of @reply chain • Community Message Board: Boards.ie – Analysed all posts and forums in 2006 – Attention measure: number of posts in a thread • Support Forum: SAP Community Network – Attention measure: number of replies to a question • News-sharing Platform: Digg – Used previous dataset of ‘popular’ stories – Attention measure: number of comments (and replies) to a story Using Behaviour Analysis to Detect Cultural Aspects in 17 Social Web Systems
  • 19. Experiments • Experiment 1: Identifying Seed Posts – Will this post yield a reply? – Experiment 1(a): Model Selection • Which model performs best? – Experiment 1(b): Feature Assessment • How do features correlate with seed posts? – Datasets: Twitter and Boards.ie • Experiment 2: Activity Level Prediction – What is the level of activity that seed posts yield? – Experiment 2(a): Model Selection – Experiment 2(b): Feature Assessment • How do features correlate with heightened attention? – Datasets: Twitter, Boards.ie, SCN and Digg Using Behaviour Analysis to Detect Cultural Aspects in 18 Social Web Systems
  • 20. Experiments • Experiment 1: Identifying Seed Posts – Will this post yield a reply – Experiment 1(a): Model Selection • Which model performs best? – Experiment 1(b): Feature Assessment • How do features correlate with seed posts? – Datasets: Twitter and Boards.ie • Experiment 2: Activity Level Prediction – What is the level of activity that seed posts yield? – Experiment 2(a): Model Selection – Experiment 2(b): Feature Assessment • How do features correlate with heightened attention? – Datasets: Twitter, Boards.ie, SCN and Digg Using Behaviour Analysis to Detect Cultural Aspects in 19 Social Web Systems
  • 21. Results: 1(a) Model Selection • Which model performs best? Twitter Boards.ie Using Behaviour Analysis to Detect Cultural Aspects in 20 Social Web Systems
  • 22. Results: 1(b) Feature Assessment • How do features correlate with seed posts? Using Behaviour Analysis to Detect Cultural Aspects in 21 Social Web Systems
  • 23. Results: 1(b) Feature Assessment Twitter Boards.ie Using Behaviour Analysis to Detect Cultural Aspects in 22 Social Web Systems
  • 24. Experiments • Experiment 1: Identifying Seed Posts – Will this post yield a reply – Experiment 1(a): Model Selection • Which model performs best? – Experiment 1(b): Feature Assessment • How do features correlate with seed posts? – Datasets: Twitter and Boards.ie • Experiment 2: Activity Level Prediction – What is the level of activity that seed posts yield? – Experiment 2(a): Model Selection – Experiment 2(b): Feature Assessment • How do features correlate with heightened attention? – Datasets: Twitter, Boards.ie, SCN and Digg Using Behaviour Analysis to Detect Cultural Aspects in 23 Social Web Systems
  • 25. Activity Distribution Twitter Boards.ie 1. Predict a ranking 2. Compare ranking against ground truth 3. Measure using Normalised Discounted Cumulative Gain @ varying ranks (k) • k={1,5,10,20,50,100} 4. Best model: highest nDCG averaged over k SCN Digg Using Behaviour Analysis to Detect Cultural Aspects in 24 Social Web Systems
  • 26. Results: 2(a) Model Selection • Which model performs best? Using Behaviour Analysis to Detect Cultural Aspects in 25 Social Web Systems
  • 27. Results: 2(b) Feature Assessment • How do features correlate with heightened attention? Using Behaviour Analysis to Detect Cultural Aspects in 26 Social Web Systems
  • 28. Results: 2(b) Feature Assessment • How do features correlate with heightened attention? • Heightened Activity on Twitter= • Shorter posts • Denser vocabulary • Fewer hyperlinks • Earlier in the day! Using Behaviour Analysis to Detect Cultural Aspects in 27 Social Web Systems
  • 29. Results: 2(b) Feature Assessment • How do features correlate with heightened attention? • Heightened Activity on Boards.ie= • Concentrated topics • Longer posts • Wider vocabulary • Fewer referrals • Negative sentiment Using Behaviour Analysis to Detect Cultural Aspects in 28 Social Web Systems
  • 30. Results: 2(b) Feature Assessment • How do features correlate with heightened attention? • Heightened Activity on SCN= • Less author participation • Contacted fewer people • User contacted by many people • Longer posts • Wider vocabulary • More hyperlinks Using Behaviour Analysis to Detect Cultural Aspects in 29 Social Web Systems
  • 31. Results: 2(b) Feature Assessment • How do features correlate with heightened attention? • Heightened Activity on Digg= • Concentrated topics • Longer posts • Later in the day • Familiar community terms Using Behaviour Analysis to Detect Cultural Aspects in 30 Social Web Systems
  • 32. Generating Attention: Findings How do Social Web Systems differ in how attention is generated? • Commonalities – Fewer hyperlinks for Microblogging platforms and discussion message boards – Use familiar language to the community – Negative content yields more activity – Activity distribution What drives attention in one system is not the • Idiosyncrasies same as another – More hyperlinks on support forums – Lower topic affinity on news-sharing system – Models differ: a) best performing, b) coefficients: • Content: Twitter • User: Boards.ie, SCN • Focus: Digg Anticipating Discussion Activity on Community Forums. M Rowe, S Angeletou and H Alani. The Third IEEE International Conference on Social Computing. Boston, USA. (2011) Using Behaviour Analysis to Detect Cultural Aspects in 31 Social Web Systems
  • 33. Analysis II Behaviour Role Compositions Using Behaviour Analysis to Detect Cultural Aspects in 32 Social Web Systems
  • 34. Online Communities in Social Web Systems • Social Web Systems support online communities to function and grow, enabling: – Idea generation – Customer support – Problem solving • Managing and hosting communities can be – Expensive – Time-consuming • Social Web Systems have large investments, therefore they must: – flourish and remain active – remain… ‘healthy’ Using Behaviour Analysis to Detect Cultural Aspects in 33 Social Web Systems
  • 35. Increased Community Activity What did the community look like at the point? Using Behaviour Analysis to Detect Cultural Aspects in 34 Social Web Systems
  • 36. Decreased Community Activity What were the conditions at this point? Using Behaviour Analysis to Detect Cultural Aspects in 35 Social Web Systems
  • 37. The Need to Assess Behaviour • How can we gauge community health? – Post Count? – Communication/Interaction? – Behaviour? • Domination of one behaviour could lead to churn – Preece, 2000 • Behaviour in online community is influenced by the roles that users assume – Preece, 2001 • To provide health insights we need to monitor behaviour over time – Combined with basic health metrics (e.g. post count) • Enabling detection of how behaviour differs between systems Using Behaviour Analysis to Detect Cultural Aspects in 36 Social Web Systems
  • 38. Modelling, Representing and Tracking Behaviour: How? • Users exhibit different behaviour in different contexts: – How can we model user behaviour and represent its change over time? • According to [Chan et al, 2010] users can be classified by their community role: – What behaviour correlates with community roles? – How can we label users as the system changes? • Communities evolve and change over time: – Is there a correlation between community composition and health? – Can we predict community changes based on composition data? How do Social Web Systems differ in terms of behaviour? Using Behaviour Analysis to Detect Cultural Aspects in 37 Social Web Systems
  • 39. Behaviour Ontology • How can we model user behaviour and represent its change over time? http://purl.org/net/oubo/0.3 Using Behaviour Analysis to Detect Cultural Aspects in 38 Social Web Systems
  • 40. Behaviour Features • In-degree Ratio – Proportion of users that reply to user ui • Posts Replied Ratio – Proportion of posts by ui that yield a reply • Thread Initiation Ratio – Proportion of threads started by ui • Bi-directional Threads Ratio – Proportion of threads where ui is involved in a reciprocal action • Bi-directional Neighbours Ratio – Proportion of ui‘s neighbours with whom a reciprocal action has taken place • Average Posts per Thread – Mean number of posts in the threads that ui has participated in • Standard Deviation of Posts per Thread – Standard deviation of posts in the threads that ui has posted in Using Behaviour Analysis to Detect Cultural Aspects in 39 Social Web Systems
  • 41. Behaviour Roles Elitist Grunt Joining Conversationalist Popular Initiator Popular Participant Supporter Taciturn Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing discussion forums using common user roles. In Proc. Web Science Ignored Conf. (WebSci10), Raleigh, NC: US, 2010. Using Behaviour Analysis to Detect Cultural Aspects in 40 Social Web Systems
  • 42. Behaviour Roles (2) • What behaviour correlates with community roles? T abl e 1. Roles and t he feat ure-t o-level mappings R ol e Feat ur e L evel E l i t i st I n-D egr ee R at i o l ow B i -di r ect i onal T hr eads R at i o hi gh B i -di r ect i onal N ei ghb our s R at i o l ow G r unt B i -di r ect i onal T hr eads R at i o m ed B i -di r ect i onal N ei ghb our s R at i o m ed A ver age Post s p er T hr ead l ow ST D of Post s p er T hr ead l ow Joi ni ng Conver sat i onal i st T hr ead I ni t i at i on R at i o l ow A ver age Post s p er T hr ead hi gh ST D of Post s p er T hr ead hi gh Popul ar I ni t i at or I n-D egr ee R at i o hi gh T hr ead I ni t i at i on R at i o hi gh Popul ar Par t i ci pant s I n-D egr ee R at i o hi gh T hr ead I ni t i at i on R at i o l ow A ver age Post s p er T hr ead m ed ST D of Post s p er T hr ead m ed Supp or t er I n-D egr ee R at i o m ed B i -di r ect i onal T hr eads R at i o m ed B i -di r ect i onal N ei ghb our s R at i o m ed T aci t ur n B i -di r ect i onal T hr eads R at i o l ow B i -di r ect i onal N ei ghb our s R at i o l ow A ver age Post s p er T hr ead l ow ST D of Post s p er T hr ead l ow I gnor ed Post s R epl i ed R at i o l ow Using Behaviour Analysis to Detect Cultural Aspects in 41 Social Web Systems
  • 43. Constructing and Applying Rules • How can we label users as the system changes? Structural, social network, Feature levels change with the reciprocity, persistence, participation dynamics of the community Run rules over each user’s features Based on related work, we associate and derive the community role composition roles with a collection of feature-to-level mappings e.g. in-degree -> high, out-degree -> high Using Behaviour Analysis to Detect Cultural Aspects in 42 Social Web Systems
  • 44. Composition vs Activity • Is there a correlation between community composition and health? • Community Message board: Boards.ie – All posts used from 2004 – 2006 – Selected 3 forums for analysis • F246: Commuting and Transport • F388: Rugby • F411: Mobile Phones and PDAs • Support Forum: Tiddlywiki – Software development forum used by BT’s development team • Measured at 12-week increments: – Forum composition (% of roles) • E.g. 20% elitists, 10% grunts, etc – Number of posts Using Behaviour Analysis to Detect Cultural Aspects in 43 Social Web Systems
  • 45. Correlation Results (1): Boards.ie Forum 246 – Commuting and Transport Using Behaviour Analysis to Detect Cultural Aspects in 44 Social Web Systems
  • 46. Correlation Results (2): Boards.ie Forum 246 – Commuting Forum 388 – Rugby Forum 411 – Mobile Phones and Transport and PDAs Using Behaviour Analysis to Detect Cultural Aspects in 45 Social Web Systems
  • 47. Correlation Results: Tiddlywiki Using Behaviour Analysis to Detect Cultural Aspects in 46 Social Web Systems
  • 48. Evolution Results (1): Boards.ie Forum 246 – Commuting and Transport Using Behaviour Analysis to Detect Cultural Aspects in 47 Social Web Systems
  • 49. Evolution Results (2): Boards.ie Forum 246 – Commuting Forum 388 – Rugby Forum 411 – Mobile Phones and Transport and PDAs Using Behaviour Analysis to Detect Cultural Aspects in 48 Social Web Systems
  • 50. Evolution Results: Tiddlywiki Using Behaviour Analysis to Detect Cultural Aspects in 49 Social Web Systems
  • 51. Predicting Community Health • Can we predict community changes based on composition data? 1. Activity Change Detection: – Predict either an increase or decrease in activity – Features: roles and percentages – Class label: increase/decrease – Performed 10-fold cross validation with J48 decision tree 2. Post Count Prediction: – Predict post count from role composition – Independent variables: roles and percentages – Dependent variable: post count – Induced linear regression model and assessed the model Using Behaviour Analysis to Detect Cultural Aspects in 50 Social Web Systems
  • 52. Activity Change Detection Boards.ie Tiddlywiki Using Behaviour Analysis to Detect Cultural Aspects in 51 Social Web Systems
  • 53. Post Count Prediction Boards.ie Tiddlywiki Using Behaviour Analysis to Detect Cultural Aspects in 52 Social Web Systems
  • 54. Post Count Prediction Boards.ie Tiddlywiki • Increased Community Activity on Boards.ie = • More initiators • More participants • Less supporters • Fewer ignored Using Behaviour Analysis to Detect Cultural Aspects in 53 Social Web Systems
  • 55. Post Count Prediction Boards.ie • Increased Community Activity on Tiddlywiki = • More conversationalists • More initiators • Fewer supporters • Fewer ignored Tiddlywiki Using Behaviour Analysis to Detect Cultural Aspects in 54 Social Web Systems
  • 56. Clustering Communities by Composition Using Behaviour Analysis to Detect Cultural Aspects in 55 Social Web Systems
  • 57. Behaviour Role Compositions: Findings • How do Social Web Systems differ in terms of behaviour? • Commonalities – No grunts in either system – Increase in ignored users and supporters decreases health – Increase in initiators increases activity • Idiosyncrasies – No elitists found on support-forum – Conversationalists improve activity in certain cases – Optimum behaviour compositions differ Modelling and Analysis of User Behaviour in Online Communities. S Angeletou, M Rowe and H Alani. International Semantic Web Conference. Bonn, Germany. (2011) Using Behaviour Analysis to Detect Cultural Aspects in 56 Social Web Systems
  • 58. Thesis: Microcultures Recap Social Web Systems contain micro-cultures that differ in terms of a) user behaviour b) how attention is generated c) role compositions in such systems Using Behaviour Analysis to Detect Cultural Aspects in 57 Social Web Systems
  • 59. Microcultures: Evidence • Social Web Systems contain micro-cultures that differ in terms of – a) User behaviour • Non-existence of roles in certain communities • Conversation behaviour important in certain communities – b) How attention is generated • Differences in optimum prediction models • Factors differ in driving activity – E.g. referrals, topic affinity – c) Role compositions in such systems • Intra and inter composition differences Using Behaviour Analysis to Detect Cultural Aspects in 58 Social Web Systems
  • 60. Questions? Web: http://people.kmi.open.ac.uk/rowe http://www.matthew-rowe.com Email: m.c.rowe@open.ac.uk Twitter: @mattroweshow Using Behaviour Analysis to Detect Cultural Aspects in 59 Social Web Systems

Hinweis der Redaktion

  1. Myriad social web systems exists, at the heart of such systems is the user: who drives action and contentThese systems differ
  2. Solitary feature sets: Content features produce the best predictive performance on both systems!All features: produces best performance
  3. Twitter:Time in day: no-reply zoneHigher out-degree = more likely to get a replyBoards.ieHigherreferral counts correlated with non-seedsBoth:Lower informativeness correlated with seedsUse language that is familiar to the platform’s usersReadability lower for seeds although harder to see
  4. Twitter: Content quality once again importantBoards.ie: User features more important this time, unlike content beforeSCN: Similar to Boards, user features most importantDigg: focus information of the user most important in this caseTwitter: best model=ContentBoards.ie: best model = content and focusSCN: best model = user and contentDigg: best model = content and focus. Like Boards.ie
  5. In-degree ratio = concentrationPosts Replied ratio = popularityThread initiation ratio = propensity to initiate discussionsBi-directional threads ratio = reciprocity and interactionBi-directional neighbours ratio = reciprocityAverage posts per thread = level of discussionSD of posts per thread = captures variance of discussions
  6. Maintain a mapping between feature and a level (low, mid, high)Enables dynamic derivation of the feature levels
  7. Increase in Elitists and Participants is associated with increased activityUsers who communicate often with other usersIncrease in Taciturns and ignored is associated with decreased activityTaciturns contribute little
  8. Common patterns across all three forums analysedCertain roles more important that others in differing communities:Conversationalists important in commuting and transport and rugby, not in mobile phones and PDAs – conversation not a driving factor in the forumsSupporters found to negatively impact upon activity in forum 411 – again because conversation is not a common action in the community: more interested in support
  9. Increase in Joining Conversationalists and Popular Particiants correlates with increased activityDecrease in supporters and Ignored users correlates with increased activityNo Elitists or Grunts!Lower correlations, behaviour roles that fit well in one system are not the same as another! Different behaviour
  10. Activity increases as the composition reaches a relatively stable settingi.e. little variation and fluctuation in the roles
  11. Composition stability is associated with increased activity in 246 and 411Fluctuation in activity in rugby forum correlated with variation in roles
  12. No Elitists or Grunts!
  13. Best results for 246 – steady increase in activity over timeWorst results for 388 – fluctuation in composition and activity making it hard to perform predictionsCross community patterns are not reliable – idiosyncratic behaviour in each community
  14. 895 (Celebrity & Showbiz) and 452 (For Sale: Computer Hardware)Outliers:7: After hours 47: Motors151: Soccer908: Beer Guts and Receding Hair