SlideShare a Scribd company logo
1 of 24
<is web>                         Information Systems & Semantic Web
                              University of Koblenz ▪ Landau, Germany




 An Epistemic Dynamic Model for Tagging Systems

             Klaas Dellschaft and Steffen Staab
              {klaasd, staab}@uni-koblenz.de
Delicious Tagging Interface                                             <is web>




 How do users influence each other in tagging systems?

  Hypothesis: Tagging involves imitation of other users AND
  selection of tags from background knowledge of users.

 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   2 of 24
Understanding Dynamics in Tagging Systems                              <is web>
                 User interface              Something else?


                                                                       Tagging
                                Conceptualization                      Behavior




                                                                          Comparison
                                                    Shared




                                                                          of Statistics
              Own Background
                Knowledge                         terminology


        Model of User Interface Influence

                                                                        Simulated
                               Joint Stochastic Model                    Tagging
                                                                        Behavior


            Model of Own                        Model of Sharing
        Background Knowledge
ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
& Semantic Web                klaasd@uni-koblenz.de   3 of 24
Folksonomies                                                            <is web>
 Vertexes: Users, tags, resources
 Hyperedges: Tag assignments (user X tag X resource)
 Postings:
    Tag assignments of a user to a single resource
    Can be ordered according to their time-stamp




 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   4 of 24
Co-occurrence Streams                                                       <is web>
 Co-occurrence Streams:
    All tags co-occurring with a given tag in a posting
    Ordered by posting time
 Example tag assignments for ‘ajax':
    {mackz, r1, {ajax, javascript}, 13:25}
    {klaasd, r2, {ajax, rss, web2.0}, 13:26}
    {mackz, r2, {ajax, php, javascript}, 13:27}
 Resulting co-occurrence stream:
           javascript rss web2.0 php javascript
            time

              Tag              |Y|            |U|            |T|           |R|
              ajax      2.949.614          88.526         41.898         71.525
              blog      6.098.471          158.578       186.043         557.017
              xml        974.866           44.326         31.998         61.843
 ISWeb - Information Systems    Klaas Dellschaft        Hypertext 2008
 & Semantic Web                 klaasd@uni-koblenz.de   5 of 24
Co-occurrence Streams – Tag Growth                                      <is web>

                         Linear scaling of axes




                                 Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   6 of 24
Co-occurrence Streams – Tag Frequencies                                 <is web>




                                 Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   7 of 24
Resource Streams                                                        <is web>
 Resource Streams:
    All tags assigned to a resource
    Ordered by posting time
 Example tag assignments:
    {mackz, r1, {ajax, javascript}, 13:25}
    {klaasd, r2, {ajax, rss, web2.0}, 13:26}
    {mackz, r2, {ajax, php, javascript}, 13:27}
 Resulting resource stream:

             ajax rss web2.0                    ajax php javascript
            time




 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   8 of 24
Resource Streams – Tag Frequencies                                      <is web>




                                 Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   9 of 24
Resource Streams – Tag Frequencies                                      <is web>




                    Logarithmic scaling of axes



                Taken from Halpin, Robu and Shepard, WWW 2007
 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   10 of 24
Existing Models                                                             <is web>


                                              Frequency Rank
                                Co-occur. Streams       Resource Streams     Tag Growth
 Polya Urn Model                          O                       O           Fixed size
 Simon Model                              O                       O            Linear
 YS Model w/ Memory                       +                       O            Linear

 Halpin et al. Model                      O                       O            Linear
 Observed Properties           Long tail w/ cut-off     Drop around tag 7     Sublinear




 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   11 of 24
Stochastic Models of Influence                                          <is web>
                  User interface              Something else?


                                                                        Tagging
                                 Conceptualization                      Behavior




                                                                           Comparison
                                                     Shared




                                                                           of Statistics
               Own Background
                 Knowledge                         terminology


         Model of User Interface Influence

                                                                         Simulated
                                Joint Stochastic Model                    Tagging
                                                                         Behavior


             Model of Own                        Model of Sharing
         Background Knowledge
 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   12 of 24
Modeling Tag Imitation                                                  <is web>




 Delicious Tagging Interface
    Imitation of previous tag assignments
    Free input of new tags (taken from background knowledge of a user)

 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   13 of 24
Simulating a Tag Stream                                                   <is web>
 Start with empty tag stream
 Each simulation step appends a new tag assignment:


                                 p(w|t): Probability of selecting word w for topic t.
                                 Modeled by word distributions in a topic centered
                                 text corpus.
                  K
              PB



               1-
                  P
                   BK


                                 n: Number of visible previous tags.
                                 h: Maximal number of previous tag assignments
                                 used for determining ranking of the n distinct tags.



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   14 of 24
Modeling Background Knowledge                                           <is web>




     Text Corpora                                       Del.icio.us
                               Text Corpora


 PBK: Probability of selecting from background knowledge
    p(w|t): Probability of selecting word w for topic t. Modeled by
     word distributions in a topic centered text corpus.
    p(w|r): Probability of selecting word w for resource r.
 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   15 of 24
Modeling Tag Imitation                                                              <is web>



                     PBK
                                         t      t-1    t-2    t-3   t-4   t-5   …   t-h   …



                                             1-PBK

                                     1   2     3 … n



 PI = 1 - PBK: Probability of imitating a previous tag assignment
    n: Number of visible top-ranked tags
    h: Maximal number of previous tag assignments used for determining
      ranking of the n distinct tags


 ISWeb - Information Systems   Klaas Dellschaft              Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de         16 of 24
Simulation Results                                                      <is web>
                  User interface              Something else?


                                                                        Tagging
                                 Conceptualization                      Behavior




                                                                           Comparison
                                                     Shared




                                                                           of Statistics
               Own Background
                 Knowledge                         terminology


         Model of User Interface Influence

                                                                         Simulated
                                Joint Stochastic Model                    Tagging
                                                                         Behavior


             Model of Own                        Model of Sharing
         Background Knowledge
 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   17 of 24
Simulating Co-occurrence Streams                                        <is web>
 Tag growth:
    Influenced by PBK and p(w|t)

 Tag Frequencies:
    Influenced by PBK, p(w|t), n, h
    n: Semantic breadth of a topic (blog: 100 tags,
     ajax: 50 tags, xml: 50 tags; Cattuto et al. 2007)
    h: No hint for realistic values. Good guess may be a value
     around 1000.




 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   18 of 24
Co-occ. Streams – Simulated Tag Growth                                        <is web>

PI: Imitation                                PI           PBK

PBK: Background Knowl.




                                                       Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft           Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de      19 of 24
Co-occ. Streams – Simulated Tag Frequencies <is                                    web>

PI: Imitation
                                                                        PI   PBK
PBK: Background Knowl.                                                  PI   PBK
                                                                        PI   PBK

n: Visible Tags

h: Available History




                                      Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   20 of 24
Simulating Ajax, Blog and XML                                                 <is web>

PI: Imitation                                                                  (PI

                                                                              (PI
PBK: Background Knowl.
                                                                               (PI
n: Visible Tags

h: Available History




                                                                                        blog


                                                                                     ajax
Observation
                                                                        xml
Simulation
                                           Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   21 of 24
Res. Streams – Simulated Tag Frequencies                                     <is web>

PI: Imitation
                                                                        PI   PBK
                                                                        PI   PBK
PBK: Background Knowl.

n: Visible Tags

h: Available History




                                      Logarithmic scaling of axes



 ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
 & Semantic Web                klaasd@uni-koblenz.de   22 of 24
Conclusions                                                                <is web>
• Imitation AND background knowledge are needed for
  explaining properties of tag streams
• Probability of imitating previous tag assignments:
  ~60-90%
                                             Frequency Rank
                               Co-occur. Streams       Resource Streams     Tag Growth
 Polya Urn Model                         O                       O           Fixed size
 Simon Model                             O                       O            Linear
 YS Model w/ Memory                      +                       O            Linear

 Halpin et al. Model                     O                       O            Linear

 Epistemic Model                         +                       +           Sublinear
 Observed Properties          Long tail w/ cut-off     Drop around tag 7     Sublinear

ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
& Semantic Web                klaasd@uni-koblenz.de   23 of 24
<is web>




                     Data sets and simulation software:
            http://isweb.uni-koblenz.de/Research/Tagdataset




                              http://www.tagora-project.eu


ISWeb - Information Systems   Klaas Dellschaft        Hypertext 2008
& Semantic Web                klaasd@uni-koblenz.de   24 of 24

More Related Content

Viewers also liked

Streaming architecture zx_dec2015
Streaming architecture zx_dec2015Streaming architecture zx_dec2015
Streaming architecture zx_dec2015Zhenzhong Xu
 
04. TCI Future Prosperity 2050 (2)
04. TCI Future Prosperity 2050 (2)04. TCI Future Prosperity 2050 (2)
04. TCI Future Prosperity 2050 (2)Richard Plumpton
 
Jury decision-making
Jury decision-makingJury decision-making
Jury decision-makingAmanda Moore
 
TCI_DangerousDegrees_print
TCI_DangerousDegrees_printTCI_DangerousDegrees_print
TCI_DangerousDegrees_printRichard Plumpton
 
Noa Noa Journal Autumn 2012
Noa Noa Journal Autumn 2012Noa Noa Journal Autumn 2012
Noa Noa Journal Autumn 2012NoaNoa1
 
League Of Nations Revision
League Of Nations RevisionLeague Of Nations Revision
League Of Nations RevisionLungi Holl
 
Vitrue deck sales_case_studynestle
Vitrue deck sales_case_studynestleVitrue deck sales_case_studynestle
Vitrue deck sales_case_studynestlepbrady459
 
Course delivery
Course deliveryCourse delivery
Course deliverygodbeyb
 
The model of perfect competition
The model of perfect competitionThe model of perfect competition
The model of perfect competitionLin Zaw
 
Peluang Biznis HiGOAT
Peluang Biznis HiGOATPeluang Biznis HiGOAT
Peluang Biznis HiGOATImpian Hari
 
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...Klaas Dellschaft
 
Berbicara dialektik
Berbicara dialektikBerbicara dialektik
Berbicara dialektikindhria
 

Viewers also liked (18)

Streaming architecture zx_dec2015
Streaming architecture zx_dec2015Streaming architecture zx_dec2015
Streaming architecture zx_dec2015
 
04. TCI Future Prosperity 2050 (2)
04. TCI Future Prosperity 2050 (2)04. TCI Future Prosperity 2050 (2)
04. TCI Future Prosperity 2050 (2)
 
Jury decision-making
Jury decision-makingJury decision-making
Jury decision-making
 
Asaco pla salut 2013
Asaco pla salut 2013Asaco pla salut 2013
Asaco pla salut 2013
 
TCI_DangerousDegrees_print
TCI_DangerousDegrees_printTCI_DangerousDegrees_print
TCI_DangerousDegrees_print
 
Networking Basics
Networking BasicsNetworking Basics
Networking Basics
 
Noa Noa Journal Autumn 2012
Noa Noa Journal Autumn 2012Noa Noa Journal Autumn 2012
Noa Noa Journal Autumn 2012
 
Implants
ImplantsImplants
Implants
 
League Of Nations Revision
League Of Nations RevisionLeague Of Nations Revision
League Of Nations Revision
 
Vitrue deck sales_case_studynestle
Vitrue deck sales_case_studynestleVitrue deck sales_case_studynestle
Vitrue deck sales_case_studynestle
 
Course delivery
Course deliveryCourse delivery
Course delivery
 
Idea 1
Idea 1Idea 1
Idea 1
 
The model of perfect competition
The model of perfect competitionThe model of perfect competition
The model of perfect competition
 
THS
THSTHS
THS
 
Peluang Biznis HiGOAT
Peluang Biznis HiGOATPeluang Biznis HiGOAT
Peluang Biznis HiGOAT
 
Presentation1
Presentation1Presentation1
Presentation1
 
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...
Measuring the Influence of Tag Recommenders on the Indexing Quality in Taggin...
 
Berbicara dialektik
Berbicara dialektikBerbicara dialektik
Berbicara dialektik
 

Similar to An Epistemic Dynamic Model for Tagging Systems

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC
 
Swt infontology and ambient intelligence
Swt infontology and ambient intelligenceSwt infontology and ambient intelligence
Swt infontology and ambient intelligencekeith scharding
 
SAF 2008 - Analysis and Architecture
SAF 2008 - Analysis  and ArchitectureSAF 2008 - Analysis  and Architecture
SAF 2008 - Analysis and Architecturemhessinger
 
Web standards, why care?
Web standards, why care?Web standards, why care?
Web standards, why care?Thomas Roessler
 
Data APIs as a Foundation for Systems of Engagement
Data APIs as a Foundation for Systems of EngagementData APIs as a Foundation for Systems of Engagement
Data APIs as a Foundation for Systems of EngagementVictor Olex
 
Swt Infontology
Swt InfontologySwt Infontology
Swt Infontologyguest95d86
 
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...Shreeraj Shah
 
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...Kai Wähner
 
What’s New in the Berkeley Data Analytics Stack
What’s New in the Berkeley Data Analytics StackWhat’s New in the Berkeley Data Analytics Stack
What’s New in the Berkeley Data Analytics StackTuri, Inc.
 
Scalable Social Architectures by Biren Gandhi
Scalable Social Architectures by Biren GandhiScalable Social Architectures by Biren Gandhi
Scalable Social Architectures by Biren GandhiBiren Gandhi
 
Cross platform mobile web apps
Cross platform mobile web appsCross platform mobile web apps
Cross platform mobile web appsJames Pearce
 
Graph Data: a New Data Management Frontier
Graph Data: a New Data Management FrontierGraph Data: a New Data Management Frontier
Graph Data: a New Data Management FrontierDemai Ni
 
Get Started with JavaScript Frameworks
Get Started with JavaScript FrameworksGet Started with JavaScript Frameworks
Get Started with JavaScript FrameworksChristian Gaetano
 
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)Shreeraj Shah
 
Web2.0: Integration issues
Web2.0: Integration issuesWeb2.0: Integration issues
Web2.0: Integration issueshnzz pronk
 
Ria Meets Enteprise SOA
Ria Meets Enteprise SOARia Meets Enteprise SOA
Ria Meets Enteprise SOAschennamaraja
 
How to Build Modern Data Architectures Both On Premises and in the Cloud
How to Build Modern Data Architectures Both On Premises and in the CloudHow to Build Modern Data Architectures Both On Premises and in the Cloud
How to Build Modern Data Architectures Both On Premises and in the CloudVMware Tanzu
 
[DSBW Spring 2009] Unit 01: Introducing Web Engineering
[DSBW Spring 2009] Unit 01: Introducing Web Engineering[DSBW Spring 2009] Unit 01: Introducing Web Engineering
[DSBW Spring 2009] Unit 01: Introducing Web EngineeringCarles Farré
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 

Similar to An Epistemic Dynamic Model for Tagging Systems (20)

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Swt infontology and ambient intelligence
Swt infontology and ambient intelligenceSwt infontology and ambient intelligence
Swt infontology and ambient intelligence
 
SAF 2008 - Analysis and Architecture
SAF 2008 - Analysis  and ArchitectureSAF 2008 - Analysis  and Architecture
SAF 2008 - Analysis and Architecture
 
Web standards, why care?
Web standards, why care?Web standards, why care?
Web standards, why care?
 
Data APIs as a Foundation for Systems of Engagement
Data APIs as a Foundation for Systems of EngagementData APIs as a Foundation for Systems of Engagement
Data APIs as a Foundation for Systems of Engagement
 
Swt Infontology
Swt InfontologySwt Infontology
Swt Infontology
 
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...
[Infosecworld 08 Orlando] New Defenses for .NET Web Apps: IHttpModule in Prac...
 
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...
Enterprise Integration Patterns Revisited (again) for the Era of Big Data, In...
 
What’s New in the Berkeley Data Analytics Stack
What’s New in the Berkeley Data Analytics StackWhat’s New in the Berkeley Data Analytics Stack
What’s New in the Berkeley Data Analytics Stack
 
Scalable Social Architectures by Biren Gandhi
Scalable Social Architectures by Biren GandhiScalable Social Architectures by Biren Gandhi
Scalable Social Architectures by Biren Gandhi
 
Cross platform mobile web apps
Cross platform mobile web appsCross platform mobile web apps
Cross platform mobile web apps
 
Graph Data: a New Data Management Frontier
Graph Data: a New Data Management FrontierGraph Data: a New Data Management Frontier
Graph Data: a New Data Management Frontier
 
Get Started with JavaScript Frameworks
Get Started with JavaScript FrameworksGet Started with JavaScript Frameworks
Get Started with JavaScript Frameworks
 
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)
Next Generation Web Attacks – HTML 5, DOM(L3) and XHR(L2)
 
Web2.0: Integration issues
Web2.0: Integration issuesWeb2.0: Integration issues
Web2.0: Integration issues
 
Ria Meets Enteprise SOA
Ria Meets Enteprise SOARia Meets Enteprise SOA
Ria Meets Enteprise SOA
 
How to Build Modern Data Architectures Both On Premises and in the Cloud
How to Build Modern Data Architectures Both On Premises and in the CloudHow to Build Modern Data Architectures Both On Premises and in the Cloud
How to Build Modern Data Architectures Both On Premises and in the Cloud
 
[DSBW Spring 2009] Unit 01: Introducing Web Engineering
[DSBW Spring 2009] Unit 01: Introducing Web Engineering[DSBW Spring 2009] Unit 01: Introducing Web Engineering
[DSBW Spring 2009] Unit 01: Introducing Web Engineering
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Unit 01 - Introduction
Unit 01 - IntroductionUnit 01 - Introduction
Unit 01 - Introduction
 

Recently uploaded

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 

Recently uploaded (20)

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 

An Epistemic Dynamic Model for Tagging Systems

  • 1. <is web> Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany An Epistemic Dynamic Model for Tagging Systems Klaas Dellschaft and Steffen Staab {klaasd, staab}@uni-koblenz.de
  • 2. Delicious Tagging Interface <is web>  How do users influence each other in tagging systems?  Hypothesis: Tagging involves imitation of other users AND selection of tags from background knowledge of users. ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 2 of 24
  • 3. Understanding Dynamics in Tagging Systems <is web> User interface Something else? Tagging Conceptualization Behavior Comparison Shared of Statistics Own Background Knowledge terminology Model of User Interface Influence Simulated Joint Stochastic Model Tagging Behavior Model of Own Model of Sharing Background Knowledge ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 3 of 24
  • 4. Folksonomies <is web>  Vertexes: Users, tags, resources  Hyperedges: Tag assignments (user X tag X resource)  Postings:  Tag assignments of a user to a single resource  Can be ordered according to their time-stamp ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 4 of 24
  • 5. Co-occurrence Streams <is web>  Co-occurrence Streams:  All tags co-occurring with a given tag in a posting  Ordered by posting time  Example tag assignments for ‘ajax':  {mackz, r1, {ajax, javascript}, 13:25}  {klaasd, r2, {ajax, rss, web2.0}, 13:26}  {mackz, r2, {ajax, php, javascript}, 13:27}  Resulting co-occurrence stream: javascript rss web2.0 php javascript time Tag |Y| |U| |T| |R| ajax 2.949.614 88.526 41.898 71.525 blog 6.098.471 158.578 186.043 557.017 xml 974.866 44.326 31.998 61.843 ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 5 of 24
  • 6. Co-occurrence Streams – Tag Growth <is web> Linear scaling of axes Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 6 of 24
  • 7. Co-occurrence Streams – Tag Frequencies <is web> Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 7 of 24
  • 8. Resource Streams <is web>  Resource Streams:  All tags assigned to a resource  Ordered by posting time  Example tag assignments:  {mackz, r1, {ajax, javascript}, 13:25}  {klaasd, r2, {ajax, rss, web2.0}, 13:26}  {mackz, r2, {ajax, php, javascript}, 13:27}  Resulting resource stream: ajax rss web2.0 ajax php javascript time ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 8 of 24
  • 9. Resource Streams – Tag Frequencies <is web> Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 9 of 24
  • 10. Resource Streams – Tag Frequencies <is web> Logarithmic scaling of axes Taken from Halpin, Robu and Shepard, WWW 2007 ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 10 of 24
  • 11. Existing Models <is web> Frequency Rank Co-occur. Streams Resource Streams Tag Growth Polya Urn Model O O Fixed size Simon Model O O Linear YS Model w/ Memory + O Linear Halpin et al. Model O O Linear Observed Properties Long tail w/ cut-off Drop around tag 7 Sublinear ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 11 of 24
  • 12. Stochastic Models of Influence <is web> User interface Something else? Tagging Conceptualization Behavior Comparison Shared of Statistics Own Background Knowledge terminology Model of User Interface Influence Simulated Joint Stochastic Model Tagging Behavior Model of Own Model of Sharing Background Knowledge ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 12 of 24
  • 13. Modeling Tag Imitation <is web>  Delicious Tagging Interface  Imitation of previous tag assignments  Free input of new tags (taken from background knowledge of a user) ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 13 of 24
  • 14. Simulating a Tag Stream <is web>  Start with empty tag stream  Each simulation step appends a new tag assignment: p(w|t): Probability of selecting word w for topic t. Modeled by word distributions in a topic centered text corpus. K PB 1- P BK n: Number of visible previous tags. h: Maximal number of previous tag assignments used for determining ranking of the n distinct tags. ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 14 of 24
  • 15. Modeling Background Knowledge <is web> Text Corpora Del.icio.us Text Corpora  PBK: Probability of selecting from background knowledge  p(w|t): Probability of selecting word w for topic t. Modeled by word distributions in a topic centered text corpus.  p(w|r): Probability of selecting word w for resource r. ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 15 of 24
  • 16. Modeling Tag Imitation <is web> PBK t t-1 t-2 t-3 t-4 t-5 … t-h … 1-PBK 1 2 3 … n  PI = 1 - PBK: Probability of imitating a previous tag assignment  n: Number of visible top-ranked tags  h: Maximal number of previous tag assignments used for determining ranking of the n distinct tags ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 16 of 24
  • 17. Simulation Results <is web> User interface Something else? Tagging Conceptualization Behavior Comparison Shared of Statistics Own Background Knowledge terminology Model of User Interface Influence Simulated Joint Stochastic Model Tagging Behavior Model of Own Model of Sharing Background Knowledge ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 17 of 24
  • 18. Simulating Co-occurrence Streams <is web>  Tag growth:  Influenced by PBK and p(w|t)  Tag Frequencies:  Influenced by PBK, p(w|t), n, h  n: Semantic breadth of a topic (blog: 100 tags, ajax: 50 tags, xml: 50 tags; Cattuto et al. 2007)  h: No hint for realistic values. Good guess may be a value around 1000. ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 18 of 24
  • 19. Co-occ. Streams – Simulated Tag Growth <is web> PI: Imitation PI PBK PBK: Background Knowl. Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 19 of 24
  • 20. Co-occ. Streams – Simulated Tag Frequencies <is web> PI: Imitation PI PBK PBK: Background Knowl. PI PBK PI PBK n: Visible Tags h: Available History Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 20 of 24
  • 21. Simulating Ajax, Blog and XML <is web> PI: Imitation (PI (PI PBK: Background Knowl. (PI n: Visible Tags h: Available History blog ajax Observation xml Simulation Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 21 of 24
  • 22. Res. Streams – Simulated Tag Frequencies <is web> PI: Imitation PI PBK PI PBK PBK: Background Knowl. n: Visible Tags h: Available History Logarithmic scaling of axes ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 22 of 24
  • 23. Conclusions <is web> • Imitation AND background knowledge are needed for explaining properties of tag streams • Probability of imitating previous tag assignments: ~60-90% Frequency Rank Co-occur. Streams Resource Streams Tag Growth Polya Urn Model O O Fixed size Simon Model O O Linear YS Model w/ Memory + O Linear Halpin et al. Model O O Linear Epistemic Model + + Sublinear Observed Properties Long tail w/ cut-off Drop around tag 7 Sublinear ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 23 of 24
  • 24. <is web> Data sets and simulation software: http://isweb.uni-koblenz.de/Research/Tagdataset http://www.tagora-project.eu ISWeb - Information Systems Klaas Dellschaft Hypertext 2008 & Semantic Web klaasd@uni-koblenz.de 24 of 24

Editor's Notes

  1. Hier durch Bild aus Steffens Folie ersetzen?