SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
From libre software to Wikipedia:
 A tour of open collaboration




Felipe Ortega
Libresoft, Universidad Rey Juan Carlos
e-mail: jfelipe@libresoft.es
Twitter | Identi.ca: @jfelipe

Xerox PARC
June 14, 2011
                                         By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
© 2011 Felipe Ortega.
                                          Some rights reserved.
                              This document is licensed under a
Creative Commons Attribution-ShareAlike 3.0 Unported License
 (Logos on first slide are (TM) of their respective organizations)
Open collaboration
“Think of how Wikipedia works, how Amazon harnesses
user annotation on its site, the way photo-sharing sites
like Flickr are bleeding out into other applications...
We're entering an era in which software learns from
its users and all of the users are connected”.

Tim O'Reilly.
TIME Magazine, 24 October 2005.




                                                By Felipe Ortega, CC-BY-SA 3.0
In the beginning...


●   ...all started with “real programmers” and FLOSS.
    ●   FSF, GNU, free licenses.
    ●   Open source goes into industry.
    ●   Libre software becomes ubiquitous.
●   However
    ●   Crowdsourced ! = Open source
    ●   Much betters if results encourage reusing and
        distribution of derivative works.
The “paradox” of open collaboration



“Wikipedia is the best thing ever. Anyone in the world can
write anything they want about any subject, so you know
you are getting the best possible information.”.

Michael Scott (played by Steve Carell)
The Office, "The Negotiation" [3.18], 5 April 2007
3 lessons from libre software



●   Onion model.
●   Generational relay.
●   Lasting participation.         By El_T, Public Domain,
                                from Wikimedia Commons
Onion model

The Social Structure of Free and Open Source Software Development
Crowston & Howison, 2005
Generational relay




      Robles, González-Barahona.
      Contributor Turnover in Libre Software Projects.
      OSS 2006.
Lasting participation


●   Robles, González-Barahona and Michlmayr.
    Evolution of Volunteer Participation in Libre Software
    Projects: Evidence from Debian. OSS 2005.



    Half-life ratio = 7.5 years!


+50% maintainers in Debian 2.0 still present in Debian 3.1
Thesis. Wikipedia: A quantitative
analysis.

●   Apply lessons from libre software to under-
    stand open collaborative process in Wikipedia.
    ●   Content production.
    ●   Effort distribution.
    ●   Implications for quality.
    ●   Participation and sustainability.
Tool: WikiXRay

Automated analysis of Wikipedia dumps.
http://git.libresoft.es/WikiXRay




                                      Download
                                                  Local MySQL
Wikimedia Download   Compressed        dumps
                                                     Server
      Center          DB dumps
                                      WIKIXRAY




Results evaluation   Analysis (scripts + GNU R)   Preparation for
                                                   data mining
New articles created in Wikipedia




                Entered steady-state in 2006,
                before graph of monthly edits
                    became stable (2007)
Interaction: talk pages

100%

90%

80%

70%

60%

50%                                                           no-talk
40%
                                                              talk

30%

20%

10%

 0%
       EN   DE   FR   PL   JA   NL   IT   PT   ES   SV

                           0.0086% (old talk pages deleted)
Contributions per editor

                    ●   Upper truncated Pareto
                        distribution.
                    ●   Limit in max. number of
                        revisions by human
                        editors.
                    ●   Better to have more
                        editors rather than
                        increasing contributions
                        per editor.
Effort distribution: Gini coefficient
Monthly effort distribution Wikipedia




                   Constant over the whole history!
              Ortega, F., González-Barahona, J., Robles, G.
              On the inequality of contributions to Wikipedia.
              HICSS 2008.
Profile editors in Featured Articles

●   Most Featured Articles are at least 1,000 days old.
●   10 times more editors in FAs than in non-FAs,
    almost 200 times in EN (!!).
●   FAs reviewed by significantly older authors
    (+3 years actively contributing to Wikipedia).


         FAs                                   non-FAs
The Digital Potlatch


●   Book with J. Rodríguez (in Spanish).
    ●   Ed. Cátedra, expected September 2011.
●   Interdisciplinary.
    ●   Anthropology + Engineering.
●   Meritocracy in Wikipedia.
●   Effort recognition.
●   Motivations.
●   Implications for quality.
                                        Public Domain, from Wikimedia Commons
Future lines of work


●   Study causes of change in
    evolution patterns and reverts.
    ●   “The singularity is not near”       By Bios, CC-BY-SA 3.0, from
                                                    Wikimedia Commons

        ASC @PARC, WikiSym 2009.
●   Edit diffs to study contribution patterns.
●   Different types of content.
●   Cross-relation with traffic patterns.

Weitere ähnliche Inhalte

Ähnlich wie Parc floss-wikipedia

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyDavid Laniado
 
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...Oscar Corcho
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Takashi Iba
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologiesdgarijo
 
Editing Behavior over Time Power vs. Standard Wikidata Editors
Editing Behavior over Time  Power vs. Standard Wikidata EditorsEditing Behavior over Time  Power vs. Standard Wikidata Editors
Editing Behavior over Time Power vs. Standard Wikidata EditorsCristina Sarasua
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...vbrant
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWikimedia Foundation
 
Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Nicole C. Engard
 
Free For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceFree For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceAli King
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project Jie Bao
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarb p
 
Open Source: Freedom and Community
Open Source: Freedom and CommunityOpen Source: Freedom and Community
Open Source: Freedom and CommunityNicole C. Engard
 
Wikisource - Where we are, where we want to go
Wikisource  - Where we are, where we want to go Wikisource  - Where we are, where we want to go
Wikisource - Where we are, where we want to go AubreyMcFato
 
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...helmoony
 
Wanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationWanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationGrupo Inmigra i+d
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...Cornelius Puschmann
 
OpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureFutureTDM
 
Reciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsReciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsMikel Iturbe
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaRENDER project
 

Ähnlich wie Parc floss-wikipedia (20)

Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
 
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
 
Editing Behavior over Time Power vs. Standard Wikidata Editors
Editing Behavior over Time  Power vs. Standard Wikidata EditorsEditing Behavior over Time  Power vs. Standard Wikidata Editors
Editing Behavior over Time Power vs. Standard Wikidata Editors
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
 
Wmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdfWmf wikimedia conference japan feb 3 en pdf
Wmf wikimedia conference japan feb 3 en pdf
 
Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)Practical Open Source Software for Libraries (part 1)
Practical Open Source Software for Libraries (part 1)
 
Free For All: Getting Started in Open Source
Free For All: Getting Started in Open SourceFree For All: Getting Started in Open Source
Free For All: Getting Started in Open Source
 
Collaborative Ontology Building Project
Collaborative Ontology Building Project  Collaborative Ontology Building Project
Collaborative Ontology Building Project
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
BCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaarBCcampus a-great-babbling-bazaar
BCcampus a-great-babbling-bazaar
 
Open Source: Freedom and Community
Open Source: Freedom and CommunityOpen Source: Freedom and Community
Open Source: Freedom and Community
 
Wikisource - Where we are, where we want to go
Wikisource  - Where we are, where we want to go Wikisource  - Where we are, where we want to go
Wikisource - Where we are, where we want to go
 
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
Which tools to manage a medium-sized version of Wikipedia? Arabic Wikipedia a...
 
Wanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative TranslationWanted: Best Practices for Collaborative Translation
Wanted: Best Practices for Collaborative Translation
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
 
OpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructure
 
Reciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine TranslatorsReciprocal Enrichment between Wikipedia and Machine Translators
Reciprocal Enrichment between Wikipedia and Machine Translators
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
 

Kürzlich hochgeladen

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 

Kürzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Parc floss-wikipedia

  • 1. From libre software to Wikipedia: A tour of open collaboration Felipe Ortega Libresoft, Universidad Rey Juan Carlos e-mail: jfelipe@libresoft.es Twitter | Identi.ca: @jfelipe Xerox PARC June 14, 2011 By Diego GrezCC-BY-SA 3.0, Wikimedia Commons
  • 2. © 2011 Felipe Ortega. Some rights reserved. This document is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License (Logos on first slide are (TM) of their respective organizations)
  • 4. “Think of how Wikipedia works, how Amazon harnesses user annotation on its site, the way photo-sharing sites like Flickr are bleeding out into other applications... We're entering an era in which software learns from its users and all of the users are connected”. Tim O'Reilly. TIME Magazine, 24 October 2005. By Felipe Ortega, CC-BY-SA 3.0
  • 5. In the beginning... ● ...all started with “real programmers” and FLOSS. ● FSF, GNU, free licenses. ● Open source goes into industry. ● Libre software becomes ubiquitous. ● However ● Crowdsourced ! = Open source ● Much betters if results encourage reusing and distribution of derivative works.
  • 6. The “paradox” of open collaboration “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information.”. Michael Scott (played by Steve Carell) The Office, "The Negotiation" [3.18], 5 April 2007
  • 7. 3 lessons from libre software ● Onion model. ● Generational relay. ● Lasting participation. By El_T, Public Domain, from Wikimedia Commons
  • 8. Onion model The Social Structure of Free and Open Source Software Development Crowston & Howison, 2005
  • 9. Generational relay Robles, González-Barahona. Contributor Turnover in Libre Software Projects. OSS 2006.
  • 10. Lasting participation ● Robles, González-Barahona and Michlmayr. Evolution of Volunteer Participation in Libre Software Projects: Evidence from Debian. OSS 2005. Half-life ratio = 7.5 years! +50% maintainers in Debian 2.0 still present in Debian 3.1
  • 11. Thesis. Wikipedia: A quantitative analysis. ● Apply lessons from libre software to under- stand open collaborative process in Wikipedia. ● Content production. ● Effort distribution. ● Implications for quality. ● Participation and sustainability.
  • 12. Tool: WikiXRay Automated analysis of Wikipedia dumps. http://git.libresoft.es/WikiXRay Download Local MySQL Wikimedia Download Compressed dumps Server Center DB dumps WIKIXRAY Results evaluation Analysis (scripts + GNU R) Preparation for data mining
  • 13. New articles created in Wikipedia Entered steady-state in 2006, before graph of monthly edits became stable (2007)
  • 14. Interaction: talk pages 100% 90% 80% 70% 60% 50% no-talk 40% talk 30% 20% 10% 0% EN DE FR PL JA NL IT PT ES SV 0.0086% (old talk pages deleted)
  • 15. Contributions per editor ● Upper truncated Pareto distribution. ● Limit in max. number of revisions by human editors. ● Better to have more editors rather than increasing contributions per editor.
  • 17. Monthly effort distribution Wikipedia Constant over the whole history! Ortega, F., González-Barahona, J., Robles, G. On the inequality of contributions to Wikipedia. HICSS 2008.
  • 18. Profile editors in Featured Articles ● Most Featured Articles are at least 1,000 days old. ● 10 times more editors in FAs than in non-FAs, almost 200 times in EN (!!). ● FAs reviewed by significantly older authors (+3 years actively contributing to Wikipedia). FAs non-FAs
  • 19. The Digital Potlatch ● Book with J. Rodríguez (in Spanish). ● Ed. Cátedra, expected September 2011. ● Interdisciplinary. ● Anthropology + Engineering. ● Meritocracy in Wikipedia. ● Effort recognition. ● Motivations. ● Implications for quality. Public Domain, from Wikimedia Commons
  • 20. Future lines of work ● Study causes of change in evolution patterns and reverts. ● “The singularity is not near” By Bios, CC-BY-SA 3.0, from Wikimedia Commons ASC @PARC, WikiSym 2009. ● Edit diffs to study contribution patterns. ● Different types of content. ● Cross-relation with traffic patterns.