SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Intensive Metrics for the Study of the Evolution
of Open Source Projects: Case Studies from the
ASF
Santiago Gala-Pérez (ASF), Gregorio Robles (URJC),
Jesús M. González-Barahona (URJC), Israel Herraiz (UPM)

10th Working Conference on Mining Software Repositories
SF, California, May 18th, 2013
Preprint available at http://oa.upm.es/14698/
Slides at http://slideshare.net/herraiz/intensive-metrics-software-evolution

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

1/13
Metrics for Software Evolution

Common metrics are extensive
Difficult to compare projects of different size
Successful projects undergo large size changes over their lifetime
Intensive metrics in natural sciences
Metrics not depending on the size of system
Scale invariant

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

2/13
Metrics for Software Evolution

Common metrics are extensive
Difficult to compare projects of different size
Successful projects undergo large size changes over their lifetime
Intensive metrics in natural sciences
Metrics not depending on the size of system
Scale invariant
Are there any intensive metric for software?
Can we find intensive metrics to study software evolution?

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

2/13
The case of the Apache Software Foundation
ASF members mailing list, November 29 2008
Joe Schaeffer says
something IMO interesting about the ASF: the fact that the number of
commits and the number of mailing list posts have grown in linear
relationship [...] over the years.

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

3/13
Goal of the paper
Ratio Communication flow / development activity
Hypothesis: the ratio is an intensive metric for software evolution
It varies with
Maturity, technology, community composition

But not with project source code size

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

4/13
Goal of the paper
Ratio Communication flow / development activity
Hypothesis: the ratio is an intensive metric for software evolution
It varies with
Maturity, technology, community composition

But not with project source code size
Case study: the ASF
Broad and diverse range of projects
Size, scope, technology, maturity

If it didn’t happen on-list, it didn’t happen
Communications between developers (decisions)
Issue trackers
Code review tools, automated builds, wiki page edits
Commits
,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

4/13
ASF projects under study
Project
HTTPD
APR
Lucene
Turbine
Tomcat
Jackrabbit
Hadoop
Geronimo
SpamAssassin
Portals
Beehive

,

kSLOC
156
66
414
41
213
344
1270
370
54
202
88

Technology
Web server
Library
Index & search
Java web fwork
Servlet API
JSR-170 ref. impl.
Big Data
JavaEE app. srv.
Spam filter
Web fwork
J2EE Struts

Maturity
Active, long-lived
Active, long-lived
Active, long-lived
Stagnated
Active, long-lived
Active
Very active
Active, long-lived
Mature
Nearly dead
Attic

Intensive metrics for open source evolution – http://oa.upm.es/14698/

Scope
Users
Devs
Users
Devs
Devs
Devs
Devs
Devs
End users
Devs
Devs

5/13
ASF projects under study
Project
HTTPD
APR
Lucene
Turbine
Tomcat
Jackrabbit
Hadoop
Geronimo
SpamAssassin
Portals
Beehive

kSLOC
156
66
414
41
213
344
1270
370
54
202
88

Technology
Web server
Library
Index & search
Java web fwork
Servlet API
JSR-170 ref. impl.
Big Data
JavaEE app. srv.
Spam filter
Web fwork
J2EE Struts

Maturity
Active, long-lived
Active, long-lived
Active, long-lived
Stagnated
Active, long-lived
Active
Very active
Active, long-lived
Mature
Nearly dead
Attic

Scope
Users
Devs
Users
Devs
Devs
Devs
Devs
Devs
End users
Devs
Devs

Ratio
What’s the ratio evolution for these projects?

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

5/13
Apache httpd
156 kSLOC, active and long lived web server

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

6/13
Apache Portable Runtime (APR)
66 kSLOC, active and long lived library used by httpd and Subversion

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

7/13
Apache Hadoop
1270 kSLOC, very active development and community, higher presence of
non-human emails

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

8/13
Apache SpamAssassin
54 kSLOC, spam filter, intended for end users, maturing project

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

9/13
Apache Beehive
88 kSLOC, project in the Attic (no longer under development)

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

10/13
Overall comparison
Allows for comparison of projects with large differences in size, scope,
technology, maturity

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

11/13
Overall comparison
Lessons learned
Healthy Apache projects have smooth ratios
Projects with little activity, or small core group, are noisier
Peaks to infinity are evidence of stagnation

,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

12/13
Overall comparison
Lessons learned
Healthy Apache projects have smooth ratios
Projects with little activity, or small core group, are noisier
Peaks to infinity are evidence of stagnation
User-oriented projects
Evolution:
Starts with high values
Stabilize and matures with 3 <ratio< 8
Developer-oriented projects
Evolution:
Smaller community, no peaks
Always within 3 <ratio< 8
,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

12/13
Conclusions and further work
Metric
Intensive and expressive metric.
Not depending on size, maturity,
scope or technology.
End-users
More suitable for users-oriented
projects. Ratio works better with
large and active communities.

,

Stagnation

Other ratios, other cases
Devel-only messages, issues,
commits complexity.
Study beyond the ASF.

Can identify stagnated projects.
Can signal potential stagnation
threats.

Intensive metrics for open source evolution – http://oa.upm.es/14698/

13/13
Conclusions and further work
Metric
Intensive and expressive metric.
Not depending on size, maturity,
scope or technology.

Stagnation

End-users
More suitable for users-oriented
projects. Ratio works better with
large and active communities.

Other ratios, other cases
Devel-only messages, issues,
commits complexity.
Study beyond the ASF.

Can identify stagnated projects.
Can signal potential stagnation
threats.

Get a preprint of the paper at http://oa.upm.es/14698
Replication package
http://gsyc.es/∼grex/repro/2013-apache-intensive/
,

Intensive metrics for open source evolution – http://oa.upm.es/14698/

13/13

Weitere ähnliche Inhalte

Ähnlich wie Intensive metrics software evolution

GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaFriprogsenteret
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringTao Xie
 
Software Security Assurance for DevOps
Software Security Assurance for DevOpsSoftware Security Assurance for DevOps
Software Security Assurance for DevOpsBlack Duck by Synopsys
 
Software Security Assurance for Devops
Software Security Assurance for DevopsSoftware Security Assurance for Devops
Software Security Assurance for DevopsJerika Phelps
 
Implementing policy @ WSSSPE
Implementing policy @ WSSSPEImplementing policy @ WSSSPE
Implementing policy @ WSSSPEDaisie Huang
 
DaveParizekResumeJune2015
DaveParizekResumeJune2015DaveParizekResumeJune2015
DaveParizekResumeJune2015Dave Parizek
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectGoethe Univeristy
 
Using Opensource Software For Public Health
Using Opensource Software For Public HealthUsing Opensource Software For Public Health
Using Opensource Software For Public HealthInSTEDD
 
Using Open Source Software For Public Health Kass-Hout Di Tada
Using Open Source Software For Public Health Kass-Hout Di TadaUsing Open Source Software For Public Health Kass-Hout Di Tada
Using Open Source Software For Public Health Kass-Hout Di TadaTaha Kass-Hout, MD, MS
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsAndrea Wiggins
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitSlim Baltagi
 
LaranEvansResume
LaranEvansResumeLaranEvansResume
LaranEvansResumebutest
 
Six Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open SourceSix Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open SourceDirk Riehle
 
Project On-Science
Project On-ScienceProject On-Science
Project On-ScienceAmrit Ravi
 

Ähnlich wie Intensive metrics software evolution (20)

GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'Elia
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
Software Security Assurance for DevOps
Software Security Assurance for DevOpsSoftware Security Assurance for DevOps
Software Security Assurance for DevOps
 
Software Security Assurance for Devops
Software Security Assurance for DevopsSoftware Security Assurance for Devops
Software Security Assurance for Devops
 
2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki
 
Implementing policy @ WSSSPE
Implementing policy @ WSSSPEImplementing policy @ WSSSPE
Implementing policy @ WSSSPE
 
DaveParizekResumeJune2015
DaveParizekResumeJune2015DaveParizekResumeJune2015
DaveParizekResumeJune2015
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee Projeect
 
Open Source
Open SourceOpen Source
Open Source
 
Using Opensource Software For Public Health
Using Opensource Software For Public HealthUsing Opensource Software For Public Health
Using Opensource Software For Public Health
 
Using Open Source Software For Public Health Kass-Hout Di Tada
Using Open Source Software For Public Health Kass-Hout Di TadaUsing Open Source Software For Public Health Kass-Hout Di Tada
Using Open Source Software For Public Health Kass-Hout Di Tada
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
 
LaranEvansResume
LaranEvansResumeLaranEvansResume
LaranEvansResume
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Six Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open SourceSix Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open Source
 
Project On-Science
Project On-ScienceProject On-Science
Project On-Science
 

Kürzlich hochgeladen

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Kürzlich hochgeladen (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

Intensive metrics software evolution

  • 1. Intensive Metrics for the Study of the Evolution of Open Source Projects: Case Studies from the ASF Santiago Gala-Pérez (ASF), Gregorio Robles (URJC), Jesús M. González-Barahona (URJC), Israel Herraiz (UPM) 10th Working Conference on Mining Software Repositories SF, California, May 18th, 2013 Preprint available at http://oa.upm.es/14698/ Slides at http://slideshare.net/herraiz/intensive-metrics-software-evolution , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 1/13
  • 2. Metrics for Software Evolution Common metrics are extensive Difficult to compare projects of different size Successful projects undergo large size changes over their lifetime Intensive metrics in natural sciences Metrics not depending on the size of system Scale invariant , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 2/13
  • 3. Metrics for Software Evolution Common metrics are extensive Difficult to compare projects of different size Successful projects undergo large size changes over their lifetime Intensive metrics in natural sciences Metrics not depending on the size of system Scale invariant Are there any intensive metric for software? Can we find intensive metrics to study software evolution? , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 2/13
  • 4. The case of the Apache Software Foundation ASF members mailing list, November 29 2008 Joe Schaeffer says something IMO interesting about the ASF: the fact that the number of commits and the number of mailing list posts have grown in linear relationship [...] over the years. , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 3/13
  • 5. Goal of the paper Ratio Communication flow / development activity Hypothesis: the ratio is an intensive metric for software evolution It varies with Maturity, technology, community composition But not with project source code size , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 4/13
  • 6. Goal of the paper Ratio Communication flow / development activity Hypothesis: the ratio is an intensive metric for software evolution It varies with Maturity, technology, community composition But not with project source code size Case study: the ASF Broad and diverse range of projects Size, scope, technology, maturity If it didn’t happen on-list, it didn’t happen Communications between developers (decisions) Issue trackers Code review tools, automated builds, wiki page edits Commits , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 4/13
  • 7. ASF projects under study Project HTTPD APR Lucene Turbine Tomcat Jackrabbit Hadoop Geronimo SpamAssassin Portals Beehive , kSLOC 156 66 414 41 213 344 1270 370 54 202 88 Technology Web server Library Index & search Java web fwork Servlet API JSR-170 ref. impl. Big Data JavaEE app. srv. Spam filter Web fwork J2EE Struts Maturity Active, long-lived Active, long-lived Active, long-lived Stagnated Active, long-lived Active Very active Active, long-lived Mature Nearly dead Attic Intensive metrics for open source evolution – http://oa.upm.es/14698/ Scope Users Devs Users Devs Devs Devs Devs Devs End users Devs Devs 5/13
  • 8. ASF projects under study Project HTTPD APR Lucene Turbine Tomcat Jackrabbit Hadoop Geronimo SpamAssassin Portals Beehive kSLOC 156 66 414 41 213 344 1270 370 54 202 88 Technology Web server Library Index & search Java web fwork Servlet API JSR-170 ref. impl. Big Data JavaEE app. srv. Spam filter Web fwork J2EE Struts Maturity Active, long-lived Active, long-lived Active, long-lived Stagnated Active, long-lived Active Very active Active, long-lived Mature Nearly dead Attic Scope Users Devs Users Devs Devs Devs Devs Devs End users Devs Devs Ratio What’s the ratio evolution for these projects? , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 5/13
  • 9. Apache httpd 156 kSLOC, active and long lived web server , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 6/13
  • 10. Apache Portable Runtime (APR) 66 kSLOC, active and long lived library used by httpd and Subversion , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 7/13
  • 11. Apache Hadoop 1270 kSLOC, very active development and community, higher presence of non-human emails , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 8/13
  • 12. Apache SpamAssassin 54 kSLOC, spam filter, intended for end users, maturing project , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 9/13
  • 13. Apache Beehive 88 kSLOC, project in the Attic (no longer under development) , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 10/13
  • 14. Overall comparison Allows for comparison of projects with large differences in size, scope, technology, maturity , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 11/13
  • 15. Overall comparison Lessons learned Healthy Apache projects have smooth ratios Projects with little activity, or small core group, are noisier Peaks to infinity are evidence of stagnation , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 12/13
  • 16. Overall comparison Lessons learned Healthy Apache projects have smooth ratios Projects with little activity, or small core group, are noisier Peaks to infinity are evidence of stagnation User-oriented projects Evolution: Starts with high values Stabilize and matures with 3 <ratio< 8 Developer-oriented projects Evolution: Smaller community, no peaks Always within 3 <ratio< 8 , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 12/13
  • 17. Conclusions and further work Metric Intensive and expressive metric. Not depending on size, maturity, scope or technology. End-users More suitable for users-oriented projects. Ratio works better with large and active communities. , Stagnation Other ratios, other cases Devel-only messages, issues, commits complexity. Study beyond the ASF. Can identify stagnated projects. Can signal potential stagnation threats. Intensive metrics for open source evolution – http://oa.upm.es/14698/ 13/13
  • 18. Conclusions and further work Metric Intensive and expressive metric. Not depending on size, maturity, scope or technology. Stagnation End-users More suitable for users-oriented projects. Ratio works better with large and active communities. Other ratios, other cases Devel-only messages, issues, commits complexity. Study beyond the ASF. Can identify stagnated projects. Can signal potential stagnation threats. Get a preprint of the paper at http://oa.upm.es/14698 Replication package http://gsyc.es/∼grex/repro/2013-apache-intensive/ , Intensive metrics for open source evolution – http://oa.upm.es/14698/ 13/13