SlideShare a Scribd company logo
1 of 57
Download to read offline
The dynamics of software evolution
                      EVOLUMONS 2011
             Research Seminar on Software Evolution

                           Universitรฉ de Mons, Belgium
                               January 26th 2011

                                   Israel Herraiz
                           Universidad Alfonso X el Sabio
                                <isra@herraiz.org>
                                 <herraiz@uax.es>

                                                            1

http://www.uax.es http://herraiz.org
(c) 2011 Israel Herraiz
                                              This work is licensed under the
                                  Creative Commons Attribution-Share Alike 3.0

                                              To view a copy of this license, visit
                                 http://creativecommons.org/licenses/by-sa/3.0/

                                                                          or send a letter to

                                                                 Creative Commons,
                                                        171 Second Street, Suite 300,
                                                            San Francisco, California,
                                                                         94105, USA.

                                  Get the full bibliographic references listed in these slides at
                                  http://herraiz.org/stuff/evolumons_references_20110126.txt


http://www.uax.es http://herraiz.org
Outline
     โ—   The laws of software evolution
     โ—   The nature of software evolution (for libre
         software)
     โ—   How to accurately forecast software evolution.
         And why it works.
     โ—   What's next?
     โ—   And what did I learn during all these years of
         work?

                                                          3

http://www.uax.es http://herraiz.org
The laws of software evolution




                                             4

http://www.uax.es http://herraiz.org
My background
     โ—   Educated as a chemical and mechanical
         engineer
     โ—   Wasted my time in the chemical industry. But I
         did (and do) love doing software!
               โ€“   http://caflur.sf.net http://gpinch.sf.net
     โ—   Involved in the open source community since
         around 2001, started a PhD in 2004 in the
         Libresoft research group
               โ€“   http://libresoft.es

                                                               5

http://www.uax.es http://herraiz.org
How it all started
     โ—   Godfrey and Tu                 โ—   My supervisors and I
         [GT00] [GT01]                      wrote a paper on the
         studied the evolution              topic [RAGBH05]
         of the Linux kernel            โ—   At the time, I thought
     โ—   They said that the                 it was just one more
         laws of software                   paper
         evolution were not             โ—   It turned out to be our
         valid for Linux                    most cited paper
               โ€“   Laws of software
                   evolution. What is
                                            โ—   Completely puzzled
                   that?                        me
                                                                      6

http://www.uax.es http://herraiz.org
The topic background:
                        Software evolution
     โ—   How and why does
         software evolve?
     โ—   Meir M. Lehman
         Laws of software
         evolution
     โ—   โ€œProgram evolution.
         Processes of
         software changeโ€
         published in 1985

                                               7

http://www.uax.es http://herraiz.org
The laws in the seventies
     โ—   Laws of Program Evolution Dynamics (1974)




                                                           8
                                        [Leh74] [Leh85b]
http://www.uax.es http://herraiz.org
The evolution of the laws of
                    software evolution [Leh96] [LRW+97]
                                          [MFRP06]

                               [Leh78]    [Leh80]
                               [Leh85c]   [LB85]
    [Leh74]
    [Leh85b]




                                                    9

http://www.uax.es http://herraiz.org
The laws in the present day
                         (I โ€“ IV)




                                             10

http://www.uax.es http://herraiz.org
The laws in the present day
                        (V โ€“ VIII)




                                             11

http://www.uax.es http://herraiz.org
Empirical studies of software
                        evolution




                  See โ€œEmpirical Studies of Open Source Evolutionโ€ by
                      Juan Fernandez-Ramil, Angela Lozano, Michel Wermelinger, Andrea Capiluppi   12
                      in Tom Mens, Serge Demeyer (eds.) Software Evolution

http://www.uax.es http://herraiz.org
Why the controversy about the laws
           of software evolution?
     โ—   Fernandez-Ramil et al. found in the literature
         empirical validation for the I, VI, VII (partially)
         and VIII (partially)
     โ—   The most interesting part (for me)
               โ€“   Statistical analysis of software projects and their
                   evolution, using time series analysis among other
                   techniques (suggested in ยก1974!) [Leh74] [Leh85b]
               โ€“   โ€œFor maximum cost-effectiveness, management
                   consideration and judgement should include the entire
                   history of the project with the current state having the
                   strongest, but not exclusive, in๏ฌ‚uenceโ€
                   [Leh78] [Leh85c]
          โ—
                                                                              13

http://www.uax.es http://herraiz.org
The nature of (libre) software
                       evolution




                                               14

http://www.uax.es http://herraiz.org
The nature of (libre) software
                       evolution
     โ—   The goal is to develop a theoretical model for
         software evolution
     โ—   Long pursued goal
          โ—   Lehman and Belady in 1971 [BL71] [LB85]
          โ—   Woodside progressive and anti-regressive work
              [Woo80] (included in [LB85])
          โ—   Turski models [Tur96] [Tur02]
               โ€“   Growth is inversely proportional to complexity
               โ€“   Complexity is proportional to the square of size

                                                                      15

http://www.uax.es http://herraiz.org
More recent models
     โ—   Self-Organized criticality [Wu06] [WHH07]
          โ—   Power laws for the size of the system
          โ—   Long range correlations in the time series of
              changes
     โ—   Maintenance Guidance Model [CFR07]
          โ—   Those functions that have suffered more changes in
              the past are more likely to be changed in the future
          โ—   Assumptions:
               โ€“   Distribution of accumulated changes is asymmetrical
               โ€“   Developers prioritize changes using past number of
                   changes and complexity                                16

http://www.uax.es http://herraiz.org
Determinism and evolution
     โ—   Self Organized Criticality
          โ—   This means that current events are influenced by
              very old events
          โ—   Against Lehman suggestions [Leh78] [Leh85c]
     โ—   In my opinion, counter intuitive




                                                                 17

http://www.uax.es http://herraiz.org
Long range correlated processes




http://www.uax.es http://herraiz.org
Long range correlated processes




http://www.uax.es http://herraiz.org
Long range correlated processes




                                       Unreachable
http://www.uax.es http://herraiz.org
Short range correlated




http://www.uax.es http://herraiz.org
Short range correlated




http://www.uax.es http://herraiz.org
Short range correlated




http://www.uax.es http://herraiz.org
Short range correlated




http://www.uax.es http://herraiz.org
How is software evolution?




                                       or     ?




http://www.uax.es http://herraiz.org
Autocorrelation coefficients

                                                   ...
     1             2              3    4   5

                                                         r(1)
                                                   ...
                   1              2    3   4                    r(2)



                                                   ...
                                  1    2   3

                                               .
                                               .
                                               .




http://www.uax.es http://herraiz.org
r(k)           Autocorrelation coefficients
       1




       0

              1    2    3    4    5    6   7   8   9   10 11 12 13   14 15
                                                                             k

http://www.uax.es http://herraiz.org
r(k)           Autocorrelation coefficients
       1
                                                           Long range
                                                           correlated
                                                            r ๎‚žk ๎‚Ÿ~k 2dโˆ’1
                                                            0๎‚„d ๎‚„0.5



              Short range
               correlated
            (ARIMA process)
              r ๎‚žk ๎‚Ÿ~C ๎‚ž1โˆ’k ๎‚ป๎‚Ÿ
       0

              1    2    3    4    5    6   7   8   9   10 11 12 13   14 15
                                                                             k

http://www.uax.es http://herraiz.org
r(k)         Autocorrelation coefficients
       1
                                       Long range
                                       correlated
                                        r ๎‚žk ๎‚Ÿ~k 2dโˆ’1
                                         0๎‚„d ๎‚„0.5

                 Short range
                  correlated
               (ARIMA process)
                 r ๎‚žk ๎‚Ÿ~ Ai ๎‚ž1โˆ’k ๎‚ป๎‚Ÿ         Logarithmic
                                               scale

       0


                                                        k

http://www.uax.es http://herraiz.org
Empirical study
     โ—   3,821 software projects
               โ€“   More than 3 developers
               โ€“   More than 1 year of active history
               โ€“   9,234,104 commits / 2,357,438 modification requests
               โ€“   Projects registered between Nov. 1999 and Dec. 2004
               โ€“   Datasets publicly available
     โ—   See Determinism and evolution
               โ€“   5th International Working Conference on
                   Mining Software Repositories (MSR 2008)
                                                                FLOSSMole
                                                                    +
                                                               CVSAnalY-SF


http://www.uax.es http://herraiz.org
Methodology
     โ—   Liner correlation to calculate linearity
     โ—   Distribution of the Pearson coefficients
     โ—   Smoothing applied to the series before
         calculating ACF




http://www.uax.es http://herraiz.org
Results




http://www.uax.es http://herraiz.org
Results




http://www.uax.es http://herraiz.org
Results



                          Long
                          memory
                          processes              Short
                                                 memory
                                                 processes




http://www.uax.es http://herraiz.org
Looking at the numbers
                    Quantile Commits                    MRs
                           0 0.3235                     0.2886
                          20 0.7394                     0.7248
                          40 0.8178                     0.8036
                          60 0.8906                     0.8705
                          80 0.9783                     0.9464
                        100 0.9998                      0.9998

                                       Long memory process


                                       Short memory process
                                                                 35

http://www.uax.es http://herraiz.org
Implications for evolution
     โ—   Short memory -> Yesterday's weather
         http://doi.ieeecomputersociety.org/10.1109/ICSM.2004.1357788
     โ—   When deciding, current situation should have
         more influence
          โ—   As Lehman said in 1978




http://www.uax.es http://herraiz.org
How to forecast software evolution




                                            37

http://www.uax.es http://herraiz.org
Background
     โ—   Forecasting traditionally done using very simple
         statistical models
          โ—   Regression
     โ—   Lehman suggested in 1974 that Time Series
         Analysis was the best approach to study
         software evolution
     โ—   Let's compare time series analysis against
         regression models


                                                        38

http://www.uax.es http://herraiz.org
Case studies

                                       Training set                  Test set



                                                                                PostgreSQL


                                                                                FreeBSD


                                                                                NetBSD

          1993      1995      1997     1999     2001   2003   2005     2007
                                              Time



                                                                                          39

http://www.uax.es http://herraiz.org
Case studies




                                              Training set   Test set




                                                                        40

http://www.uax.es http://herraiz.org
Time Series Analysis
                      Original                                  Yes
                                       ACF           Clear
                     time series
                                       PACF         pattern?
                        data


                                                          No


                                                     Kernel
                                                    smoothing




                                         ARIMA              p, d, q
                        Predictions       model            based on
                                          fitting         ACF / PACF



http://www.uax.es http://herraiz.org
Parameters of the model




http://www.uax.es http://herraiz.org
Autocorrelation coefficients.
                     No smoothing




http://www.uax.es http://herraiz.org
Autocorrelation coefficients.
                    After smoothing




http://www.uax.es http://herraiz.org
Parameters of all the models
     โ—   Time series ARIMA model
          โ—   d=1 q=0                  p = 6, 7 or 9
     โ—   Regression model
          โ—   r > 0.99




http://www.uax.es http://herraiz.org
How does the model look like?


                     ๎‚ž                            ๎‚Ÿ ๎‚ž                ๎‚Ÿ
                                 q                         p
             d                                j                  i
        โˆ‡ x t 1โˆ’โˆ‘ ๎‚ด j B =๎‚ป๎‚žt๎‚Ÿ 1โˆ’โˆ‘ ๎‚ฎ i B
                               j=1                         i=1

                                          i       i
                                         B =B =x tโˆ’i
                                                  xt

                         โˆ‡ x t =x t โˆ’x tโˆ’1=๎‚ž1โˆ’B๎‚Ÿ x t
                                     d                 d
                               โˆ‡ x t =๎‚ž1โˆ’B๎‚Ÿ x t

http://www.uax.es http://herraiz.org
How does the model look like?

     Predicted / Actual values                             Estimation
                                       Coefficients                         Linear component
                                                             errors




                     ๎‚ž                           ๎‚Ÿ ๎‚ž                                   ๎‚Ÿ
                                 q                                      p
             d                               j                                     i
        โˆ‡ x t 1โˆ’โˆ‘ ๎‚ด j B =๎‚ป๎‚žt๎‚Ÿ 1โˆ’โˆ‘ ๎‚ฎ i B
                               j=1                                  i=1

      Parameters of
        the model                                Linear component




http://www.uax.es http://herraiz.org
Results
     Time series (ARIMA) vs. regression

                     ARIMA Regression
            FreeBSD 3.93        16.89
             NetBSD   1.80      15.94
           PostgreSQL 1.48       6.86

                        Mean Squared Relative Error




http://www.uax.es http://herraiz.org
Conclusions
     โ—   Time Series more accurate than Regression
         Analysis for macroscopic predictions
     โ—   Basic model. More components can be added.
     โ—   Seasonality
     โ—   Multi-variable, combining different factors




http://www.uax.es http://herraiz.org
More results
     โ—   Ok, so you predicted last year...which is past...
     โ—   What about predicting real future?
                          MSR Challenge 2007 winners

                          Goal:
                          predicting the number of changes
                          in Eclipse in the next three months
                          http://dx.doi.org/10.1109/MSR.2007.10




http://www.uax.es http://herraiz.org
Why this works?
     โ—   Isn't it too accurate?
     โ—   Why do you think this works?




http://www.uax.es http://herraiz.org
What's next?




                                                  52

http://www.uax.es http://herraiz.org
Further work
     โ—   Write a paper about the controversy around the
         validation of the laws of software evolution
          โ—   In progress
     โ—   Write a paper about the short memory nature of
         evolution
          โ—   Using Time Series Analysis to show it
          โ—   And ARIMA as a forecasting tool
          โ—   Extracting principles and guidelines for software
              projects management

                                                                  53

http://www.uax.es http://herraiz.org
And what I did learn during all these
                   years?




                                         54

http://www.uax.es http://herraiz.org
Things I appreciate my advisors did
     โ—   Freedom of movements
     โ—   Pressure to get my own funding
     โ—   Unconditional support
     โ—   Demanding and challenging environment
     โ—   Opportunity to coordinate projects
     โ—   And to participate in many meetings alone



                                                     55

http://www.uax.es http://herraiz.org
Things that I did not know and I do
                         now
     โ—   Know-how about conferences and journals
     โ—   English skills
     โ—   Writing skills (papers and proposals)
     โ—   Presentation skills
     โ—   Self-motivation
               โ€“   Brick walls are there for the rest of people
               โ€“   Experience is what you get when you don't get what
                   you want
               โ€“   Never give up
               โ€“   http://www.youtube.com/watch?v=ji5_MqicxSo           56

http://www.uax.es http://herraiz.org
Take away
            Laws of                                Statistical
       Software Evolution                          approach

              Controversy                     Replicable study

            Short memory                      Brick walls are
              dynamics                         a good thing

             ARIMA                             Keep working.
        accurate forecast                      Don't give up
                                                                 57

http://www.uax.es http://herraiz.org

More Related Content

Similar to The dynamics of software evolution - EVOLUMONS 2011

Open Source, Sourceforge Projects, & Apache Foundation
Open Source, Sourceforge Projects, & Apache FoundationOpen Source, Sourceforge Projects, & Apache Foundation
Open Source, Sourceforge Projects, & Apache FoundationMohammad Kotb
ย 
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...Building the Future Together: AtoM3, Governance, and the Sustainability of Op...
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...Artefactual Systems - AtoM
ย 
W3 c semantic web activity
W3 c semantic web activityW3 c semantic web activity
W3 c semantic web activityabhishek_manish2005
ย 
Introduction to License Compliance and My research (D. German)
Introduction to License Compliance and My research (D. German)Introduction to License Compliance and My research (D. German)
Introduction to License Compliance and My research (D. German)dmgerman
ย 
Prospero: A Web-based Document Delivery System
Prospero: A Web-based Document Delivery SystemProspero: A Web-based Document Delivery System
Prospero: A Web-based Document Delivery SystemEric Schnell
ย 
GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaFriprogsenteret
ย 
XOOPS 2.5.x Installation Guide
XOOPS 2.5.x Installation GuideXOOPS 2.5.x Installation Guide
XOOPS 2.5.x Installation Guidexoopsproject
ย 
Workshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and ArchivematicaWorkshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and ArchivematicaArtefactual Systems - Archivematica
ย 
National Archives of Norway - AtoM and Archivematica intro workshop
National Archives of Norway - AtoM and Archivematica intro workshopNational Archives of Norway - AtoM and Archivematica intro workshop
National Archives of Norway - AtoM and Archivematica intro workshopArtefactual Systems - AtoM
ย 
Overview of oss(open source software library) and its pros and cons
Overview of oss(open source software library) and its pros and consOverview of oss(open source software library) and its pros and cons
Overview of oss(open source software library) and its pros and consYuga Priya Satheesh
ย 
Community catalysts value of open source
Community catalysts   value of open sourceCommunity catalysts   value of open source
Community catalysts value of open sourceDave Neary
ย 
Open source software
Open source softwareOpen source software
Open source softwarejaimeacurry
ย 
Artefactual and Open Source Development
Artefactual and Open Source DevelopmentArtefactual and Open Source Development
Artefactual and Open Source DevelopmentArtefactual Systems - AtoM
ย 
Opensource Development
Opensource DevelopmentOpensource Development
Opensource Developmentpetr_havel
ย 
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...WSO2
ย 
Day 2-presentation
Day 2-presentationDay 2-presentation
Day 2-presentationDeb Forsten
ย 
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...cresco
ย 
Nonsoftwareoss
NonsoftwareossNonsoftwareoss
NonsoftwareossCarlo Daffara
ย 
open source technology
open source technologyopen source technology
open source technologyLila Ram Yadav
ย 
Fundamentals of Free and Open Source Software
Fundamentals of Free and Open Source SoftwareFundamentals of Free and Open Source Software
Fundamentals of Free and Open Source SoftwareRoss Gardler
ย 

Similar to The dynamics of software evolution - EVOLUMONS 2011 (20)

Open Source, Sourceforge Projects, & Apache Foundation
Open Source, Sourceforge Projects, & Apache FoundationOpen Source, Sourceforge Projects, & Apache Foundation
Open Source, Sourceforge Projects, & Apache Foundation
ย 
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...Building the Future Together: AtoM3, Governance, and the Sustainability of Op...
Building the Future Together: AtoM3, Governance, and the Sustainability of Op...
ย 
W3 c semantic web activity
W3 c semantic web activityW3 c semantic web activity
W3 c semantic web activity
ย 
Introduction to License Compliance and My research (D. German)
Introduction to License Compliance and My research (D. German)Introduction to License Compliance and My research (D. German)
Introduction to License Compliance and My research (D. German)
ย 
Prospero: A Web-based Document Delivery System
Prospero: A Web-based Document Delivery SystemProspero: A Web-based Document Delivery System
Prospero: A Web-based Document Delivery System
ย 
GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'Elia
ย 
XOOPS 2.5.x Installation Guide
XOOPS 2.5.x Installation GuideXOOPS 2.5.x Installation Guide
XOOPS 2.5.x Installation Guide
ย 
Workshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and ArchivematicaWorkshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and Archivematica
ย 
National Archives of Norway - AtoM and Archivematica intro workshop
National Archives of Norway - AtoM and Archivematica intro workshopNational Archives of Norway - AtoM and Archivematica intro workshop
National Archives of Norway - AtoM and Archivematica intro workshop
ย 
Overview of oss(open source software library) and its pros and cons
Overview of oss(open source software library) and its pros and consOverview of oss(open source software library) and its pros and cons
Overview of oss(open source software library) and its pros and cons
ย 
Community catalysts value of open source
Community catalysts   value of open sourceCommunity catalysts   value of open source
Community catalysts value of open source
ย 
Open source software
Open source softwareOpen source software
Open source software
ย 
Artefactual and Open Source Development
Artefactual and Open Source DevelopmentArtefactual and Open Source Development
Artefactual and Open Source Development
ย 
Opensource Development
Opensource DevelopmentOpensource Development
Opensource Development
ย 
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...
ย 
Day 2-presentation
Day 2-presentationDay 2-presentation
Day 2-presentation
ย 
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...
Students of Navgujarat College of Computer Applications, Ahmedabad felt excit...
ย 
Nonsoftwareoss
NonsoftwareossNonsoftwareoss
Nonsoftwareoss
ย 
open source technology
open source technologyopen source technology
open source technology
ย 
Fundamentals of Free and Open Source Software
Fundamentals of Free and Open Source SoftwareFundamentals of Free and Open Source Software
Fundamentals of Free and Open Source Software
ย 

More from Israel Herraiz

intensive metrics software evolution
intensive metrics software evolutionintensive metrics software evolution
intensive metrics software evolutionIsrael Herraiz
ย 
Public Key Cryptography
Public Key CryptographyPublic Key Cryptography
Public Key CryptographyIsrael Herraiz
ย 
Statistical Distribution of Metrics
Statistical Distribution of MetricsStatistical Distribution of Metrics
Statistical Distribution of MetricsIsrael Herraiz
ย 
ยฟMATLAB? Yo uso Octave UPM
ยฟMATLAB? Yo uso Octave UPMยฟMATLAB? Yo uso Octave UPM
ยฟMATLAB? Yo uso Octave UPMIsrael Herraiz
ย 
The Ultimate Debian Database
The Ultimate Debian DatabaseThe Ultimate Debian Database
The Ultimate Debian DatabaseIsrael Herraiz
ย 
Evaluating the presence and impact of bias in bug-fix datasets
Evaluating the presence and impact of bias in bug-fix datasetsEvaluating the presence and impact of bias in bug-fix datasets
Evaluating the presence and impact of bias in bug-fix datasetsIsrael Herraiz
ย 
Software size distribution - Why we always underestimate software cost
Software size distribution - Why we always underestimate software costSoftware size distribution - Why we always underestimate software cost
Software size distribution - Why we always underestimate software costIsrael Herraiz
ย 
Public key cryptography
Public key cryptographyPublic key cryptography
Public key cryptographyIsrael Herraiz
ย 
Mining Software Repositories
Mining Software RepositoriesMining Software Repositories
Mining Software RepositoriesIsrael Herraiz
ย 

More from Israel Herraiz (9)

intensive metrics software evolution
intensive metrics software evolutionintensive metrics software evolution
intensive metrics software evolution
ย 
Public Key Cryptography
Public Key CryptographyPublic Key Cryptography
Public Key Cryptography
ย 
Statistical Distribution of Metrics
Statistical Distribution of MetricsStatistical Distribution of Metrics
Statistical Distribution of Metrics
ย 
ยฟMATLAB? Yo uso Octave UPM
ยฟMATLAB? Yo uso Octave UPMยฟMATLAB? Yo uso Octave UPM
ยฟMATLAB? Yo uso Octave UPM
ย 
The Ultimate Debian Database
The Ultimate Debian DatabaseThe Ultimate Debian Database
The Ultimate Debian Database
ย 
Evaluating the presence and impact of bias in bug-fix datasets
Evaluating the presence and impact of bias in bug-fix datasetsEvaluating the presence and impact of bias in bug-fix datasets
Evaluating the presence and impact of bias in bug-fix datasets
ย 
Software size distribution - Why we always underestimate software cost
Software size distribution - Why we always underestimate software costSoftware size distribution - Why we always underestimate software cost
Software size distribution - Why we always underestimate software cost
ย 
Public key cryptography
Public key cryptographyPublic key cryptography
Public key cryptography
ย 
Mining Software Repositories
Mining Software RepositoriesMining Software Repositories
Mining Software Repositories
ย 

Recently uploaded

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
ย 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
ย 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
ย 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
ย 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
ย 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
ย 
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)lakshayb543
ย 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
ย 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
ย 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
ย 
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...Nguyen Thanh Tu Collection
ย 
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธcall girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
ย 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
ย 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
ย 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
ย 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
ย 

Recently uploaded (20)

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
ย 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
ย 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
ย 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
ย 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
ย 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
ย 
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)
Visit to a blind student's school๐Ÿง‘โ€๐Ÿฆฏ๐Ÿง‘โ€๐Ÿฆฏ(community medicine)
ย 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
ย 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
ย 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
ย 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ย 
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...
HแปŒC TแปT TIแบพNG ANH 11 THEO CHฦฏฦ NG TRรŒNH GLOBAL SUCCESS ฤรP รN CHI TIแบพT - Cแบข Nฤ‚...
ย 
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธcall girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ
call girls in Kamla Market (DELHI) ๐Ÿ” >เผ’9953330565๐Ÿ” genuine Escort Service ๐Ÿ”โœ”๏ธโœ”๏ธ
ย 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
ย 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
ย 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
ย 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
ย 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
ย 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
ย 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
ย 

The dynamics of software evolution - EVOLUMONS 2011

  • 1. The dynamics of software evolution EVOLUMONS 2011 Research Seminar on Software Evolution Universitรฉ de Mons, Belgium January 26th 2011 Israel Herraiz Universidad Alfonso X el Sabio <isra@herraiz.org> <herraiz@uax.es> 1 http://www.uax.es http://herraiz.org
  • 2. (c) 2011 Israel Herraiz This work is licensed under the Creative Commons Attribution-Share Alike 3.0 To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/3.0/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA. Get the full bibliographic references listed in these slides at http://herraiz.org/stuff/evolumons_references_20110126.txt http://www.uax.es http://herraiz.org
  • 3. Outline โ— The laws of software evolution โ— The nature of software evolution (for libre software) โ— How to accurately forecast software evolution. And why it works. โ— What's next? โ— And what did I learn during all these years of work? 3 http://www.uax.es http://herraiz.org
  • 4. The laws of software evolution 4 http://www.uax.es http://herraiz.org
  • 5. My background โ— Educated as a chemical and mechanical engineer โ— Wasted my time in the chemical industry. But I did (and do) love doing software! โ€“ http://caflur.sf.net http://gpinch.sf.net โ— Involved in the open source community since around 2001, started a PhD in 2004 in the Libresoft research group โ€“ http://libresoft.es 5 http://www.uax.es http://herraiz.org
  • 6. How it all started โ— Godfrey and Tu โ— My supervisors and I [GT00] [GT01] wrote a paper on the studied the evolution topic [RAGBH05] of the Linux kernel โ— At the time, I thought โ— They said that the it was just one more laws of software paper evolution were not โ— It turned out to be our valid for Linux most cited paper โ€“ Laws of software evolution. What is โ— Completely puzzled that? me 6 http://www.uax.es http://herraiz.org
  • 7. The topic background: Software evolution โ— How and why does software evolve? โ— Meir M. Lehman Laws of software evolution โ— โ€œProgram evolution. Processes of software changeโ€ published in 1985 7 http://www.uax.es http://herraiz.org
  • 8. The laws in the seventies โ— Laws of Program Evolution Dynamics (1974) 8 [Leh74] [Leh85b] http://www.uax.es http://herraiz.org
  • 9. The evolution of the laws of software evolution [Leh96] [LRW+97] [MFRP06] [Leh78] [Leh80] [Leh85c] [LB85] [Leh74] [Leh85b] 9 http://www.uax.es http://herraiz.org
  • 10. The laws in the present day (I โ€“ IV) 10 http://www.uax.es http://herraiz.org
  • 11. The laws in the present day (V โ€“ VIII) 11 http://www.uax.es http://herraiz.org
  • 12. Empirical studies of software evolution See โ€œEmpirical Studies of Open Source Evolutionโ€ by Juan Fernandez-Ramil, Angela Lozano, Michel Wermelinger, Andrea Capiluppi 12 in Tom Mens, Serge Demeyer (eds.) Software Evolution http://www.uax.es http://herraiz.org
  • 13. Why the controversy about the laws of software evolution? โ— Fernandez-Ramil et al. found in the literature empirical validation for the I, VI, VII (partially) and VIII (partially) โ— The most interesting part (for me) โ€“ Statistical analysis of software projects and their evolution, using time series analysis among other techniques (suggested in ยก1974!) [Leh74] [Leh85b] โ€“ โ€œFor maximum cost-effectiveness, management consideration and judgement should include the entire history of the project with the current state having the strongest, but not exclusive, in๏ฌ‚uenceโ€ [Leh78] [Leh85c] โ— 13 http://www.uax.es http://herraiz.org
  • 14. The nature of (libre) software evolution 14 http://www.uax.es http://herraiz.org
  • 15. The nature of (libre) software evolution โ— The goal is to develop a theoretical model for software evolution โ— Long pursued goal โ— Lehman and Belady in 1971 [BL71] [LB85] โ— Woodside progressive and anti-regressive work [Woo80] (included in [LB85]) โ— Turski models [Tur96] [Tur02] โ€“ Growth is inversely proportional to complexity โ€“ Complexity is proportional to the square of size 15 http://www.uax.es http://herraiz.org
  • 16. More recent models โ— Self-Organized criticality [Wu06] [WHH07] โ— Power laws for the size of the system โ— Long range correlations in the time series of changes โ— Maintenance Guidance Model [CFR07] โ— Those functions that have suffered more changes in the past are more likely to be changed in the future โ— Assumptions: โ€“ Distribution of accumulated changes is asymmetrical โ€“ Developers prioritize changes using past number of changes and complexity 16 http://www.uax.es http://herraiz.org
  • 17. Determinism and evolution โ— Self Organized Criticality โ— This means that current events are influenced by very old events โ— Against Lehman suggestions [Leh78] [Leh85c] โ— In my opinion, counter intuitive 17 http://www.uax.es http://herraiz.org
  • 18. Long range correlated processes http://www.uax.es http://herraiz.org
  • 19. Long range correlated processes http://www.uax.es http://herraiz.org
  • 20. Long range correlated processes Unreachable http://www.uax.es http://herraiz.org
  • 25. How is software evolution? or ? http://www.uax.es http://herraiz.org
  • 26. Autocorrelation coefficients ... 1 2 3 4 5 r(1) ... 1 2 3 4 r(2) ... 1 2 3 . . . http://www.uax.es http://herraiz.org
  • 27. r(k) Autocorrelation coefficients 1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 k http://www.uax.es http://herraiz.org
  • 28. r(k) Autocorrelation coefficients 1 Long range correlated r ๎‚žk ๎‚Ÿ~k 2dโˆ’1 0๎‚„d ๎‚„0.5 Short range correlated (ARIMA process) r ๎‚žk ๎‚Ÿ~C ๎‚ž1โˆ’k ๎‚ป๎‚Ÿ 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 k http://www.uax.es http://herraiz.org
  • 29. r(k) Autocorrelation coefficients 1 Long range correlated r ๎‚žk ๎‚Ÿ~k 2dโˆ’1 0๎‚„d ๎‚„0.5 Short range correlated (ARIMA process) r ๎‚žk ๎‚Ÿ~ Ai ๎‚ž1โˆ’k ๎‚ป๎‚Ÿ Logarithmic scale 0 k http://www.uax.es http://herraiz.org
  • 30. Empirical study โ— 3,821 software projects โ€“ More than 3 developers โ€“ More than 1 year of active history โ€“ 9,234,104 commits / 2,357,438 modification requests โ€“ Projects registered between Nov. 1999 and Dec. 2004 โ€“ Datasets publicly available โ— See Determinism and evolution โ€“ 5th International Working Conference on Mining Software Repositories (MSR 2008) FLOSSMole + CVSAnalY-SF http://www.uax.es http://herraiz.org
  • 31. Methodology โ— Liner correlation to calculate linearity โ— Distribution of the Pearson coefficients โ— Smoothing applied to the series before calculating ACF http://www.uax.es http://herraiz.org
  • 34. Results Long memory processes Short memory processes http://www.uax.es http://herraiz.org
  • 35. Looking at the numbers Quantile Commits MRs 0 0.3235 0.2886 20 0.7394 0.7248 40 0.8178 0.8036 60 0.8906 0.8705 80 0.9783 0.9464 100 0.9998 0.9998 Long memory process Short memory process 35 http://www.uax.es http://herraiz.org
  • 36. Implications for evolution โ— Short memory -> Yesterday's weather http://doi.ieeecomputersociety.org/10.1109/ICSM.2004.1357788 โ— When deciding, current situation should have more influence โ— As Lehman said in 1978 http://www.uax.es http://herraiz.org
  • 37. How to forecast software evolution 37 http://www.uax.es http://herraiz.org
  • 38. Background โ— Forecasting traditionally done using very simple statistical models โ— Regression โ— Lehman suggested in 1974 that Time Series Analysis was the best approach to study software evolution โ— Let's compare time series analysis against regression models 38 http://www.uax.es http://herraiz.org
  • 39. Case studies Training set Test set PostgreSQL FreeBSD NetBSD 1993 1995 1997 1999 2001 2003 2005 2007 Time 39 http://www.uax.es http://herraiz.org
  • 40. Case studies Training set Test set 40 http://www.uax.es http://herraiz.org
  • 41. Time Series Analysis Original Yes ACF Clear time series PACF pattern? data No Kernel smoothing ARIMA p, d, q Predictions model based on fitting ACF / PACF http://www.uax.es http://herraiz.org
  • 42. Parameters of the model http://www.uax.es http://herraiz.org
  • 43. Autocorrelation coefficients. No smoothing http://www.uax.es http://herraiz.org
  • 44. Autocorrelation coefficients. After smoothing http://www.uax.es http://herraiz.org
  • 45. Parameters of all the models โ— Time series ARIMA model โ— d=1 q=0 p = 6, 7 or 9 โ— Regression model โ— r > 0.99 http://www.uax.es http://herraiz.org
  • 46. How does the model look like? ๎‚ž ๎‚Ÿ ๎‚ž ๎‚Ÿ q p d j i โˆ‡ x t 1โˆ’โˆ‘ ๎‚ด j B =๎‚ป๎‚žt๎‚Ÿ 1โˆ’โˆ‘ ๎‚ฎ i B j=1 i=1 i i B =B =x tโˆ’i xt โˆ‡ x t =x t โˆ’x tโˆ’1=๎‚ž1โˆ’B๎‚Ÿ x t d d โˆ‡ x t =๎‚ž1โˆ’B๎‚Ÿ x t http://www.uax.es http://herraiz.org
  • 47. How does the model look like? Predicted / Actual values Estimation Coefficients Linear component errors ๎‚ž ๎‚Ÿ ๎‚ž ๎‚Ÿ q p d j i โˆ‡ x t 1โˆ’โˆ‘ ๎‚ด j B =๎‚ป๎‚žt๎‚Ÿ 1โˆ’โˆ‘ ๎‚ฎ i B j=1 i=1 Parameters of the model Linear component http://www.uax.es http://herraiz.org
  • 48. Results Time series (ARIMA) vs. regression ARIMA Regression FreeBSD 3.93 16.89 NetBSD 1.80 15.94 PostgreSQL 1.48 6.86 Mean Squared Relative Error http://www.uax.es http://herraiz.org
  • 49. Conclusions โ— Time Series more accurate than Regression Analysis for macroscopic predictions โ— Basic model. More components can be added. โ— Seasonality โ— Multi-variable, combining different factors http://www.uax.es http://herraiz.org
  • 50. More results โ— Ok, so you predicted last year...which is past... โ— What about predicting real future? MSR Challenge 2007 winners Goal: predicting the number of changes in Eclipse in the next three months http://dx.doi.org/10.1109/MSR.2007.10 http://www.uax.es http://herraiz.org
  • 51. Why this works? โ— Isn't it too accurate? โ— Why do you think this works? http://www.uax.es http://herraiz.org
  • 52. What's next? 52 http://www.uax.es http://herraiz.org
  • 53. Further work โ— Write a paper about the controversy around the validation of the laws of software evolution โ— In progress โ— Write a paper about the short memory nature of evolution โ— Using Time Series Analysis to show it โ— And ARIMA as a forecasting tool โ— Extracting principles and guidelines for software projects management 53 http://www.uax.es http://herraiz.org
  • 54. And what I did learn during all these years? 54 http://www.uax.es http://herraiz.org
  • 55. Things I appreciate my advisors did โ— Freedom of movements โ— Pressure to get my own funding โ— Unconditional support โ— Demanding and challenging environment โ— Opportunity to coordinate projects โ— And to participate in many meetings alone 55 http://www.uax.es http://herraiz.org
  • 56. Things that I did not know and I do now โ— Know-how about conferences and journals โ— English skills โ— Writing skills (papers and proposals) โ— Presentation skills โ— Self-motivation โ€“ Brick walls are there for the rest of people โ€“ Experience is what you get when you don't get what you want โ€“ Never give up โ€“ http://www.youtube.com/watch?v=ji5_MqicxSo 56 http://www.uax.es http://herraiz.org
  • 57. Take away Laws of Statistical Software Evolution approach Controversy Replicable study Short memory Brick walls are dynamics a good thing ARIMA Keep working. accurate forecast Don't give up 57 http://www.uax.es http://herraiz.org