SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Can Organizations Really Use Predictions?
           Discussion paper: PROMISE ’08

                        Tim Menzies
                      tim@menzies.us
Lane Department of Computer Science and Electrical Engineering
                   West Virginia University
                    http://menzies.us

                         May 5, 2008




                                                                 1 / 16
We’re Learning. Who’s Listenning?


We’re Learning. Who’s
Listenning?
                        ■   This talk:
Examples                    ◆   Data Mining for SE really works
Conclusion
                            ◆   5 success stories with NASA data
Questions? Comments?
                        ■   Still, main problem is organizational, not technological
                                                         4
                            ◆   Despite clear success,   5
                                                             of those data sources have vanished
                        ■   What to do?
                        ■   How to insure that all our good work is not wasted?




                                                                                                   2 / 16
We’re Learning. Who’s
Listenning?

Examples
Eg1: Text Mining NASA
Eg1 (more)
Eg2: Effort estimation
Eg3: SILAP
Eg3 (more)
Eg4: static code
Eg5: defect prediction

Conclusion
                         Examples
Questions? Comments?




                                    3 / 16
Eg1: Text Mining NASA


We’re Learning. Who’s
Listenning?
                         ■     901 NASA records, PITS issue      ■     Severity 2 predictors:
Examples
                               tracker: {severity, free text}          10*{(train,test) = (90,10)%}
Eg1: Text Mining NASA
                                                                       N    a=recall    b=precision    F = 2∗a∗b
                                                                                                            a+b
Eg1 (more)
Eg2: Effort estimation                                                100       0.81           0.93     0.87
Eg3: SILAP                                                            50       0.80           0.90     0.85
                             severity    frequency                    25       0.79           0.93     0.85
Eg3 (more)
                             1 (panic)           0                    12       0.74           0.92     0.82
Eg4: static code
                             2                 311                     6       0.71           0.94     0.81
Eg5: defect prediction
                             3                 356                     3       0.74           0.82     0.78
Conclusion                   4                 208
Questions? Comments?         5 (yawn)           26
                                                                 Rules (from N=3 words):
                         ■     Top 100 unique words (selected             if (rvm ≤ 0) & (srs = 3) → sev=4
                                                                     else if (srs ≥ 2)             → sev=2
                               by Tf*IDF); sorted by InfoGain        else                          → sev=3
                         ■     Rules learned from N best (RIP-   ■     Diamonds in the dust
                               PER Cohen [1995a]
                                                                       ◆     Not 9414 words total
                                                                       ◆     or 1662 unique words
                                                                       ◆     but 3 highly predictive
                                                                             words




                                                                                                                   4 / 16
Top 3 words


We’re Learning. Who’s
Listenning?
                         ■   In issue reports from four other projects:
Examples                     ◆         10*{(train,test) = (90,10)%}
Eg1: Text Mining NASA
Eg1 (more)                   ◆         yields probability of detection of highest severity class of 93% to 99.8%.
Eg2: Effort estimation
Eg3: SILAP               ■   (Note: ignoring real rare classes.)
Eg3 (more)
Eg4: static code
Eg5: defect prediction   Project “b”: 984 records                        Project “d”: 180 records
Conclusion

Questions? Comments?     a   b     c     d       <--   classified as       a   b       c   <-- classified as
                         1 380     0     0   |     a   = _4              157 23        0 |   a = _4
                         1 520     0     0   |     b   = _3 pd=99.8%       9 121       0 |   b = _3 pd=99.4%
                         0 59      0     0   |     c   = _5                6   1       0 |   c = _5
                         0 23      0     0   |     d   = 2



                         Project “c”: 317 records                        Project “e”: 832 records

                           a   b       c   <-- classified as             a   b     c    d       <--   classified as
                           9 121       0 |   b = _3 pd=93.1%             0 23      0    0   |     a   = _2
                         157 23        0 |   a = _4                      0 498     0   18   |     b   = _3 pd=96.5%
                           6   1       0 |   c = _5                      0 34      0    7   |     c   = _5
                                                                         0 178     0   65   |     d   = _4




                                                                                                                      5 / 16
Eg2: Effort estimation


We’re Learning. Who’s
Listenning?
                         ■   NASA COCOMO data (Boehm et al. [2000])
Examples                 ■   Results from IEEE TSE (Menzies et al. [2006]).
Eg1: Text Mining NASA    ■   learners for continuous classes
Eg1 (more)
Eg2: Effort estimation    ■   A study of 160 effort estimation methods
Eg3: SILAP               ■   20 * { pick any 10, train on remaining, test on 10 }
Eg3 (more)
Eg4: static code
Eg5: defect prediction                                              100 ∗ (pred − actual)/actual
Conclusion                                                  50% percentile  65% percentile  75% percentile
Questions? Comments?
                                      mode= embedded                   -9              26              60
                                              project= X               -6              16              46
                                                      all              -4              12              31
                                             year= 1975                -3              19              39
                                   mode= semi-detached                 -3              10              22
                                         ground systems                -3              11              29
                                               center= 5               -3              20              50
                                        mission planning               -1              25              50
                                            project= gro               -1                9             19
                                               center= 2                 0             11              21
                                             year= 1980                  4             29              58
                                     avionics monitoring                 6             32              56
                                                  median               -3              19              39


                         ■   i.e. usually, very accurate estimates


                                                                                                             6 / 16
Eg3: SILAP: Early Life cycle Severity Detection


We’re Learning. Who’s
Listenning?
                         ■   NASA defect data: 5 projects, (Menzies et.al 2007)
Examples                 ■   SILAP: predict error {potential, consequence} from project description
Eg1: Text Mining NASA
Eg1 (more)
Eg2: Effort estimation
                                                           Derived                               Raw features
Eg3: SILAP
                                           co   =   Consequence                 am     =Artifact Maturity
Eg3 (more)
                                           dv   =   Development                  as    =Asset Safety
                                           ep   =   Error Potential               cl   =CMM Level
Eg4: static code
                                           pr   =   Process                      cx    =Complexity
Eg5: defect prediction
                                           sc   =   Software Characteristic       di   =Degree of Innovation
Conclusion                                                                       do    =Development Organization
                                                                                 dt    =Use of Defect Tracking System
Questions? Comments?
                                                                                 ex    =Experience
                                                                                  fr   =Use of Formal Reviews
                                                                                 hs    =Human Safety
                                                                                 pf    =Performance
                                                                                 ra    =Re-use Approach
                                                                                rm     =Use of Risk Management System
                                                                                 ss    =Size of System
                                                                                 uc    =Use of Configuration Management
                                                                                 us    =Use of Standards



                         function   CO( tmp)        {   tmp=0.35*AS + 0.65 *PF; return (round((HS) < tmp) ? HS : tmp)
                         function   EP()            {   return round(0.579*DV() + 0.249*PR() + 0.172*SC())}
                         function   SC()            {   return 0.547*CX + 0.351*DI + 0.102*SS }
                         function   DV()            {   return 0.828*EX + 0.172*DO }
                         function   PR()            {   return 0.226*RA + 0.242*AM + formality() }
                         function   formality()     {   return 0.0955*US+ 0.0962*UC+ 0.0764*CL + 0.1119*FR +0.0873*DT + 0.0647*RM}


                                                                                                                                 7 / 16
Eg3: 2F FSS + RIPPER + SILAP data (211 components on 5
                         projects)

We’re Learning. Who’s                                                        severity predictions
Listenning?                                                                    a=pd, b=prec
Examples                                                                     x = 2ab/(a + b)
Eg1: Text Mining NASA             features                            |F |    12      3      4      5    X =    x /4
Eg1 (more)
Eg2: Effort estimation
                             A    all - L1 - L2 - group(6)              8    0.97   0.95   0.97   0.99   0.97
Eg3: SILAP
                             B    all - L1 - L2 - group(5 + 6)          7    0.95   0.94   0.97   0.96   0.96
Eg3 (more)
Eg4: static code
                             C    all - L1 - L2 - group(4 + 5 + 6)      6    0.93   0.95   0.98   0.93   0.95
Eg5: defect prediction       D    all - L1 - L2                        16    0.94   0.94   0.93   0.96   0.94
                             E    all - L1 - L2 - group(3 + 4 + 5 + 6) 4     0.93   0.97   0.90   0.87   0.92
Conclusion
                             F    {co*ep, co, ep}                       3    0.94   0.84   0.55   0.70   0.76
Questions? Comments?         G    L1                                    1    0.67   0.69   0.00   0.46   0.45
                             H    just ”us”                             1    0.64   0.60   0.00   0.00   0.31
                              I   L2                                    1    0.57   0.00   0.32   0.00   0.22



                                              rule   1          if   uc ≥ 2 ∧ us = 1          then severity = 5
                                              rule   2   else   if   am = 3                   then severity = 5
                                              rule   3   else   if   uc ≥ 2 ∧ am = 1 ∧ us ≤ 2 then severity = 5
                                              rule   4   else   if   am = 1 ∧ us = 2          then severity = 4
                         Rules from “E”:      rule   5   else   if   us = 3 ∧ ra ≥ 4          then severity = 4
                                              rule   6   else   if   us = 1                   then severity = 3
                                              rule   7   else   if   ra = 3                   then severity = 3
                                              rule   8   else   if   true                     then severity = 1 or 2




                                                                                                                       8 / 16
Eg4: Defect predictors from Static code measures


We’re Learning. Who’s
Listenning?
                         ■   IEEE TSE (Menzies et al. [2006])
Examples                 ■   Modules from eight NASA projects, MDP, described using LOC, McCabe,
Eg1: Text Mining NASA        Halstead metrics
Eg1 (more)
Eg2: Effort estimation    ■   New methods
Eg3: SILAP
Eg3 (more)                   ◆   Shoot-out between :
Eg4: static code
Eg5: defect prediction           ■   Bayesian;
Conclusion                       ■   simple rule learners; e.g. v(g) ≥ 10
Questions? Comments?             ■   complex tree learners; C4.5
                             ◆   Simple pre-processor on the exponential numerics
                                 ■   num = log(num < 0.000001?0.000001 : num)

                         ■   Prior state(s)-of-the-art, percentage of defects found:
                             ◆   IEEE Metrics 2002 panel: manual software reviews finds ≈ 60%
                             ◆   Raffo: industrial reviews finds TR(min,mod,max) = TR(35, 50, 65)%
                             ◆   My old data mining experiments: prob {detection,false alarm}={36,17}%




                                                                                                    9 / 16
Eg5: Yet more detect prediction


We’re Learning. Who’s
Listenning?
                         ■   (Song et al. [2006])
Examples                 ■   NASA SEL defect data: than 200 projects over 15 years.
Eg1: Text Mining NASA    ■   Predicting defects accuracy is very high (over 95%),
Eg1 (more)
Eg2: Effort estimation    ■   false-negative rate is very low (under 3%).
Eg3: SILAP               ■   Wow.
Eg3 (more)
Eg4: static code
Eg5: defect prediction

Conclusion

Questions? Comments?




                                                                                      10 / 16
We’re Learning. Who’s
Listenning?

Examples

Conclusion
In summary...
Why?
What to do?

Questions? Comments?



                        Conclusion




                                     11 / 16
In summary...


We’re Learning. Who’s
Listenning?
                        ■   Five NASA data sources
Examples                    ◆   Eg   #1:   text mining a NASA issue database (PITS)
Conclusion
                            ◆   Eg   #2:   effort estimation from NASA data (COCOMO)
In summary...
Why?                        ◆   Eg   #3:   early life cycle severity prediction (SILAP)
What to do?
                            ◆   Eg   #4:   defect prediction from NASA static code data (MDP)
Questions? Comments?
                            ◆   Eg   #5:   defect prediction (NASA SEL)
                        ■   All of which yield strong predictors for quality (effort, defects)
                        ■   Only one of which is still active (PITS)
                        ■   What went wrong?
                        ■   What to do?




                                                                                                12 / 16
Why is this Data Being Ignored?


We’re Learning. Who’s   ■   Group 1 : easy to explain
Listenning?

Examples                    ◆   NASA SEL : Technology used in case study #5 very new
Conclusion                  ◆   PITS :
In summary...
Why?                            ■   Accessing PITS data was hard- required much civil servant support
What to do?                     ■   No one was crazy enough to try text mining on unstructured PITS issue
Questions? Comments?                reports.
                            ◆   SILAP :
                                ■   Newest data set of all the above
                                ■   Never explored before since not available before
                                ■   Data collection stopped since IV&V business model changed (now focused on
                                    model-based early lifecycle validation).

                        ■   Group 2 : harder to explain
                            ◆   MDP : Much interest across the agency (at GRC, JSC) in MDP (and associated
                                tools).
                            ◆   COCOMO: well-documented, cheap to collect, many tools available
                            ◆   Maybe the answer lies in NASA culture:
                                ■   NASA’s centers compete for resources.
                                ■   Reluctance to critically evaluate and share process information.



                                                                                                            13 / 16
What to do?


We’re Learning. Who’s
                        ■   Stop debating what data to collect
Listenning?
                            ◆   Many loosely-defined sources will do: COCOMO, SILAP, defect reports
Examples

Conclusion              ■   Stop debating how to store data
In summary...
Why?                        ◆   Comma-separated or ARFF format or XML , one per component, is fine
What to do?

Questions? Comments?    ■   Stop hiding data
                            ◆   Create a central register for all NASA’s software components
                            ◆   Register = component name and “part-of” (super-component)
                            ◆   Features extracted from all components, stored at a central location
                            ◆   All reports have anonymous join key to the central register
                            ◆   Make the anonymous data open source (lever the data mining community)
                        ■   Stop ignoring institutional data
                            ◆   Active repository, not data tomb
                            ◆   Success measure: not data in, but conclusions out
                        ■   Stop publishing vague generalities
                            ◆   Rather, publish general methods for building specific models
                            ◆   Open research question: how much data is enough to learn local model?


                                                                                                   14 / 16
We’re Learning. Who’s
Listenning?

Examples

Conclusion

Questions? Comments?
References




                        Questions? Comments?




                                               15 / 16
References


We’re Learning. Who’s   ■ Eg1: text mining
Listenning?

Examples
                           ◆    SEVERIS: T. Menzies and A. Marcus. Automated severity assessment of software defect reports. In
                                Submitted to ICMS’08, 2008. Available from http://menzies.us/pdf/08severis.pdf
Conclusion                 ◆    RIPPER: W.W. Cohen. Fast effective rule induction. In ICML’95, pages 115–123, 1995b. Available on-line
Questions? Comments?
                                from http://www.cs.cmu.edu/~wcohen/postscript/ml-95-ripper.ps
References              ■ Eg2: effort estimation
                           ◆    Barry Boehm, Ellis Horowitz, Ray Madachy, Donald Reifer, Bradford K. Clark, Bert Steece, A. Winsor Brown,
                                Sunita Chulani, and Chris Abts. Software Cost Estimation with Cocomo II. Prentice Hall, 2000
                           ◆    Tim Menzies, Zhihao Chen, Jairus Hihn, and Karen Lum. Selecting best practices for effort estimation. IEEE
                                Transactions on Software Engineering, November 2006. Available from
                                http://menzies.us/pdf/06coseekmo.pdf
                        ■ Eg3: issue prediction
                           ◆    SILAP: T. Menzies, M. Benson, K. Costello, C. Moats, M. Northey, and J. Richarson. Learning better ivv
                                practices. Innovations in Systems and Software Engineering, March 2008. Available from
                                http://menzies.us/pdf/07ivv.pdf
                        ■ Eg4: static code
                           ◆    Tim Menzies, Jeremy Greenwald, and Art Frank. Data mining static code attributes to learn defect
                                predictors. IEEE Transactions on Software Engineering, January 2007. Available from
                                http://menzies.us/pdf/06learnPredict.pdf
                        ■ Eg5: SEL
                           ◆    Qinbao Song, Martin Shepperd, Michelle Cartwright, and Carolyn Mair. Software defect association mining
                                and defect correction effort prediction. IEEE Trans. Softw. Eng., 32(2):69–82, 2006. ISSN 0098-5589




                                                                                                                                     16 / 16

Weitere ähnliche Inhalte

Ähnlich wie Organizations Use Data

Data Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemData Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemJamie (Taka) Wang
 
strassen matrix multiplication algorithm
strassen matrix multiplication algorithmstrassen matrix multiplication algorithm
strassen matrix multiplication algorithmevil eye
 
Lecture 8: Decision Trees & k-Nearest Neighbors
Lecture 8: Decision Trees & k-Nearest NeighborsLecture 8: Decision Trees & k-Nearest Neighbors
Lecture 8: Decision Trees & k-Nearest NeighborsMarina Santini
 
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...Alexander Litvinenko
 
Computer Aided Assessment (CAA) for mathematics
Computer Aided Assessment (CAA) for mathematicsComputer Aided Assessment (CAA) for mathematics
Computer Aided Assessment (CAA) for mathematicstelss09
 
MLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningMLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningBigML, Inc
 
My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...Alexander Litvinenko
 
Spring 2003
Spring 2003Spring 2003
Spring 2003butest
 
Arbonne's Results Presentation
Arbonne's Results PresentationArbonne's Results Presentation
Arbonne's Results Presentationguest06b488
 
Active Learning for Multi-relational Data Construction
Active Learning for Multi-relational Data ConstructionActive Learning for Multi-relational Data Construction
Active Learning for Multi-relational Data ConstructionHiroshi Kajino
 
Qiskit advocate demo qsvm
Qiskit advocate demo qsvmQiskit advocate demo qsvm
Qiskit advocate demo qsvmYuma Nakamura
 
An Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUMAn Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUMDominik Dahlem
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data AnalysisNBER
 
Complex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsComplex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsPeter Solymos
 
Molecular models, threads and you
Molecular models, threads and youMolecular models, threads and you
Molecular models, threads and youJiahao Chen
 

Ähnlich wie Organizations Use Data (20)

Data Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemData Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification System
 
strassen matrix multiplication algorithm
strassen matrix multiplication algorithmstrassen matrix multiplication algorithm
strassen matrix multiplication algorithm
 
Lecture 8: Decision Trees & k-Nearest Neighbors
Lecture 8: Decision Trees & k-Nearest NeighborsLecture 8: Decision Trees & k-Nearest Neighbors
Lecture 8: Decision Trees & k-Nearest Neighbors
 
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...Developing fast  low-rank tensor methods for solving PDEs with uncertain coef...
Developing fast low-rank tensor methods for solving PDEs with uncertain coef...
 
Computer Aided Assessment (CAA) for mathematics
Computer Aided Assessment (CAA) for mathematicsComputer Aided Assessment (CAA) for mathematics
Computer Aided Assessment (CAA) for mathematics
 
Chapter 04
Chapter 04Chapter 04
Chapter 04
 
Input analysis
Input analysisInput analysis
Input analysis
 
MLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningMLSD18. Unsupervised Learning
MLSD18. Unsupervised Learning
 
My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...My presentation at University of Nottingham "Fast low-rank methods for solvin...
My presentation at University of Nottingham "Fast low-rank methods for solvin...
 
Spring 2003
Spring 2003Spring 2003
Spring 2003
 
Arbonne's Results Presentation
Arbonne's Results PresentationArbonne's Results Presentation
Arbonne's Results Presentation
 
Active Learning for Multi-relational Data Construction
Active Learning for Multi-relational Data ConstructionActive Learning for Multi-relational Data Construction
Active Learning for Multi-relational Data Construction
 
Qiskit advocate demo qsvm
Qiskit advocate demo qsvmQiskit advocate demo qsvm
Qiskit advocate demo qsvm
 
Borderline Smote
Borderline SmoteBorderline Smote
Borderline Smote
 
An Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUMAn Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUM
 
T test statistics
T test statisticsT test statistics
T test statistics
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
Complex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsComplex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutions
 
Molecular models, threads and you
Molecular models, threads and youMolecular models, threads and you
Molecular models, threads and you
 

Mehr von gregoryg

Community-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision MakingCommunity-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision Makinggregoryg
 
The Robust Optimization of Non-Linear Requirements Models
The Robust Optimization of Non-Linear Requirements ModelsThe Robust Optimization of Non-Linear Requirements Models
The Robust Optimization of Non-Linear Requirements Modelsgregoryg
 
Finding Robust Solutions to Requirements Models
Finding Robust Solutions to Requirements ModelsFinding Robust Solutions to Requirements Models
Finding Robust Solutions to Requirements Modelsgregoryg
 
Distributed Decision Tree Induction
Distributed Decision Tree InductionDistributed Decision Tree Induction
Distributed Decision Tree Inductiongregoryg
 
Irrf Presentation
Irrf PresentationIrrf Presentation
Irrf Presentationgregoryg
 
Optimizing Requirements Decisions with KEYS
Optimizing Requirements Decisions with KEYSOptimizing Requirements Decisions with KEYS
Optimizing Requirements Decisions with KEYSgregoryg
 
Confidence in Software Cost Estimation Results based on MMRE and PRED
Confidence in Software Cost Estimation Results based on MMRE and PREDConfidence in Software Cost Estimation Results based on MMRE and PRED
Confidence in Software Cost Estimation Results based on MMRE and PREDgregoryg
 
Promise08 Wrapup
Promise08 WrapupPromise08 Wrapup
Promise08 Wrapupgregoryg
 
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...gregoryg
 
Software Defect Repair Times: A Multiplicative Model
Software Defect Repair Times: A Multiplicative ModelSoftware Defect Repair Times: A Multiplicative Model
Software Defect Repair Times: A Multiplicative Modelgregoryg
 
Complementing Approaches in ERP Effort Estimation Practice: an Industrial Study
Complementing Approaches in ERP Effort Estimation Practice: an Industrial StudyComplementing Approaches in ERP Effort Estimation Practice: an Industrial Study
Complementing Approaches in ERP Effort Estimation Practice: an Industrial Studygregoryg
 
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...gregoryg
 
Implications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect PredictorsImplications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect Predictorsgregoryg
 
Practical use of defect detection and prediction
Practical use of defect detection and predictionPractical use of defect detection and prediction
Practical use of defect detection and predictiongregoryg
 

Mehr von gregoryg (14)

Community-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision MakingCommunity-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision Making
 
The Robust Optimization of Non-Linear Requirements Models
The Robust Optimization of Non-Linear Requirements ModelsThe Robust Optimization of Non-Linear Requirements Models
The Robust Optimization of Non-Linear Requirements Models
 
Finding Robust Solutions to Requirements Models
Finding Robust Solutions to Requirements ModelsFinding Robust Solutions to Requirements Models
Finding Robust Solutions to Requirements Models
 
Distributed Decision Tree Induction
Distributed Decision Tree InductionDistributed Decision Tree Induction
Distributed Decision Tree Induction
 
Irrf Presentation
Irrf PresentationIrrf Presentation
Irrf Presentation
 
Optimizing Requirements Decisions with KEYS
Optimizing Requirements Decisions with KEYSOptimizing Requirements Decisions with KEYS
Optimizing Requirements Decisions with KEYS
 
Confidence in Software Cost Estimation Results based on MMRE and PRED
Confidence in Software Cost Estimation Results based on MMRE and PREDConfidence in Software Cost Estimation Results based on MMRE and PRED
Confidence in Software Cost Estimation Results based on MMRE and PRED
 
Promise08 Wrapup
Promise08 WrapupPromise08 Wrapup
Promise08 Wrapup
 
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
 
Software Defect Repair Times: A Multiplicative Model
Software Defect Repair Times: A Multiplicative ModelSoftware Defect Repair Times: A Multiplicative Model
Software Defect Repair Times: A Multiplicative Model
 
Complementing Approaches in ERP Effort Estimation Practice: an Industrial Study
Complementing Approaches in ERP Effort Estimation Practice: an Industrial StudyComplementing Approaches in ERP Effort Estimation Practice: an Industrial Study
Complementing Approaches in ERP Effort Estimation Practice: an Industrial Study
 
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
 
Implications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect PredictorsImplications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect Predictors
 
Practical use of defect detection and prediction
Practical use of defect detection and predictionPractical use of defect detection and prediction
Practical use of defect detection and prediction
 

Kürzlich hochgeladen

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Organizations Use Data

  • 1. Can Organizations Really Use Predictions? Discussion paper: PROMISE ’08 Tim Menzies tim@menzies.us Lane Department of Computer Science and Electrical Engineering West Virginia University http://menzies.us May 5, 2008 1 / 16
  • 2. We’re Learning. Who’s Listenning? We’re Learning. Who’s Listenning? ■ This talk: Examples ◆ Data Mining for SE really works Conclusion ◆ 5 success stories with NASA data Questions? Comments? ■ Still, main problem is organizational, not technological 4 ◆ Despite clear success, 5 of those data sources have vanished ■ What to do? ■ How to insure that all our good work is not wasted? 2 / 16
  • 3. We’re Learning. Who’s Listenning? Examples Eg1: Text Mining NASA Eg1 (more) Eg2: Effort estimation Eg3: SILAP Eg3 (more) Eg4: static code Eg5: defect prediction Conclusion Examples Questions? Comments? 3 / 16
  • 4. Eg1: Text Mining NASA We’re Learning. Who’s Listenning? ■ 901 NASA records, PITS issue ■ Severity 2 predictors: Examples tracker: {severity, free text} 10*{(train,test) = (90,10)%} Eg1: Text Mining NASA N a=recall b=precision F = 2∗a∗b a+b Eg1 (more) Eg2: Effort estimation 100 0.81 0.93 0.87 Eg3: SILAP 50 0.80 0.90 0.85 severity frequency 25 0.79 0.93 0.85 Eg3 (more) 1 (panic) 0 12 0.74 0.92 0.82 Eg4: static code 2 311 6 0.71 0.94 0.81 Eg5: defect prediction 3 356 3 0.74 0.82 0.78 Conclusion 4 208 Questions? Comments? 5 (yawn) 26 Rules (from N=3 words): ■ Top 100 unique words (selected if (rvm ≤ 0) & (srs = 3) → sev=4 else if (srs ≥ 2) → sev=2 by Tf*IDF); sorted by InfoGain else → sev=3 ■ Rules learned from N best (RIP- ■ Diamonds in the dust PER Cohen [1995a] ◆ Not 9414 words total ◆ or 1662 unique words ◆ but 3 highly predictive words 4 / 16
  • 5. Top 3 words We’re Learning. Who’s Listenning? ■ In issue reports from four other projects: Examples ◆ 10*{(train,test) = (90,10)%} Eg1: Text Mining NASA Eg1 (more) ◆ yields probability of detection of highest severity class of 93% to 99.8%. Eg2: Effort estimation Eg3: SILAP ■ (Note: ignoring real rare classes.) Eg3 (more) Eg4: static code Eg5: defect prediction Project “b”: 984 records Project “d”: 180 records Conclusion Questions? Comments? a b c d <-- classified as a b c <-- classified as 1 380 0 0 | a = _4 157 23 0 | a = _4 1 520 0 0 | b = _3 pd=99.8% 9 121 0 | b = _3 pd=99.4% 0 59 0 0 | c = _5 6 1 0 | c = _5 0 23 0 0 | d = 2 Project “c”: 317 records Project “e”: 832 records a b c <-- classified as a b c d <-- classified as 9 121 0 | b = _3 pd=93.1% 0 23 0 0 | a = _2 157 23 0 | a = _4 0 498 0 18 | b = _3 pd=96.5% 6 1 0 | c = _5 0 34 0 7 | c = _5 0 178 0 65 | d = _4 5 / 16
  • 6. Eg2: Effort estimation We’re Learning. Who’s Listenning? ■ NASA COCOMO data (Boehm et al. [2000]) Examples ■ Results from IEEE TSE (Menzies et al. [2006]). Eg1: Text Mining NASA ■ learners for continuous classes Eg1 (more) Eg2: Effort estimation ■ A study of 160 effort estimation methods Eg3: SILAP ■ 20 * { pick any 10, train on remaining, test on 10 } Eg3 (more) Eg4: static code Eg5: defect prediction 100 ∗ (pred − actual)/actual Conclusion 50% percentile 65% percentile 75% percentile Questions? Comments? mode= embedded -9 26 60 project= X -6 16 46 all -4 12 31 year= 1975 -3 19 39 mode= semi-detached -3 10 22 ground systems -3 11 29 center= 5 -3 20 50 mission planning -1 25 50 project= gro -1 9 19 center= 2 0 11 21 year= 1980 4 29 58 avionics monitoring 6 32 56 median -3 19 39 ■ i.e. usually, very accurate estimates 6 / 16
  • 7. Eg3: SILAP: Early Life cycle Severity Detection We’re Learning. Who’s Listenning? ■ NASA defect data: 5 projects, (Menzies et.al 2007) Examples ■ SILAP: predict error {potential, consequence} from project description Eg1: Text Mining NASA Eg1 (more) Eg2: Effort estimation Derived Raw features Eg3: SILAP co = Consequence am =Artifact Maturity Eg3 (more) dv = Development as =Asset Safety ep = Error Potential cl =CMM Level Eg4: static code pr = Process cx =Complexity Eg5: defect prediction sc = Software Characteristic di =Degree of Innovation Conclusion do =Development Organization dt =Use of Defect Tracking System Questions? Comments? ex =Experience fr =Use of Formal Reviews hs =Human Safety pf =Performance ra =Re-use Approach rm =Use of Risk Management System ss =Size of System uc =Use of Configuration Management us =Use of Standards function CO( tmp) { tmp=0.35*AS + 0.65 *PF; return (round((HS) < tmp) ? HS : tmp) function EP() { return round(0.579*DV() + 0.249*PR() + 0.172*SC())} function SC() { return 0.547*CX + 0.351*DI + 0.102*SS } function DV() { return 0.828*EX + 0.172*DO } function PR() { return 0.226*RA + 0.242*AM + formality() } function formality() { return 0.0955*US+ 0.0962*UC+ 0.0764*CL + 0.1119*FR +0.0873*DT + 0.0647*RM} 7 / 16
  • 8. Eg3: 2F FSS + RIPPER + SILAP data (211 components on 5 projects) We’re Learning. Who’s severity predictions Listenning? a=pd, b=prec Examples x = 2ab/(a + b) Eg1: Text Mining NASA features |F | 12 3 4 5 X = x /4 Eg1 (more) Eg2: Effort estimation A all - L1 - L2 - group(6) 8 0.97 0.95 0.97 0.99 0.97 Eg3: SILAP B all - L1 - L2 - group(5 + 6) 7 0.95 0.94 0.97 0.96 0.96 Eg3 (more) Eg4: static code C all - L1 - L2 - group(4 + 5 + 6) 6 0.93 0.95 0.98 0.93 0.95 Eg5: defect prediction D all - L1 - L2 16 0.94 0.94 0.93 0.96 0.94 E all - L1 - L2 - group(3 + 4 + 5 + 6) 4 0.93 0.97 0.90 0.87 0.92 Conclusion F {co*ep, co, ep} 3 0.94 0.84 0.55 0.70 0.76 Questions? Comments? G L1 1 0.67 0.69 0.00 0.46 0.45 H just ”us” 1 0.64 0.60 0.00 0.00 0.31 I L2 1 0.57 0.00 0.32 0.00 0.22 rule 1 if uc ≥ 2 ∧ us = 1 then severity = 5 rule 2 else if am = 3 then severity = 5 rule 3 else if uc ≥ 2 ∧ am = 1 ∧ us ≤ 2 then severity = 5 rule 4 else if am = 1 ∧ us = 2 then severity = 4 Rules from “E”: rule 5 else if us = 3 ∧ ra ≥ 4 then severity = 4 rule 6 else if us = 1 then severity = 3 rule 7 else if ra = 3 then severity = 3 rule 8 else if true then severity = 1 or 2 8 / 16
  • 9. Eg4: Defect predictors from Static code measures We’re Learning. Who’s Listenning? ■ IEEE TSE (Menzies et al. [2006]) Examples ■ Modules from eight NASA projects, MDP, described using LOC, McCabe, Eg1: Text Mining NASA Halstead metrics Eg1 (more) Eg2: Effort estimation ■ New methods Eg3: SILAP Eg3 (more) ◆ Shoot-out between : Eg4: static code Eg5: defect prediction ■ Bayesian; Conclusion ■ simple rule learners; e.g. v(g) ≥ 10 Questions? Comments? ■ complex tree learners; C4.5 ◆ Simple pre-processor on the exponential numerics ■ num = log(num < 0.000001?0.000001 : num) ■ Prior state(s)-of-the-art, percentage of defects found: ◆ IEEE Metrics 2002 panel: manual software reviews finds ≈ 60% ◆ Raffo: industrial reviews finds TR(min,mod,max) = TR(35, 50, 65)% ◆ My old data mining experiments: prob {detection,false alarm}={36,17}% 9 / 16
  • 10. Eg5: Yet more detect prediction We’re Learning. Who’s Listenning? ■ (Song et al. [2006]) Examples ■ NASA SEL defect data: than 200 projects over 15 years. Eg1: Text Mining NASA ■ Predicting defects accuracy is very high (over 95%), Eg1 (more) Eg2: Effort estimation ■ false-negative rate is very low (under 3%). Eg3: SILAP ■ Wow. Eg3 (more) Eg4: static code Eg5: defect prediction Conclusion Questions? Comments? 10 / 16
  • 11. We’re Learning. Who’s Listenning? Examples Conclusion In summary... Why? What to do? Questions? Comments? Conclusion 11 / 16
  • 12. In summary... We’re Learning. Who’s Listenning? ■ Five NASA data sources Examples ◆ Eg #1: text mining a NASA issue database (PITS) Conclusion ◆ Eg #2: effort estimation from NASA data (COCOMO) In summary... Why? ◆ Eg #3: early life cycle severity prediction (SILAP) What to do? ◆ Eg #4: defect prediction from NASA static code data (MDP) Questions? Comments? ◆ Eg #5: defect prediction (NASA SEL) ■ All of which yield strong predictors for quality (effort, defects) ■ Only one of which is still active (PITS) ■ What went wrong? ■ What to do? 12 / 16
  • 13. Why is this Data Being Ignored? We’re Learning. Who’s ■ Group 1 : easy to explain Listenning? Examples ◆ NASA SEL : Technology used in case study #5 very new Conclusion ◆ PITS : In summary... Why? ■ Accessing PITS data was hard- required much civil servant support What to do? ■ No one was crazy enough to try text mining on unstructured PITS issue Questions? Comments? reports. ◆ SILAP : ■ Newest data set of all the above ■ Never explored before since not available before ■ Data collection stopped since IV&V business model changed (now focused on model-based early lifecycle validation). ■ Group 2 : harder to explain ◆ MDP : Much interest across the agency (at GRC, JSC) in MDP (and associated tools). ◆ COCOMO: well-documented, cheap to collect, many tools available ◆ Maybe the answer lies in NASA culture: ■ NASA’s centers compete for resources. ■ Reluctance to critically evaluate and share process information. 13 / 16
  • 14. What to do? We’re Learning. Who’s ■ Stop debating what data to collect Listenning? ◆ Many loosely-defined sources will do: COCOMO, SILAP, defect reports Examples Conclusion ■ Stop debating how to store data In summary... Why? ◆ Comma-separated or ARFF format or XML , one per component, is fine What to do? Questions? Comments? ■ Stop hiding data ◆ Create a central register for all NASA’s software components ◆ Register = component name and “part-of” (super-component) ◆ Features extracted from all components, stored at a central location ◆ All reports have anonymous join key to the central register ◆ Make the anonymous data open source (lever the data mining community) ■ Stop ignoring institutional data ◆ Active repository, not data tomb ◆ Success measure: not data in, but conclusions out ■ Stop publishing vague generalities ◆ Rather, publish general methods for building specific models ◆ Open research question: how much data is enough to learn local model? 14 / 16
  • 15. We’re Learning. Who’s Listenning? Examples Conclusion Questions? Comments? References Questions? Comments? 15 / 16
  • 16. References We’re Learning. Who’s ■ Eg1: text mining Listenning? Examples ◆ SEVERIS: T. Menzies and A. Marcus. Automated severity assessment of software defect reports. In Submitted to ICMS’08, 2008. Available from http://menzies.us/pdf/08severis.pdf Conclusion ◆ RIPPER: W.W. Cohen. Fast effective rule induction. In ICML’95, pages 115–123, 1995b. Available on-line Questions? Comments? from http://www.cs.cmu.edu/~wcohen/postscript/ml-95-ripper.ps References ■ Eg2: effort estimation ◆ Barry Boehm, Ellis Horowitz, Ray Madachy, Donald Reifer, Bradford K. Clark, Bert Steece, A. Winsor Brown, Sunita Chulani, and Chris Abts. Software Cost Estimation with Cocomo II. Prentice Hall, 2000 ◆ Tim Menzies, Zhihao Chen, Jairus Hihn, and Karen Lum. Selecting best practices for effort estimation. IEEE Transactions on Software Engineering, November 2006. Available from http://menzies.us/pdf/06coseekmo.pdf ■ Eg3: issue prediction ◆ SILAP: T. Menzies, M. Benson, K. Costello, C. Moats, M. Northey, and J. Richarson. Learning better ivv practices. Innovations in Systems and Software Engineering, March 2008. Available from http://menzies.us/pdf/07ivv.pdf ■ Eg4: static code ◆ Tim Menzies, Jeremy Greenwald, and Art Frank. Data mining static code attributes to learn defect predictors. IEEE Transactions on Software Engineering, January 2007. Available from http://menzies.us/pdf/06learnPredict.pdf ■ Eg5: SEL ◆ Qinbao Song, Martin Shepperd, Michelle Cartwright, and Carolyn Mair. Software defect association mining and defect correction effort prediction. IEEE Trans. Softw. Eng., 32(2):69–82, 2006. ISSN 0098-5589 16 / 16