Learning for Testing Prioritization- an industrial case study

•

1 gefällt mir•1,496 views

Talk given by Ben Busjaeger, Software Engineering Architect at Salesforce, at FSE 2016: ACM SIGSOFT on November 2016 Modern cloud-software providers, such as Salesforce.com, increasingly adopt large-scale continuous integration environments. In such environments, assuring high developer productivity is strongly dependent on conducting testing efficiently and effectively. Specifically, to shorten feedback cycles, test prioritization is popularly used as an optimization mechanism for ranking tests to run by their likelihood of revealing failures. To apply test prioritization in industrial environments, we present a novel approach (tailored for practical applicability) that integrates multiple existing techniques via a systematic framework of machine learning to rank. Our initial empirical evaluation on a large realworld dataset from Salesforce shows that our approach significantly outperforms existing individual techniques.

Ingenieurwesen

Learning for Test Prioritization
An Industrial Case Study
Benjamin Busjaeger
Salesforce
Tao Xie
University of Illinois,
Urbana-Champaign
ACM SIGSOFT FSE 2016

Large Scale Continuous Integration
2K+
Modules
600K+
Tests
2K+
Engineers
500+
Changes/Day
Main repository ...
350K+
Source files

Motivation
Before Commit After Commit
Objective Reject faulty changes Detect test failures as quickly as possible
Current Run hand-picked tests Parallelize (8000 concurrent VMs) &
batch (1-1000 changes per run)
Problem Too complex for human Feedback time constrained by capacity &
batching complicates fault assignment
Desired Select top-k tests likely to fail Prioritize all tests by likelihood of failure

Insight: Need Multiple Existing Techniques
● Heterogeneous languages: Java, PL/SQL, JavaScript, Apex, etc.
●Non-code artifacts: metadata and configuration
●New/recently-failed tests more likely to fail
Test code coverage of change Textual similarity between test and change Test age and recent failures

Insight: Need Scalable Techniques
●Complex integration tests: concurrent execution
●Large data volume: 500+ changes, 50M+ test runs, 20TB+ results per day
→ Our approach: Periodically collect coarse-grained measurements
Test code coverage of change Textual similarity between test and change Test age and recent failures

Formal model for
learning from
past test results
New Approach: Learning for Test Prioritization
Test code coverage of change Textual similarity between test and change Test age and recent failures
Change
Test
Ranking
→ Implementation currently in pilot use at Salesforce

Empirical Study: Setup
●Test results of 45K tests for ~3 month period
●In this period, 711 changes with ≥1 failure
• 440 for training
• 271 for testing
●Collected once for each test:
• Test code coverage
• Test text content
●Collected continuously:
• Test age and recent failures

● New approach achieves highest
average recall at all cutoff points
• 50% failures detected with top 0.2%
• 75% failures detected with top 3%
Results: Test selection (before commit)
New Approach

● New approach achieves highest
APFD with least variance
• Median: 85%
• Average: 99%
Results: Test prioritization (after commit)
New Approach

Results: Side Benefits
Invalid Assignments
Flaky Tests

Summary
● Main insights gained in conducting test prioritization in industry
● Novel learning-based approach to test prioritization
● Implementation currently in pilot use at Salesforce
● Empirical evaluation using a large Salesforce dataset
● Ongoing work: add features, word2vec, deep learning, cost-aware

Weitere ähnliche Inhalte

Andere mochten auch

1 gerir o teu dinheiromaladigitalmourao

Group Mentoring Session 12: Dealing with DifficultiesApostle Dr. Norma Gray

Summit2013 choi - wise kb-introdSemantic Technology Institute International

nature of learningvigneswari1161996

Transferring Software Testing Tools to PracticeTao Xie

Best Philosophy of EducationKaplan University

The learning process- Fundamentals of InstructionHolmes Aviation Training

The learning processKerry Harrison

Drive test learningKshitij Verma

Learning Process Theories Malyn Singson

Learning presentationLara Melissa Cariño

The Teaching Learning Process: Intro, Phases, Definitions, Theories and Model...Monica P

Teaching and Learning ProcessJaser Daher

Philosophy of educationajtame

Philosophies of educationSherwin Balbuena

Major philosophies in educationBogs De Castro

Philosophy of educationDavid Deubelbeiss

Andere mochten auch (17)

1 gerir o teu dinheiro

Group Mentoring Session 12: Dealing with Difficulties

Summit2013 choi - wise kb-introd

nature of learning

Transferring Software Testing Tools to Practice

Best Philosophy of Education

The learning process- Fundamentals of Instruction

The learning process

Drive test learning

Learning Process Theories

Learning presentation

The Teaching Learning Process: Intro, Phases, Definitions, Theories and Model...

Teaching and Learning Process

Philosophy of education

Philosophies of education

Major philosophies in education

Philosophy of education

Mehr von Salesforce Engineering

Locker Service Ready Lightning Components With WebpackSalesforce Engineering

Scaling HBase for Big DataSalesforce Engineering

Techniques to Effectively Monitor the Performance of Customers in the CloudSalesforce Engineering

Predictive System Performance Data AnalysisSalesforce Engineering

Apache HBase State of the ProjectSalesforce Engineering

Hit the Trail with TrailheadSalesforce Engineering

HBase/PHOENIX @ ScaleSalesforce Engineering

Scaling up data science applicationsSalesforce Engineering

Containers and Security for DevOpsSalesforce Engineering

Aspect Oriented Programming: Hidden Toolkit That You Already HaveSalesforce Engineering

Monitoring @ Scale in SalesforceSalesforce Engineering

Performance Tuning with XHProfSalesforce Engineering

A Smarter Pig: Building a SQL interface to Pig using Apache CalciteSalesforce Engineering

Implementing a Content Strategy Is Like Running 100 MilesSalesforce Engineering

Salesforce Cloud Infrastructure and Challenges - A Brief OverviewSalesforce Engineering

Koober Preduction IO PresentationSalesforce Engineering

Finding Security Issues Fast!Salesforce Engineering

MicroservicesSalesforce Engineering

Global State Management of Micro ServicesSalesforce Engineering

The Future of HbaseSalesforce Engineering

Mehr von Salesforce Engineering (20)

Locker Service Ready Lightning Components With Webpack

Scaling HBase for Big Data

Techniques to Effectively Monitor the Performance of Customers in the Cloud

Predictive System Performance Data Analysis

Apache HBase State of the Project

Hit the Trail with Trailhead

HBase/PHOENIX @ Scale

Scaling up data science applications

Containers and Security for DevOps

Aspect Oriented Programming: Hidden Toolkit That You Already Have

Monitoring @ Scale in Salesforce

Performance Tuning with XHProf

A Smarter Pig: Building a SQL interface to Pig using Apache Calcite

Implementing a Content Strategy Is Like Running 100 Miles

Salesforce Cloud Infrastructure and Challenges - A Brief Overview

Koober Preduction IO Presentation

Finding Security Issues Fast!

Microservices

Global State Management of Micro Services

The Future of Hbase

Kürzlich hochgeladen

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

Introduction and different types of Ethernet.pptxupamatechverse

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...ranjana rawat

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Kürzlich hochgeladen (20)

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

Introduction and different types of Ethernet.pptx

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

Microscopic Analysis of Ceramic Materials.pptx

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts

(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

Learning for Testing Prioritization- an industrial case study

1. Learning for Test Prioritization An Industrial Case Study Benjamin Busjaeger Salesforce Tao Xie University of Illinois, Urbana-Champaign ACM SIGSOFT FSE 2016

2. Large Scale Continuous Integration 2K+ Modules 600K+ Tests 2K+ Engineers 500+ Changes/Day Main repository ... 350K+ Source files

3. Motivation Before Commit After Commit Objective Reject faulty changes Detect test failures as quickly as possible Current Run hand-picked tests Parallelize (8000 concurrent VMs) & batch (1-1000 changes per run) Problem Too complex for human Feedback time constrained by capacity & batching complicates fault assignment Desired Select top-k tests likely to fail Prioritize all tests by likelihood of failure

4. Insight: Need Multiple Existing Techniques ● Heterogeneous languages: Java, PL/SQL, JavaScript, Apex, etc. ●Non-code artifacts: metadata and configuration ●New/recently-failed tests more likely to fail Test code coverage of change Textual similarity between test and change Test age and recent failures

5. Insight: Need Scalable Techniques ●Complex integration tests: concurrent execution ●Large data volume: 500+ changes, 50M+ test runs, 20TB+ results per day → Our approach: Periodically collect coarse-grained measurements Test code coverage of change Textual similarity between test and change Test age and recent failures

6. Formal model for learning from past test results New Approach: Learning for Test Prioritization Test code coverage of change Textual similarity between test and change Test age and recent failures Change Test Ranking → Implementation currently in pilot use at Salesforce

7. Empirical Study: Setup ●Test results of 45K tests for ~3 month period ●In this period, 711 changes with ≥1 failure • 440 for training • 271 for testing ●Collected once for each test: • Test code coverage • Test text content ●Collected continuously: • Test age and recent failures

8. ● New approach achieves highest average recall at all cutoff points • 50% failures detected with top 0.2% • 75% failures detected with top 3% Results: Test selection (before commit) New Approach

9. ● New approach achieves highest APFD with least variance • Median: 85% • Average: 99% Results: Test prioritization (after commit) New Approach

10. Results: Side Benefits Invalid Assignments Flaky Tests

11. Summary ● Main insights gained in conducting test prioritization in industry ● Novel learning-based approach to test prioritization ● Implementation currently in pilot use at Salesforce ● Empirical evaluation using a large Salesforce dataset ● Ongoing work: add features, word2vec, deep learning, cost-aware

12. Thank you Q&A

13. Summary ● Main insights gained in conducting test prioritization in industry ● Novel learning-based approach to test prioritization ● Implementation currently in pilot use at Salesforce ● Empirical evaluation using a large Salesforce dataset ● Ongoing work: add features, word2vec/deep learning, cost-aware

Hinweis der Redaktion

coverage: periodically collected (slightly out-of-date), dynamic instrumentation (JaCoCo, no rebuild), approximate proximity

Learning for Testing Prioritization- an industrial case study

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (17)

Mehr von Salesforce Engineering

Mehr von Salesforce Engineering (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Learning for Testing Prioritization- an industrial case study

Hinweis der Redaktion