The Tester’s Dashboard: Release Decision Support

•

0 gefällt mir•498 views

The document discusses metrics for supporting release decisions based on model-based testing. It describes using an operational profile to generate test cases, calculating model coverage metrics, using a reliability demonstration chart to assess risk, and measuring relative proximity to compare expected and actual failure rates. A case study applies these methods to a word processing app and missile defense system. Key observations are that model coverage ensures sufficient testing, reliability demonstration charts assume flat profiles which may be optimistic, and relative proximity indicates when failure intensities match expectations.

Technologie

The Tester’s Dashboard:
Release Decision Support

Robert V. Binder
System Verification Associates, LLC
rbinder@ieee.org
Peter B. Lakey
Cognitive Concepts, Inc.
peterlakey@sbcglobal.net

Reliability Demonstration Chart
• Sequential
Sampling
• Risk-
Adjusted
• Musa
equations

http://sourceforge.net/projects/rdc/

Relative Proximity
• Kullback-Lieber Distance
– Information theoretic
– Characterizes difference in variation of message population E
(expected) and sample A (actual) as “relative entropy”

KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 ))

• Relative Proximity
– KLD math doesn’t work unless failures modeled (sum of the
actuals must be 1.0)
– Assume the target failure rate is aggregate
– Allocate failure rate in proportion to each operation

Case Studies
• Stochastic Models
• Assumed Failure Rates
• Word Processing Application
• Ground-Based Midcourse Missile Defense

GBMD Relative Proximity Trend

1600.00
1484.00
1400.00

1200.00

1000.00

800.00

600.00
418.20
400.00

200.00 67.90
12.00 6.30
0.00
10 100 1000 10000

Empfohlen

Experience with a Profile-based Automated Testing EnvironmentBob Binder

Mobile Reliability ChallengesBob Binder

mVerify Investor OverviewBob Binder

ISSRE 2008 Trip ReportBob Binder

Lessons learned validating 60,000 pages of api documentationBob Binder

MTS: Controllable Test ObjectsBob Binder

Performance Testing Mobile and Multi-Tier ApplicationsBob Binder

Designing apps for resiliencyMasashi Narumoto

Empfohlen

Experience with a Profile-based Automated Testing EnvironmentBob Binder

Mobile Reliability ChallengesBob Binder

mVerify Investor OverviewBob Binder

ISSRE 2008 Trip ReportBob Binder

Lessons learned validating 60,000 pages of api documentationBob Binder

MTS: Controllable Test ObjectsBob Binder

Performance Testing Mobile and Multi-Tier ApplicationsBob Binder

Designing apps for resiliencyMasashi Narumoto

(BDT207) Real-Time Analytics In Service Of Self-Healing EcosystemsAmazon Web Services

Enabling Carrier-Grade Availability Within a Cloud InfrastructureOPNFV

Thesis PresentationvFinal3M Ghorbanzadeh

PhD_defense_presentation_Oct2013Selvi Kadirvel

Rf network designNguyen Le

Ithings2012 20novPanagiotis Garefalakis

Multilin™ Intelligent Line Monitoring SystemCorporación Eléctrica del Ecuador, CELEC EP

EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s GuideIosif Itkin

Virtual Power Plant - Becky Harrison, Progress EnergyEnergy Network marcus evans

Overcoming challenges of_verifying complex mixed signal designsPankaj Singh

Fuzzy Control meets Software EngineeringPooyan Jamshidi

Nokia kpi and_core_optimizationdebasish goswami

(ISM301) Engineering Netflix Global Operations In The CloudAmazon Web Services

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...Pooyan Jamshidi

Towards a Unified View of Cloud ElasticitySrikumar Venugopal

Overview of life testing in MinitabRob Schubert

Using Six Sigma to Optimize Performance and ReliabilityTimothy Williams

FIELD TESTING of 2G 3G Devices.pptpatrickwang85

Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turipdituri

Wcdma planningShantanu Mukherjee

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobBob Binder

Model-based Testing: Taking BDD/ATDD to the Next LevelBob Binder

Weitere ähnliche Inhalte

Ähnlich wie The Tester’s Dashboard: Release Decision Support

(BDT207) Real-Time Analytics In Service Of Self-Healing EcosystemsAmazon Web Services

Enabling Carrier-Grade Availability Within a Cloud InfrastructureOPNFV

Thesis PresentationvFinal3M Ghorbanzadeh

PhD_defense_presentation_Oct2013Selvi Kadirvel

Rf network designNguyen Le

Ithings2012 20novPanagiotis Garefalakis

Multilin™ Intelligent Line Monitoring SystemCorporación Eléctrica del Ecuador, CELEC EP

EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s GuideIosif Itkin

Virtual Power Plant - Becky Harrison, Progress EnergyEnergy Network marcus evans

Overcoming challenges of_verifying complex mixed signal designsPankaj Singh

Fuzzy Control meets Software EngineeringPooyan Jamshidi

Nokia kpi and_core_optimizationdebasish goswami

(ISM301) Engineering Netflix Global Operations In The CloudAmazon Web Services

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...Pooyan Jamshidi

Towards a Unified View of Cloud ElasticitySrikumar Venugopal

Overview of life testing in MinitabRob Schubert

Using Six Sigma to Optimize Performance and ReliabilityTimothy Williams

FIELD TESTING of 2G 3G Devices.pptpatrickwang85

Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turipdituri

Wcdma planningShantanu Mukherjee

Ähnlich wie The Tester’s Dashboard: Release Decision Support (20)

(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems

Enabling Carrier-Grade Availability Within a Cloud Infrastructure

Thesis PresentationvFinal3

PhD_defense_presentation_Oct2013

Rf network design

Ithings2012 20nov

Multilin™ Intelligent Line Monitoring System

EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide

Virtual Power Plant - Becky Harrison, Progress Energy

Overcoming challenges of_verifying complex mixed signal designs

Fuzzy Control meets Software Engineering

Nokia kpi and_core_optimization

(ISM301) Engineering Netflix Global Operations In The Cloud

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...

Towards a Unified View of Cloud Elasticity

Overview of life testing in Minitab

Using Six Sigma to Optimize Performance and Reliability

FIELD TESTING of 2G 3G Devices.ppt

Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi

Wcdma planning

Mehr von Bob Binder

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobBob Binder

Model-based Testing: Taking BDD/ATDD to the Next LevelBob Binder

Model-based Testing: Today And TomorrowBob Binder

Mobile App Assurance: Yesterday, Today, and Tomorrow.Bob Binder

Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?Bob Binder

Achieving Very High Reliability for Ubiquitous Information Technology Bob Binder

Testing Object-Oriented Systems: Lessons LearnedBob Binder

Model-Based Testing: Why, What, HowBob Binder

MDD and the Tautology Problem: Discussion Notes.Bob Binder

Testability: Factors and StrategyBob Binder

Test Objects -- They Just WorkBob Binder

A Million Users in a Box: The WTS StoryBob Binder

Software Test Patterns: Successes and ChallengesBob Binder

Assurance for Cloud ComputingBob Binder

The Advanced Mobile Application Testing Environment: Project ReportBob Binder

Software Testing: Models, Patterns, ToolsBob Binder

The Tester’s Dashboard: Release Decision SupportBob Binder

Testability: Factors and StrategyBob Binder

Mehr von Bob Binder (18)

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob

Model-based Testing: Taking BDD/ATDD to the Next Level

Model-based Testing: Today And Tomorrow

Mobile App Assurance: Yesterday, Today, and Tomorrow.

Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?

Achieving Very High Reliability for Ubiquitous Information Technology

Testing Object-Oriented Systems: Lessons Learned

Model-Based Testing: Why, What, How

MDD and the Tautology Problem: Discussion Notes.

Testability: Factors and Strategy

Test Objects -- They Just Work

A Million Users in a Box: The WTS Story

Software Test Patterns: Successes and Challenges

Assurance for Cloud Computing

The Advanced Mobile Application Testing Environment: Project Report

Software Testing: Models, Patterns, Tools

The Tester’s Dashboard: Release Decision Support

Testability: Factors and Strategy

Kürzlich hochgeladen

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

How to write a Business Continuity PlanDatabarracks

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

UiPath Community: Communication Mining from Zero to HeroUiPathCommunity

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

Time Series Foundation Models - current state and future directionsNathaniel Shimoni

So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda

Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3

Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq

[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein

A Journey Into the Emotions of Software DevelopersNicole Novielli

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes

Kürzlich hochgeladen (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

How AI, OpenAI, and ChatGPT impact business and software.

The State of Passkeys with FIDO Alliance.pptx

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

How to write a Business Continuity Plan

DevEX - reference for building teams, processes, and platforms

UiPath Community: Communication Mining from Zero to Hero

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...

TeamStation AI System Report LATAM IT Salaries 2024

Time Series Foundation Models - current state and future directions

So einfach geht modernes Roaming fuer Notes und Nomad.pdf

Moving Beyond Passwords: FIDO Paris Seminar.pdf

Genislab builds better products and faster go-to-market with Lean project man...

[Webinar] SpiraTest - Setting New Standards in Quality Assurance

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24

A Journey Into the Emotions of Software Developers

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes

The Tester’s Dashboard: Release Decision Support

1. The Tester’s Dashboard: Release Decision Support Robert V. Binder System Verification Associates, LLC rbinder@ieee.org Peter B. Lakey Cognitive Concepts, Inc. peterlakey@sbcglobal.net

2. Overview • Complementary metrics for release decision- support – Model-based testing • Operational profile • Model coverage metrics – Reliability Demonstration Chart – Relative Proximity • Case Study • Observations

3. Release Decision Support

5. Model-based Reliability Estimation • Test suites must be – Proportional to operational profile – Sequentially feasible – Input feasible • Approach – Markov model – Monte Carlo simulation – Post run analytics

6. Model Coverage Metrics • % States Usage Profile Reached S T • % State- Transitions Observe System Reached Failure Trigger Latent Defect Software System Process Space Observe fault System activated Failure Data Space

7. Reliability Demonstration Chart • Sequential Sampling • Risk- Adjusted • Musa equations http://sourceforge.net/projects/rdc/

8. Relative Proximity • Kullback-Lieber Distance – Information theoretic – Characterizes difference in variation of message population E (expected) and sample A (actual) as “relative entropy” KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 )) • Relative Proximity – KLD math doesn’t work unless failures modeled (sum of the actuals must be 1.0) – Assume the target failure rate is aggregate – Allocate failure rate in proportion to each operation

9. Profile Explicit Failure Modes • Assume maximum acceptable failure rate intensity of 1 in 10,000 Operation Mode Standard Explicit Failure Expected Number, Profile Profile 10000 Tests A Pass 0.7 0.6993 6993 B Pass 0.2 0.1998 1998 C Pass 0.1 0.0999 999

10. Profile Explicit Failure Modes Mode Expected Actual KL Distance Actual KL Distance A Pass 6993 7000 10.104 6990 -4.327 Fail 7 0 0.000 10 5.146 B Pass 1998 1990 -11.518 2000 2.887 Fail 2 10 23.219 0 0.000 C Pass 999 980 -27.149 994 -7.195 Fail 1 20 86.439 6 15.510 10000 10000 81.094 10000 12.020 • Relative Proximity indicates the difference between actual and observed failure rates • Many possible operation failure rates with better or worse fidelity • RDC based on aggregate FIO, not sensitive to operation variance

11. Case Studies • Stochastic Models • Assumed Failure Rates • Word Processing Application • Ground-Based Midcourse Missile Defense

12. GBMD Test Run, 0-100

13. GBMD Test Run, 1K, 5K

14. GMBD Test Run, 10K

15. GBMD Relative Proximity Trend 1600.00 1484.00 1400.00 1200.00 1000.00 800.00 600.00 418.20 400.00 200.00 67.90 12.00 6.30 0.00 10 100 1000 10000

16. Observations • Model coverage indicates minimal sufficiency – Wouldn’t release without all state-xtn pairs covered – Stochastic can take a long time to do this – Cover with N+ first • RDC assumes “flat” profile – With sequential constraints, may be optimistic – Strength is explicit risk-adjustment • Relative Proximity will indicate when operation- specific Failure Intensity is as expected (or not)

17. Q&A