SlideShare a Scribd company logo
Walking the Talk: Random
Exploration of a Chatbot API
James Thomas
British Computer Society 2022-11-16
@qahiccupps
www.associationforsoftwaretesting.org
@qahiccupps
www.ada.com
@qahiccupps
supported by
@qahiccupps
Project: Extract a Basic Chatbot API
@qahiccupps
@qahiccupps
The Task
Presentation-agnostic API.
Integration with text-based clients.
Turn-based medical assessment.
Short deadline.
@qahiccupps
Risks, Questions, Test Ideas
@qahiccupps
Covering the Assessment Space
Unintended consequences.
Discover edges and corners.
Exercise the API extensively.
@@qahiccupps
A Walker
@qahiccupps
The idea
Run unattended medical assessments.
Can be iterated and customised quickly.
Identify places for deeper inspection.
@qahiccupps
Start
“OK”
Random
Choose
randomly
Card
type?
More
turns?
Stop
yes
no
text
input
choice
The Simplest Thing That Could Possibly Work
@qahiccupps
“Welcome to
Ada.”
“What is your
name?”
“Which option,
A, B, or C?”
✅❌
@@qahiccupps
✅❌
✅❌
✅❌
✅❌
❓
❓
❓
❓
❓ ❓ ❓
Navigate Explore
Survey
Demonstration
@qahiccupps
Implementation
@qahiccupps
Configuration
@qahiccupps
Another
symptom
~1/5 turns
Detailed, Parsable Logs
@qahiccupps
Archive
config
Archive
state
Parallel
@qahiccupps
Long Assessments
500 Server Error ❌
❌
Back end
Back end
Back end
❌
@qahiccupps
Reflection
@qahiccupps
@qahiccupps
@qahiccupps
@qahiccupps
GREAT SHOT KID,
THAT WAS ONE
IN A MILLION
@qahiccupps
YEAH!
I TOOK A MILLION
SHOTS
Questions?
@qahiccupps
What Worked Well
Verbose logs.
Asserting generally and on fixes.
Randomisation for unknowns.
Configuration for directed exploration.
Toolkit (replay, parallel, analysis, …).
Question-driven development.
@qahiccupps
Code changes and dependencies.
Card identifiers.
Medical testing.
State.
No explicit model.
What Worked Well … and What was Challenging
References
Ada screenshots: https://www.uisources.com/explainer/ada-diagnosing-via-chat-bot
Wipotec: https://www.wipotec-ocs.com/en/product-inspection/
Microscope: https://londonlaboquip.com/product/microscope-binocular-biological-sc302
Messy lab: https://imgur.com/gallery/bQiK6
Dice: https://www.richardhughesjones.com/luck-randomness/dice-gif/
Altwalker: https://altom.gitlab.io/altwalker/altwalker/
Star Wars: https://www.starwars.com/video/one-in-a-million-shot
@qahiccupps

More Related Content

Similar to James_Thomas_Walk_The_Talk_BCS.pptx

Seminar report on captcha
Seminar report on captchaSeminar report on captcha
Seminar report on captcha
kunalkiit
 

Similar to James_Thomas_Walk_The_Talk_BCS.pptx (13)

When Support Calls
When Support CallsWhen Support Calls
When Support Calls
 
Leveling up your JavaScipt - DrupalJam 2017
Leveling up your JavaScipt - DrupalJam 2017Leveling up your JavaScipt - DrupalJam 2017
Leveling up your JavaScipt - DrupalJam 2017
 
Travel Hacking 101: The ROI of Hackathons
Travel Hacking 101: The ROI of HackathonsTravel Hacking 101: The ROI of Hackathons
Travel Hacking 101: The ROI of Hackathons
 
CoffeeScript: The Good Parts
CoffeeScript: The Good PartsCoffeeScript: The Good Parts
CoffeeScript: The Good Parts
 
AppSec Pipelines and Event based Security
AppSec Pipelines and Event based SecurityAppSec Pipelines and Event based Security
AppSec Pipelines and Event based Security
 
JavaScript isn't evil.
JavaScript isn't evil.JavaScript isn't evil.
JavaScript isn't evil.
 
WAPWG Clark defining capturing_web-based_if
WAPWG Clark defining capturing_web-based_ifWAPWG Clark defining capturing_web-based_if
WAPWG Clark defining capturing_web-based_if
 
Alejandro Saucedo Presentation on IWMC 2015
Alejandro Saucedo Presentation on IWMC 2015Alejandro Saucedo Presentation on IWMC 2015
Alejandro Saucedo Presentation on IWMC 2015
 
Coding a SaaS
Coding a SaaSCoding a SaaS
Coding a SaaS
 
Leaping Forward: Finding The Future of Your API Docs
Leaping Forward: Finding The Future of Your API DocsLeaping Forward: Finding The Future of Your API Docs
Leaping Forward: Finding The Future of Your API Docs
 
Creating Your MVP (or Startup Validation Hacks)
Creating Your MVP (or Startup Validation Hacks)Creating Your MVP (or Startup Validation Hacks)
Creating Your MVP (or Startup Validation Hacks)
 
Y Pipes Mashup Camp
Y Pipes Mashup CampY Pipes Mashup Camp
Y Pipes Mashup Camp
 
Seminar report on captcha
Seminar report on captchaSeminar report on captcha
Seminar report on captcha
 

More from James Thomas

Your Testing is a Joke
Your Testing is a JokeYour Testing is a Joke
Your Testing is a Joke
James Thomas
 

More from James Thomas (12)

How to Test Anything
How to Test AnythingHow to Test Anything
How to Test Anything
 
We Don't Know?
We Don't Know?We Don't Know?
We Don't Know?
 
People problems
People problemsPeople problems
People problems
 
Testing vs Chicken
Testing vs ChickenTesting vs Chicken
Testing vs Chicken
 
James thomas
James thomasJames thomas
James thomas
 
Theoreticus Prime vs Praktikertron
Theoreticus Prime vs PraktikertronTheoreticus Prime vs Praktikertron
Theoreticus Prime vs Praktikertron
 
Testing All the Way Down, and Other Directions
Testing All the Way Down, and Other DirectionsTesting All the Way Down, and Other Directions
Testing All the Way Down, and Other Directions
 
What is What is Professional Testing?
What is What is Professional Testing?What is What is Professional Testing?
What is What is Professional Testing?
 
Bug-Free Software? Go For It!
Bug-Free Software? Go For It!Bug-Free Software? Go For It!
Bug-Free Software? Go For It!
 
Your Testing is a Joke
Your Testing is a JokeYour Testing is a Joke
Your Testing is a Joke
 
You're Having a Laugh
You're Having  a LaughYou're Having  a Laugh
You're Having a Laugh
 
It's Like That
It's Like ThatIt's Like That
It's Like That
 

Recently uploaded

Recently uploaded (20)

Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 

James_Thomas_Walk_The_Talk_BCS.pptx

Editor's Notes

  1. **Navigating** Error handling Consistency of API E.g. Male only assessments - compromise test code to get a walking skeleton; POST vs GET, two similar but slightly different schemas in the API itself. These activities make me ask questions… what if I …? How about when? Could it possibly be that …? Problems with the walker and with the product. While automating I’m testing. Don’t be too quick to restrict to what you think the system wants. Can I get from start to finish? What assumptions are required? What workarounds are required? How might developers struggle here? … ** Checking** E.g. certain kinds of dialog turns have different properties to assert on - keys in DTO must be present, or in some kind of relationship. What can I assert specifically and generally? Where are the edge cases? (e.g. by general global assertions failing) What are the error cases? How valuable is it to check these things here? ** Exploring** When you explore you don’t know if you’ll find anything, and if you find something you won’t know whether it’s relevant, and if it’s relevant you won’t know whether it’s important. Code is a tool and a toolkit. Extend it to the next question you have. (Can I get to “call ambulance” outcomes? What would need to happen to do that? How could I avoid it?) Log paths and outcomes. Analyse outside the code for patterns (seen and missing) Don’t error check too heavily. Catch and investigate failures. (Expose assumptions) Check some positive cases by hand. Check failures by hand. Look for patterns. (Don’t have to catch all failures; TODO outcomes are fine because then you can filter them in later analysis.) Explore the data you produce. Replay for repeatability. Configuration to guide the direction of exploration. (Initially, I’d just hack the code) How can I take the extreme choice each time? How can I make the longest assessment? Can I run an assessment for ever? A toolkit to gather data … … for human analysis. Failures are targets. Patterns are indicators. …
  2. Parallel nothing clever - just run two or more copies!