9. Power of the Hybrid
[Courtesy of Murray Campbell]
10. Troubleshooting of ML Systems
training data
accuracy
test data
query
system
response
execution
data
In the lab
In the wild
What is the performance in the wild?
How does the system fail?
Why does the system fail?
How the system can be improved?
14. Where do Blind Spots Come From?
M
cats
dogs
cat
(conf = 0.96)
Unknown unknowns: Data points with confident but incorrect predictions.
Blind-spots: Feature spaces with high concentration of unknown unknowns
15. Blind-spots Detection
execution data
Beat the Machine
[Attenberg, Ipeirotis, Provost, 2011]
Exploration of Unknown Unknowns
[Lakkaraju, K., Caruana, Horvitz, 2011]
Step 1:
Descriptive
Space
Partitioning
execution data
Step 2:
Multi-armed
Bandit
based
Exploration