SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
AI engineering
New challenges in system and software
architecting and managing lifecycle for AI-
based systems
ECSA 2020 14-18 Sept, L’Aquila
Ivica Crnkovic, ivica.crnkovic@chalmers.se
9/16/20
Chalmers 3
Types of Machine Learning
SUPERVISED UNSUPERVISED REINFORCEMENT
Task Driven
predict next value,
classification
Data Driven
Identify clusters
Learn From
Mistakes
Trial and Error
4
95% of ML
9/16/20
Chalmers 6
AI as effective as medical specialists at diagnosing diseas
AI in transportation
• Self-driving vehicles, Traffic management, delay predictions…
AI and NLP (Natural Language processes)
Recognition
Traffic Medicine Robotics
Finance
Health Space
AI success Stories
virtually no area that does not (plan to) use AI
9/16/20
AI failures 2018
https://medium.com/syncedreview/2018-in-review-10-ai-failures-c18faadf5983
2020-09-16
Chalmers University of
Technology
8
https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
Amazon AI recruiting
tool is gender biased
BiggestAI failures 2018AI failures 2018
2020-09-16
Chalmers University of
Technology
9
AI failures 2018AI failures 2018
AI World Cup 2018 - predictions almost all wrong J
2020-09-16 10
Why these
failures?
What are the
challenges inAI
development?
9/16/20 11
• Non technical character
• Who is owner of data?
• What are the ethical aspects of using data?
• What can we allow a machine to decide?
• How do interpret the results from AI models?
New questions in AI Development
• Technical nature
• How to efficiently collect, store, process, analyse, and present data?
• How to efficiently build the AI-based systems?
• How to ensure enough resources (computation, storage, timing)
• How to ensure dependability/trustworthy of such systems?
• What system and software architecture are required for AI-systems?
• WHAT KIND OF SOFTWARE ENGINEERING SUPPORT IS NEEDED?
9/16/20 12
ML Development cycle
assemble
datasets
create
models
train and
evaluate deployment
Life cycle
ML Code –
Create model
Train model
Machine Learning: The High-Interest Credit Card of Technical Debt, D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young
9/16/20
AI-System - system architecture (example)
https://www.youtube.com/watch?v=XEHG2oYdsiU
Collecting data
Analyzing Data
Controlling system
Sending data
Updating system
System evolution
Example: vehicular system (a Cyber-physical system)
Architecture of a car control system
Complex services
– distributed components
Sensor 1
Sensor 2
Sensor 3
Sensor ..
Network
resources
++++++++++
++++++++++
++++++++++
Sensor ..
Execution
resources
Node 1
Node 2
Node 3
Node …
Node …
Actuator 1
Actuator 2
Actuator 3
Actuator …
Actuator …
ECU ECU ECU
C1 C2
Shared resources
Challenges:
• Real-time requirements
• Shared resources
• Resource constraints
Software architecture
9/16/20 17
Platform
Middleware
Subsystem A
HW
Subsystem B Subsystem C
C1
C2
C3
C4
C3
Subsystem and components
Component-based and service-based approach
- Components
- Encapsulation of data
- Encapsulation of functionality
- Dependency between components defined and controlled
MP
9/16/20
18
AI-System - system architecture (example)
Platform
Middleware
Subsystem A
HW
Subsystem B Subsystem C
C1
C2
C3
C4
C3
Subsystem and components
Components – black boxes
- Components
- Encapsulation of AI-based functionality
- Dependency between components defined and controlled
- What about AI code and data?
MP
9/16/20
AI-System - system architecture (example)
https://www.youtube.com/watch?v=XEHG2oYdsiU
Collecting more data
Analyzing more data
More computation
Control system
Further data
processing
Sending data
For diagnostic
And for ML
Updating system
AI-based vehicular system System evolutionCollecting data
System training
- large amounts of data processing
- large computational requirements
With AI the existing architecture is not feasible
CPU GPU
FPGA
EDGE computing
Cloud
computing
Challenges
- New computation models
- Heterogenous platform
- Edge sensor vs. edge centralized level vs. cloud computing
- Development process
- separation of training process from operational process
- New monitoring
- New quality attributes and their metrics
- Migration process from the existing architecture
CPU GPU
FPGA
Cloud
computing
EDGE computing
9/16/20 22
AI- based components – fundamental problems
C1
controlled
Data – used for ML
Controlled?
• The results depend not only
on the algorithms and controlled data
but also on uncontrolled/unknown data
• The AI-based functions are not continuous:
small change of data can cause big changes
• Due to data impact on the model
many new challenges appear
2020-09-16 23
1. Entanglement (Data fusion)
2. Data dependencies
• Unstable data dependencies
• Underutilized Data Dependencies
3. Hidden Feedback Loops
4. Undeclared Consumers
5. Correction Cascades
Examples of Data-related challenges
DATA
DATA
DATA
AI-component
AI-component
DATA
DATA
DATA
AI-component
AI-component
Not needed
DATA
DATA
AI-component
DATA
DATA
AI-component
C1
AI-component
C2DATA
DATA
DATA
AI-component
AI-component
9/16/20 Chalmers 31
• Heterogeneity of data (different formats, accuracy, semantics) and use of
standard ML functions require a lot Glue code
• 95% of code in AI-based systems is a glue-code (empirical data)
• Requires
• Frequent refactoring of code
• Re-implementing AI models
• Pipeline Jungles
• ML-friendly format data become a jungle of scrapes, joins, and sampling steps,
intermediate files
• Requires – a close team work of data and domain engineers
System-design anti-patterns (1)
9/16/20 Chalmers 32
Dead Experimental Code paths
• AI solution requires a lot of experimentation
• A lot of code that will not be used later
• Problems
• Dead code
• Version management – how to preserve useful configuration
branches, and remove unnecessary
System-design anti-patterns (2)
9/16/20 Chalmers 33
Managing Changes in the External World
System
Cloud Data
Local Data
Local Data
Local Data
Historical data
Used in ML
Continuous change of data
Challenge:
models and system behaviour
dependent of data
Examples:
- Threshold changes
- Correlation between data
Requirements: Continuous monitoring of data and system. Continuous test.
35
Study : Overview of industrial ML systems
Case studies
A. Project EST: Real Estate Valuation
B. Project OIL: Predicting Oil and Gas Recovery Potential
C. Project RET: Predicting User Retention
D. Project WEA: Weather Forecasting
E. Project CCF: Credit Card Fraud Detection
F. Project BOT: Poker Bot Identification
G. Project REC: Media Recommendations
Software Engineering Challenges of Deep Learning, Euromicro SEAA 2018, A. Arpteg, B. Brinne, l. Crnkovic-Friis, J. Bosch
Which challenges are the most difficult
in development of AI-based systems?
2020-09-16 Chalmers University of Technology 37
Study: Engineering challenges of DL
■Dependency
Management
◆Glue code and supporting systems
●Unintentional feedback loops
●Monitoring and logging
●Troubleshooting
●Testing ●Limited transparency
●Experimental management
■ Organisational challenges
● Production/operation challenges
◆ Development challenges
◆Effort estimation
■Cultural differences
Hard to solve
Easier to solve
Lower business impact High business impact
●Resource Limitations
◆Privacy and Safety/security
Experiment
prototyping
Non-critical
deployment
Critical deployment Cascading deployment
Assemble
dataset
Issues with problem
formulation and
specifying desired
outcome
Data silos, scarcity of
labelled data, imbalanced
training set
Limitations in techniques for
gathering training data from
large-scale, non-stationary
data streams
Complex and effects of data
dependencies
Create
model
Use of non-
representative dataset,
data drifts
No critical analysis of
training data
Difficulties in building highly
scalable ML pipeline
Entanglements causing
difficulties in isolating
improvements
Train and
evaluate
model
Lack of
well-established ground
truth
No evaluation of models
with business-centric
measures
Difficulties in reproducing
models, results and debugging
DL models
Need of techniques for sliced
analysis in final model
Deploy
model No deployment
mechanism
Training- serving
skew
Adhering to stringent serving
requirements e.g., of latency,
throughput
Hidden feedback-loops and
undeclared consumers of the
models
LE Lwakatare, A Raj, J Bosch, HH Olsson, I Crnkovic, A taxonomy of software engineering challenges for machine learning systems:
An empirical investigation - International Conference on Agile Software …, 2019
The challenges in evolution of development and use ML components
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
Data quality
management
Design
methods and
processes
Model
performance
Deployment
and
compliance
Data versioning &
dependency
mgmt
DataOps/DevOps Automated data
labeling
Federated
learning
infrastructure
Reuse of pre-
developed ML
models
Manage multiple
models
Deployment
infrastructure
Quality attributes A/B testing of
models
Storage and
computing
infrastructure
Integration of
models and SW
components
Monitoring &
logging
Strategic focus Infrastucture Development Process
AI Engineering challenges
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
Data quality
management
Design
methods and
processes
Model
performance
Deployment
and
compliance
Federated collection &
storage of data
Federated ML models
Heterogenous HW
platforms
Transfer learning
Strategic focus Cyber-physical
systems
Safety-critical
systems
Autonomously
Improving systems
Data trail
Explainable models
reproducibility
Safety validation
Data generation for ML
Automated
experimentation
Exploration vs.
exploratiom
Online evaluation
Domain-specific AI Engineering challenges
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
43
DevOps
ML Workflow
Example – Process integration
Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
44
Conceptual ML Workflow and DevOps Process Integration
Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
9/16/20 Chalmers 46
Excitement of a researcher
ML/DL introduces many new challenges for software
• On addition to software evolution there is data evolution
• Context change – in behaviour and in data
• New (distributed) system configuration
• Continuous integration, continuous deployment
• Continuous training
Many companies will not be able to cope with new types of complexity
Many opportunities for researchers!
Conclusion
2020 09-16-ai-engineering challanges

Weitere ähnliche Inhalte

Was ist angesagt?

Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Analytics India Magazine
 
Data science for business decisions (public)
Data science for business decisions (public)Data science for business decisions (public)
Data science for business decisions (public)Lars Ahlfors
 
MedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care PathwaysMedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care PathwaysObeo
 
Semantic Web for Advanced Engineering
Semantic Web for Advanced EngineeringSemantic Web for Advanced Engineering
Semantic Web for Advanced EngineeringMarta Sabou
 
Semantic Web Technologies for Intelligent Engineering Applications
Semantic Web Technologies for  Intelligent Engineering ApplicationsSemantic Web Technologies for  Intelligent Engineering Applications
Semantic Web Technologies for Intelligent Engineering ApplicationsMarta Sabou
 
Resume_20160610_MarcusGohm
Resume_20160610_MarcusGohmResume_20160610_MarcusGohm
Resume_20160610_MarcusGohmMarcus Gohm
 
Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1Miquel Centelles
 
Introduction to Computer Engineering
Introduction to Computer EngineeringIntroduction to Computer Engineering
Introduction to Computer EngineeringBESOR ACADEMY
 

Was ist angesagt? (11)

Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...
 
Data science for business decisions (public)
Data science for business decisions (public)Data science for business decisions (public)
Data science for business decisions (public)
 
MedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care PathwaysMedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care Pathways
 
Innoslate for Academia
Innoslate for AcademiaInnoslate for Academia
Innoslate for Academia
 
Semantic Web for Advanced Engineering
Semantic Web for Advanced EngineeringSemantic Web for Advanced Engineering
Semantic Web for Advanced Engineering
 
Semantic Web Technologies for Intelligent Engineering Applications
Semantic Web Technologies for  Intelligent Engineering ApplicationsSemantic Web Technologies for  Intelligent Engineering Applications
Semantic Web Technologies for Intelligent Engineering Applications
 
Resume_20160610_MarcusGohm
Resume_20160610_MarcusGohmResume_20160610_MarcusGohm
Resume_20160610_MarcusGohm
 
OA centre of excellence
OA centre of excellenceOA centre of excellence
OA centre of excellence
 
ERP vs SAP (Difference)
ERP vs SAP (Difference)ERP vs SAP (Difference)
ERP vs SAP (Difference)
 
Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1
 
Introduction to Computer Engineering
Introduction to Computer EngineeringIntroduction to Computer Engineering
Introduction to Computer Engineering
 

Ähnlich wie 2020 09-16-ai-engineering challanges

AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...CARLOS III UNIVERSITY OF MADRID
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software EngineeringMiroslaw Staron
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningEdunomica
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...CARLOS III UNIVERSITY OF MADRID
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Production machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scaleProduction machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scaleAlex Housley
 
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA
 
From Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesFrom Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesObeo
 
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...Naoshi Uchihira
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionBATbern
 
An Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementAn Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementCognizant
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15MLconf
 
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...Justin Basilico
 
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdfSLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdfSamuelNahum1
 
Practical machine learning
Practical machine learningPractical machine learning
Practical machine learningFaizan Javed
 

Ähnlich wie 2020 09-16-ai-engineering challanges (20)

INCOSE IS 2019: AI and Systems Engineering
INCOSE IS 2019: AI and Systems EngineeringINCOSE IS 2019: AI and Systems Engineering
INCOSE IS 2019: AI and Systems Engineering
 
SESE 2021: Where Systems Engineering meets AI/ML
SESE 2021: Where Systems Engineering meets AI/MLSESE 2021: Where Systems Engineering meets AI/ML
SESE 2021: Where Systems Engineering meets AI/ML
 
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Production machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scaleProduction machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scale
 
Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...
 
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
 
From Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesFrom Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems Architectures
 
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
 
An Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementAn Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data Management
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
 
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
 
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdfSLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
 
Practical machine learning
Practical machine learningPractical machine learning
Practical machine learning
 

Mehr von Ivica Crnkovic

ICSE 2018 opening session
ICSE 2018 opening sessionICSE 2018 opening session
ICSE 2018 opening sessionIvica Crnkovic
 
Beyond digitalisation 2016-06-07
Beyond digitalisation  2016-06-07Beyond digitalisation  2016-06-07
Beyond digitalisation 2016-06-07Ivica Crnkovic
 
ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20Ivica Crnkovic
 
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...Ivica Crnkovic
 
European Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementEuropean Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementIvica Crnkovic
 
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...Ivica Crnkovic
 
Software Assurance: What Should We Do next? - Software Design for Reliability
Software Assurance: What Should We Do next?  - Software Design for ReliabilitySoftware Assurance: What Should We Do next?  - Software Design for Reliability
Software Assurance: What Should We Do next? - Software Design for ReliabilityIvica Crnkovic
 
A classification framework for component models
A classification framework for component modelsA classification framework for component models
A classification framework for component modelsIvica Crnkovic
 
Teaching in multicultural classromre
Teaching in multicultural  classromreTeaching in multicultural  classromre
Teaching in multicultural classromreIvica Crnkovic
 
The challenges and opportunities in open source reuse
The challenges and opportunities in open source reuseThe challenges and opportunities in open source reuse
The challenges and opportunities in open source reuseIvica Crnkovic
 
Resilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolutionResilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolutionIvica Crnkovic
 
Empirical se 2013-01-17
Empirical se 2013-01-17Empirical se 2013-01-17
Empirical se 2013-01-17Ivica Crnkovic
 
Crnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptxCrnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptxIvica Crnkovic
 
Ten Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering EducationTen Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering EducationIvica Crnkovic
 

Mehr von Ivica Crnkovic (16)

ICSE 2018 opening session
ICSE 2018 opening sessionICSE 2018 opening session
ICSE 2018 opening session
 
Beyond digitalisation 2016-06-07
Beyond digitalisation  2016-06-07Beyond digitalisation  2016-06-07
Beyond digitalisation 2016-06-07
 
ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20
 
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
 
European Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementEuropean Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 Announcement
 
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
 
Software Assurance: What Should We Do next? - Software Design for Reliability
Software Assurance: What Should We Do next?  - Software Design for ReliabilitySoftware Assurance: What Should We Do next?  - Software Design for Reliability
Software Assurance: What Should We Do next? - Software Design for Reliability
 
Sa past-future
Sa past-futureSa past-future
Sa past-future
 
A classification framework for component models
A classification framework for component modelsA classification framework for component models
A classification framework for component models
 
Teaching in multicultural classromre
Teaching in multicultural  classromreTeaching in multicultural  classromre
Teaching in multicultural classromre
 
The challenges and opportunities in open source reuse
The challenges and opportunities in open source reuseThe challenges and opportunities in open source reuse
The challenges and opportunities in open source reuse
 
Resilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolutionResilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolution
 
Empirical se 2013-01-17
Empirical se 2013-01-17Empirical se 2013-01-17
Empirical se 2013-01-17
 
SPL in Clouds
SPL in CloudsSPL in Clouds
SPL in Clouds
 
Crnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptxCrnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptx
 
Ten Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering EducationTen Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering Education
 

Kürzlich hochgeladen

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 

Kürzlich hochgeladen (20)

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 

2020 09-16-ai-engineering challanges

  • 1. AI engineering New challenges in system and software architecting and managing lifecycle for AI- based systems ECSA 2020 14-18 Sept, L’Aquila Ivica Crnkovic, ivica.crnkovic@chalmers.se
  • 2.
  • 4. Types of Machine Learning SUPERVISED UNSUPERVISED REINFORCEMENT Task Driven predict next value, classification Data Driven Identify clusters Learn From Mistakes Trial and Error 4 95% of ML
  • 5. 9/16/20 Chalmers 6 AI as effective as medical specialists at diagnosing diseas AI in transportation • Self-driving vehicles, Traffic management, delay predictions… AI and NLP (Natural Language processes) Recognition Traffic Medicine Robotics Finance Health Space AI success Stories virtually no area that does not (plan to) use AI
  • 8. 2020-09-16 Chalmers University of Technology 9 AI failures 2018AI failures 2018 AI World Cup 2018 - predictions almost all wrong J
  • 9. 2020-09-16 10 Why these failures? What are the challenges inAI development?
  • 10. 9/16/20 11 • Non technical character • Who is owner of data? • What are the ethical aspects of using data? • What can we allow a machine to decide? • How do interpret the results from AI models? New questions in AI Development • Technical nature • How to efficiently collect, store, process, analyse, and present data? • How to efficiently build the AI-based systems? • How to ensure enough resources (computation, storage, timing) • How to ensure dependability/trustworthy of such systems? • What system and software architecture are required for AI-systems? • WHAT KIND OF SOFTWARE ENGINEERING SUPPORT IS NEEDED?
  • 11. 9/16/20 12 ML Development cycle assemble datasets create models train and evaluate deployment
  • 12. Life cycle ML Code – Create model Train model Machine Learning: The High-Interest Credit Card of Technical Debt, D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young
  • 13. 9/16/20 AI-System - system architecture (example) https://www.youtube.com/watch?v=XEHG2oYdsiU Collecting data Analyzing Data Controlling system Sending data Updating system System evolution Example: vehicular system (a Cyber-physical system)
  • 14. Architecture of a car control system
  • 15. Complex services – distributed components Sensor 1 Sensor 2 Sensor 3 Sensor .. Network resources ++++++++++ ++++++++++ ++++++++++ Sensor .. Execution resources Node 1 Node 2 Node 3 Node … Node … Actuator 1 Actuator 2 Actuator 3 Actuator … Actuator … ECU ECU ECU C1 C2 Shared resources Challenges: • Real-time requirements • Shared resources • Resource constraints
  • 16. Software architecture 9/16/20 17 Platform Middleware Subsystem A HW Subsystem B Subsystem C C1 C2 C3 C4 C3 Subsystem and components Component-based and service-based approach - Components - Encapsulation of data - Encapsulation of functionality - Dependency between components defined and controlled MP
  • 17. 9/16/20 18 AI-System - system architecture (example) Platform Middleware Subsystem A HW Subsystem B Subsystem C C1 C2 C3 C4 C3 Subsystem and components Components – black boxes - Components - Encapsulation of AI-based functionality - Dependency between components defined and controlled - What about AI code and data? MP
  • 18. 9/16/20 AI-System - system architecture (example) https://www.youtube.com/watch?v=XEHG2oYdsiU Collecting more data Analyzing more data More computation Control system Further data processing Sending data For diagnostic And for ML Updating system AI-based vehicular system System evolutionCollecting data System training - large amounts of data processing - large computational requirements
  • 19. With AI the existing architecture is not feasible CPU GPU FPGA EDGE computing Cloud computing
  • 20. Challenges - New computation models - Heterogenous platform - Edge sensor vs. edge centralized level vs. cloud computing - Development process - separation of training process from operational process - New monitoring - New quality attributes and their metrics - Migration process from the existing architecture CPU GPU FPGA Cloud computing EDGE computing
  • 21. 9/16/20 22 AI- based components – fundamental problems C1 controlled Data – used for ML Controlled? • The results depend not only on the algorithms and controlled data but also on uncontrolled/unknown data • The AI-based functions are not continuous: small change of data can cause big changes • Due to data impact on the model many new challenges appear
  • 22. 2020-09-16 23 1. Entanglement (Data fusion) 2. Data dependencies • Unstable data dependencies • Underutilized Data Dependencies 3. Hidden Feedback Loops 4. Undeclared Consumers 5. Correction Cascades Examples of Data-related challenges DATA DATA DATA AI-component AI-component DATA DATA DATA AI-component AI-component Not needed DATA DATA AI-component DATA DATA AI-component C1 AI-component C2DATA DATA DATA AI-component AI-component
  • 23. 9/16/20 Chalmers 31 • Heterogeneity of data (different formats, accuracy, semantics) and use of standard ML functions require a lot Glue code • 95% of code in AI-based systems is a glue-code (empirical data) • Requires • Frequent refactoring of code • Re-implementing AI models • Pipeline Jungles • ML-friendly format data become a jungle of scrapes, joins, and sampling steps, intermediate files • Requires – a close team work of data and domain engineers System-design anti-patterns (1)
  • 24. 9/16/20 Chalmers 32 Dead Experimental Code paths • AI solution requires a lot of experimentation • A lot of code that will not be used later • Problems • Dead code • Version management – how to preserve useful configuration branches, and remove unnecessary System-design anti-patterns (2)
  • 25. 9/16/20 Chalmers 33 Managing Changes in the External World System Cloud Data Local Data Local Data Local Data Historical data Used in ML Continuous change of data Challenge: models and system behaviour dependent of data Examples: - Threshold changes - Correlation between data Requirements: Continuous monitoring of data and system. Continuous test.
  • 26. 35 Study : Overview of industrial ML systems Case studies A. Project EST: Real Estate Valuation B. Project OIL: Predicting Oil and Gas Recovery Potential C. Project RET: Predicting User Retention D. Project WEA: Weather Forecasting E. Project CCF: Credit Card Fraud Detection F. Project BOT: Poker Bot Identification G. Project REC: Media Recommendations Software Engineering Challenges of Deep Learning, Euromicro SEAA 2018, A. Arpteg, B. Brinne, l. Crnkovic-Friis, J. Bosch Which challenges are the most difficult in development of AI-based systems?
  • 27. 2020-09-16 Chalmers University of Technology 37 Study: Engineering challenges of DL ■Dependency Management ◆Glue code and supporting systems ●Unintentional feedback loops ●Monitoring and logging ●Troubleshooting ●Testing ●Limited transparency ●Experimental management ■ Organisational challenges ● Production/operation challenges ◆ Development challenges ◆Effort estimation ■Cultural differences Hard to solve Easier to solve Lower business impact High business impact ●Resource Limitations ◆Privacy and Safety/security
  • 28. Experiment prototyping Non-critical deployment Critical deployment Cascading deployment Assemble dataset Issues with problem formulation and specifying desired outcome Data silos, scarcity of labelled data, imbalanced training set Limitations in techniques for gathering training data from large-scale, non-stationary data streams Complex and effects of data dependencies Create model Use of non- representative dataset, data drifts No critical analysis of training data Difficulties in building highly scalable ML pipeline Entanglements causing difficulties in isolating improvements Train and evaluate model Lack of well-established ground truth No evaluation of models with business-centric measures Difficulties in reproducing models, results and debugging DL models Need of techniques for sliced analysis in final model Deploy model No deployment mechanism Training- serving skew Adhering to stringent serving requirements e.g., of latency, throughput Hidden feedback-loops and undeclared consumers of the models LE Lwakatare, A Raj, J Bosch, HH Olsson, I Crnkovic, A taxonomy of software engineering challenges for machine learning systems: An empirical investigation - International Conference on Agile Software …, 2019 The challenges in evolution of development and use ML components
  • 29. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
  • 30. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint) Data quality management Design methods and processes Model performance Deployment and compliance Data versioning & dependency mgmt DataOps/DevOps Automated data labeling Federated learning infrastructure Reuse of pre- developed ML models Manage multiple models Deployment infrastructure Quality attributes A/B testing of models Storage and computing infrastructure Integration of models and SW components Monitoring & logging Strategic focus Infrastucture Development Process AI Engineering challenges
  • 31. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint) Data quality management Design methods and processes Model performance Deployment and compliance Federated collection & storage of data Federated ML models Heterogenous HW platforms Transfer learning Strategic focus Cyber-physical systems Safety-critical systems Autonomously Improving systems Data trail Explainable models reproducibility Safety validation Data generation for ML Automated experimentation Exploration vs. exploratiom Online evaluation Domain-specific AI Engineering challenges
  • 32. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
  • 33. 43 DevOps ML Workflow Example – Process integration Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
  • 34. 44 Conceptual ML Workflow and DevOps Process Integration Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
  • 35. 9/16/20 Chalmers 46 Excitement of a researcher ML/DL introduces many new challenges for software • On addition to software evolution there is data evolution • Context change – in behaviour and in data • New (distributed) system configuration • Continuous integration, continuous deployment • Continuous training Many companies will not be able to cope with new types of complexity Many opportunities for researchers! Conclusion