SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Mining Business Process Stages from
Event Logs
Hoang Nguyen, Marlon Dumas, Arthur H.M. ter Hofstede,
Marcello La Rosa, and Fabrizio Maria Maggi
CAISE’17
Process Mining
2
/
event logs
discovered model
Discovery
Conformance
DeviancePerformance
input model
if A
then B
Current Challenges
Spaghetti model Complex rules with limited
predictive accuracy
3
Limited knowledge of process
performance
Event log decomposition
Decomposition
Horizontal Vertical
Region-based (genet tool)
Aggregation (Fuzzy Miner)
Divide and Conquer (ProM 6)
Mining sub-processes (BPMN Miner)
Activity Clustering for Perf. Analysis
Conforti (2014)
van Dongen (2010)
Trace Clustering
Trace Similarity
Sequence Clustering
Contextual Clustering
Mining Hierarchies
Trace Alignment
Clustering-based Predictive
Monitoring
Bose (2012)
4
Internal Quality
External Quality
Multistage processes
Passenger boarding process Kanban board used in many office processes
5
Stage-based decomposition of directly-follows graphs
BPIC2012 BPIC2013 BPIC2015-1
Event logs with ground truth of stages
6
Structure of a process stage
7
Stage identification based on graph cuts
How to find the right
graph cuts? What
measure to guide the
decomposition to
approximate the real
stage decomposition?
8
Finding communities from social networks
Newman, M. E., & Girvan, M. (2004). Finding and evaluating community structure in networks.
Physical review E, 69(2), 026113
Modularity =
Fraction of edges
within one
community in the
real network
Fraction of edges
connecting to nodes
in a community in a
random network
9
Approach – modularity-based graph cuts
Mod. 0.4 0.8 0.85 0.81 0.5 0.78
10
Experiments
Event logs
Accuracy Index (result-A vs. ground truth-B)
Baselines
Divide and Conquer framework
Simple Precedence Diagram
Discover Matrix Create Graph Create Clusters Modify Clusters
𝐹𝑜𝑤𝑙𝑘𝑒𝑠 − 𝑀𝑎𝑙𝑙𝑜𝑤𝑠 =
𝑁1𝐴1𝐵
(𝑁1𝐴1𝐵 + 𝑁1𝐴0𝐵)(𝑁1𝐴1𝐵 + 𝑁0𝐴1𝐵)
N1A1B: no. of pairs in the same
clusters in both A and B. N1A0B:
no. of pairs in the same clusters
in A but diff. clusters in B. N0A1B:
no. of pairs in diff. clusters in A
but the same clusters in B.
11
Experimental Result
Our technique (SPM) Divide & Conquer (DC) Simple Precedence Diagram (SPD)
Logs SPM DC SPD
BPI12 1.0 0.30 0.49
BPI13 0.78 0.36 0.73
BPI15-1 0.90 0.40 0.54
BPI15-2 0.92 0.40 0.52
BPI15-3 0.86 0.42 0.50
BPI15-4 0.72 0.45 0.57
BPI15-5 0.83 0.46 0.49
Fowlkes − Mallows Index
Visualization of clustering for BPI15-2
12
Summary
• Contribution
– An automated technique to discover stages from event
logs which can approximate the real division of business
process stages
• Future work
– Optimize parameters for improving the accuracy of
stage mining
– Develop techniques for stage-based process discovery
– Develop techniques for stage-based process prediction
13
How to measure process performance from stages?
(CAiSE’16)
14Cumulative Flow Diagram
How to discover process models from stages?
15
Discover Compose
event log
Decompose
How to predict process performance from stages?
Predictor 1
Predictor 2
Predictor 3
Meta -
Predictor
16
Predictor 4
Learn Compose
event log
Decompose
Thank you for your attention!
Q&A
17
References
• Tsung, F., Li, Y., & Jin, M. (2008). Statistical process control for multistage manufacturing and service
operations: a review and some extensions. International Journal of Services Operations and
Informatics, 3(2), 191-204.
• Millard, P. H., Christodoulou, G., Jagger, C., Harrison, G. W., & McClean, S. I. (2001). Modelling
hospital and social care bed occupancy and use by elderly people in an English health district. Health
Care Management Science, 4(1), 57-62.
• Cooper, R. G. (1990). Stage-gate systems: a new tool for managing new products. Business
horizons, 33(3), 44-54.
• Bose, R. J. C., & van der Aalst, W. M. (2012). Process diagnostics using trace alignment:
opportunities, issues, and challenges. Information Systems, 37(2), 117-141.
• Conforti, R., Dumas, M., García-Bañuelos, L., & La Rosa, M. (2014). Beyond tasks and gateways:
Discovering BPMN models with subprocesses, boundary events and activity markers. In Business
Process Management (pp. 101-117): Springer.
• van Dongen, B. F., & Adriansyah, A. (2010). Process mining: fuzzy clustering and performance
visualization. In Business Process Management Workshops (pp. 158-169): Springer.
• Verbeek, H., van der Aalst, W., & Munoz-Gama, J. (2017). Divide and Conquer: A Tool Framework
for Supporting Decomposed Discovery in Process Mining. The Computer Journal, 1-26
• Newman, M. E., & Girvan, M. (2004). Finding and evaluating community structure in networks.
Physical review E, 69(2), 026113
18

Weitere ähnliche Inhalte

Mehr von Marlon Dumas

Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationMarlon Dumas
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationMarlon Dumas
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Marlon Dumas
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsMarlon Dumas
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process MiningMarlon Dumas
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Marlon Dumas
 
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Marlon Dumas
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersMarlon Dumas
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxMarlon Dumas
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business ProcessesMarlon Dumas
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datosMarlon Dumas
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementMarlon Dumas
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionMarlon Dumas
 
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...Marlon Dumas
 
On the Road to AI-Infused Process Execution
On the Road to AI-Infused Process ExecutionOn the Road to AI-Infused Process Execution
On the Road to AI-Infused Process ExecutionMarlon Dumas
 
Process Mining 2.0: From Insights to Actions
Process Mining 2.0: From Insights to ActionsProcess Mining 2.0: From Insights to Actions
Process Mining 2.0: From Insights to ActionsMarlon Dumas
 
Process Mining in Action: Self-service data science for business teams
Process Mining in Action: Self-service data science for business teamsProcess Mining in Action: Self-service data science for business teams
Process Mining in Action: Self-service data science for business teamsMarlon Dumas
 
Enterprise Systems
Enterprise SystemsEnterprise Systems
Enterprise SystemsMarlon Dumas
 
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...Marlon Dumas
 
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...Identifying Candidate Routines for Robotic Process Automation From Unsegmente...
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...Marlon Dumas
 

Mehr von Marlon Dumas (20)

Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
 
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
 
On the Road to AI-Infused Process Execution
On the Road to AI-Infused Process ExecutionOn the Road to AI-Infused Process Execution
On the Road to AI-Infused Process Execution
 
Process Mining 2.0: From Insights to Actions
Process Mining 2.0: From Insights to ActionsProcess Mining 2.0: From Insights to Actions
Process Mining 2.0: From Insights to Actions
 
Process Mining in Action: Self-service data science for business teams
Process Mining in Action: Self-service data science for business teamsProcess Mining in Action: Self-service data science for business teams
Process Mining in Action: Self-service data science for business teams
 
Enterprise Systems
Enterprise SystemsEnterprise Systems
Enterprise Systems
 
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...
Process Mining Meets Causal Machine Learning: Discovering Causal Rules From E...
 
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...Identifying Candidate Routines for Robotic Process Automation From Unsegmente...
Identifying Candidate Routines for Robotic Process Automation From Unsegmente...
 

Kürzlich hochgeladen

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 

Kürzlich hochgeladen (20)

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 

Mining Business Process Stages from Event Logs

  • 1. Mining Business Process Stages from Event Logs Hoang Nguyen, Marlon Dumas, Arthur H.M. ter Hofstede, Marcello La Rosa, and Fabrizio Maria Maggi CAISE’17
  • 2. Process Mining 2 / event logs discovered model Discovery Conformance DeviancePerformance input model if A then B
  • 3. Current Challenges Spaghetti model Complex rules with limited predictive accuracy 3 Limited knowledge of process performance
  • 4. Event log decomposition Decomposition Horizontal Vertical Region-based (genet tool) Aggregation (Fuzzy Miner) Divide and Conquer (ProM 6) Mining sub-processes (BPMN Miner) Activity Clustering for Perf. Analysis Conforti (2014) van Dongen (2010) Trace Clustering Trace Similarity Sequence Clustering Contextual Clustering Mining Hierarchies Trace Alignment Clustering-based Predictive Monitoring Bose (2012) 4 Internal Quality External Quality
  • 5. Multistage processes Passenger boarding process Kanban board used in many office processes 5
  • 6. Stage-based decomposition of directly-follows graphs BPIC2012 BPIC2013 BPIC2015-1 Event logs with ground truth of stages 6
  • 7. Structure of a process stage 7
  • 8. Stage identification based on graph cuts How to find the right graph cuts? What measure to guide the decomposition to approximate the real stage decomposition? 8
  • 9. Finding communities from social networks Newman, M. E., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical review E, 69(2), 026113 Modularity = Fraction of edges within one community in the real network Fraction of edges connecting to nodes in a community in a random network 9
  • 10. Approach – modularity-based graph cuts Mod. 0.4 0.8 0.85 0.81 0.5 0.78 10
  • 11. Experiments Event logs Accuracy Index (result-A vs. ground truth-B) Baselines Divide and Conquer framework Simple Precedence Diagram Discover Matrix Create Graph Create Clusters Modify Clusters 𝐹𝑜𝑤𝑙𝑘𝑒𝑠 − 𝑀𝑎𝑙𝑙𝑜𝑤𝑠 = 𝑁1𝐴1𝐵 (𝑁1𝐴1𝐵 + 𝑁1𝐴0𝐵)(𝑁1𝐴1𝐵 + 𝑁0𝐴1𝐵) N1A1B: no. of pairs in the same clusters in both A and B. N1A0B: no. of pairs in the same clusters in A but diff. clusters in B. N0A1B: no. of pairs in diff. clusters in A but the same clusters in B. 11
  • 12. Experimental Result Our technique (SPM) Divide & Conquer (DC) Simple Precedence Diagram (SPD) Logs SPM DC SPD BPI12 1.0 0.30 0.49 BPI13 0.78 0.36 0.73 BPI15-1 0.90 0.40 0.54 BPI15-2 0.92 0.40 0.52 BPI15-3 0.86 0.42 0.50 BPI15-4 0.72 0.45 0.57 BPI15-5 0.83 0.46 0.49 Fowlkes − Mallows Index Visualization of clustering for BPI15-2 12
  • 13. Summary • Contribution – An automated technique to discover stages from event logs which can approximate the real division of business process stages • Future work – Optimize parameters for improving the accuracy of stage mining – Develop techniques for stage-based process discovery – Develop techniques for stage-based process prediction 13
  • 14. How to measure process performance from stages? (CAiSE’16) 14Cumulative Flow Diagram
  • 15. How to discover process models from stages? 15 Discover Compose event log Decompose
  • 16. How to predict process performance from stages? Predictor 1 Predictor 2 Predictor 3 Meta - Predictor 16 Predictor 4 Learn Compose event log Decompose
  • 17. Thank you for your attention! Q&A 17
  • 18. References • Tsung, F., Li, Y., & Jin, M. (2008). Statistical process control for multistage manufacturing and service operations: a review and some extensions. International Journal of Services Operations and Informatics, 3(2), 191-204. • Millard, P. H., Christodoulou, G., Jagger, C., Harrison, G. W., & McClean, S. I. (2001). Modelling hospital and social care bed occupancy and use by elderly people in an English health district. Health Care Management Science, 4(1), 57-62. • Cooper, R. G. (1990). Stage-gate systems: a new tool for managing new products. Business horizons, 33(3), 44-54. • Bose, R. J. C., & van der Aalst, W. M. (2012). Process diagnostics using trace alignment: opportunities, issues, and challenges. Information Systems, 37(2), 117-141. • Conforti, R., Dumas, M., García-Bañuelos, L., & La Rosa, M. (2014). Beyond tasks and gateways: Discovering BPMN models with subprocesses, boundary events and activity markers. In Business Process Management (pp. 101-117): Springer. • van Dongen, B. F., & Adriansyah, A. (2010). Process mining: fuzzy clustering and performance visualization. In Business Process Management Workshops (pp. 158-169): Springer. • Verbeek, H., van der Aalst, W., & Munoz-Gama, J. (2017). Divide and Conquer: A Tool Framework for Supporting Decomposed Discovery in Process Mining. The Computer Journal, 1-26 • Newman, M. E., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical review E, 69(2), 026113 18