SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Discovering Branching
Conditions from Business
Process Execution Logs
Massimiliano de Leoni, Marlon Dumas,
Luciano García-Bañuelos
University of Tartu, Estonia
(Joint work with Eindhoven University of Technology)
Business Process Management
DiscoveringBranchingConditions
1
Start
Get Ready
Travel by CarTravel by Train
BETA PhD Day Starts
Visit Brewery
Have Dinner
Go Home
Travel by Train Pay for Parking
Travel by Car
End
Give a Talk
Implementation
Event
Log
Execution
Business Process Mining
DiscoveringBranchingConditions
2
Start
Register order
Prepare
shipment
Ship goods
(Re)send bill
Receive payment
Contact
customer
Archive order
End
Performance Analysis
Process Model
Organizational Model
Social Network
Event
Log
Slide by Ana Karla Alves de Medeiros (TU/e)
ProM
Process mining
workbench
Data perspective?
DiscoveringBranchingConditions
3
Enter Loan
Application
Retrieve
Applicant
Data
Compute
Installments
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
salaryage
installment
amount
length
Branching
points
ProM’s Decision Miner
DiscoveringBranchingConditions
4
Enter Loan
Application
Retrieve
Applicant
Data
Compute
Installments
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
salaryage
installment
amount
length
CID Amount Len Salary Age Installm TaskCID Amount Len Salary Age Installm Task
13219 8500 1 NULL NULL NULL ELA
Event
Log
CID Task Data Time Stamp …
13219 ELA
Amount=8500
Len=1
2007-11-09 T 11:20:10 -
13219 RAP
Salary=2000
Age=25
2007-11-09 T 11:22:15 -
13220 ELA
Amount=25000
Len=1
2007-11-09 T 11:22:40 -
13219 CI Installm=750 2007-11-09 T 11:22:45 -
13219 NE 2007-11-09 T 11:23:00 -
13219 ASA 2007-11-09 T 11:24:30 -
13220 CI Installm=1200 2007-11-09 T 11:24:35 -
… … … … …
CID Amount Len Salary Age Installm Task
13219 8500 1 NULL NULL NULL ELA
13219 8500 1 2000 25 NULL RAP
13219 8500 1 2000 25 750 RAP
13219 8500 1 2000 25 750 NE
(amount < 10000)(amount < 10000) ∨
(amount ≥ 10000 ∧ age < 35)
amount
Approve Simple
Application (ASA)
≥ 10000 < 10000
Approve Complex
Application (ACA)
Approve Simple
Application (ASA)
≥ 35
age
< 35
ProM’s decision miner / 2
CID Amount Installm Salary Age Len Task
13219 8500 750 2000 25 1 ASA
13220 12500 1200 3500 35 4 ACA
13221 9000 450 2500 27 2 ASA
… … … … … … …
5
Approve
Simple
Application
Approve
Complex
Application
Decision tree
learning
amount ≥ 10000
∧ age ≥ 35
Decision miner: Not a panacea!
• Decision tree learning cannot discover expressions of the
form “v op v”
DiscoveringBranchingConditions
6
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
installment > salary
The decision miner would return:
installment ≤ 1760 ∧ salary ≤ 1750 ∨
installment ≤ 1810 ∧ salary ≤ 1800 ∨
installment ≤ 1875 ∧ salary ≤ 1850 ∨
installment ≤ 1960 ∧ salary ≤ 1950 ∨
installment ≤ 1975 ∧ salary ≤ 1970 ∨
installment ≤ 2000 ∧ salary ≤ 1990 ∨ …
Problem statement
• Discovery of branching conditions composed of atoms of
the form “v op c” and “v op v”, including linear equations
or inequalities involving multiple variables
• Our solution combines
• Tools for dynamic analysis of software
(i.e., likely invariant discovery)
• Theory of decision tree learning
DiscoveringBranchingConditions
7
Daikon
• Tool for discovering likely invariants from execution logs
• Given a set of program points, Daikon:
• Instantiates a set of invariant templates
(over certain combination of variables)
• Traverses the execution log
• Falsifying some invariants
• Gathering the statistical support for the remaining templates
• Discards some invariants based on:
• Subsumption
• Statistical support
Daikon strongly relies on code instrumentation/analysis
DiscoveringBranchingConditions
8
CID Amount Installm Salary Age Len Task
13210 20000 2000 2000 25 1 NR
13220 25000 1200 3500 35 2 NE
13221 9000 450 2500 27 2 NE
13219 8500 750 2000 25 1 ASA
13220 25000 1200 3500 35 2 ACA
13221 9000 450 2500 27 2 ASA
… … … … … … …
Daikon: Tool for mining likely invariants
DiscoveringBranchingConditions
9
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
Daikon
installment > salary
amount ≥ 5000
length < age
…
installment ≤ salary
amount ≥ 5000
length < age
…
installment ≤ salary
amount ≤ 9500
length < age
…
installment ≤ salary
amount ≥ 10000
length < age
…
BranchMiner (Conjunctive)
• Information Gain (IG) quantifies the discriminating power of a
predicate (with respect to two different outcomes)
• Approach:
• Use Daikon for discovering invariants
• Combine invariants in a conjunction so as to maximize the overall IG
DiscoveringBranchingConditions
10
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
a1: installment > salary
a2: amount ≥ 5000
a3: length < age
…
IG(a1) = 0.8
IG(a2) = 0.2
IG(a3) = 0
…
IG(a1∧a2) = 0.8
…
¬(P∧Q)
Disjunctions?
DiscoveringBranchingConditions
11
Approve
Simple
Application
Approve
Complex
Application
Notify
Rejection
Notify
Eligibility
P∧Q
¬P∨¬Q
• Only the negation of conjunctive expression by the de
Morgan Laws
BranchMiner (Disjunctive)
DiscoveringBranchingConditions
12
…
Partition 1 Partition 2
Conjunctive
BranchMiner
Conjunctive
BranchMiner
CONJ1 CONJ2
Partition
n
Conjunctive
BranchMiner
CONJn
Event
Log
BranchMiner (Disjunctive)
DiscoveringBranchingConditions
13
…
Partition 1 Partition 2
Conjunctive
BranchMiner
Conjunctive
BranchMiner
CONJ1 CONJ2
Event
Log
Notify
Rejection
Notify
Eligibility
Notify
Rejection
Decision Tree
…
IG(CONJ1) = 0.4
IG(CONJ2) = 0.45
IG(CONJ3) = 0.5
…
IG(CONJ1∨CONJ2) = 0.78
IG(CONJ1∨CONJ3) = 0.6
…
Linear and polynomial expressions
• Approach
• Select all numerical
variables and generate
some derived (a.k.a.
latent) variables using an
arithmetic operator
e.g.,
salary_div_installment,
meaning
“salary/installment”
• Augment the event log
with the values for latent
variables
• Run the discovery method
for conjunctive/disjunctive
conditions
DiscoveringBranchingConditions
14
CID Amount Installm Salary Age Len Task
13210 20000 2000 2000 25 1 NR
13220 25000 1200 3500 35 2 NE
13221 9000 450 2500 27 2 NE
13219 8500 750 2000 25 1 ASA
13220 25000 1200 3500 35 2 ACA
13221 9000 450 2500 27 2 ASA
… … … … … … …
CID Amount Installm Salary Sal/Inst Age Len
Age+Le
n Task
13210 20000 2000 2000 1.00 25 1 26 NR
13220 25000 1200 3500 2.92 35 2 37 NE
13221 9000 450 2500 5.56 27 2 29 NE
13219 8500 750 2000 2.67 25 1 26 ASA
13220 25000 1200 3500 2.92 35 2 37 ACA
13221 9000 450 2500 5.56 27 2 29 ASA
… … … … … … … … …
CID Amount Installm Salary Sal/Inst Age Len
Age+Le
n Task
13210 20000 2000 2000 1.00 25 1 26 NR
13220 25000 1200 3500 2.92 35 2 37 NE
13221 9000 450 2500 5.56 27 2 29 NE
13219 8500 750 2000 2.67 25 1 26 ASA
13220 25000 1200 3500 2.92 35 2 37 ACA
13221 9000 450 2500 5.56 27 2 29 ASA
… … … … … … … … …
Assessment
DiscoveringBranchingConditions
15
Conclusions
• We developed a technique for discovering branching
conditions from event logs
• Complex expressions (e.g., “v op v”, linear inequalities, etc.)
• More compact than those mined with conventional decision trees
• Integration into ProM
• Implemented as a command line tool
• Validation with real-life logs
• Assessed with synthetically generated event logs
• Areas for extensions
• Coping with noise in the event logs
• Handling of null values
• Extending the coverage to more complex types of expressions
DiscoveringBranchingConditions
16

Weitere ähnliche Inhalte

Ähnlich wie Discovering Branching Conditions from Business Process Execution Logs

Building Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
Building Competing Models Using Apache Spark DataFrames with Abdulla Al-QawasmehBuilding Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
Building Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
Databricks
 
face recognition using Principle Componet Analysis
face recognition using Principle Componet Analysisface recognition using Principle Componet Analysis
face recognition using Principle Componet Analysis
Abhilash Kotawar
 

Ähnlich wie Discovering Branching Conditions from Business Process Execution Logs (20)

Process Mining Reloaded: Event Structures as a Unified Representation of Proc...
Process Mining Reloaded: Event Structures as a Unified Representation of Proc...Process Mining Reloaded: Event Structures as a Unified Representation of Proc...
Process Mining Reloaded: Event Structures as a Unified Representation of Proc...
 
Evaluation of transport safety policies in commercial motorcycle operation in...
Evaluation of transport safety policies in commercial motorcycle operation in...Evaluation of transport safety policies in commercial motorcycle operation in...
Evaluation of transport safety policies in commercial motorcycle operation in...
 
How to build real time price adjustments in vehicle insurance on Streams ( Do...
How to build real time price adjustments in vehicle insurance on Streams ( Do...How to build real time price adjustments in vehicle insurance on Streams ( Do...
How to build real time price adjustments in vehicle insurance on Streams ( Do...
 
Bridging the Instance V3 Parametric Simulation
Bridging the Instance V3 Parametric SimulationBridging the Instance V3 Parametric Simulation
Bridging the Instance V3 Parametric Simulation
 
Using Open Source Tools for Machine Learning
Using Open Source Tools for Machine LearningUsing Open Source Tools for Machine Learning
Using Open Source Tools for Machine Learning
 
Cisco on Cisco. Defining the NG Cloud & Data Center Services
Cisco on Cisco. Defining the NG Cloud & Data Center ServicesCisco on Cisco. Defining the NG Cloud & Data Center Services
Cisco on Cisco. Defining the NG Cloud & Data Center Services
 
Demand Forcasting
Demand ForcastingDemand Forcasting
Demand Forcasting
 
AI for Business Process Management
AI for Business Process ManagementAI for Business Process Management
AI for Business Process Management
 
Autonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based SoftwareAutonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based Software
 
Cash Flow Forecaster presentation
Cash Flow Forecaster presentationCash Flow Forecaster presentation
Cash Flow Forecaster presentation
 
Example-Dependent Cost-Sensitive Credit Card Fraud Detection
Example-Dependent Cost-Sensitive Credit Card Fraud DetectionExample-Dependent Cost-Sensitive Credit Card Fraud Detection
Example-Dependent Cost-Sensitive Credit Card Fraud Detection
 
PREPAYMENT METER_PRESENTATION_Don Taylor.pdf
PREPAYMENT METER_PRESENTATION_Don Taylor.pdfPREPAYMENT METER_PRESENTATION_Don Taylor.pdf
PREPAYMENT METER_PRESENTATION_Don Taylor.pdf
 
Herding Microservices – the Atlassian Way
Herding Microservices – the Atlassian WayHerding Microservices – the Atlassian Way
Herding Microservices – the Atlassian Way
 
Building Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
Building Competing Models Using Apache Spark DataFrames with Abdulla Al-QawasmehBuilding Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
Building Competing Models Using Apache Spark DataFrames with Abdulla Al-Qawasmeh
 
Incorporation risk1
Incorporation risk1Incorporation risk1
Incorporation risk1
 
Relaxing Join and Selection Queries - VLDB 2006 Slides
Relaxing Join and Selection Queries - VLDB 2006 SlidesRelaxing Join and Selection Queries - VLDB 2006 Slides
Relaxing Join and Selection Queries - VLDB 2006 Slides
 
Fuzzy Control meets Software Engineering
Fuzzy Control meets Software EngineeringFuzzy Control meets Software Engineering
Fuzzy Control meets Software Engineering
 
Six sigma case study-a good approach with example
Six sigma case study-a good approach with exampleSix sigma case study-a good approach with example
Six sigma case study-a good approach with example
 
face recognition using Principle Componet Analysis
face recognition using Principle Componet Analysisface recognition using Principle Componet Analysis
face recognition using Principle Componet Analysis
 
Supply chain design and operation
Supply chain design and operationSupply chain design and operation
Supply chain design and operation
 

Mehr von Marlon Dumas

Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
Marlon Dumas
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Marlon Dumas
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Marlon Dumas
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Marlon Dumas
 

Mehr von Marlon Dumas (20)

How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
 
Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
 

Kürzlich hochgeladen

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Discovering Branching Conditions from Business Process Execution Logs

  • 1. Discovering Branching Conditions from Business Process Execution Logs Massimiliano de Leoni, Marlon Dumas, Luciano García-Bañuelos University of Tartu, Estonia (Joint work with Eindhoven University of Technology)
  • 2. Business Process Management DiscoveringBranchingConditions 1 Start Get Ready Travel by CarTravel by Train BETA PhD Day Starts Visit Brewery Have Dinner Go Home Travel by Train Pay for Parking Travel by Car End Give a Talk Implementation Event Log Execution
  • 3. Business Process Mining DiscoveringBranchingConditions 2 Start Register order Prepare shipment Ship goods (Re)send bill Receive payment Contact customer Archive order End Performance Analysis Process Model Organizational Model Social Network Event Log Slide by Ana Karla Alves de Medeiros (TU/e) ProM Process mining workbench
  • 5. ProM’s Decision Miner DiscoveringBranchingConditions 4 Enter Loan Application Retrieve Applicant Data Compute Installments Approve Simple Application Approve Complex Application Notify Rejection Notify Eligibility salaryage installment amount length CID Amount Len Salary Age Installm TaskCID Amount Len Salary Age Installm Task 13219 8500 1 NULL NULL NULL ELA Event Log CID Task Data Time Stamp … 13219 ELA Amount=8500 Len=1 2007-11-09 T 11:20:10 - 13219 RAP Salary=2000 Age=25 2007-11-09 T 11:22:15 - 13220 ELA Amount=25000 Len=1 2007-11-09 T 11:22:40 - 13219 CI Installm=750 2007-11-09 T 11:22:45 - 13219 NE 2007-11-09 T 11:23:00 - 13219 ASA 2007-11-09 T 11:24:30 - 13220 CI Installm=1200 2007-11-09 T 11:24:35 - … … … … … CID Amount Len Salary Age Installm Task 13219 8500 1 NULL NULL NULL ELA 13219 8500 1 2000 25 NULL RAP 13219 8500 1 2000 25 750 RAP 13219 8500 1 2000 25 750 NE
  • 6. (amount < 10000)(amount < 10000) ∨ (amount ≥ 10000 ∧ age < 35) amount Approve Simple Application (ASA) ≥ 10000 < 10000 Approve Complex Application (ACA) Approve Simple Application (ASA) ≥ 35 age < 35 ProM’s decision miner / 2 CID Amount Installm Salary Age Len Task 13219 8500 750 2000 25 1 ASA 13220 12500 1200 3500 35 4 ACA 13221 9000 450 2500 27 2 ASA … … … … … … … 5 Approve Simple Application Approve Complex Application Decision tree learning amount ≥ 10000 ∧ age ≥ 35
  • 7. Decision miner: Not a panacea! • Decision tree learning cannot discover expressions of the form “v op v” DiscoveringBranchingConditions 6 Approve Simple Application Approve Complex Application Notify Rejection Notify Eligibility installment > salary The decision miner would return: installment ≤ 1760 ∧ salary ≤ 1750 ∨ installment ≤ 1810 ∧ salary ≤ 1800 ∨ installment ≤ 1875 ∧ salary ≤ 1850 ∨ installment ≤ 1960 ∧ salary ≤ 1950 ∨ installment ≤ 1975 ∧ salary ≤ 1970 ∨ installment ≤ 2000 ∧ salary ≤ 1990 ∨ …
  • 8. Problem statement • Discovery of branching conditions composed of atoms of the form “v op c” and “v op v”, including linear equations or inequalities involving multiple variables • Our solution combines • Tools for dynamic analysis of software (i.e., likely invariant discovery) • Theory of decision tree learning DiscoveringBranchingConditions 7
  • 9. Daikon • Tool for discovering likely invariants from execution logs • Given a set of program points, Daikon: • Instantiates a set of invariant templates (over certain combination of variables) • Traverses the execution log • Falsifying some invariants • Gathering the statistical support for the remaining templates • Discards some invariants based on: • Subsumption • Statistical support Daikon strongly relies on code instrumentation/analysis DiscoveringBranchingConditions 8
  • 10. CID Amount Installm Salary Age Len Task 13210 20000 2000 2000 25 1 NR 13220 25000 1200 3500 35 2 NE 13221 9000 450 2500 27 2 NE 13219 8500 750 2000 25 1 ASA 13220 25000 1200 3500 35 2 ACA 13221 9000 450 2500 27 2 ASA … … … … … … … Daikon: Tool for mining likely invariants DiscoveringBranchingConditions 9 Approve Simple Application Approve Complex Application Notify Rejection Notify Eligibility Daikon installment > salary amount ≥ 5000 length < age … installment ≤ salary amount ≥ 5000 length < age … installment ≤ salary amount ≤ 9500 length < age … installment ≤ salary amount ≥ 10000 length < age …
  • 11. BranchMiner (Conjunctive) • Information Gain (IG) quantifies the discriminating power of a predicate (with respect to two different outcomes) • Approach: • Use Daikon for discovering invariants • Combine invariants in a conjunction so as to maximize the overall IG DiscoveringBranchingConditions 10 Approve Simple Application Approve Complex Application Notify Rejection Notify Eligibility a1: installment > salary a2: amount ≥ 5000 a3: length < age … IG(a1) = 0.8 IG(a2) = 0.2 IG(a3) = 0 … IG(a1∧a2) = 0.8 …
  • 13. BranchMiner (Disjunctive) DiscoveringBranchingConditions 12 … Partition 1 Partition 2 Conjunctive BranchMiner Conjunctive BranchMiner CONJ1 CONJ2 Partition n Conjunctive BranchMiner CONJn Event Log
  • 14. BranchMiner (Disjunctive) DiscoveringBranchingConditions 13 … Partition 1 Partition 2 Conjunctive BranchMiner Conjunctive BranchMiner CONJ1 CONJ2 Event Log Notify Rejection Notify Eligibility Notify Rejection Decision Tree … IG(CONJ1) = 0.4 IG(CONJ2) = 0.45 IG(CONJ3) = 0.5 … IG(CONJ1∨CONJ2) = 0.78 IG(CONJ1∨CONJ3) = 0.6 …
  • 15. Linear and polynomial expressions • Approach • Select all numerical variables and generate some derived (a.k.a. latent) variables using an arithmetic operator e.g., salary_div_installment, meaning “salary/installment” • Augment the event log with the values for latent variables • Run the discovery method for conjunctive/disjunctive conditions DiscoveringBranchingConditions 14 CID Amount Installm Salary Age Len Task 13210 20000 2000 2000 25 1 NR 13220 25000 1200 3500 35 2 NE 13221 9000 450 2500 27 2 NE 13219 8500 750 2000 25 1 ASA 13220 25000 1200 3500 35 2 ACA 13221 9000 450 2500 27 2 ASA … … … … … … … CID Amount Installm Salary Sal/Inst Age Len Age+Le n Task 13210 20000 2000 2000 1.00 25 1 26 NR 13220 25000 1200 3500 2.92 35 2 37 NE 13221 9000 450 2500 5.56 27 2 29 NE 13219 8500 750 2000 2.67 25 1 26 ASA 13220 25000 1200 3500 2.92 35 2 37 ACA 13221 9000 450 2500 5.56 27 2 29 ASA … … … … … … … … … CID Amount Installm Salary Sal/Inst Age Len Age+Le n Task 13210 20000 2000 2000 1.00 25 1 26 NR 13220 25000 1200 3500 2.92 35 2 37 NE 13221 9000 450 2500 5.56 27 2 29 NE 13219 8500 750 2000 2.67 25 1 26 ASA 13220 25000 1200 3500 2.92 35 2 37 ACA 13221 9000 450 2500 5.56 27 2 29 ASA … … … … … … … … …
  • 17. Conclusions • We developed a technique for discovering branching conditions from event logs • Complex expressions (e.g., “v op v”, linear inequalities, etc.) • More compact than those mined with conventional decision trees • Integration into ProM • Implemented as a command line tool • Validation with real-life logs • Assessed with synthetically generated event logs • Areas for extensions • Coping with noise in the event logs • Handling of null values • Extending the coverage to more complex types of expressions DiscoveringBranchingConditions 16