SlideShare ist ein Scribd-Unternehmen logo
1 von 19
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Event Logs
What kind of data does
process mining require?
prof.dr.ir. Wil van der Aalst
www.processmining.org
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Getting the "right" data …
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Event log
• We assume the existence of an event log where
each event refers to a case, an activity, and a
point in time.
• An event log can be seen as a collection of cases.
• A case can be seen as a trace/sequence of
events.
event data can be found everywhere!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Event data may come from …
• a database system (e.g., patient data in a hospital),
• a comma-separated values (CSV) file or spreadsheet,
• a transaction log (e.g., a trading system),
• a business suite/ERP system (SAP, Oracle, etc.),
• a message log (e.g., from IBM middleware),
• an open API providing data from websites or social
media, …
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
An example log
student name course name exam date mark
Peter Jones Business Information systems 16-1-2014 8
Sandy Scott Business Information systems 16-1-2014 5
Bridget White Business Information systems 16-1-2014 9
John Anderson Business Information systems 16-1-2014 8
Sandy Scott BPM Systems 17-1-2014 7
Bridget White BPM Systems 17-1-2014 8
Sandy Scott Process Mining 20-1-2014 5
Bridget White Process Mining 20-1-2014 9
John Anderson Process Mining 20-1-2014 8
… … … …
case id activity name timestamp other data
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Another event log: order handling
order
number
activity timestamp user product quantity
9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1
9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2
9903 register order 22-1-2014@09.27 Sara Jones iPhone4S 1
9901 check stock 22-1-2014@09.49 Pete Scott iPhone5S 1
9901 ship order 22-1-2014@10.11 Sue Fox iPhone5S 1
9903 check stock 22-1-2014@10.34 Pete Scott iPhone4S 1
9901 handle payment 22-1-2014@10.41 Carol Hope iPhone5S 1
9902 check stock 22-1-2014@10.57 Pete Scott iPhone5S 2
9902 cancel order 22-1-2014@11.08 Carol Hope iPhone5S 2
… … … … … …
case id activity name timestamp other dataresource
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Another event log: patient treatment
patient activity timestamp doctor age cost
5781 make X-ray 23-1-2014@10.30 Dr. Jones 45 70.00
5541 blood test 23-1-2014@10.18 Dr. Scott 61 40.00
5833 blood test 23-1-2014@10.27 Dr. Scott 24 40.00
5781 blood test 23-1-2014@10.49 Dr. Scott 45 40.00
5781 CT scan 23-1-2014@11.10 Dr. Fox 45 1200.00
5833 surgery 23-1-2014@12.34 Dr. Scott 24 2300.00
5781 handle payment 23-1-2014@12.41 Carol Hope 45 0.00
5541 radiation therapy 23-1-2014@13.57 Dr. Jones 61 140.00
5541 radiation therapy 23-1-2014@13.08 Dr. Jones 61 140.00
… … … … … …
case id activity name timestamp other dataresource
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Minimal requirements in terms of a CSV file
• Each row corresponds to an event.
• There are at least three columns:
• Case id (patient id, order number, claim number, …)
• Activity name (approve, reject, request, send, …)
• Timestamp (2015-08-18T06:36:40, …)
• There may be many other (optional) columns:
resource, transaction type, age, costs, etc.
order number activity timestamp user product quantity
9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1
9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Extensions
• Transactional information on activity instances:
An event can represent
a start, complete,
suspend, resume,
abort, etc.
• Case versus event attributes:
− case attributes do not change, e.g., the birth date or
gender of a patient,
− event attributes are related to a particular step in the
process.
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
XES (eXtensible Event Stream)
• Adopted by the IEEE Task Force on Process Mining.
• The format is supported by tools such as ProM and
Disco (used in this course).
• Predecessors: MXML and SA-MXML.
• Conversion from other formats (CSV) is easy if the
right data are available.
• XML syntax and OpenXES library available.
• See www.xes-standard.org.
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Simplistic view on event data
order
number
activity timestamp user product quantity
9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1
9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2
9903 register order 22-1-2014@09.27 Sara Jones iPhone4S 1
9901 check stock 22-1-2014@09.49 Pete Scott iPhone5S 1
9901 ship order 22-1-2014@10.11 Sue Fox iPhone5S 1
9903 check stock 22-1-2014@10.34 Pete Scott iPhone4S 1
9901 handle payment 22-1-2014@10.41 Carol Hope iPhone5S 1
9902 check stock 22-1-2014@10.57 Pete Scott iPhone5S 2
9902 cancel order 22-1-2014@11.08 Carol Hope iPhone5S 2
… … … … … …
case id activity name timestamp other dataresource
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
A more refined view
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Transactional model for activities
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Atomic activities
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Activities that have a duration
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Converting Event Data to XES
• Easy with tools like ProM and Disco!
Import CSV file, then apply
“Convert CSV to XES” plug-in.
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Learn more about event logs
• Introduction: Chapter 4 of Process Mining: Discovery,
Conformance and Enhancement of Business Processes
by W.M.P. van der Aalst, Springer Verlag, 2011 (ISBN
978-3-642-19344-6)
• XES standard: www.xes-standard.org
• More advanced: W.M.P. van der Aalst. Extracting Event
Data from Databases to Unleash Process Mining. In BPM
- Driving Innovation in a Digital World, Springer-Verlag,
Berlin, 2015. http://dx.doi.org/10.1007/978-3-319-14430-6_8
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Example logs files (XES)
• http://data.3tu.nl/repository/collection:event_logs_real
• http://data.3tu.nl/repository/collection:event_logs_synthetic
• http://www.win.tue.nl/ieeetfpm/doku.php?
id=shared:process_mining_logs
• http://www.processmining.org/logs/start
Event data are everywhere!
What is your excuse?

Weitere ähnliche Inhalte

Was ist angesagt?

Process Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - IntroductionProcess Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - IntroductionWil van der Aalst
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Databricks
 
BI & Big data use case for banking - by rully feranata
BI & Big data use case for banking - by rully feranataBI & Big data use case for banking - by rully feranata
BI & Big data use case for banking - by rully feranataRully Feranata
 
The Roadmap for SQL Server 2019
The Roadmap for SQL Server 2019The Roadmap for SQL Server 2019
The Roadmap for SQL Server 2019Amit Banerjee
 
Process Mining - Chapter 3 - Data Mining
Process Mining - Chapter 3 - Data MiningProcess Mining - Chapter 3 - Data Mining
Process Mining - Chapter 3 - Data MiningWil van der Aalst
 
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWSMatthew (정재화)
 
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiReal-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiManish Gupta
 
Process Mining - Chapter 2 - Process Modeling and Analysis
Process Mining - Chapter 2 - Process Modeling and AnalysisProcess Mining - Chapter 2 - Process Modeling and Analysis
Process Mining - Chapter 2 - Process Modeling and AnalysisWil van der Aalst
 
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?Juhong Park
 
Process Mining and Predictive Process Monitoring
Process Mining and Predictive Process MonitoringProcess Mining and Predictive Process Monitoring
Process Mining and Predictive Process MonitoringMarlon Dumas
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk OverviewSplunk
 
Process Mining: Understanding and Improving Desire Lines in Big Data
Process Mining: Understanding and Improving Desire Lines in Big DataProcess Mining: Understanding and Improving Desire Lines in Big Data
Process Mining: Understanding and Improving Desire Lines in Big DataWil van der Aalst
 
Road to NODES - Healthcare Analytics
Road to NODES - Healthcare AnalyticsRoad to NODES - Healthcare Analytics
Road to NODES - Healthcare AnalyticsNeo4j
 
Honu/Big Data @ Riot Games
Honu/Big Data @ Riot GamesHonu/Big Data @ Riot Games
Honu/Big Data @ Riot GamesJerome Boulon
 
Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Harald Erb
 
Fifth Elephant Apache Atlas Talk
Fifth Elephant Apache Atlas TalkFifth Elephant Apache Atlas Talk
Fifth Elephant Apache Atlas TalkVimal Sharma
 

Was ist angesagt? (20)

Process Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - IntroductionProcess Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - Introduction
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
BI & Big data use case for banking - by rully feranata
BI & Big data use case for banking - by rully feranataBI & Big data use case for banking - by rully feranata
BI & Big data use case for banking - by rully feranata
 
The Roadmap for SQL Server 2019
The Roadmap for SQL Server 2019The Roadmap for SQL Server 2019
The Roadmap for SQL Server 2019
 
Process Mining - Chapter 3 - Data Mining
Process Mining - Chapter 3 - Data MiningProcess Mining - Chapter 3 - Data Mining
Process Mining - Chapter 3 - Data Mining
 
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS
스타트업 사례로 본 로그 데이터 분석 : Tajo on AWS
 
Observability at Spotify
Observability at SpotifyObservability at Spotify
Observability at Spotify
 
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiReal-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFi
 
Process Mining - Chapter 2 - Process Modeling and Analysis
Process Mining - Chapter 2 - Process Modeling and AnalysisProcess Mining - Chapter 2 - Process Modeling and Analysis
Process Mining - Chapter 2 - Process Modeling and Analysis
 
Data streaming fundamentals
Data streaming fundamentalsData streaming fundamentals
Data streaming fundamentals
 
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?
[KAIST 채용설명회] 데이터 엔지니어는 무슨 일을 하나요?
 
Process Mining and Predictive Process Monitoring
Process Mining and Predictive Process MonitoringProcess Mining and Predictive Process Monitoring
Process Mining and Predictive Process Monitoring
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk Overview
 
Process Mining: Understanding and Improving Desire Lines in Big Data
Process Mining: Understanding and Improving Desire Lines in Big DataProcess Mining: Understanding and Improving Desire Lines in Big Data
Process Mining: Understanding and Improving Desire Lines in Big Data
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Road to NODES - Healthcare Analytics
Road to NODES - Healthcare AnalyticsRoad to NODES - Healthcare Analytics
Road to NODES - Healthcare Analytics
 
Honu/Big Data @ Riot Games
Honu/Big Data @ Riot GamesHonu/Big Data @ Riot Games
Honu/Big Data @ Riot Games
 
Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020
 
Fifth Elephant Apache Atlas Talk
Fifth Elephant Apache Atlas TalkFifth Elephant Apache Atlas Talk
Fifth Elephant Apache Atlas Talk
 

Mehr von Wil van der Aalst

Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)Wil van der Aalst
 
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To Ask
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To AskEverything You Always Wanted To Know About Petri Nets, But Were Afraid To Ask
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To AskWil van der Aalst
 
Earth Movers’ Stochastic Conformance Checking
Earth Movers’ Stochastic Conformance CheckingEarth Movers’ Stochastic Conformance Checking
Earth Movers’ Stochastic Conformance CheckingWil van der Aalst
 
Using Process Mining to Remove Operational Friction in Shared Services
Using Process Mining to Remove Operational Friction in Shared ServicesUsing Process Mining to Remove Operational Friction in Shared Services
Using Process Mining to Remove Operational Friction in Shared ServicesWil van der Aalst
 
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...Wil van der Aalst
 
Configurable Declare: Designing Customizable Flexible Models
Configurable Declare: Designing Customizable Flexible ModelsConfigurable Declare: Designing Customizable Flexible Models
Configurable Declare: Designing Customizable Flexible ModelsWil van der Aalst
 
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...On the Role of Fitness, Precision, Generalization and Simplicity in Process D...
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...Wil van der Aalst
 
A Decade of Business Process Management Conferences: Reflections on a Develop...
A Decade of Business Process Management Conferences: Reflections on a Develop...A Decade of Business Process Management Conferences: Reflections on a Develop...
A Decade of Business Process Management Conferences: Reflections on a Develop...Wil van der Aalst
 
Business Process Configuration in the Cloud: How to Support and Analyze Multi...
Business Process Configuration in the Cloud: How to Support and Analyze Multi...Business Process Configuration in the Cloud: How to Support and Analyze Multi...
Business Process Configuration in the Cloud: How to Support and Analyze Multi...Wil van der Aalst
 
Discovering Concurrency: Learning (Business) Process Models from Examples
Discovering Concurrency: Learning (Business) Process Models from ExamplesDiscovering Concurrency: Learning (Business) Process Models from Examples
Discovering Concurrency: Learning (Business) Process Models from ExamplesWil van der Aalst
 
Distributed Process Discovery and Conformance Checking
Distributed Process Discovery and Conformance CheckingDistributed Process Discovery and Conformance Checking
Distributed Process Discovery and Conformance CheckingWil van der Aalst
 
Service Interaction: Patterns, Formalization, and Analysis
Service Interaction: Patterns, Formalization, and AnalysisService Interaction: Patterns, Formalization, and Analysis
Service Interaction: Patterns, Formalization, and AnalysisWil van der Aalst
 
Keynote Gartner Business Process Management Summit, February 2009, London
Keynote Gartner Business Process Management Summit, February 2009, London Keynote Gartner Business Process Management Summit, February 2009, London
Keynote Gartner Business Process Management Summit, February 2009, London Wil van der Aalst
 
Keynote on Process Mining at SSCI 2010 / CIDM 2011
Keynote on Process Mining at SSCI 2010 / CIDM 2011Keynote on Process Mining at SSCI 2010 / CIDM 2011
Keynote on Process Mining at SSCI 2010 / CIDM 2011Wil van der Aalst
 
Discovering Petri Nets: Evidence-Based Business Process Management
Discovering Petri Nets: Evidence-Based Business Process ManagementDiscovering Petri Nets: Evidence-Based Business Process Management
Discovering Petri Nets: Evidence-Based Business Process ManagementWil van der Aalst
 
TomTom for Business Process Managment (TomTom4BPM)
TomTom for Business Process Managment (TomTom4BPM)TomTom for Business Process Managment (TomTom4BPM)
TomTom for Business Process Managment (TomTom4BPM)Wil van der Aalst
 
Keynote at 18th International Conference on Cooperative Information Systems (...
Keynote at 18th International Conference on Cooperative Information Systems (...Keynote at 18th International Conference on Cooperative Information Systems (...
Keynote at 18th International Conference on Cooperative Information Systems (...Wil van der Aalst
 
Process Mining - Chapter 14 - Epilogue
Process Mining - Chapter 14 - EpilogueProcess Mining - Chapter 14 - Epilogue
Process Mining - Chapter 14 - EpilogueWil van der Aalst
 
Process Mining - Chapter 12 - Analyzing Spaghetti Processes
Process Mining - Chapter 12 - Analyzing Spaghetti ProcessesProcess Mining - Chapter 12 - Analyzing Spaghetti Processes
Process Mining - Chapter 12 - Analyzing Spaghetti ProcessesWil van der Aalst
 
Process Mining - Chapter 10 - Tool Support
Process Mining - Chapter 10 - Tool SupportProcess Mining - Chapter 10 - Tool Support
Process Mining - Chapter 10 - Tool SupportWil van der Aalst
 

Mehr von Wil van der Aalst (20)

Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
 
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To Ask
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To AskEverything You Always Wanted To Know About Petri Nets, But Were Afraid To Ask
Everything You Always Wanted To Know About Petri Nets, But Were Afraid To Ask
 
Earth Movers’ Stochastic Conformance Checking
Earth Movers’ Stochastic Conformance CheckingEarth Movers’ Stochastic Conformance Checking
Earth Movers’ Stochastic Conformance Checking
 
Using Process Mining to Remove Operational Friction in Shared Services
Using Process Mining to Remove Operational Friction in Shared ServicesUsing Process Mining to Remove Operational Friction in Shared Services
Using Process Mining to Remove Operational Friction in Shared Services
 
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...
Object-Centric Process Mining: Dealing With Divergence and Convergence in Eve...
 
Configurable Declare: Designing Customizable Flexible Models
Configurable Declare: Designing Customizable Flexible ModelsConfigurable Declare: Designing Customizable Flexible Models
Configurable Declare: Designing Customizable Flexible Models
 
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...On the Role of Fitness, Precision, Generalization and Simplicity in Process D...
On the Role of Fitness, Precision, Generalization and Simplicity in Process D...
 
A Decade of Business Process Management Conferences: Reflections on a Develop...
A Decade of Business Process Management Conferences: Reflections on a Develop...A Decade of Business Process Management Conferences: Reflections on a Develop...
A Decade of Business Process Management Conferences: Reflections on a Develop...
 
Business Process Configuration in the Cloud: How to Support and Analyze Multi...
Business Process Configuration in the Cloud: How to Support and Analyze Multi...Business Process Configuration in the Cloud: How to Support and Analyze Multi...
Business Process Configuration in the Cloud: How to Support and Analyze Multi...
 
Discovering Concurrency: Learning (Business) Process Models from Examples
Discovering Concurrency: Learning (Business) Process Models from ExamplesDiscovering Concurrency: Learning (Business) Process Models from Examples
Discovering Concurrency: Learning (Business) Process Models from Examples
 
Distributed Process Discovery and Conformance Checking
Distributed Process Discovery and Conformance CheckingDistributed Process Discovery and Conformance Checking
Distributed Process Discovery and Conformance Checking
 
Service Interaction: Patterns, Formalization, and Analysis
Service Interaction: Patterns, Formalization, and AnalysisService Interaction: Patterns, Formalization, and Analysis
Service Interaction: Patterns, Formalization, and Analysis
 
Keynote Gartner Business Process Management Summit, February 2009, London
Keynote Gartner Business Process Management Summit, February 2009, London Keynote Gartner Business Process Management Summit, February 2009, London
Keynote Gartner Business Process Management Summit, February 2009, London
 
Keynote on Process Mining at SSCI 2010 / CIDM 2011
Keynote on Process Mining at SSCI 2010 / CIDM 2011Keynote on Process Mining at SSCI 2010 / CIDM 2011
Keynote on Process Mining at SSCI 2010 / CIDM 2011
 
Discovering Petri Nets: Evidence-Based Business Process Management
Discovering Petri Nets: Evidence-Based Business Process ManagementDiscovering Petri Nets: Evidence-Based Business Process Management
Discovering Petri Nets: Evidence-Based Business Process Management
 
TomTom for Business Process Managment (TomTom4BPM)
TomTom for Business Process Managment (TomTom4BPM)TomTom for Business Process Managment (TomTom4BPM)
TomTom for Business Process Managment (TomTom4BPM)
 
Keynote at 18th International Conference on Cooperative Information Systems (...
Keynote at 18th International Conference on Cooperative Information Systems (...Keynote at 18th International Conference on Cooperative Information Systems (...
Keynote at 18th International Conference on Cooperative Information Systems (...
 
Process Mining - Chapter 14 - Epilogue
Process Mining - Chapter 14 - EpilogueProcess Mining - Chapter 14 - Epilogue
Process Mining - Chapter 14 - Epilogue
 
Process Mining - Chapter 12 - Analyzing Spaghetti Processes
Process Mining - Chapter 12 - Analyzing Spaghetti ProcessesProcess Mining - Chapter 12 - Analyzing Spaghetti Processes
Process Mining - Chapter 12 - Analyzing Spaghetti Processes
 
Process Mining - Chapter 10 - Tool Support
Process Mining - Chapter 10 - Tool SupportProcess Mining - Chapter 10 - Tool Support
Process Mining - Chapter 10 - Tool Support
 

Kürzlich hochgeladen

Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...kumargunjan9515
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 

Kürzlich hochgeladen (20)

Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 

Event Logs: What kind of data does process mining require?

  • 1. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Event Logs What kind of data does process mining require? prof.dr.ir. Wil van der Aalst www.processmining.org
  • 2. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Getting the "right" data …
  • 3. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Event log • We assume the existence of an event log where each event refers to a case, an activity, and a point in time. • An event log can be seen as a collection of cases. • A case can be seen as a trace/sequence of events.
  • 4. event data can be found everywhere!
  • 5. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Event data may come from … • a database system (e.g., patient data in a hospital), • a comma-separated values (CSV) file or spreadsheet, • a transaction log (e.g., a trading system), • a business suite/ERP system (SAP, Oracle, etc.), • a message log (e.g., from IBM middleware), • an open API providing data from websites or social media, …
  • 6. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) An example log student name course name exam date mark Peter Jones Business Information systems 16-1-2014 8 Sandy Scott Business Information systems 16-1-2014 5 Bridget White Business Information systems 16-1-2014 9 John Anderson Business Information systems 16-1-2014 8 Sandy Scott BPM Systems 17-1-2014 7 Bridget White BPM Systems 17-1-2014 8 Sandy Scott Process Mining 20-1-2014 5 Bridget White Process Mining 20-1-2014 9 John Anderson Process Mining 20-1-2014 8 … … … … case id activity name timestamp other data
  • 7. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Another event log: order handling order number activity timestamp user product quantity 9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1 9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2 9903 register order 22-1-2014@09.27 Sara Jones iPhone4S 1 9901 check stock 22-1-2014@09.49 Pete Scott iPhone5S 1 9901 ship order 22-1-2014@10.11 Sue Fox iPhone5S 1 9903 check stock 22-1-2014@10.34 Pete Scott iPhone4S 1 9901 handle payment 22-1-2014@10.41 Carol Hope iPhone5S 1 9902 check stock 22-1-2014@10.57 Pete Scott iPhone5S 2 9902 cancel order 22-1-2014@11.08 Carol Hope iPhone5S 2 … … … … … … case id activity name timestamp other dataresource
  • 8. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Another event log: patient treatment patient activity timestamp doctor age cost 5781 make X-ray 23-1-2014@10.30 Dr. Jones 45 70.00 5541 blood test 23-1-2014@10.18 Dr. Scott 61 40.00 5833 blood test 23-1-2014@10.27 Dr. Scott 24 40.00 5781 blood test 23-1-2014@10.49 Dr. Scott 45 40.00 5781 CT scan 23-1-2014@11.10 Dr. Fox 45 1200.00 5833 surgery 23-1-2014@12.34 Dr. Scott 24 2300.00 5781 handle payment 23-1-2014@12.41 Carol Hope 45 0.00 5541 radiation therapy 23-1-2014@13.57 Dr. Jones 61 140.00 5541 radiation therapy 23-1-2014@13.08 Dr. Jones 61 140.00 … … … … … … case id activity name timestamp other dataresource
  • 9. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Minimal requirements in terms of a CSV file • Each row corresponds to an event. • There are at least three columns: • Case id (patient id, order number, claim number, …) • Activity name (approve, reject, request, send, …) • Timestamp (2015-08-18T06:36:40, …) • There may be many other (optional) columns: resource, transaction type, age, costs, etc. order number activity timestamp user product quantity 9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1 9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2
  • 10. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Extensions • Transactional information on activity instances: An event can represent a start, complete, suspend, resume, abort, etc. • Case versus event attributes: − case attributes do not change, e.g., the birth date or gender of a patient, − event attributes are related to a particular step in the process.
  • 11. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) XES (eXtensible Event Stream) • Adopted by the IEEE Task Force on Process Mining. • The format is supported by tools such as ProM and Disco (used in this course). • Predecessors: MXML and SA-MXML. • Conversion from other formats (CSV) is easy if the right data are available. • XML syntax and OpenXES library available. • See www.xes-standard.org.
  • 12. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Simplistic view on event data order number activity timestamp user product quantity 9901 register order 22-1-2014@09.15 Sara Jones iPhone5S 1 9902 register order 22-1-2014@09.18 Sara Jones iPhone5S 2 9903 register order 22-1-2014@09.27 Sara Jones iPhone4S 1 9901 check stock 22-1-2014@09.49 Pete Scott iPhone5S 1 9901 ship order 22-1-2014@10.11 Sue Fox iPhone5S 1 9903 check stock 22-1-2014@10.34 Pete Scott iPhone4S 1 9901 handle payment 22-1-2014@10.41 Carol Hope iPhone5S 1 9902 check stock 22-1-2014@10.57 Pete Scott iPhone5S 2 9902 cancel order 22-1-2014@11.08 Carol Hope iPhone5S 2 … … … … … … case id activity name timestamp other dataresource
  • 13. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) A more refined view
  • 14. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Transactional model for activities
  • 15. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Atomic activities
  • 16. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Activities that have a duration
  • 17. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Converting Event Data to XES • Easy with tools like ProM and Disco! Import CSV file, then apply “Convert CSV to XES” plug-in.
  • 18. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Learn more about event logs • Introduction: Chapter 4 of Process Mining: Discovery, Conformance and Enhancement of Business Processes by W.M.P. van der Aalst, Springer Verlag, 2011 (ISBN 978-3-642-19344-6) • XES standard: www.xes-standard.org • More advanced: W.M.P. van der Aalst. Extracting Event Data from Databases to Unleash Process Mining. In BPM - Driving Innovation in a Digital World, Springer-Verlag, Berlin, 2015. http://dx.doi.org/10.1007/978-3-319-14430-6_8
  • 19. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Example logs files (XES) • http://data.3tu.nl/repository/collection:event_logs_real • http://data.3tu.nl/repository/collection:event_logs_synthetic • http://www.win.tue.nl/ieeetfpm/doku.php? id=shared:process_mining_logs • http://www.processmining.org/logs/start Event data are everywhere! What is your excuse?