SlideShare ist ein Scribd-Unternehmen logo
1 von 22
IBM Research - India, New Delhi, India‡
IBM TJ Watson Research Center, New York, USA†
   Free form diagramming tools (e.g., Visio, Powerpoint) are preferred in
    creation for initial process models
     Ease of use, Intuitiveness
     Ubiquity
     Doesn’t hinder your creativity

   Process modeling software (e.g., WBM, ARIS) create models with formal
    underpinnings
       Allow formal analysis, model checking
       Process Reuse
       Process Improvement
       Traceability with realized executable process

   Sound, automatic approach to convert process diagrams to formal process
    models is essential
     A bridge between the worlds of diagramming and formal modeling


          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Challenges
     Ambiguities in diagrams
     Limitation of existing capabilities

   Approach
     Structure Inference
     Semantic Interpretation

   Empirical Study

   Related Work & Future directions
        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Human can interpret different visual cues in
drawings to correctly resolve the structure and
semantics of the models, but machines cannot
do the same!


  September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Connectors not glued to shapes at their endpoints


                                          Missing
                                          Edge




                                          Missing
                                          Edge



                                          Missing
                                          Edge
Text annotations not explicitly part of any shape for node/edge
Same shape conveys multiple semantics




                           Same semantic conveyed in multiple shapes




September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Popular BPM tools such as Websphere Business
    Modeler, ARIS, Lombardi, Telelogic System Architect,
    have Visio import capabilities
   Create imprecise flow structure when faced with
    structural ambiguities
   Employ a simple mapping (fixed or pluggable) from a
    set of diagram shapes to a target set of process
    semantics to interpret semantics
     Such an approach cannot deal with under-specification
     Building an exhaustive mapping is painful in presence of
      over-specification

        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
A                                         B A                                       B




                          C               D                        C                    D




September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
    Uses notion of connection
                                                                  points created at node – line
    SRC    SRC            SRC        SRC
                                     TGT
                                                                  and line – line intersections
    SRC    NEU            NEU
    C1     C2             C5         C8
A                                             B                  Assign direction to connection
          C3 TGT        UNK C6
                                                                  points
                         SRC
          SRC C4         TGT   C7                                Starting at connection points
                                                                  attached to nodes, propagate
            C               D                                     their directions along paths in
                                                                  which the directions are
                                                                  consistent and identifies the
                                                                  reached nodes
                                                                 Create edges if connection
                                                                  point at reached node has a
                                                                  different direction


          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Process                                           Diagram Parsing
Diagram                                             Attributes such as             Use format specific
                     Parse information
                                                 coordinates, dimensions,          SDKs or parse XML
                   about diagram shapes
                                                     text, geometry                     formats


Shapes &
Attributes
                                               Structure Inference
                   Precisely determine the          Deal with structural       Extract features for each
                    underlying flow graph              ambiguities                  node and edge
 Flow
 Graph

                                            Semantic Interpretation
                     Assign process semantics to every
Process            node and edge in the flow graph using
                                                                Supervised and unsupervised schemes
 Model                                                                 to train such a classifier
                             a trained classifier

      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Train a classifier to mimic human reasoning
    to decide process semantics
   Features used for classification:
     Relational: Indegree, Outdegree, Count of nodes
      contained within
     Geometric: Shape name, Count of horizontal,
      vertical, diagonal lines
     Textual: Count of cue words for every target entity


    September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Structure                             {Nodes, Edges}
        Flow                                                                    Annotated by
      Diagrams                           Inference
                                                                                  Features




              Classifier                     {Nodes, Edges}
                                          Annotated by Features
                                           + Process Semantic




Classifier establishes correspondence                                      An expert labels all nodes &
between the features and labels for                                        edges in the input set of
process semantics                                                          diagrams by their semantics

           September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Flow                         Structure                      {Nodes, Edges}                     Clusterer
Diagrams                       Inference                       Annotated by
                                                                 Features




                                                   Cluster A = Semantic X
       {Nodes, Edges}                                                                                          Cluster A
    Annotated by Features
     + Process Semantic                            Cluster B = Semantic Y                                      Cluster B



                                                                                                              Clusters
                                                                                                                have
                                                                    An expert looks at                        common
                                                                    exemplars from each                      semantics
           Classifier                                               cluster to label process
                                                                    semantic of the cluster


           September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Data Set: 185 Visio process diagrams created in real
    business-transformation projects
   Objective: Compare accuracy of our tool iDISCOVER
    and a popular modeling tool (called PMT for
    proprietary reasons)
   Method: Compare tool outputs with models created
    manually by human experts to measure precision &
    recall
   Precision =         |Actual ∩ Retrieved| , Recall = |Actual ∩ Retrieved|
                            |Retrieved|                      |Actual|

        September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Node                 96.93                   95.91                    70.44                   86.29
Edge                 93.26                   90.86                    63.43                   59.87




Dangling                   47 (100%)                     3 (14%)                        56%
Connector
Unlinked Labels            46 (39%)                      2 (3.7%)                       38%


Count of dangling connectors has a greater correlation with the edge recall of
(ρ = −0.48) than with the edge recall of           (ρ = −0.08).
            September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
Our            (Overall Δ ≈30%) and           (Overall Δ ≈20%) for all process semantic classes
 •Accuracy is low only for scarce entities like Intermediate Events and Data Objects (together
are greater than that of
 less than 3% of the data set)
                 is almost as good as
 •Better results possible with a more equitable distribution of entities work almost
        Size of the training data need not be huge. Classification could
        as well with only a third of the dataset size


             September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Large body of work in the area of
    understanding line drawings and hand
    sketches (e.g., Futrelle, Gross, Barbu)
     Focus on identifying shape geometry
     Semantic interpretation follows directly from a
      fixed mapping between shape geometry and target
      semantics
     Visual Language theory prescribes geometry
      detection with grammar rules.
      September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   More efficient modeling of textual cues
     Text is the only reliable feature in highly ambiguous
      scenarios

   Tracking spatial patterns of shapes and labels
    that emerge due to local styles
   Identification of higher-level relations (block
    structures) between model entities (e.g., sub-
    process, loop, and fork-merge)
   Extend to other diagram types
       September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
   Informal process diagrams contain structural and
    semantic ambiguities – need to be dealt with in order to
    discover precise formal models
   Existing capabilities are limited because:
       Do not resolve structural ambiguities
       Interpreting semantic based on shape name does not suffice

   Standard pattern-classification techniques can be
    successfully employed in interpreting process semantics if
    the feature space is carefully modeled to mimic human
    reasoning
     Unsupervised clustering can almost match supervised
        techniques in performance

          September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA

Weitere ähnliche Inhalte

Ähnlich wie From Informal Process Diagrams To Formal Process Models

Software Design_Se lect16 btech
Software Design_Se lect16 btechSoftware Design_Se lect16 btech
Software Design_Se lect16 btechIIITA
 
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)Denis Gagné
 
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Sotiris Koussouris
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesIstvan Rath
 
scenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walkerscenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug WalkerOpenPlans
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality softwareChristophe Mercier
 
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)Steve Lange
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Marco Brambilla
 
Axel uhl sap@md-day2011
Axel uhl sap@md-day2011Axel uhl sap@md-day2011
Axel uhl sap@md-day2011MDDAY11
 
Advanced Excel
Advanced Excel Advanced Excel
Advanced Excel eduCBA
 
Star Ccm604 Features Final
Star Ccm604 Features FinalStar Ccm604 Features Final
Star Ccm604 Features FinalEd Erb
 
An interactive exploration environment for complex process design
An interactive exploration environment for complex process designAn interactive exploration environment for complex process design
An interactive exploration environment for complex process designEric Fraga
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010Julen Mohanty
 
01.egovFrame Training Book II
01.egovFrame Training Book II01.egovFrame Training Book II
01.egovFrame Training Book IIChuong Nguyen
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecturedrewz lin
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architectureTot Bob
 
Software Architecture: views and viewpoints
Software Architecture: views and viewpointsSoftware Architecture: views and viewpoints
Software Architecture: views and viewpointsHenry Muccini
 
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareOn the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareRaffi Khatchadourian
 
Framework Engineering_Final
Framework Engineering_FinalFramework Engineering_Final
Framework Engineering_FinalYoungSu Son
 

Ähnlich wie From Informal Process Diagrams To Formal Process Models (20)

Software Design_Se lect16 btech
Software Design_Se lect16 btechSoftware Design_Se lect16 btech
Software Design_Se lect16 btech
 
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)BPM Standards -  What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
BPM Standards - What is new in BPMN 2.0 and XPDL 2.2 (BBC 2011)
 
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
Heterogeneous Domains’ e-Business Transactions Interoperability with the use ...
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling Languages
 
scenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walkerscenario planning at the city scale | Doug Walker
scenario planning at the city scale | Doug Walker
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality software
 
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
PHX Session #5 : Architecture Without Big Design Up Front (Garibay)
 
Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...Industrial and Academic Experiences with a User Interaction Modeling Language...
Industrial and Academic Experiences with a User Interaction Modeling Language...
 
Axel uhl sap@md-day2011
Axel uhl sap@md-day2011Axel uhl sap@md-day2011
Axel uhl sap@md-day2011
 
Advanced Excel
Advanced Excel Advanced Excel
Advanced Excel
 
Star Ccm604 Features Final
Star Ccm604 Features FinalStar Ccm604 Features Final
Star Ccm604 Features Final
 
An interactive exploration environment for complex process design
An interactive exploration environment for complex process designAn interactive exploration environment for complex process design
An interactive exploration environment for complex process design
 
BA conf presentation 2010
BA conf presentation 2010BA conf presentation 2010
BA conf presentation 2010
 
01.egovFrame Training Book II
01.egovFrame Training Book II01.egovFrame Training Book II
01.egovFrame Training Book II
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecture
 
4+1view architecture
4+1view architecture4+1view architecture
4+1view architecture
 
Software Architecture: views and viewpoints
Software Architecture: views and viewpointsSoftware Architecture: views and viewpoints
Software Architecture: views and viewpoints
 
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented SoftwareOn the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
On the Assessment of Pointcut Design in Evolving Aspect-Oriented Software
 
Software design
Software designSoftware design
Software design
 
Framework Engineering_Final
Framework Engineering_FinalFramework Engineering_Final
Framework Engineering_Final
 

Mehr von Debdoot Mukherjee

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersDebdoot Mukherjee
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDebdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Debdoot Mukherjee
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Debdoot Mukherjee
 

Mehr von Debdoot Mukherjee (6)

Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
Determining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL CompositionsDetermining QoS of WS-BPEL Compositions
Determining QoS of WS-BPEL Compositions
 
Scc talk
Scc talkScc talk
Scc talk
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
 
Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?Which Work-Item Updates Need Your Response?
Which Work-Item Updates Need Your Response?
 

Kürzlich hochgeladen

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Kürzlich hochgeladen (20)

Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

From Informal Process Diagrams To Formal Process Models

  • 1. IBM Research - India, New Delhi, India‡ IBM TJ Watson Research Center, New York, USA†
  • 2. Free form diagramming tools (e.g., Visio, Powerpoint) are preferred in creation for initial process models  Ease of use, Intuitiveness  Ubiquity  Doesn’t hinder your creativity  Process modeling software (e.g., WBM, ARIS) create models with formal underpinnings  Allow formal analysis, model checking  Process Reuse  Process Improvement  Traceability with realized executable process  Sound, automatic approach to convert process diagrams to formal process models is essential  A bridge between the worlds of diagramming and formal modeling September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 3. Challenges  Ambiguities in diagrams  Limitation of existing capabilities  Approach  Structure Inference  Semantic Interpretation  Empirical Study  Related Work & Future directions September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 4. Human can interpret different visual cues in drawings to correctly resolve the structure and semantics of the models, but machines cannot do the same! September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 5. Connectors not glued to shapes at their endpoints Missing Edge Missing Edge Missing Edge
  • 6. Text annotations not explicitly part of any shape for node/edge
  • 7. Same shape conveys multiple semantics Same semantic conveyed in multiple shapes September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 8. Popular BPM tools such as Websphere Business Modeler, ARIS, Lombardi, Telelogic System Architect, have Visio import capabilities  Create imprecise flow structure when faced with structural ambiguities  Employ a simple mapping (fixed or pluggable) from a set of diagram shapes to a target set of process semantics to interpret semantics  Such an approach cannot deal with under-specification  Building an exhaustive mapping is painful in presence of over-specification September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 9. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 10. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 11. A B A B C D C D September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 12. Uses notion of connection points created at node – line SRC SRC SRC SRC TGT and line – line intersections SRC NEU NEU C1 C2 C5 C8 A B  Assign direction to connection C3 TGT UNK C6 points SRC SRC C4 TGT C7  Starting at connection points attached to nodes, propagate C D their directions along paths in which the directions are consistent and identifies the reached nodes  Create edges if connection point at reached node has a different direction September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 13. Process Diagram Parsing Diagram Attributes such as Use format specific Parse information coordinates, dimensions, SDKs or parse XML about diagram shapes text, geometry formats Shapes & Attributes Structure Inference Precisely determine the Deal with structural Extract features for each underlying flow graph ambiguities node and edge Flow Graph Semantic Interpretation Assign process semantics to every Process node and edge in the flow graph using Supervised and unsupervised schemes Model to train such a classifier a trained classifier September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 14. Train a classifier to mimic human reasoning to decide process semantics  Features used for classification:  Relational: Indegree, Outdegree, Count of nodes contained within  Geometric: Shape name, Count of horizontal, vertical, diagonal lines  Textual: Count of cue words for every target entity September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 15. Structure {Nodes, Edges} Flow Annotated by Diagrams Inference Features Classifier {Nodes, Edges} Annotated by Features + Process Semantic Classifier establishes correspondence An expert labels all nodes & between the features and labels for edges in the input set of process semantics diagrams by their semantics September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 16. Flow Structure {Nodes, Edges} Clusterer Diagrams Inference Annotated by Features Cluster A = Semantic X {Nodes, Edges} Cluster A Annotated by Features + Process Semantic Cluster B = Semantic Y Cluster B Clusters have An expert looks at common exemplars from each semantics Classifier cluster to label process semantic of the cluster September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 17. Data Set: 185 Visio process diagrams created in real business-transformation projects  Objective: Compare accuracy of our tool iDISCOVER and a popular modeling tool (called PMT for proprietary reasons)  Method: Compare tool outputs with models created manually by human experts to measure precision & recall  Precision = |Actual ∩ Retrieved| , Recall = |Actual ∩ Retrieved| |Retrieved| |Actual| September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 18. Node 96.93 95.91 70.44 86.29 Edge 93.26 90.86 63.43 59.87 Dangling 47 (100%) 3 (14%) 56% Connector Unlinked Labels 46 (39%) 2 (3.7%) 38% Count of dangling connectors has a greater correlation with the edge recall of (ρ = −0.48) than with the edge recall of (ρ = −0.08). September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 19. Our (Overall Δ ≈30%) and (Overall Δ ≈20%) for all process semantic classes •Accuracy is low only for scarce entities like Intermediate Events and Data Objects (together are greater than that of less than 3% of the data set) is almost as good as •Better results possible with a more equitable distribution of entities work almost Size of the training data need not be huge. Classification could as well with only a third of the dataset size September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 20. Large body of work in the area of understanding line drawings and hand sketches (e.g., Futrelle, Gross, Barbu)  Focus on identifying shape geometry  Semantic interpretation follows directly from a fixed mapping between shape geometry and target semantics  Visual Language theory prescribes geometry detection with grammar rules. September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 21. More efficient modeling of textual cues  Text is the only reliable feature in highly ambiguous scenarios  Tracking spatial patterns of shapes and labels that emerge due to local styles  Identification of higher-level relations (block structures) between model entities (e.g., sub- process, loop, and fork-merge)  Extend to other diagram types September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA
  • 22. Informal process diagrams contain structural and semantic ambiguities – need to be dealt with in order to discover precise formal models  Existing capabilities are limited because:  Do not resolve structural ambiguities  Interpreting semantic based on shape name does not suffice  Standard pattern-classification techniques can be successfully employed in interpreting process semantics if the feature space is carefully modeled to mimic human reasoning  Unsupervised clustering can almost match supervised techniques in performance September 14 ,2010, International Conference on Business Process Management, Hoboken, NJ, USA