SlideShare ist ein Scribd-Unternehmen logo
1 von 61
Downloaden Sie, um offline zu lesen
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
TEMPORAL CATEGORICAL DATA
•  A type of time series                Category!
 Numerical                    Categorical
                                                          Event!
       Stock: Microsoft           Patient ID: 45851737
 04/26/2010&10:00   &31.03&   12/02/2008&14:26   &Admit&
 04/26/2010&10:15   &31.01&   12/02/2008&14:26   &Emergency&
 04/26/2010&10:30   &31.02&   12/02/2008&22:44   &ICU&
 04/26/2010&10:45   &31.08&   12/05/2008&05:07   &Floor&
 04/26/2010&11:00   &31.16&   12/08/2008&10:02   &Floor&
 04/26/2010&11:15   &31.15&   12/14/2008&06:19   &Exit&
                              &           Time
                                   Admit
                                   Emergency
                                        ICU
                                                  Floor
                                                           Exit
TEMPORAL CATEGORICAL DATA




•    Electronic Health Records: symptoms, treatment, lab test
•    Traffic incident logs: arrival/departure time of each unit
•    Student records: course, paper, proposal, defense, etc.
•    Usability study logs
•    Etc.
working with physicians at
WASHINGTON HOSPITAL CENTER
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
EVENT SEQUENCES
•  Examples: “Bounce backs”

              ICU      Floor      ICU



                         within 2 days
RESEARCH STATEMENT
           “my research aims to design
             effective visualization
           and interaction techniques
          to support users in exploring
 event sequences in temporal categorical data.”

       •  a flexible temporal search approach
•  a novel visualization that provides an overview
                  of multiple records
RESEARCH
              QUESTION#1
             PRELIM. + PROPOSED
MOTIVATION         WORK             CONCLUSION




        RESEARCH           RESEARCH
        QUESTIONS         QUESTION#2
                         PRELIM. + PROPOSED
                               WORK
TRADITIONAL SEARCH MODEL

Know exactly what they want



                         Query                          Data     *
   How to specify the query?                            How to display the data?
   SQL, TSQL, GUI, ...

                         How to process the query?
                         How to make the search efficient?
                         Indexing methods, algorithms
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses   *
                         search
               Query                 Data
                         refine,
                         formulate
UNCERTAIN
                     Floor        ICU      Exit-Alive



                     QUERY            within 2 days

Find something
useful and display


                     RESULTS
                                                        Frustrated!
                     found 0 record
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses             *
                         search
               Query                  Data
                         refine,
                         formulate



                                       How to display the data?
                                  Difficult to understand the
                                  event sequences in the data
                                  quickly
NEEDS A STARTING POINT
 Show overview
 or summary




Where should I start?
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
APPROACHES
Exact Search!       Similarity-based Search!
MUST have A, B, C      SHOULD have A, B, C


      Query                  Query


    Record#1                Record#2

                                             more
    Record#2                Record#1
                                             similar

    Record#3                Record#3
SIMILARITY-BASED SEARCH


                     How to specify query?
What is “similar”?
                     How to display results?




                         User Interface!
Similarity Measure
                         & Visualization
SIMILARITY MEASURE
Gives a score that shows how much a record is similar to the query.
                       Min = 0 / Max = 1



             Query                     Record#1      0.80

                                       Record#2      0.63

                                       Record#3      0.96

                                       Record#4      0.77
SIMILARITY MEASURE (2)



      Query             Record#3   0.96

                        Record#1   0.80
                more
              similar
                        Record#4   0.77

                        Record#2   0.63
CHALLENGE
•  What is “similar”?       depends&on&users/tasks&
    Record#1
    (Query)       A                           B   C

    Record#2                    missing
                  A                           B   C

    Record#3                                              extra
                  A                           B   C   D

    Record#4
                            Time difference
                  A     B                         C
               time
MATCH & MISMATCH MEASURE
                time


  Record#1
  (Query)              A             C    B              C



  Record#2
                       A         B        C                   B


            Matched Events           Missing Event      Extra Event




                                          }
      Time Difference
      Number of Swapping                      Total Score
      Number of Missing Events                       0 to 1
      Number of Extra Events
RELATED WORK
•  Similarity measure
  –  For numerical time series
     •  [Ding et al. 2008], [Liu et al. 2006], [Berndt and Clifford 1994], ...

  –  Edit distance
     •  [Levenshtein 1966], [Winkler 1999], [Chen 2004], ...

  –  Biological sequence searching
     •  [Pearson and Lipman 1988], [Altschul et al. 1990], ...

  –  Approximate graph matching
     •  [Tian et al. 2007], [Tian et al. 2008] , ...

  –  Temporal Categorical Data
     •  [Sherkat 2006], ...
PROTOTYPE: SIMILAN
DEMO - DATA
•  Patient transfers in the emergency department
•  Real data from hospital (Jan – Mar 2010)

      ADMIT           Admission time
      EMERGENCY       Emergency room
      ICU             Intensive Care Unit
      INTERMEDIATE    Intermediate Medical Care
      FLOOR           Normal room
      EXIT-ALIVE      Leave the hospital alive
      EXIT-DEAD       Leave the hospital dead
DEMO - SIMILAN
This is when bugs are more likely to occur.!
CONTROLLED EXPERIMENT
•  18 participants
•  2 interfaces by 5 tasks
  –  Similarity-based Search (Similan)
  –  Exact Search (LifeLines2)




•  Time, error, preference
X = Exact Search , S = Similarity-based Search
RESULTS
         Sequence         Counting              Time const.           Uncertainty

        Sequence          CountSpeed (seconds)
                                            Time                      Uncertain
60                  15                  150                   150

40                  10                  100                   100

20                  5                     50                  50

    0               0                       0                     0
          X     S          X       S
                                   R                X    S                 X    S
                         Preference (7-points likert scale)
8                   8                   8                     8
6                   6                   6                     6
4                   4                   4                     4
2                   2                   2                     2
0                   0                   0                     0
         X      S         X        S            X       S
                                                        R              X       S
FLEXIBLE TEMPORAL SEARCH
(FTS)
PROPOSED WORK: FTS
•  Draw an example to specify query
•  Specify flexible and inflexible constraints

          MUST HAVE              RANGE      ?


                 MUST NOT HAVE           optional



•  Algorithms
PROPOSED WORK: FTS (2)
      •  Combine exact and similarity-based search



                                              Primary Bin
similar




                                           Pass all constraints




                                             Secondary Bin
similar




                                                The rest
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
WHEN YOU READ A PAPER…
•  What is it about?




         INFORMATION VISUALIZATION MANTRA
         OVERVIEW FIRST,
         ZOOM AND FILTER,
         THEN DETAILS ON DEMAND
RELATED WORK
•  Single-record visualizations

      Patient ID: 45851737
   12/02/2008&14:26   &Admit&
   12/02/2008&14:26   &Emergency&   Single-record"
   12/02/2008&22:44   &ICU&         Visualization
   12/05/2008&05:07   &Floor&
   12/08/2008&10:02   &Floor&
   12/14/2008&06:19   &Exit&
   &

•  E.g. LifeLines, MIDGAARD, etc.
RELATED WORK (2)
•  Multiple instances of single-record visualizations

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 45851737
       Patient ID: &Admit&
 12/02/2008&14:26 &Emergency&
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
   12/02/2008&14:26 &ICU&
      12/02/2008&14:26 &Admit&
    12/02/2008&14:26 &Emergency&
   12/02/2008&22:44 &ICU&
      12/02/2008&14:26 &Emergency&                Visualization
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&                         Visualization
                                                     Single-record"
                                                     Visualization
   12/05/2008&05:07 &Floor&
      12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
    12/05/2008&05:07 &Floor&                         Visualization
   12/08/2008&10:02 &Floor&
      12/05/2008&05:07 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
 & 12/14/2008&06:19 &Exit&
      12/08/2008&10:02 &Floor&
    12/14/2008&06:19 &Exit&
   & 12/14/2008&06:19 &Exit&
    &
      &
•  E.g. LifeLines2, Continuum, ActiviTree, etc.
More space please....
RELATED WORK (3)
•  Multiple-record visualizations

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 45851737
       Patient ID: &Admit&
 12/02/2008&14:26 &Emergency&
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
   12/02/2008&14:26 &ICU&
      12/02/2008&14:26 &Admit&
    12/02/2008&14:26 &Emergency&     Multiple-record"
   12/02/2008&22:44 &ICU&
      12/02/2008&14:26 &Emergency&
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&            Visualization
   12/05/2008&05:07 &Floor&
      12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
    12/05/2008&05:07 &Floor&
   12/08/2008&10:02 &Floor&
      12/05/2008&05:07 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
 & 12/14/2008&06:19 &Exit&
      12/08/2008&10:02 &Floor&
    12/14/2008&06:19 &Exit&
   & 12/14/2008&06:19 &Exit&
    &
      &
•  E.g. LifeLines2, Continuum
LifeLines2’s Temporal Summary [Wang et al. 2009]




           Continuum’s Histogram [Andre 2007]



 Not so useful with many event types
Does not help exploring event sequences
OVERVIEW OF EVENT SEQUENCES
•  Scalability vs. Loss of information

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 &Admit&
 12/02/2008&14:26 &Emergency&


                                         ?
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
  12/02/2008&14:26 &ICU&
    12/02/2008&14:26 &Emergency&
  12/02/2008&22:44 &ICU&
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
  12/05/2008&05:07 &Floor&
    12/05/2008&05:07 &Floor&
  12/08/2008&10:02 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
  12/14/2008&06:19 &Exit&
 & 12/14/2008&06:19 &Exit&
  &
    &

•  Goal: Show common trends, outliers
LIFEFLOW
         AGGREGATE

Merge multiple records into tree




          VISUALIZE
        Display the tree
AGGREGATE
•  Aggregate by prefix

     #1
     #2
     #3
     #4




               Example with 4 records
AGGREGATE
•  Aggregate by prefix

     #1
     #2
     #3
     #4
VISUALIZE
•  Inspired by the Icicle tree




        Number of files!
VISUALIZE (2)
•  Modify: Use horizontal axis to represent time
•  Video
DEMO - LIFEFLOW
When the lines are combined into flow!
LIFEFLOW
•  Overview
  –  See trends
  –  Spot outliers
•  Query
•  Compare data
COMPARISON                     ICU
    ICU         Intermediate               Intermediate




          Jan-Mar 2008               April-June 2008
Floor
PROPOSED WORK: LIFEFLOW
•  Designing interaction
PROPOSED WORK: LIFEFLOW
•  Primary
  –  Including non-temporal attributes
     •  Age, Gender, etc.
  –  Link with Flexible Temporal Search
•  Optional
  –  Rank-by-feature
  –  Complex aggregation
  –  Multiple alignments
PROPOSED WORK: LIFEFLOW
•  Including non-temporal attributes
  –  [Pinto et al. 2001]

           Male

           Male

           Female


                                       Male

                                  Male

                                   Female
PROPOSED WORK: LIFEFLOW
•  Complex aggregation


    #7
    #8


                    Not just merge by prefix

    #9

    #10


           Do not merge if the time gap is too different
PROPOSED WORK: LIFEFLOW
•  Multiple alignments
            ICU     Floor   ICU
EVALUATION PLAN
•  Flexible Temporal Search
   –  Controlled Experiment
      •  18-20 participants / 60-90 mins
•  LifeFlow
   –  Usability Study
      •  15-20 participants / 60-90 mins
   –  MILCs
      •  3-5 domain experts / 5-12 weeks
CONCLUSION
Summary and expected contributions!
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses             *
                         search
               Query                  Data
                         refine,
                         formulate



                                       How to display the data?
                                  Difficult to understand the
                                  event sequences in the data
                                  quickly
RESEARCH QUESTIONS

1.  How to support the users when they are
    uncertain about what they are looking for?
   Flexible Temporal Search

2.  How to provide an overview of event
    sequences for temporal categorical data?
   LifeFlow: an overview visualization
EXPECTED CONTRIBUTIONS
1.  Design of visual representations, user
    interfaces and interaction techniques
2.  Algorithms for flexible temporal search
3.  Evaluation results
4.  Open new directions for exploring
    temporal categorical data
ACKNOWLEDGEMENT
  DR. PHUONG HO, DR. MARK SMITH, DAVID ROSEMAN
           WASHINGTON HOSPITAL CENTER
             http://www.whcenter.org

       NATIONAL INSTITUTES OF HEALTH (NIH)
                http://www.nih.gov

        MICHAEL PACK, MICHAEL VANDANIKER
CENTER FOR ADVANCED TRANPORTATION TECHNOLOGY LAB
                   (CATT LAB)
            http://www.cattlab.umd.edu
Q&A
Questions?!
THANK YOU
Thank you!

Weitere ähnliche Inhalte

Ähnlich wie Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data

Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdf
Jojo314349
 

Ähnlich wie Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data (20)

The Business Research Method
The Business Research MethodThe Business Research Method
The Business Research Method
 
Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdf
 
KDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptxKDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptx
 
Research and collection of data
Research and collection of dataResearch and collection of data
Research and collection of data
 
The Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalThe Semantic Web - This time... its Personal
The Semantic Web - This time... its Personal
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
 
Data Science Workshop - day 1
Data Science Workshop - day 1Data Science Workshop - day 1
Data Science Workshop - day 1
 
Research
ResearchResearch
Research
 
Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts
 
Csci 6530 2016 fall presentation
Csci 6530 2016 fall presentationCsci 6530 2016 fall presentation
Csci 6530 2016 fall presentation
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformatics
 
Reproducibility for IR evaluation
Reproducibility for IR evaluationReproducibility for IR evaluation
Reproducibility for IR evaluation
 
Reproducibility for IR evaluation
Reproducibility for IR evaluationReproducibility for IR evaluation
Reproducibility for IR evaluation
 
Baselining Logs
Baselining LogsBaselining Logs
Baselining Logs
 
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
 
CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
Brainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & PredictionBrainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & Prediction
 
Secondary Data presentation.pptx
Secondary Data presentation.pptxSecondary Data presentation.pptx
Secondary Data presentation.pptx
 

Mehr von Krist Wongsuphasawat

Logs & Visualizations at Twitter
Logs & Visualizations at TwitterLogs & Visualizations at Twitter
Logs & Visualizations at Twitter
Krist Wongsuphasawat
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament Recap
Krist Wongsuphasawat
 

Mehr von Krist Wongsuphasawat (20)

What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizing
 
Navigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesNavigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization Libraries
 
Encodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsEncodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization Components
 
6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)
 
Increasing the Impact of Visualization Research
Increasing the Impact of Visualization ResearchIncreasing the Impact of Visualization Research
Increasing the Impact of Visualization Research
 
6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizing
 
What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)
 
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Dataร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
 
Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...
 
What to expect when you are visualizing
What to expect when you are visualizingWhat to expect when you are visualizing
What to expect when you are visualizing
 
Adventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at TwitterAdventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at Twitter
 
Logs & Visualizations at Twitter
Logs & Visualizations at TwitterLogs & Visualizations at Twitter
Logs & Visualizations at Twitter
 
d3Kit
d3Kitd3Kit
d3Kit
 
Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science Enthusiasts
 
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
 
Data Visualization at Twitter
Data Visualization at TwitterData Visualization at Twitter
Data Visualization at Twitter
 
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the TweetsMaking Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
 
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament Recap
 
Visualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationVisualization for Event Sequences Exploration
Visualization for Event Sequences Exploration
 

KĂźrzlich hochgeladen

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
 

KĂźrzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data

  • 1. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 2. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 3. TEMPORAL CATEGORICAL DATA •  A type of time series Category! Numerical Categorical Event! Stock: Microsoft Patient ID: 45851737 04/26/2010&10:00 &31.03& 12/02/2008&14:26 &Admit& 04/26/2010&10:15 &31.01& 12/02/2008&14:26 &Emergency& 04/26/2010&10:30 &31.02& 12/02/2008&22:44 &ICU& 04/26/2010&10:45 &31.08& 12/05/2008&05:07 &Floor& 04/26/2010&11:00 &31.16& 12/08/2008&10:02 &Floor& 04/26/2010&11:15 &31.15& 12/14/2008&06:19 &Exit& & Time Admit Emergency ICU Floor Exit
  • 4. TEMPORAL CATEGORICAL DATA •  Electronic Health Records: symptoms, treatment, lab test •  Traffic incident logs: arrival/departure time of each unit •  Student records: course, paper, proposal, defense, etc. •  Usability study logs •  Etc.
  • 5. working with physicians at WASHINGTON HOSPITAL CENTER
  • 6. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 7. EVENT SEQUENCES •  Examples: “Bounce backs” ICU Floor ICU within 2 days
  • 8. RESEARCH STATEMENT “my research aims to design effective visualization and interaction techniques to support users in exploring event sequences in temporal categorical data.” •  a flexible temporal search approach •  a novel visualization that provides an overview of multiple records
  • 9. RESEARCH QUESTION#1 PRELIM. + PROPOSED MOTIVATION WORK CONCLUSION RESEARCH RESEARCH QUESTIONS QUESTION#2 PRELIM. + PROPOSED WORK
  • 10. TRADITIONAL SEARCH MODEL Know exactly what they want Query Data * How to specify the query? How to display the data? SQL, TSQL, GUI, ... How to process the query? How to make the search efficient? Indexing methods, algorithms
  • 11. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate
  • 12. UNCERTAIN Floor ICU Exit-Alive QUERY within 2 days Find something useful and display RESULTS Frustrated! found 0 record
  • 13. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate How to display the data? Difficult to understand the event sequences in the data quickly
  • 14. NEEDS A STARTING POINT Show overview or summary Where should I start?
  • 15. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 16. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 17. APPROACHES Exact Search! Similarity-based Search! MUST have A, B, C SHOULD have A, B, C Query Query Record#1 Record#2 more Record#2 Record#1 similar Record#3 Record#3
  • 18. SIMILARITY-BASED SEARCH How to specify query? What is “similar”? How to display results? User Interface! Similarity Measure & Visualization
  • 19. SIMILARITY MEASURE Gives a score that shows how much a record is similar to the query. Min = 0 / Max = 1 Query Record#1 0.80 Record#2 0.63 Record#3 0.96 Record#4 0.77
  • 20. SIMILARITY MEASURE (2) Query Record#3 0.96 Record#1 0.80 more similar Record#4 0.77 Record#2 0.63
  • 21. CHALLENGE •  What is “similar”? depends&on&users/tasks& Record#1 (Query) A B C Record#2 missing A B C Record#3 extra A B C D Record#4 Time difference A B C time
  • 22. MATCH & MISMATCH MEASURE time Record#1 (Query) A C B C Record#2 A B C B Matched Events Missing Event Extra Event } Time Difference Number of Swapping Total Score Number of Missing Events 0 to 1 Number of Extra Events
  • 23. RELATED WORK •  Similarity measure –  For numerical time series •  [Ding et al. 2008], [Liu et al. 2006], [Berndt and Clifford 1994], ... –  Edit distance •  [Levenshtein 1966], [Winkler 1999], [Chen 2004], ... –  Biological sequence searching •  [Pearson and Lipman 1988], [Altschul et al. 1990], ... –  Approximate graph matching •  [Tian et al. 2007], [Tian et al. 2008] , ... –  Temporal Categorical Data •  [Sherkat 2006], ...
  • 25. DEMO - DATA •  Patient transfers in the emergency department •  Real data from hospital (Jan – Mar 2010) ADMIT Admission time EMERGENCY Emergency room ICU Intensive Care Unit INTERMEDIATE Intermediate Medical Care FLOOR Normal room EXIT-ALIVE Leave the hospital alive EXIT-DEAD Leave the hospital dead
  • 26. DEMO - SIMILAN This is when bugs are more likely to occur.!
  • 27. CONTROLLED EXPERIMENT •  18 participants •  2 interfaces by 5 tasks –  Similarity-based Search (Similan) –  Exact Search (LifeLines2) •  Time, error, preference
  • 28. X = Exact Search , S = Similarity-based Search RESULTS Sequence Counting Time const. Uncertainty Sequence CountSpeed (seconds) Time Uncertain 60 15 150 150 40 10 100 100 20 5 50 50 0 0 0 0 X S X S R X S X S Preference (7-points likert scale) 8 8 8 8 6 6 6 6 4 4 4 4 2 2 2 2 0 0 0 0 X S X S X S R X S
  • 30. PROPOSED WORK: FTS •  Draw an example to specify query •  Specify flexible and inflexible constraints MUST HAVE RANGE ? MUST NOT HAVE optional •  Algorithms
  • 31. PROPOSED WORK: FTS (2) •  Combine exact and similarity-based search Primary Bin similar Pass all constraints Secondary Bin similar The rest
  • 32. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 33. WHEN YOU READ A PAPER… •  What is it about? INFORMATION VISUALIZATION MANTRA OVERVIEW FIRST, ZOOM AND FILTER, THEN DETAILS ON DEMAND
  • 34. RELATED WORK •  Single-record visualizations Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& Single-record" 12/02/2008&22:44 &ICU& Visualization 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & •  E.g. LifeLines, MIDGAARD, etc.
  • 35.
  • 36. RELATED WORK (2) •  Multiple instances of single-record visualizations Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 45851737 Patient ID: &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&22:44 &ICU& 12/02/2008&14:26 &Emergency& Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& Visualization Single-record" Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& Visualization 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& & 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  E.g. LifeLines2, Continuum, ActiviTree, etc.
  • 38. RELATED WORK (3) •  Multiple-record visualizations Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 45851737 Patient ID: &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& Multiple-record" 12/02/2008&22:44 &ICU& 12/02/2008&14:26 &Emergency& 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& & 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  E.g. LifeLines2, Continuum
  • 39. LifeLines2’s Temporal Summary [Wang et al. 2009] Continuum’s Histogram [Andre 2007] Not so useful with many event types Does not help exploring event sequences
  • 40. OVERVIEW OF EVENT SEQUENCES •  Scalability vs. Loss of information Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& ? 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Emergency& 12/02/2008&22:44 &ICU& 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  Goal: Show common trends, outliers
  • 41. LIFEFLOW AGGREGATE Merge multiple records into tree VISUALIZE Display the tree
  • 42. AGGREGATE •  Aggregate by prefix #1 #2 #3 #4 Example with 4 records
  • 44. VISUALIZE •  Inspired by the Icicle tree Number of files!
  • 45. VISUALIZE (2) •  Modify: Use horizontal axis to represent time •  Video
  • 46. DEMO - LIFEFLOW When the lines are combined into flow!
  • 47. LIFEFLOW •  Overview –  See trends –  Spot outliers •  Query •  Compare data
  • 48. COMPARISON ICU ICU Intermediate Intermediate Jan-Mar 2008 April-June 2008 Floor
  • 49. PROPOSED WORK: LIFEFLOW •  Designing interaction
  • 50. PROPOSED WORK: LIFEFLOW •  Primary –  Including non-temporal attributes •  Age, Gender, etc. –  Link with Flexible Temporal Search •  Optional –  Rank-by-feature –  Complex aggregation –  Multiple alignments
  • 51. PROPOSED WORK: LIFEFLOW •  Including non-temporal attributes –  [Pinto et al. 2001] Male Male Female Male Male Female
  • 52. PROPOSED WORK: LIFEFLOW •  Complex aggregation #7 #8 Not just merge by prefix #9 #10 Do not merge if the time gap is too different
  • 53. PROPOSED WORK: LIFEFLOW •  Multiple alignments ICU Floor ICU
  • 54. EVALUATION PLAN •  Flexible Temporal Search –  Controlled Experiment •  18-20 participants / 60-90 mins •  LifeFlow –  Usability Study •  15-20 participants / 60-90 mins –  MILCs •  3-5 domain experts / 5-12 weeks
  • 56. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate How to display the data? Difficult to understand the event sequences in the data quickly
  • 57. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? Flexible Temporal Search 2.  How to provide an overview of event sequences for temporal categorical data? LifeFlow: an overview visualization
  • 58. EXPECTED CONTRIBUTIONS 1.  Design of visual representations, user interfaces and interaction techniques 2.  Algorithms for flexible temporal search 3.  Evaluation results 4.  Open new directions for exploring temporal categorical data
  • 59. ACKNOWLEDGEMENT DR. PHUONG HO, DR. MARK SMITH, DAVID ROSEMAN WASHINGTON HOSPITAL CENTER http://www.whcenter.org NATIONAL INSTITUTES OF HEALTH (NIH) http://www.nih.gov MICHAEL PACK, MICHAEL VANDANIKER CENTER FOR ADVANCED TRANPORTATION TECHNOLOGY LAB (CATT LAB) http://www.cattlab.umd.edu