SlideShare ist ein Scribd-Unternehmen logo
1 von 47
Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event  (Item) Bases A,T,G,C An element of the DNA sequence  DNA sequence of a particular species Genome sequences Types of alarms generated by sensors  Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
Formal Definition of a Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Examples of Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Formal Definition of a Subsequence ,[object Object],[object Object],[object Object],< {2,4} {2,4} {2,5} > < {1,2} {3,4} >  < {2,4} {3,5,6} {8} > Data sequence Yes < {2} {4} > No < {1} {2} > Yes < {2} {3,5} > Contain? Subsequence
What Is Sequential Pattern Mining? ,[object Object],A  sequence database   A  sequence  : < (ef) (ab)  (df) c b > An element may contain a set of items. Items within an element are unordered and we list them alphabetically.   <a(bc)dc> is a  subsequence  of  < a (a bc )(ac) d ( c f)> Given  support threshold   min_sup  =2, <(ab)c> is a  sequential pattern <eg(af)cbc> 40 <(ef)( ab )(df) c b> 30 <(ad)c(bc)(ae)> 20 <a( ab c)(a c )d(cf)> 10 sequence SID
Sequential Pattern Mining: Definition ,[object Object],[object Object],[object Object],[object Object],[object Object]
Extracting Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Pattern Mining: Challenge ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Mining Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Patterns (cont’d) ,[object Object],[object Object],[object Object]
Algorithm ,[object Object],[object Object],[object Object],[object Object]
Algorithm (cont’d) ,[object Object],[object Object],[object Object],[object Object]
Original Database
Sort Phase
Customer-Sequence Version  of the Database
Large itemset Phase (support:2)
Transformation Phase
Sequence Phase ,[object Object],[object Object],2 <2 3 4> 2 <1 3 5> 3 <1 3 4>  2 <1 2 4> 2 <1 2 3> Support Sequence <1 3 5 4> <1 3 4 5> <1 2 4 3> <1 2 3 4>
<{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
Maximal Sequence ,[object Object],[object Object],[object Object]
Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
Notes ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Rule Discovery from Time Sequences ,[object Object],[object Object],[object Object],[object Object],[object Object]
Episode Mining ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],E B A F B A C
Example of Episode Mining  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],0  10  20  30  40  50  60  70  80  90  100  110  120 130 140  150 D  C  A  B  D  A  B  C  A  D  C  A  B  D  A
Event Sequence ,[object Object],[object Object]
Sliding Window ,[object Object],[object Object],0  10  20  30  40  50  60  70  80  90 D  C  A  B  D  A  B  C
Frequency of an Episode ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Find Frequent Episodes ,[object Object],[object Object],[object Object],[object Object]
FreeSpan ,[object Object],[object Object],[object Object],[object Object]
Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <a> <aa> => {a}-projected database <e( a f)cbc> 40 <(ef)( a b)(df)cb> 30 <( a d)c(bc)( a e)> 20 < a ( a bc)( a c)d(cf)> 10 Sequence Sequence id <a> 40 <a> 30 <aa> 20 <aaa> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <b> <ab> <ba> <(ab)> => {b}-projected database <e( a f)c b c> 40 <(ef)( ab )(df)c b > 30 <( a d)c( b c)( a e)> 20 < a ( ab c)( a c)d(cf)> 10 Sequence Sequence id <ab> 40 <(ab)b> 30 <aba> 20 <a(ab)a> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <c> <ac> <bc> <(bc)> <ca> <cb> <(ab)c> <acc> <acb> => {c}-projected database <e( a f) cbc > 40 <(ef)( ab )(df) cb > 30 <( a d) c ( bc )( a e)> 20 < a ( abc )( ac )d( c f)> 10 Sequence Sequence id <acbc> 40 <(ab)cb> 30 <ac(bc)a> 20 <a(abc)(ac)c> 10
PrefixSpan ,[object Object],[object Object],[object Object]
PrefixSpan - Concepts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
PrefixSpan – Concepts ,[object Object],[object Object],[object Object],[object Object]
Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2  <ab>:4  <(ab)>:2  <ac>:4  <ad>:2  <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
References ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3
guesta3202
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)
James Clause
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка Twisted
Maxim Kulsha
 

Was ist angesagt? (20)

AST: threats and opportunities
AST: threats and opportunitiesAST: threats and opportunities
AST: threats and opportunities
 
Python memory management_v2
Python memory management_v2Python memory management_v2
Python memory management_v2
 
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italyFrom java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
 
Python 표준 라이브러리
Python 표준 라이브러리Python 표준 라이브러리
Python 표준 라이브러리
 
Yoyak ScalaDays 2015
Yoyak ScalaDays 2015Yoyak ScalaDays 2015
Yoyak ScalaDays 2015
 
The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202
 
Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3
 
Python fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuanPython fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuan
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)
 
Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語
 
PDBC
PDBCPDBC
PDBC
 
java 8 Hands on Workshop
java 8 Hands on Workshopjava 8 Hands on Workshop
java 8 Hands on Workshop
 
집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링
 
Introducción a Elixir
Introducción a ElixirIntroducción a Elixir
Introducción a Elixir
 
MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용
 
Advanced Python, Part 2
Advanced Python, Part 2Advanced Python, Part 2
Advanced Python, Part 2
 
Intro
IntroIntro
Intro
 
Getting started cpp full
Getting started cpp   fullGetting started cpp   full
Getting started cpp full
 
Tokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java DeveloperTokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java Developer
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка Twisted
 

Andere mochten auch

Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to fly
s.anshuman
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica
1Leu
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate
1Leu
 
Dementastd
DementastdDementastd
Dementastd
1Leu
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)
lpalacin
 
Toxicostd
ToxicostdToxicostd
Toxicostd
1Leu
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purba
kaloky
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive
1Leu
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejays
jawad723
 

Andere mochten auch (20)

Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and Cassandra
 
Preview - The Mind of Business Plan
Preview - The Mind of Business PlanPreview - The Mind of Business Plan
Preview - The Mind of Business Plan
 
Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?
 
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend ReachThe Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
 
Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to fly
 
Diario Blog 0119 0123
Diario Blog 0119 0123Diario Blog 0119 0123
Diario Blog 0119 0123
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate
 
Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009
 
Dementastd
DementastdDementastd
Dementastd
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)
 
Toxicostd
ToxicostdToxicostd
Toxicostd
 
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLDPresentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
 
Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purba
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejays
 
Preview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New YouPreview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New You
 
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing ProductsPreview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
 
EFT On Feeling Disappointed
EFT On Feeling DisappointedEFT On Feeling Disappointed
EFT On Feeling Disappointed
 

Ähnlich wie 03

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualization
Nicolas Maisonneuve
 
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, TuningJava 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
Carol McDonald
 

Ähnlich wie 03 (20)

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualization
 
chapter-8.ppt
chapter-8.pptchapter-8.ppt
chapter-8.ppt
 
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading SkillsReverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
 
04 sequentialbasics 1
04 sequentialbasics 104 sequentialbasics 1
04 sequentialbasics 1
 
SMDMS'13
SMDMS'13SMDMS'13
SMDMS'13
 
Data in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data EfficientlyData in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data Efficiently
 
Data Presentations Cassandra Sigmod
Data  Presentations  Cassandra SigmodData  Presentations  Cassandra Sigmod
Data Presentations Cassandra Sigmod
 
RxJava In Baby Steps
RxJava In Baby StepsRxJava In Baby Steps
RxJava In Baby Steps
 
Mining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systemsMining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systems
 
dfl
dfldfl
dfl
 
Odp
OdpOdp
Odp
 
Extracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEEExtracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEE
 
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
 
Writing Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeepWriting Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeep
 
Basics & asymptotic notations
Basics & asymptotic notationsBasics & asymptotic notations
Basics & asymptotic notations
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysis
 
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, TuningJava 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
 
Scala en
Scala enScala en
Scala en
 

Kürzlich hochgeladen

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 

Kürzlich hochgeladen (20)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 

03

  • 1. Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
  • 2. Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
  • 3. Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event (Item) Bases A,T,G,C An element of the DNA sequence DNA sequence of a particular species Genome sequences Types of alarms generated by sensors Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 17. Customer-Sequence Version of the Database
  • 18. Large itemset Phase (support:2)
  • 20.
  • 21. <{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
  • 22.
  • 23. Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2 <ab>:4 <(ab)>:2 <ac>:4 <ad>:2 <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
  • 41. Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
  • 42. Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.