SlideShare ist ein Scribd-Unternehmen logo
1 von 31
Downloaden Sie, um offline zu lesen
Change	
  Point	
  Detec.on	
  
with	
  Bayesian	
  Inference	
  
By	
  Frank	
  Kelly	
  
Py	
  data	
  
6th	
  January	
  2015	
  
Overview	
  
•  Nigeria,	
  oil	
  wells	
  &	
  drilling	
  
•  Noisy	
  data	
  
•  Some	
  maths	
  
•  Python	
  implementaDon	
  
•  Examples	
  in	
  different	
  domains	
  
FPSO	
  (oil	
  plaIorm	
  picture)	
  
Mud	
  pulse	
  telemetry	
  
•  InformaDon	
  
encoded	
  digitally,	
  
transmiOed	
  via	
  
pressure	
  pulses	
  
through	
  mud	
  fluid.	
  
•  Alert	
  drillers	
  that	
  
they	
  have	
  reached	
  
oil,	
  detect	
  rock	
  types	
  
and	
  general	
  
monitoring.	
  
The	
  problem	
  
•  Poor	
  bit	
  rate	
  and	
  
resoluDon	
  
•  Time	
  consuming	
  
analysis	
  
Approaches	
  to	
  staDsDcs	
  
•  FrequenDst	
  
– Data	
  gathered	
  is	
  a	
  
repeatable	
  random	
  
sample.	
  “Frequency”	
  
– Underlying	
  
parameters	
  are	
  
constant	
  
– Fisher’s	
  0.05	
  
•  Bayesian	
  
– Data	
  are,	
  fixed	
  and	
  
observed	
  from	
  the	
  
realised	
  sample	
  
– Parameters	
  unknown	
  
and	
  described	
  
probabilisDcally	
  
– Introduce	
  
“subjecDvity”	
  
	
  
FrequenDst	
  vs.	
  Bayesian	
  
The	
  Theory:	
  Bayesian	
  inference	
  
•  Methodology	
  of	
  mathemaDcal	
  inference:	
  	
  
–  Choosing	
  between	
  several	
  possible	
  models	
  
–  ExtracDng	
  parameters	
  for	
  these	
  models	
  
•  Bayes’	
  Theorem:	
  
Rev	
  Thomas	
  Bayes	
  1702	
  
-­‐	
  1761	
  
p(w | D) =
p(D | w)p(w)
p(D)
Likelihood	
  
Prior	
  
Probability	
  
Posterior	
  
Probability	
   Evidence	
  
-­‐  Remove	
  nuisance	
  
parameters	
  by	
  
marginalisaDon	
  
-­‐  InteresDng	
  ones	
  
remain	
  
Modelling	
  the	
  problem	
  
µ2
1µ
m
N
0	
   20	
   40	
   60	
   80	
   100	
   120	
   140	
   160	
   180	
   200	
  
0.5	
  
1	
  
1.5	
  
2	
  
2.5	
  
data	
  =	
  model	
  +	
  noise	
  
	
  
•  a	
  sequence	
  of	
  N	
  
samples	
  of	
  data	
  
from	
  a	
  piecewise	
  
constant	
  source	
  
with	
  added	
  
Gaussian	
  noise.	
  
•  Noise	
  independent	
  
of	
  mean,	
  idenDcally	
  
distributed	
  and	
  S.D.	
  
=	
  σ	
  
•  Heterogenous:	
  
divide	
  into	
  two	
  
homogenous	
  
segments	
  
µ2
⎩
⎨
⎧
+
+
=
i
i
i
e
e
d
2
1
µ
µ
Nim
mi
≤<
≤
1µ
Nm
Single	
  changepoint	
  detector:	
  
How	
  does	
  it	
  work?	
  
	
  
•  SubsDtute	
  likelihood	
  into	
  Bayes’ Law	
  
–  Simple	
  model-­‐	
  consider	
  Ockham’s	
  Razor	
  
•  Interested	
  in	
  changepoint	
  locaDon	
  m,	
  integrate	
  w.r.t.	
  the	
  
nuisance	
  parameters	
  (µ1,	
  µ2	
  and	
  σ)…rearrange	
  this…	
  
•  …get	
  a	
  BIG	
  expression	
  for	
  p({m}|dI),	
  code	
  in	
  Python	
  
•  On	
  running	
  obtain	
  most	
  likely	
  changepoint	
  locaDon	
  
Ockham’s	
  razor:	
  
hOp://www.jstor.org/discover/10.2307/29774559?sid=21105568247973&uid=3738032&uid=4&uid=2	
  	
  
The	
  maths	
  
More	
  maths	
  
•  Integrate	
  w.r.t.	
  (and	
  thereby	
  remove)	
  
nuisance	
  parameters	
  
Other	
  applicaDons…	
  
hOp://moz.com/google-­‐algorithm-­‐change	
  
“Google’s	
  algorithm	
  is	
  the	
  “secret	
  sauce	
  recipe”	
  that	
  has	
  enabled	
  it	
  to	
  dominate	
  search.”	
  	
  
	
  
-­‐	
  FT.com	
  16th	
  Sept	
  2014	
  
hOp://www.p.com/cms/s/0/9615661c-­‐3ce1-­‐11e4-­‐9733-­‐00144feabdc0.html?
siteediDon=uk#axzz3DSwXYAW8	
  
Any	
  business	
  with	
  an	
  online	
  presence	
  today	
  open	
  struggles	
  to	
  accurately	
  evaluate:	
  	
  
	
  
●	
  The	
  quality	
  of	
  their	
  website	
  and	
  associated	
  linking	
  pages,	
  as	
  perceived	
  by	
  Google	
  
	
  
●	
  The	
  robustness	
  of	
  their	
  website	
  to	
  a	
  sudden	
  change	
  in	
  Google’s	
  search	
  algorithm	
  
Web	
  traffic	
  
30000	
  
35000	
  
40000	
  
45000	
  
50000	
  
55000	
  
60000	
  
raw	
  daily	
  google	
  search-­‐sourced	
  pageviews	
  
Web	
  traffic	
  (2)	
  
30000	
  
35000	
  
40000	
  
45000	
  
50000	
  
55000	
  
60000	
  
smoothed	
  data	
  using	
  moving	
  average	
  
Web	
  traffic	
  (3)	
  
30000	
  
35000	
  
40000	
  
45000	
  
50000	
  
55000	
  
60000	
  
smoothed	
  data	
  with	
  cyclicality	
  removed	
  
Web	
  traffic	
  (4)	
  
-­‐838	
  
-­‐837.5	
  
-­‐837	
  
-­‐836.5	
  
-­‐836	
  
-­‐835.5	
  
-­‐835	
  
-­‐834.5	
  
-­‐834	
  
-­‐833.5	
  
-­‐833	
  
30000	
  
35000	
  
40000	
  
45000	
  
50000	
  
55000	
  
60000	
  
likelihood	
  of	
  change	
  in	
  data	
  plo>ed	
  over	
  .me	
  
day	
  removed	
   likelihood	
  CP	
  
number	
  of	
  tropical	
  storms	
  per	
  year	
  in	
  the	
  North	
  AtlanDc	
  
Data	
  obtained	
  from	
  ibtracs	
  database:	
  
hOps://www.ncdc.noaa.gov/ibtracs/	
  
"Amo	
  Dmeseries	
  1856-­‐present"	
  by	
  Rosentod,	
  Marsupilami	
  -­‐	
  hOp://www.cdc.noaa.gov/CorrelaDon/amon.us.long.data.	
  Licensed	
  under	
  Public	
  
Domain	
  via	
  Wikimedia	
  Commons	
  -­‐	
  hOp://commons.wikimedia.org/wiki/File:Amo_Dmeseries_1856-­‐present.svg#mediaviewer/
File:Amo_Dmeseries_1856-­‐present.svg	
  
Other	
  applicaDons	
  /	
  possibiliDes	
  
•  Financial	
  markets	
  and	
  poliDcal	
  events	
  
•  Combine	
  with	
  frequenDst	
  staDcal	
  methods:	
  
– Use	
  of	
  GLR	
  in	
  online	
  (moving	
  window)	
  detecDon	
  
applicaDon	
  
•  Your	
  own	
  data/	
  ideas	
  !	
  
Thank	
  you	
  
•  Link	
  to	
  Python	
  code	
  on	
  github:	
  
hOps://github.com/swhustla/pydata-­‐bayes-­‐changepoint	
  	
  
–  Single	
  changepoint	
  detector	
  (as	
  seen	
  tonight)	
  
–  Dual	
  changepoint	
  detector	
  
–  Ramp	
  detector	
  
•  Further	
  reading:	
  
–  Numerical	
  Bayesian	
  Methods	
  Applied	
  to	
  Signal	
  Processing	
  
(StaDsDcs	
  and	
  CompuDng)	
  by	
  Fitzgerald,	
  O’Ruanaidh,	
  1996	
  :	
  
hOp://www.amazon.co.uk/Numerical-­‐Bayesian-­‐Processing-­‐
StaDsDcs-­‐CompuDng/dp/0387946292	
  	
  	
  
–  Bayesian	
  Inference	
  on	
  Change	
  Point	
  Problems	
  (2007)
hOp://www.cs.ubc.ca/~murphyk/Students/Xuan_MSc07.pdf	
  	
  
	
  
TwiOer:	
  @norhustla	
  
Email:	
  frank.kelly@cantab.net	
  
Thank	
  you	
  
•  AddiDonal	
  links:	
  
–  Google	
  Algo	
  updates:	
  	
  hOp://moz.com/google-­‐algorithm-­‐change	
  	
  
–  Mathsight	
  -­‐>	
  insights	
  into	
  algorithm	
  changes	
  hOp://mathsight.org	
  	
  
–  AtlanDc	
  mulD-­‐decadal	
  oscillaDon	
  spaDal	
  paOern:
hOp://commons.wikimedia.org/wiki/File:AMO_PaOern.png	
  
–  NaDonal	
  climaDc	
  data	
  center	
  hOps://www.ncdc.noaa.gov/ibtracs/	
  	
  
–  Ockham’s	
  Razor	
  and	
  Bayesian	
  Inference:	
  
hOp://www.jstor.org/discover/10.2307/29774559?
sid=21105568247973&uid=3738032&uid=4&uid=2	
  
–  ConverDng	
  from	
  Matlab	
  to	
  Python:	
  
hOp://mathesaurus.sourceforge.net/matlab-­‐numpy.html	
  	
  
	
  
TwiOer:	
  @norhustla	
  
Email:	
  frank.kelly@cantab.net	
  

Weitere ähnliche Inhalte

Was ist angesagt?

Causality without headaches
Causality without headachesCausality without headaches
Causality without headachesBenoît Rostykus
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
Machine Learning for Survival Analysis
Machine Learning for Survival AnalysisMachine Learning for Survival Analysis
Machine Learning for Survival AnalysisChandan Reddy
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsDerek Kane
 
Real-Time Entity Resolution with Elasticsearch - Haystack 2018
Real-Time Entity Resolution with Elasticsearch - Haystack 2018Real-Time Entity Resolution with Elasticsearch - Haystack 2018
Real-Time Entity Resolution with Elasticsearch - Haystack 2018zentity.io
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysismothersafe
 
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksModel-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee
 
GTFSを時刻表に可視化
GTFSを時刻表に可視化GTFSを時刻表に可視化
GTFSを時刻表に可視化Kazuaki Sano
 
ラビットチャレンジレポート 機械学習
ラビットチャレンジレポート 機械学習ラビットチャレンジレポート 機械学習
ラビットチャレンジレポート 機械学習HiroyukiTerada4
 
Classical decomposition
Classical decompositionClassical decomposition
Classical decompositionAzzuriey Ahmad
 
Latent Dirichlet Allocation
Latent Dirichlet AllocationLatent Dirichlet Allocation
Latent Dirichlet AllocationKyeongUkJang
 
Markov model for the online multichannel attribution problem
Markov model for the online multichannel attribution problemMarkov model for the online multichannel attribution problem
Markov model for the online multichannel attribution problemadavide1982
 
boosting 기법 이해 (bagging vs boosting)
boosting 기법 이해 (bagging vs boosting)boosting 기법 이해 (bagging vs boosting)
boosting 기법 이해 (bagging vs boosting)SANG WON PARK
 
capsule network
capsule networkcapsule network
capsule network민기 정
 
Representing and comparing probabilities
Representing and comparing probabilitiesRepresenting and comparing probabilities
Representing and comparing probabilitiesMLReview
 
Learning to Rank - From pairwise approach to listwise
Learning to Rank - From pairwise approach to listwiseLearning to Rank - From pairwise approach to listwise
Learning to Rank - From pairwise approach to listwiseHasan H Topcu
 
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지Haesun Park
 

Was ist angesagt? (20)

Causality without headaches
Causality without headachesCausality without headaches
Causality without headaches
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
 
Machine Learning for Survival Analysis
Machine Learning for Survival AnalysisMachine Learning for Survival Analysis
Machine Learning for Survival Analysis
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
 
Robust ai
Robust aiRobust ai
Robust ai
 
Node.js in 2019
Node.js in 2019Node.js in 2019
Node.js in 2019
 
Real-Time Entity Resolution with Elasticsearch - Haystack 2018
Real-Time Entity Resolution with Elasticsearch - Haystack 2018Real-Time Entity Resolution with Elasticsearch - Haystack 2018
Real-Time Entity Resolution with Elasticsearch - Haystack 2018
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysis
 
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksModel-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
 
GTFSを時刻表に可視化
GTFSを時刻表に可視化GTFSを時刻表に可視化
GTFSを時刻表に可視化
 
ラビットチャレンジレポート 機械学習
ラビットチャレンジレポート 機械学習ラビットチャレンジレポート 機械学習
ラビットチャレンジレポート 機械学習
 
Classical decomposition
Classical decompositionClassical decomposition
Classical decomposition
 
Chapter 06
Chapter 06 Chapter 06
Chapter 06
 
Latent Dirichlet Allocation
Latent Dirichlet AllocationLatent Dirichlet Allocation
Latent Dirichlet Allocation
 
Markov model for the online multichannel attribution problem
Markov model for the online multichannel attribution problemMarkov model for the online multichannel attribution problem
Markov model for the online multichannel attribution problem
 
boosting 기법 이해 (bagging vs boosting)
boosting 기법 이해 (bagging vs boosting)boosting 기법 이해 (bagging vs boosting)
boosting 기법 이해 (bagging vs boosting)
 
capsule network
capsule networkcapsule network
capsule network
 
Representing and comparing probabilities
Representing and comparing probabilitiesRepresenting and comparing probabilities
Representing and comparing probabilities
 
Learning to Rank - From pairwise approach to listwise
Learning to Rank - From pairwise approach to listwiseLearning to Rank - From pairwise approach to listwise
Learning to Rank - From pairwise approach to listwise
 
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지
[홍대 머신러닝 스터디 - 핸즈온 머신러닝] 2장. 머신러닝 프로젝트 처음부터 끝까지
 

Ähnlich wie Changepoint Detection with Bayesian Inference

Meteo I/O Introduction
Meteo I/O IntroductionMeteo I/O Introduction
Meteo I/O IntroductionRiccardo Rigon
 
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Cheng Chen
 
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler..."Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...Dataconomy Media
 
flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationLuís Moreira-Matias
 
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler..."Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...Dataconomy Media
 
ODSC 2019: Sessionisation via stochastic periods for root event identification
ODSC 2019: Sessionisation via stochastic periods for root event identificationODSC 2019: Sessionisation via stochastic periods for root event identification
ODSC 2019: Sessionisation via stochastic periods for root event identificationKuldeep Jiwani
 
5.1 mining data streams
5.1 mining data streams5.1 mining data streams
5.1 mining data streamsKrish_ver2
 
Christian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big dataChristian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big datajins0618
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Integrated Carbon Observation System (ICOS)
 
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams   Esteban DonatoEvaluating Classification Algorithms Applied To Data Streams   Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams Esteban DonatoEsteban Donato
 
Alerting mechanism and algorithms introduction
Alerting mechanism and algorithms introductionAlerting mechanism and algorithms introduction
Alerting mechanism and algorithms introductionFEG
 
Approximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming ApplicationsApproximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming ApplicationsDebasish Ghosh
 
Big&open data challenges for smartcity-PIC2014 Shanghai
Big&open data challenges for smartcity-PIC2014 ShanghaiBig&open data challenges for smartcity-PIC2014 Shanghai
Big&open data challenges for smartcity-PIC2014 ShanghaiVictoria López
 
A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...Institute of Agricultural Machinery, NARO
 
A Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkA Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkHa Phuong
 
Object Detection and Tracking using Statistical and Stochastic Techniques
Object Detection and Tracking using Statistical and Stochastic TechniquesObject Detection and Tracking using Statistical and Stochastic Techniques
Object Detection and Tracking using Statistical and Stochastic TechniquesVasuhiSamydurai1
 

Ähnlich wie Changepoint Detection with Bayesian Inference (20)

Meteo I/O Introduction
Meteo I/O IntroductionMeteo I/O Introduction
Meteo I/O Introduction
 
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
 
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler..."Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
 
flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimation
 
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler..."Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...
"Quantum clustering - physics inspired clustering algorithm", Sigalit Bechler...
 
01-pengantar.pdf
01-pengantar.pdf01-pengantar.pdf
01-pengantar.pdf
 
ODSC 2019: Sessionisation via stochastic periods for root event identification
ODSC 2019: Sessionisation via stochastic periods for root event identificationODSC 2019: Sessionisation via stochastic periods for root event identification
ODSC 2019: Sessionisation via stochastic periods for root event identification
 
5.1 mining data streams
5.1 mining data streams5.1 mining data streams
5.1 mining data streams
 
Christian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big dataChristian jensen advanced routing in spatial networks using big data
Christian jensen advanced routing in spatial networks using big data
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...
 
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams   Esteban DonatoEvaluating Classification Algorithms Applied To Data Streams   Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
 
Alerting mechanism and algorithms introduction
Alerting mechanism and algorithms introductionAlerting mechanism and algorithms introduction
Alerting mechanism and algorithms introduction
 
Temporal data mining
Temporal data miningTemporal data mining
Temporal data mining
 
Approximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming ApplicationsApproximation Data Structures for Streaming Applications
Approximation Data Structures for Streaming Applications
 
t10_part1.pptx
t10_part1.pptxt10_part1.pptx
t10_part1.pptx
 
Big&open data challenges for smartcity-PIC2014 Shanghai
Big&open data challenges for smartcity-PIC2014 ShanghaiBig&open data challenges for smartcity-PIC2014 Shanghai
Big&open data challenges for smartcity-PIC2014 Shanghai
 
A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...
 
Introduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEASTIntroduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEAST
 
A Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkA Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap Network
 
Object Detection and Tracking using Statistical and Stochastic Techniques
Object Detection and Tracking using Statistical and Stochastic TechniquesObject Detection and Tracking using Statistical and Stochastic Techniques
Object Detection and Tracking using Statistical and Stochastic Techniques
 

Kürzlich hochgeladen

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 

Kürzlich hochgeladen (20)

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 

Changepoint Detection with Bayesian Inference

  • 1. Change  Point  Detec.on   with  Bayesian  Inference   By  Frank  Kelly   Py  data   6th  January  2015  
  • 2. Overview   •  Nigeria,  oil  wells  &  drilling   •  Noisy  data   •  Some  maths   •  Python  implementaDon   •  Examples  in  different  domains  
  • 3. FPSO  (oil  plaIorm  picture)  
  • 4.
  • 5.
  • 6. Mud  pulse  telemetry   •  InformaDon   encoded  digitally,   transmiOed  via   pressure  pulses   through  mud  fluid.   •  Alert  drillers  that   they  have  reached   oil,  detect  rock  types   and  general   monitoring.  
  • 7. The  problem   •  Poor  bit  rate  and   resoluDon   •  Time  consuming   analysis  
  • 8. Approaches  to  staDsDcs   •  FrequenDst   – Data  gathered  is  a   repeatable  random   sample.  “Frequency”   – Underlying   parameters  are   constant   – Fisher’s  0.05   •  Bayesian   – Data  are,  fixed  and   observed  from  the   realised  sample   – Parameters  unknown   and  described   probabilisDcally   – Introduce   “subjecDvity”    
  • 10. The  Theory:  Bayesian  inference   •  Methodology  of  mathemaDcal  inference:     –  Choosing  between  several  possible  models   –  ExtracDng  parameters  for  these  models   •  Bayes’  Theorem:   Rev  Thomas  Bayes  1702   -­‐  1761   p(w | D) = p(D | w)p(w) p(D) Likelihood   Prior   Probability   Posterior   Probability   Evidence   -­‐  Remove  nuisance   parameters  by   marginalisaDon   -­‐  InteresDng  ones   remain  
  • 11. Modelling  the  problem   µ2 1µ m N
  • 12. 0   20   40   60   80   100   120   140   160   180   200   0.5   1   1.5   2   2.5   data  =  model  +  noise     •  a  sequence  of  N   samples  of  data   from  a  piecewise   constant  source   with  added   Gaussian  noise.   •  Noise  independent   of  mean,  idenDcally   distributed  and  S.D.   =  σ   •  Heterogenous:   divide  into  two   homogenous   segments   µ2 ⎩ ⎨ ⎧ + + = i i i e e d 2 1 µ µ Nim mi ≤< ≤ 1µ Nm
  • 13. Single  changepoint  detector:   How  does  it  work?     •  SubsDtute  likelihood  into  Bayes’ Law   –  Simple  model-­‐  consider  Ockham’s  Razor   •  Interested  in  changepoint  locaDon  m,  integrate  w.r.t.  the   nuisance  parameters  (µ1,  µ2  and  σ)…rearrange  this…   •  …get  a  BIG  expression  for  p({m}|dI),  code  in  Python   •  On  running  obtain  most  likely  changepoint  locaDon   Ockham’s  razor:   hOp://www.jstor.org/discover/10.2307/29774559?sid=21105568247973&uid=3738032&uid=4&uid=2    
  • 15. More  maths   •  Integrate  w.r.t.  (and  thereby  remove)   nuisance  parameters  
  • 16.
  • 17.
  • 20. “Google’s  algorithm  is  the  “secret  sauce  recipe”  that  has  enabled  it  to  dominate  search.”       -­‐  FT.com  16th  Sept  2014   hOp://www.p.com/cms/s/0/9615661c-­‐3ce1-­‐11e4-­‐9733-­‐00144feabdc0.html? siteediDon=uk#axzz3DSwXYAW8   Any  business  with  an  online  presence  today  open  struggles  to  accurately  evaluate:       ●  The  quality  of  their  website  and  associated  linking  pages,  as  perceived  by  Google     ●  The  robustness  of  their  website  to  a  sudden  change  in  Google’s  search  algorithm  
  • 21. Web  traffic   30000   35000   40000   45000   50000   55000   60000   raw  daily  google  search-­‐sourced  pageviews  
  • 22. Web  traffic  (2)   30000   35000   40000   45000   50000   55000   60000   smoothed  data  using  moving  average  
  • 23. Web  traffic  (3)   30000   35000   40000   45000   50000   55000   60000   smoothed  data  with  cyclicality  removed  
  • 24. Web  traffic  (4)   -­‐838   -­‐837.5   -­‐837   -­‐836.5   -­‐836   -­‐835.5   -­‐835   -­‐834.5   -­‐834   -­‐833.5   -­‐833   30000   35000   40000   45000   50000   55000   60000   likelihood  of  change  in  data  plo>ed  over  .me   day  removed   likelihood  CP  
  • 25.
  • 26. number  of  tropical  storms  per  year  in  the  North  AtlanDc   Data  obtained  from  ibtracs  database:   hOps://www.ncdc.noaa.gov/ibtracs/  
  • 27. "Amo  Dmeseries  1856-­‐present"  by  Rosentod,  Marsupilami  -­‐  hOp://www.cdc.noaa.gov/CorrelaDon/amon.us.long.data.  Licensed  under  Public   Domain  via  Wikimedia  Commons  -­‐  hOp://commons.wikimedia.org/wiki/File:Amo_Dmeseries_1856-­‐present.svg#mediaviewer/ File:Amo_Dmeseries_1856-­‐present.svg  
  • 28.
  • 29. Other  applicaDons  /  possibiliDes   •  Financial  markets  and  poliDcal  events   •  Combine  with  frequenDst  staDcal  methods:   – Use  of  GLR  in  online  (moving  window)  detecDon   applicaDon   •  Your  own  data/  ideas  !  
  • 30. Thank  you   •  Link  to  Python  code  on  github:   hOps://github.com/swhustla/pydata-­‐bayes-­‐changepoint     –  Single  changepoint  detector  (as  seen  tonight)   –  Dual  changepoint  detector   –  Ramp  detector   •  Further  reading:   –  Numerical  Bayesian  Methods  Applied  to  Signal  Processing   (StaDsDcs  and  CompuDng)  by  Fitzgerald,  O’Ruanaidh,  1996  :   hOp://www.amazon.co.uk/Numerical-­‐Bayesian-­‐Processing-­‐ StaDsDcs-­‐CompuDng/dp/0387946292       –  Bayesian  Inference  on  Change  Point  Problems  (2007) hOp://www.cs.ubc.ca/~murphyk/Students/Xuan_MSc07.pdf       TwiOer:  @norhustla   Email:  frank.kelly@cantab.net  
  • 31. Thank  you   •  AddiDonal  links:   –  Google  Algo  updates:    hOp://moz.com/google-­‐algorithm-­‐change     –  Mathsight  -­‐>  insights  into  algorithm  changes  hOp://mathsight.org     –  AtlanDc  mulD-­‐decadal  oscillaDon  spaDal  paOern: hOp://commons.wikimedia.org/wiki/File:AMO_PaOern.png   –  NaDonal  climaDc  data  center  hOps://www.ncdc.noaa.gov/ibtracs/     –  Ockham’s  Razor  and  Bayesian  Inference:   hOp://www.jstor.org/discover/10.2307/29774559? sid=21105568247973&uid=3738032&uid=4&uid=2   –  ConverDng  from  Matlab  to  Python:   hOp://mathesaurus.sourceforge.net/matlab-­‐numpy.html       TwiOer:  @norhustla   Email:  frank.kelly@cantab.net