SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Clustering CDS: algorithms, distances,
stability and convergence rates
CMStatistics 2016, University of Seville, Spain
Gautier Marti, Frank Nielsen, Philippe Donnat
HELLEBORECAPITAL
December 9, 2016
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Introduction
Goal: Finding groups of ā€™homogeneousā€™ assets that can help to:
ā€¢ build alternative measures of risk,
ā€¢ elaborate trading strategies. . .
But, we need a high conļ¬dence in these clusters (networks).
So, we need appropriate AND fast converging methodologies [8]:
to be consistent yet eļ¬ƒcient (biasā€“variance tradeoļ¬€),
to avoid non-stationarity of the time series (too large sample).
A good model selection criterion:
Minimum sample size to reach a given ā€™accuracyā€™.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
The standard methodology - description
The methodology widely adopted in empirical studies: [7].
Let N be the number of assets.
Let Pi (t) be the price at time t of asset i, 1 ā‰¤ i ā‰¤ N.
Let ri (t) be the log-return at time t of asset i:
ri (t) = log Pi (t) āˆ’ log Pi (t āˆ’ 1).
For each pair i, j of assets, compute their correlation:
Ļij =
ri rj āˆ’ ri rj
( r2
i āˆ’ ri
2) r2
j āˆ’ rj
2
.
Convert the correlation coeļ¬ƒcients Ļij into distances:
dij = 2(1 āˆ’ Ļij ).
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
The standard methodology - description
From all the distances dij , compute a minimum spanning tree:
Figure: A minimum spanning tree of stocks (from [1]); stocks from the
same industry (represented by color) tend to cluster together
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
The standard methodology - limitations
ā€¢ MST clustering equivalent to Single Linkage clustering:
ā€¢ chaining phenomenon
ā€¢ not stable to noise / small perturbations [11]
ā€¢ Use of the Pearson correlation:
ā€¢ can take value 0 whereas variables are strongly dependent
ā€¢ not invariant to variable monotone transformations
ā€¢ not robust to outliers
Is it still useful for ļ¬nancial time series? stocks? CDS??!
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
The standard methodology - limitations
ā€¢ MST clustering equivalent to Single Linkage clustering:
ā€¢ chaining phenomenon
ā€¢ not stable to noise / small perturbations [11]
ā€¢ Use of the Pearson correlation:
ā€¢ can take value 0 whereas variables are strongly dependent
ā€¢ not invariant to variables monotone transformations
ā€¢ not robust to outliers
Is it still useful for ļ¬nancial time series? stocks? CDS??!
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Copulas
Sklarā€™s Theorem [13]
For (Xi , Xj ) having continuous marginal cdfs FXi
, FXj
, its joint cumulative
distribution F is uniquely expressed as
F(Xi , Xj ) = C(FXi
(Xi ), FXj
(Xj )),
where C is known as the copula of (Xi , Xj ).
Copulaā€™s uniform marginals jointly encode all the dependence.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
From ranks to empirical copula
ri , rj are the rank statistics of Xi , Xj respectively, i.e. rt
i is the rank
of Xt
i in {X1
i , . . . , XT
i }: rt
i = T
k=1 1{Xk
i ā‰¤ Xt
i }.
Deheuvelsā€™ empirical copula [3]
Any copula Ė†C deļ¬ned on the lattice L = {( ti
T ,
tj
T ) : ti , tj = 0, . . . , T} by
Ė†C( ti
T ,
tj
T ) = 1
T
T
t=1 1{rt
i ā‰¤ ti , rt
j ā‰¤ tj } is an empirical copula.
Ė†C is a consistent estimator of C with uniform convergence [4].
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Clustering of bivariate empirical copulas
Generate the N
2 bivariate empirical copulas
Find clusters of copulas using optimal transport [10, 9]
Compute and display the clustersā€™ centroids [2]
Some code available at www.datagrapple.com/Tech.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Copula-centers for stocks (CAC 40)
Figure: Stocks: More mass in the bottom-left corner, i.e. lower tail
dependence. Stock prices tend to plummet together.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Copula-centers for Credit Default Swaps (XO index)
Figure: Credit default swaps: More mass in the top-right corner, i.e.
upper tail dependence. Insurance cost against entitiesā€™ default tends to
soar in stressed market.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Dependence as relative distances between copulas
C copula of (Xi , Xj ),
|u āˆ’ v|/
āˆš
2 distance between (u, v) to the diagonal
Spearmanā€™s ĻS :
ĻS (Xi , Xj ) = 12
1
0
1
0
(C(u, v) āˆ’ uv)dudv
= 1 āˆ’ 6
1
0
1
0
(u āˆ’ v)2
dC(u, v)
Many correlation coeļ¬ƒcients can be expressed as distances to the
FrĀ“echetā€“Hoeļ¬€ding bounds or the independence [6]. Some are explicitely
built this way (e.g. [12, 5, 9]).
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
A metric space for copulas: Optimal Transport
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
The Target/Forget Dependence Coeļ¬ƒcient (TFDC)
Now, we can deļ¬ne our bespoke dependence coeļ¬ƒcient:
Build the forget-dependence copulas {CF
l }l
Build the target-dependence copulas {CT
k }k
Compute the empirical copula Cij from xi , xj
TFDC(Cij ) =
minl D(CF
l , Cij )
minl D(CF
l , Cij ) + mink D(Cij , CT
k )
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Spearman vs. TFDC
0.0 0.2 0.4 0.6 0.8 1.0
discontinuity position a
0.0
0.2
0.4
0.6
0.8
1.0
Estimatedpositivedependence
Spearman & TFDC values as a function of a
TFDC
Spearman
Figure: Empirical copulas for (X, Y ) where
X = Z1{Z < a} + X 1{Z > a},
Y = Z1{Z < a + 0.25} + Y 1{Z > a + 0.25}, a = 0, 0.05, . . . , 0.95, 1,
and where Z is uniform on [0, 1] and X , Y are independent noises (left).
TFDC and Spearman coeļ¬ƒcients estimated between X and Y as a
function of a (right).
For a = 0.75, Spearman coeļ¬ƒcient yields a negative value, yet X = Y
over [0, a].
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Process: Recovering a simulated ground-truth [8]
A simulation & benchmark process that needs to be reļ¬ned:
Extract (using a large sample) a ļ¬ltered correlation matrix R
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Process: Recovering a simulated ground-truth [8]
A simulation & benchmark process that needs to be reļ¬ned:
Generate samples of size T = 10, . . . , 20, . . . from a relevant
distribution (parameterized by R)
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Process: Recovering a simulated ground-truth [8]
A simulation & benchmark process that needs to be reļ¬ned:
Compute the ratio of the number of correct clustering
obtained over the number of trials as a function of T
100 200 300 400 500
Sample size
0.0
0.2
0.4
0.6
0.8
1.0
Score
Empirical rates of convergence for Single Linkage
Gaussian - Pearson
Gaussian - Spearman
Student - Pearson
Student - Spearman
100 200 300 400 500
Sample size
0.0
0.2
0.4
0.6
0.8
1.0
Score
Empirical rates of convergence for Average Linkage
Gaussian - Pearson
Gaussian - Spearman
Student - Pearson
Student - Spearman
100 200 300 400 500
Sample size
0.0
0.2
0.4
0.6
0.8
1.0
Score
Empirical rates of convergence for Ward
Gaussian - Pearson
Gaussian - Spearman
Student - Pearson
Student - Spearman
A full comparative study will be posted online at www.datagrapple.com/Tech.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
1 Introduction
2 The standard methodology
3 Exploring dependence between returns
4 Copula-based dependence coeļ¬ƒcients (clustering distances)
5 Empirical convergence rates
6 Beyond dependence: a (copula,margins) representation
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
ON CLUSTERING FINANCIAL TIME SERIES
GAUTIER MARTI, PHILIPPE DONNAT AND FRANK NIELSEN
NOISY CORRELATION MATRICES
Let X be the matrix storing the standardized re-
turns of N = 560 assets (credit default swaps)
over a period of T = 2500 trading days.
Then, the empirical correlation matrix of the re-
turns is
C =
1
T
XX .
We can compute the empirical density of its
eigenvalues
Ļ(Ī») =
1
N
dn(Ī»)
dĪ»
,
where n(Ī») counts the number of eigenvalues of
C less than Ī».
From random matrix theory, the Marchenko-
Pastur distribution gives the limit distribution as
N ā†’ āˆž, T ā†’ āˆž and T/N ļ¬xed. It reads:
Ļ(Ī») =
T/N
2Ļ€
(Ī»max āˆ’ Ī»)(Ī» āˆ’ Ī»min)
Ī»
,
where Ī»max
min = 1 + N/T Ā± 2 N/T, and Ī» āˆˆ
[Ī»min, Ī»max].
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
Ī»
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
Ļ(Ī»)
Figure 1: Marchenko-Pastur density vs. empirical den-
sity of the correlation matrix eigenvalues
Notice that the Marchenko-Pastur density ļ¬ts
well the empirical density meaning that most of
the information contained in the empirical corre-
lation matrix amounts to noise: only 26 eigenval-
ues are greater than Ī»max.
The highest eigenvalue corresponds to the ā€˜mar-
ketā€™, the 25 others can be associated to ā€˜industrial
sectorsā€™.
CLUSTERING TIME SERIES
Given a correlation matrix of the returns,
0 100 200 300 400 500
0
100
200
300
400
500
Figure 2: An empirical and noisy correlation matrix
one can re-order assets using a hierarchical clus-
tering algorithm to make the hierarchical correla-
tion pattern blatant,
0 100 200 300 400 500
0
100
200
300
400
500
Figure 3: The same noisy correlation matrix re-ordered
by a hierarchical clustering algorithm
and ļ¬nally ļ¬lter the noise according to the corre-
lation pattern:
0 100 200 300 400 500
0
100
200
300
400
500
Figure 4: The resulting ļ¬ltered correlation matrix
BEYOND CORRELATION
Sklarā€™s Theorem. For any random vector X = (X1, . . . , XN ) having continuous marginal cumulative
distribution functions Fi, its joint cumulative distribution F is uniquely expressed as
F(X1, . . . , XN ) = C(F1(X1), . . . , FN (XN )),
where C, the multivariate distribution of uniform marginals, is known as the copula of X.
Figure 5: ArcelorMittal and SociĆ©tĆ© gĆ©nĆ©rale prices are projected on dependence āŠ• distribution space; notice their
heavy-tailed exponential distribution.
Let Īø āˆˆ [0, 1]. Let (X, Y ) āˆˆ V2
. Let G = (GX, GY ), where GX and GY are respectively X and Y marginal
cdf. We deļ¬ne the following distance
d2
Īø(X, Y ) = Īød2
1(GX(X), GY (Y )) + (1 āˆ’ Īø)d2
0(GX, GY ),
where d2
1(GX(X), GY (Y )) = 3E[|GX(X) āˆ’ GY (Y )|2
], and d2
0(GX, GY ) = 1
2 R
dGX
dĪ» āˆ’ dGY
dĪ»
2
dĪ».
CLUSTERING RESULTS & STABILITY
0 5 10 15 20 25 30
Standard Deviation in basis points
0
5
10
15
20
25
30
35
Numberofoccurrences
Standard Deviations Histogram
Figure 6: (Top) The returns correlation structure ap-
pears more clearly using rank correlation; (Bottom)
Clusters of returns distributions can be partly described
by the returns volatility
Figure 7: Stability test on Odd/Even trading days sub-
sampling: our approach (GNPR) yields more stable
clusters with respect to this perturbation than standard
approaches (using Pearson correlation or L2 distances).
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Ricardo Coelho, Przemyslaw Repetowicz, Stefan Hutzler, and
Peter Richmond.
Investigation of Cluster Structure in the London Stock
Exchange.
Marco Cuturi and Arnaud Doucet.
Fast computation of wasserstein barycenters.
In Proceedings of the 31th International Conference on
Machine Learning, ICML 2014, Beijing, China, 21-26 June
2014, pages 685ā€“693, 2014.
Paul Deheuvels.
La fonction de dĀ“ependance empirique et ses propriĀ“etĀ“es. un test
non paramĀ“etrique dā€™indĀ“ependance.
Acad. Roy. Belg. Bull. Cl. Sci.(5), 65(6):274ā€“292, 1979.
Paul Deheuvels.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
A non-parametric test for independence.
Publications de lā€™Institut de Statistique de lā€™UniversitĀ“e de
Paris, 26:29ā€“50, 1981.
Fabrizio Durante and Roberta Pappada.
Cluster analysis of time series via kendall distribution.
In Strengthening Links Between Data Analysis and Soft
Computing, pages 209ā€“216. Springer, 2015.
Eckhard Liebscher et al.
Copula-based dependence measures.
Dependence Modeling, 2(1):49ā€“64, 2014.
Rosario N Mantegna.
Hierarchical structure in ļ¬nancial markets.
The European Physical Journal B-Condensed Matter and
Complex Systems, 11(1):193ā€“197, 1999.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Gautier Marti, SĀ“ebastien Andler, Frank Nielsen, and Philippe
Donnat.
Clustering ļ¬nancial time series: How long is enough?
Proceedings of the Twenty-Fifth International Joint
Conference on Artiļ¬cial Intelligence, IJCAI 2016, New York,
NY, USA, 9-15 July 2016, pages 2583ā€“2589, 2016.
Gautier Marti, Sebastien Andler, Frank Nielsen, and Philippe
Donnat.
Exploring and measuring non-linear correlations: Copulas,
lightspeed transportation and clustering.
NIPS 2016 Time Series Workshop, 55, 2016.
Gautier Marti, SĀ“ebastien Andler, Frank Nielsen, and Philippe
Donnat.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
Optimal transport vs. ļ¬sher-rao distance between copulas for
clustering multivariate time series.
In IEEE Statistical Signal Processing Workshop, SSP 2016,
Palma de Mallorca, Spain, June 26-29, 2016, pages 1ā€“5, 2016.
Gautier Marti, Philippe Very, Philippe Donnat, and Frank
Nielsen.
A proposal of a methodological framework with experimental
guidelines to investigate clustering stability on ļ¬nancial time
series.
In 14th IEEE International Conference on Machine Learning
and Applications, ICMLA 2015, Miami, FL, USA, December
9-11, 2015, pages 32ā€“37, 2015.
BarnabĀ“as PĀ“oczos, Zoubin Ghahramani, and Jeļ¬€ G. Schneider.
Copula-based kernel dependency measures.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
HELLEBORECAPITAL
Introduction
The standard methodology
Exploring dependence between returns
Copula-based dependence coeļ¬ƒcients (clustering distances)
Empirical convergence rates
Beyond dependence: a (copula,margins) representation
In Proceedings of the 29th International Conference on
Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June
26 - July 1, 2012, 2012.
A Sklar.
Fonctions de rĀ“epartition `a n dimensions et leurs marges.
UniversitĀ“e Paris 8, 1959.
Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r

Weitere Ƥhnliche Inhalte

Was ist angesagt?

A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...Gautier Marti
Ā 
Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time SeriesGautier Marti
Ā 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesGautier Marti
Ā 
A Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation ProblemA Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation ProblemErika G. G.
Ā 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Mokhtar SELLAMI
Ā 
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Chiheb Ben Hammouda
Ā 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...Chiheb Ben Hammouda
Ā 
Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Chiheb Ben Hammouda
Ā 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmologyChristian Robert
Ā 
31 Machine Learning Unsupervised Cluster Validity
31 Machine Learning Unsupervised Cluster Validity31 Machine Learning Unsupervised Cluster Validity
31 Machine Learning Unsupervised Cluster ValidityAndres Mendez-Vazquez
Ā 
Using Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowUsing Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowMartin Harrigan
Ā 
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...Lake Como School of Advanced Studies
Ā 
Affine Term Structure Model with Stochastic Market Price of Risk
Affine Term Structure Model with Stochastic Market Price of RiskAffine Term Structure Model with Stochastic Market Price of Risk
Affine Term Structure Model with Stochastic Market Price of RiskSwati Mital
Ā 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsJulyan Arbel
Ā 
Affine cascade models for term structure dynamics of sovereign yield curves
Affine cascade models for term structure dynamics of sovereign yield curvesAffine cascade models for term structure dynamics of sovereign yield curves
Affine cascade models for term structure dynamics of sovereign yield curvesLAURAMICHAELA
Ā 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...Alexander Decker
Ā 
Uncertain Volatility Models
Uncertain Volatility ModelsUncertain Volatility Models
Uncertain Volatility ModelsSwati Mital
Ā 
Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Swati Mital
Ā 

Was ist angesagt? (20)

A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...
Ā 
Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time Series
Ā 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Ā 
ABC in Varanasi
ABC in VaranasiABC in Varanasi
ABC in Varanasi
Ā 
A Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation ProblemA Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation Problem
Ā 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Ā 
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Ā 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
Ā 
Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...
Ā 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmology
Ā 
SwingOptions
SwingOptionsSwingOptions
SwingOptions
Ā 
31 Machine Learning Unsupervised Cluster Validity
31 Machine Learning Unsupervised Cluster Validity31 Machine Learning Unsupervised Cluster Validity
31 Machine Learning Unsupervised Cluster Validity
Ā 
Using Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowUsing Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication Flow
Ā 
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...
Econophysics III: Financial Correlations and Portfolio Optimization - Thomas ...
Ā 
Affine Term Structure Model with Stochastic Market Price of Risk
Affine Term Structure Model with Stochastic Market Price of RiskAffine Term Structure Model with Stochastic Market Price of Risk
Affine Term Structure Model with Stochastic Market Price of Risk
Ā 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian Nonparametrics
Ā 
Affine cascade models for term structure dynamics of sovereign yield curves
Affine cascade models for term structure dynamics of sovereign yield curvesAffine cascade models for term structure dynamics of sovereign yield curves
Affine cascade models for term structure dynamics of sovereign yield curves
Ā 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...
Ā 
Uncertain Volatility Models
Uncertain Volatility ModelsUncertain Volatility Models
Uncertain Volatility Models
Ā 
Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)Pricing interest rate derivatives (ext)
Pricing interest rate derivatives (ext)
Ā 

Ƅhnlich wie Clustering CDS: algorithms, distances, stability and convergence rates

Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsVertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsUniversidade de SĆ£o Paulo
Ā 
ders 3.3 Unit root testing section 3 .pptx
ders 3.3 Unit root testing section 3 .pptxders 3.3 Unit root testing section 3 .pptx
ders 3.3 Unit root testing section 3 .pptxErgin Akalpler
Ā 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAngie Miller
Ā 
Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2Tech_MX
Ā 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsChristian Robert
Ā 
Usp chemical medicines & excipients - evolution of validation practices
Usp    chemical medicines & excipients - evolution of validation practicesUsp    chemical medicines & excipients - evolution of validation practices
Usp chemical medicines & excipients - evolution of validation practicesNational Institute of Biologics
Ā 
Automatic Visualization
Automatic VisualizationAutomatic Visualization
Automatic VisualizationSri Ambati
Ā 
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.ai
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.aiAutomatic Visualization - Leland Wilkinson, Chief Scientist, H2O.ai
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.aiSri Ambati
Ā 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfAlemAyahu
Ā 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spssDr Nisha Arora
Ā 
Identification of Outliersin Time Series Data via Simulation Study
Identification of Outliersin Time Series Data via Simulation StudyIdentification of Outliersin Time Series Data via Simulation Study
Identification of Outliersin Time Series Data via Simulation Studyiosrjce
Ā 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statisticsMd. Mehadi Hassan Bappy
Ā 
Conducting and reporting the results of a cfd simulation
Conducting and reporting the results of a cfd simulationConducting and reporting the results of a cfd simulation
Conducting and reporting the results of a cfd simulationMalik Abdul Wahab
Ā 
Direct use of hydroclimatic information for reservoir operation
Direct use of hydroclimatic information for reservoir operationDirect use of hydroclimatic information for reservoir operation
Direct use of hydroclimatic information for reservoir operationAndrea Castelletti
Ā 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxmadlynplamondon
Ā 

Ƅhnlich wie Clustering CDS: algorithms, distances, stability and convergence rates (20)

Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsVertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Ā 
ders 3.3 Unit root testing section 3 .pptx
ders 3.3 Unit root testing section 3 .pptxders 3.3 Unit root testing section 3 .pptx
ders 3.3 Unit root testing section 3 .pptx
Ā 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer Design
Ā 
SEM
SEMSEM
SEM
Ā 
Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2
Ā 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximations
Ā 
Usp chemical medicines & excipients - evolution of validation practices
Usp    chemical medicines & excipients - evolution of validation practicesUsp    chemical medicines & excipients - evolution of validation practices
Usp chemical medicines & excipients - evolution of validation practices
Ā 
Automatic Visualization
Automatic VisualizationAutomatic Visualization
Automatic Visualization
Ā 
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.ai
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.aiAutomatic Visualization - Leland Wilkinson, Chief Scientist, H2O.ai
Automatic Visualization - Leland Wilkinson, Chief Scientist, H2O.ai
Ā 
GARCH
GARCHGARCH
GARCH
Ā 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Are Reported Likelihoo...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Are Reported Likelihoo...MUMS: Bayesian, Fiducial, and Frequentist Conference - Are Reported Likelihoo...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Are Reported Likelihoo...
Ā 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdf
Ā 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spss
Ā 
Identification of Outliersin Time Series Data via Simulation Study
Identification of Outliersin Time Series Data via Simulation StudyIdentification of Outliersin Time Series Data via Simulation Study
Identification of Outliersin Time Series Data via Simulation Study
Ā 
Autocorrelation (1)
Autocorrelation (1)Autocorrelation (1)
Autocorrelation (1)
Ā 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
Ā 
icpr_2012
icpr_2012icpr_2012
icpr_2012
Ā 
Conducting and reporting the results of a cfd simulation
Conducting and reporting the results of a cfd simulationConducting and reporting the results of a cfd simulation
Conducting and reporting the results of a cfd simulation
Ā 
Direct use of hydroclimatic information for reservoir operation
Direct use of hydroclimatic information for reservoir operationDirect use of hydroclimatic information for reservoir operation
Direct use of hydroclimatic information for reservoir operation
Ā 
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docxDistribution of EstimatesLinear Regression ModelAssume (yt,.docx
Distribution of EstimatesLinear Regression ModelAssume (yt,.docx
Ā 

Mehr von Gautier Marti

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeGautier Marti
Ā 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...Gautier Marti
Ā 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsGautier Marti
Ā 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...Gautier Marti
Ā 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?Gautier Marti
Ā 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGautier Marti
Ā 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in FinanceGautier Marti
Ā 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsGautier Marti
Ā 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaGautier Marti
Ā 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationGautier Marti
Ā 

Mehr von Gautier Marti (10)

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of Code
Ā 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...
Ā 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptions
Ā 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
Ā 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?
Ā 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in Finance
Ā 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in Finance
Ā 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returns
Ā 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, California
Ā 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond Correlation
Ā 

KĆ¼rzlich hochgeladen

Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
Ā 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
Ā 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
Ā 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
Ā 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
Ā 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
Ā 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
Ā 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
Ā 
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...amitlee9823
Ā 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...shivangimorya083
Ā 
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Callshivangimorya083
Ā 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
Ā 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
Ā 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
Ā 
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Delhi Call girls
Ā 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
Ā 

KĆ¼rzlich hochgeladen (20)

Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
Ā 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Ā 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
Ā 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Ā 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
Ā 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
Ā 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
Ā 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
Ā 
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Ā 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
Ā 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171āœ”ļøBody to body massage wit...
Ā 
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ā˜Žāœ”šŸ‘Œāœ” Whatsapp Hard And Sexy Vip Call
Ā 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
Ā 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
Ā 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
Ā 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
Ā 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Ā 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Ā 
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Ā 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
Ā 

Clustering CDS: algorithms, distances, stability and convergence rates

  • 1. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Clustering CDS: algorithms, distances, stability and convergence rates CMStatistics 2016, University of Seville, Spain Gautier Marti, Frank Nielsen, Philippe Donnat HELLEBORECAPITAL December 9, 2016 Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 2. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 3. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Introduction Goal: Finding groups of ā€™homogeneousā€™ assets that can help to: ā€¢ build alternative measures of risk, ā€¢ elaborate trading strategies. . . But, we need a high conļ¬dence in these clusters (networks). So, we need appropriate AND fast converging methodologies [8]: to be consistent yet eļ¬ƒcient (biasā€“variance tradeoļ¬€), to avoid non-stationarity of the time series (too large sample). A good model selection criterion: Minimum sample size to reach a given ā€™accuracyā€™. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 4. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 5. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation The standard methodology - description The methodology widely adopted in empirical studies: [7]. Let N be the number of assets. Let Pi (t) be the price at time t of asset i, 1 ā‰¤ i ā‰¤ N. Let ri (t) be the log-return at time t of asset i: ri (t) = log Pi (t) āˆ’ log Pi (t āˆ’ 1). For each pair i, j of assets, compute their correlation: Ļij = ri rj āˆ’ ri rj ( r2 i āˆ’ ri 2) r2 j āˆ’ rj 2 . Convert the correlation coeļ¬ƒcients Ļij into distances: dij = 2(1 āˆ’ Ļij ). Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 6. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation The standard methodology - description From all the distances dij , compute a minimum spanning tree: Figure: A minimum spanning tree of stocks (from [1]); stocks from the same industry (represented by color) tend to cluster together Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 7. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation The standard methodology - limitations ā€¢ MST clustering equivalent to Single Linkage clustering: ā€¢ chaining phenomenon ā€¢ not stable to noise / small perturbations [11] ā€¢ Use of the Pearson correlation: ā€¢ can take value 0 whereas variables are strongly dependent ā€¢ not invariant to variable monotone transformations ā€¢ not robust to outliers Is it still useful for ļ¬nancial time series? stocks? CDS??! Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 8. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation The standard methodology - limitations ā€¢ MST clustering equivalent to Single Linkage clustering: ā€¢ chaining phenomenon ā€¢ not stable to noise / small perturbations [11] ā€¢ Use of the Pearson correlation: ā€¢ can take value 0 whereas variables are strongly dependent ā€¢ not invariant to variables monotone transformations ā€¢ not robust to outliers Is it still useful for ļ¬nancial time series? stocks? CDS??! Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 9. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 10. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Copulas Sklarā€™s Theorem [13] For (Xi , Xj ) having continuous marginal cdfs FXi , FXj , its joint cumulative distribution F is uniquely expressed as F(Xi , Xj ) = C(FXi (Xi ), FXj (Xj )), where C is known as the copula of (Xi , Xj ). Copulaā€™s uniform marginals jointly encode all the dependence. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 11. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation From ranks to empirical copula ri , rj are the rank statistics of Xi , Xj respectively, i.e. rt i is the rank of Xt i in {X1 i , . . . , XT i }: rt i = T k=1 1{Xk i ā‰¤ Xt i }. Deheuvelsā€™ empirical copula [3] Any copula Ė†C deļ¬ned on the lattice L = {( ti T , tj T ) : ti , tj = 0, . . . , T} by Ė†C( ti T , tj T ) = 1 T T t=1 1{rt i ā‰¤ ti , rt j ā‰¤ tj } is an empirical copula. Ė†C is a consistent estimator of C with uniform convergence [4]. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 12. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Clustering of bivariate empirical copulas Generate the N 2 bivariate empirical copulas Find clusters of copulas using optimal transport [10, 9] Compute and display the clustersā€™ centroids [2] Some code available at www.datagrapple.com/Tech. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 13. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Copula-centers for stocks (CAC 40) Figure: Stocks: More mass in the bottom-left corner, i.e. lower tail dependence. Stock prices tend to plummet together. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 14. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Copula-centers for Credit Default Swaps (XO index) Figure: Credit default swaps: More mass in the top-right corner, i.e. upper tail dependence. Insurance cost against entitiesā€™ default tends to soar in stressed market. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 15. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 16. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Dependence as relative distances between copulas C copula of (Xi , Xj ), |u āˆ’ v|/ āˆš 2 distance between (u, v) to the diagonal Spearmanā€™s ĻS : ĻS (Xi , Xj ) = 12 1 0 1 0 (C(u, v) āˆ’ uv)dudv = 1 āˆ’ 6 1 0 1 0 (u āˆ’ v)2 dC(u, v) Many correlation coeļ¬ƒcients can be expressed as distances to the FrĀ“echetā€“Hoeļ¬€ding bounds or the independence [6]. Some are explicitely built this way (e.g. [12, 5, 9]). Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 17. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation A metric space for copulas: Optimal Transport Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 18. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation The Target/Forget Dependence Coeļ¬ƒcient (TFDC) Now, we can deļ¬ne our bespoke dependence coeļ¬ƒcient: Build the forget-dependence copulas {CF l }l Build the target-dependence copulas {CT k }k Compute the empirical copula Cij from xi , xj TFDC(Cij ) = minl D(CF l , Cij ) minl D(CF l , Cij ) + mink D(Cij , CT k ) Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 19. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Spearman vs. TFDC 0.0 0.2 0.4 0.6 0.8 1.0 discontinuity position a 0.0 0.2 0.4 0.6 0.8 1.0 Estimatedpositivedependence Spearman & TFDC values as a function of a TFDC Spearman Figure: Empirical copulas for (X, Y ) where X = Z1{Z < a} + X 1{Z > a}, Y = Z1{Z < a + 0.25} + Y 1{Z > a + 0.25}, a = 0, 0.05, . . . , 0.95, 1, and where Z is uniform on [0, 1] and X , Y are independent noises (left). TFDC and Spearman coeļ¬ƒcients estimated between X and Y as a function of a (right). For a = 0.75, Spearman coeļ¬ƒcient yields a negative value, yet X = Y over [0, a]. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 20. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 21. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Process: Recovering a simulated ground-truth [8] A simulation & benchmark process that needs to be reļ¬ned: Extract (using a large sample) a ļ¬ltered correlation matrix R Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 22. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Process: Recovering a simulated ground-truth [8] A simulation & benchmark process that needs to be reļ¬ned: Generate samples of size T = 10, . . . , 20, . . . from a relevant distribution (parameterized by R) Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 23. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Process: Recovering a simulated ground-truth [8] A simulation & benchmark process that needs to be reļ¬ned: Compute the ratio of the number of correct clustering obtained over the number of trials as a function of T 100 200 300 400 500 Sample size 0.0 0.2 0.4 0.6 0.8 1.0 Score Empirical rates of convergence for Single Linkage Gaussian - Pearson Gaussian - Spearman Student - Pearson Student - Spearman 100 200 300 400 500 Sample size 0.0 0.2 0.4 0.6 0.8 1.0 Score Empirical rates of convergence for Average Linkage Gaussian - Pearson Gaussian - Spearman Student - Pearson Student - Spearman 100 200 300 400 500 Sample size 0.0 0.2 0.4 0.6 0.8 1.0 Score Empirical rates of convergence for Ward Gaussian - Pearson Gaussian - Spearman Student - Pearson Student - Spearman A full comparative study will be posted online at www.datagrapple.com/Tech. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 24. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation 1 Introduction 2 The standard methodology 3 Exploring dependence between returns 4 Copula-based dependence coeļ¬ƒcients (clustering distances) 5 Empirical convergence rates 6 Beyond dependence: a (copula,margins) representation Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 25. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation ON CLUSTERING FINANCIAL TIME SERIES GAUTIER MARTI, PHILIPPE DONNAT AND FRANK NIELSEN NOISY CORRELATION MATRICES Let X be the matrix storing the standardized re- turns of N = 560 assets (credit default swaps) over a period of T = 2500 trading days. Then, the empirical correlation matrix of the re- turns is C = 1 T XX . We can compute the empirical density of its eigenvalues Ļ(Ī») = 1 N dn(Ī») dĪ» , where n(Ī») counts the number of eigenvalues of C less than Ī». From random matrix theory, the Marchenko- Pastur distribution gives the limit distribution as N ā†’ āˆž, T ā†’ āˆž and T/N ļ¬xed. It reads: Ļ(Ī») = T/N 2Ļ€ (Ī»max āˆ’ Ī»)(Ī» āˆ’ Ī»min) Ī» , where Ī»max min = 1 + N/T Ā± 2 N/T, and Ī» āˆˆ [Ī»min, Ī»max]. 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 Ī» 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 Ļ(Ī») Figure 1: Marchenko-Pastur density vs. empirical den- sity of the correlation matrix eigenvalues Notice that the Marchenko-Pastur density ļ¬ts well the empirical density meaning that most of the information contained in the empirical corre- lation matrix amounts to noise: only 26 eigenval- ues are greater than Ī»max. The highest eigenvalue corresponds to the ā€˜mar- ketā€™, the 25 others can be associated to ā€˜industrial sectorsā€™. CLUSTERING TIME SERIES Given a correlation matrix of the returns, 0 100 200 300 400 500 0 100 200 300 400 500 Figure 2: An empirical and noisy correlation matrix one can re-order assets using a hierarchical clus- tering algorithm to make the hierarchical correla- tion pattern blatant, 0 100 200 300 400 500 0 100 200 300 400 500 Figure 3: The same noisy correlation matrix re-ordered by a hierarchical clustering algorithm and ļ¬nally ļ¬lter the noise according to the corre- lation pattern: 0 100 200 300 400 500 0 100 200 300 400 500 Figure 4: The resulting ļ¬ltered correlation matrix BEYOND CORRELATION Sklarā€™s Theorem. For any random vector X = (X1, . . . , XN ) having continuous marginal cumulative distribution functions Fi, its joint cumulative distribution F is uniquely expressed as F(X1, . . . , XN ) = C(F1(X1), . . . , FN (XN )), where C, the multivariate distribution of uniform marginals, is known as the copula of X. Figure 5: ArcelorMittal and SociĆ©tĆ© gĆ©nĆ©rale prices are projected on dependence āŠ• distribution space; notice their heavy-tailed exponential distribution. Let Īø āˆˆ [0, 1]. Let (X, Y ) āˆˆ V2 . Let G = (GX, GY ), where GX and GY are respectively X and Y marginal cdf. We deļ¬ne the following distance d2 Īø(X, Y ) = Īød2 1(GX(X), GY (Y )) + (1 āˆ’ Īø)d2 0(GX, GY ), where d2 1(GX(X), GY (Y )) = 3E[|GX(X) āˆ’ GY (Y )|2 ], and d2 0(GX, GY ) = 1 2 R dGX dĪ» āˆ’ dGY dĪ» 2 dĪ». CLUSTERING RESULTS & STABILITY 0 5 10 15 20 25 30 Standard Deviation in basis points 0 5 10 15 20 25 30 35 Numberofoccurrences Standard Deviations Histogram Figure 6: (Top) The returns correlation structure ap- pears more clearly using rank correlation; (Bottom) Clusters of returns distributions can be partly described by the returns volatility Figure 7: Stability test on Odd/Even trading days sub- sampling: our approach (GNPR) yields more stable clusters with respect to this perturbation than standard approaches (using Pearson correlation or L2 distances). Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 26. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Ricardo Coelho, Przemyslaw Repetowicz, Stefan Hutzler, and Peter Richmond. Investigation of Cluster Structure in the London Stock Exchange. Marco Cuturi and Arnaud Doucet. Fast computation of wasserstein barycenters. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, pages 685ā€“693, 2014. Paul Deheuvels. La fonction de dĀ“ependance empirique et ses propriĀ“etĀ“es. un test non paramĀ“etrique dā€™indĀ“ependance. Acad. Roy. Belg. Bull. Cl. Sci.(5), 65(6):274ā€“292, 1979. Paul Deheuvels. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 27. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation A non-parametric test for independence. Publications de lā€™Institut de Statistique de lā€™UniversitĀ“e de Paris, 26:29ā€“50, 1981. Fabrizio Durante and Roberta Pappada. Cluster analysis of time series via kendall distribution. In Strengthening Links Between Data Analysis and Soft Computing, pages 209ā€“216. Springer, 2015. Eckhard Liebscher et al. Copula-based dependence measures. Dependence Modeling, 2(1):49ā€“64, 2014. Rosario N Mantegna. Hierarchical structure in ļ¬nancial markets. The European Physical Journal B-Condensed Matter and Complex Systems, 11(1):193ā€“197, 1999. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 28. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Gautier Marti, SĀ“ebastien Andler, Frank Nielsen, and Philippe Donnat. Clustering ļ¬nancial time series: How long is enough? Proceedings of the Twenty-Fifth International Joint Conference on Artiļ¬cial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016, pages 2583ā€“2589, 2016. Gautier Marti, Sebastien Andler, Frank Nielsen, and Philippe Donnat. Exploring and measuring non-linear correlations: Copulas, lightspeed transportation and clustering. NIPS 2016 Time Series Workshop, 55, 2016. Gautier Marti, SĀ“ebastien Andler, Frank Nielsen, and Philippe Donnat. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 29. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation Optimal transport vs. ļ¬sher-rao distance between copulas for clustering multivariate time series. In IEEE Statistical Signal Processing Workshop, SSP 2016, Palma de Mallorca, Spain, June 26-29, 2016, pages 1ā€“5, 2016. Gautier Marti, Philippe Very, Philippe Donnat, and Frank Nielsen. A proposal of a methodological framework with experimental guidelines to investigate clustering stability on ļ¬nancial time series. In 14th IEEE International Conference on Machine Learning and Applications, ICMLA 2015, Miami, FL, USA, December 9-11, 2015, pages 32ā€“37, 2015. BarnabĀ“as PĀ“oczos, Zoubin Ghahramani, and Jeļ¬€ G. Schneider. Copula-based kernel dependency measures. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r
  • 30. HELLEBORECAPITAL Introduction The standard methodology Exploring dependence between returns Copula-based dependence coeļ¬ƒcients (clustering distances) Empirical convergence rates Beyond dependence: a (copula,margins) representation In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012, 2012. A Sklar. Fonctions de rĀ“epartition `a n dimensions et leurs marges. UniversitĀ“e Paris 8, 1959. Gautier Marti Clustering CDS: algorithms, distances, stability and convergence r