SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Project Report:
Statistical Arbitrage
Shubham Patil • 19-03-2020
Eckovation
Machine Learning
Overview
Deep Learning Strategy
● RNN Overview
● Feature and Label Generation
● Model Formation
● Strategy
● Results
Statistical Arbitrage Strategy
● Statistical Arbitrage Overview
● Finding Correlated Pairs
● Stochastic Control
● Parameter Tuning
● Results
Recurrent Neural Networks (RNN)
What is RNN?
● Family of Neural Network
specialized for sequence
data.
● ‘Many-to-One’ architecture.
● ‘Vanilla’ vs. Long short-term
memory (LSTM)
RNN: Feature and Label Generation
Feature
● Bid/Ask Prices and Spread
(10 levels)
● Volumes (10 levels)
● Mean Prices and Volumes
● Accumulated Price and
Volume Differences
● Price and Volume
Changes
● Order Imbalance Changes
● VWAP
Label Generation
● Mid-Price Movement
● Volume Weighted Average
Price (VWAP) Movement
● Settled on classifying
VWAP movement over the
next time ‘window’
RNN: Model Formation
Model:
● Cost Function: Weighted Cross
Entropy ○ Helps solve challenge
of having an imbalanced dataset
● Output: Softmax Layer
Outputs a predicted probability
for each label
● Unit: LSTM
Long short-term memory (LSTM)
units to model longer term
dependencies
Hyperparameter:
● Number of Units
● Prediction Window for
Label
● Trade Probability Cutoff
● Cross Entropy Weights
● Other (e.g. Learning Rate,
Dropout)
RNN: Strategy
Statistical Arbitrage
Strategy
Baseline model
Linearly regress the mid-price returns of a pair of historically
correlated stocks.
This type of trading strategy assigns stocks a desirability ranking
and then constructs a portfolio to reduce risk as much as
possible.
Statistical arbitrage is heavily reliant on computer models and
analysis and is known as one of the most rigorous approach to
investing.
Example of execution process
What's new?
● Identifying most correlated pairs to trade
● Stochastic control to incorporate dynamically optimal
thresholds
● Hyperparameter tuning (frequency, training size,
leverage, etc.)
Parameter Tuning
Two approaches to parameter tuning:
● Grid search
○ Systematic exploration
○ Enables for sensitivity analysis
○ Inefficient
● Random search
○ Black-box method
○ Explore larger subspace
Validation set
● Evaluated 4 models
a. Fixed thresholds, pairs picked by performance
b. Fixed thresholds, pairs picked by correlation
c. Stochastic control thresholds, pairs picked by
performance d. Stochastic control thresholds, pairs picked
by correlation
● Model
a. performed best on validation set
Test Set Result
Future Work
To trade based on factorsfrom PCA eigen portfolio and its
eigenvalues:
○ take a variable number of eigenvectors, truncate to explain
a given percentage of the total variance of the system
Implement a more dynamic strategy
○ Using the correlation from yesterday to decide which pairs
to trade today.
○ Or observe the market for a couple of hours and then start
trading based on earlier correlation
Thank You.
Shubham Balasaheb Patil
Eckovation
Machine Learning

Weitere ähnliche Inhalte

Ähnlich wie Statistical Arbitrage

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionKishor Datta Gupta
 
Ad Click Prediction - Paper review
Ad Click Prediction - Paper reviewAd Click Prediction - Paper review
Ad Click Prediction - Paper reviewMazen Aly
 
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf
 
MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkSigOpt
 
Query optimization in Apache Tajo
Query optimization in Apache TajoQuery optimization in Apache Tajo
Query optimization in Apache TajoJihoon Son
 
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...Christopher Sneed, MSDS, PMP, CSPO
 
CART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User GuideCART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User GuideSalford Systems
 
Application_of_Deep_Learning_Techniques.pptx
Application_of_Deep_Learning_Techniques.pptxApplication_of_Deep_Learning_Techniques.pptx
Application_of_Deep_Learning_Techniques.pptxKiranKumar918931
 
Buy and Hold Strategy
Buy and Hold StrategyBuy and Hold Strategy
Buy and Hold StrategyK.D Manshadi
 
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...Andrew Kreimer
 
A brief introduction to Searn Algorithm
A brief introduction to Searn AlgorithmA brief introduction to Searn Algorithm
A brief introduction to Searn AlgorithmSupun Abeysinghe
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementZehra Kendir
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt
 
Implementing cost accounting tools
Implementing cost accounting tools Implementing cost accounting tools
Implementing cost accounting tools Ahmad Nadeem Syed
 
Algorithmic pricing: Forecasting and Pricing
Algorithmic pricing: Forecasting and PricingAlgorithmic pricing: Forecasting and Pricing
Algorithmic pricing: Forecasting and PricingTofigh Naghibi
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowDatabricks
 
Study and development of methods and tools for testing, validation and verif...
 Study and development of methods and tools for testing, validation and verif... Study and development of methods and tools for testing, validation and verif...
Study and development of methods and tools for testing, validation and verif...Emilio Serrano
 

Ähnlich wie Statistical Arbitrage (20)

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detection
 
Ad Click Prediction - Paper review
Ad Click Prediction - Paper reviewAd Click Prediction - Paper review
Ad Click Prediction - Paper review
 
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
 
MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott Clark
 
Query optimization in Apache Tajo
Query optimization in Apache TajoQuery optimization in Apache Tajo
Query optimization in Apache Tajo
 
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...
Presentation - Predicting Online Purchases Using Conversion Prediction Modeli...
 
CART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User GuideCART Classification and Regression Trees Experienced User Guide
CART Classification and Regression Trees Experienced User Guide
 
Application_of_Deep_Learning_Techniques.pptx
Application_of_Deep_Learning_Techniques.pptxApplication_of_Deep_Learning_Techniques.pptx
Application_of_Deep_Learning_Techniques.pptx
 
Buy and Hold Strategy
Buy and Hold StrategyBuy and Hold Strategy
Buy and Hold Strategy
 
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...
Ensemble Trend Classification in the Foreign Exchange Market Using Class Vari...
 
A brief introduction to Searn Algorithm
A brief introduction to Searn AlgorithmA brief introduction to Searn Algorithm
A brief introduction to Searn Algorithm
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer Management
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Implementing cost accounting tools
Implementing cost accounting tools Implementing cost accounting tools
Implementing cost accounting tools
 
Algorithmic pricing: Forecasting and Pricing
Algorithmic pricing: Forecasting and PricingAlgorithmic pricing: Forecasting and Pricing
Algorithmic pricing: Forecasting and Pricing
 
Advanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflowAdvanced Hyperparameter Optimization for Deep Learning with MLflow
Advanced Hyperparameter Optimization for Deep Learning with MLflow
 
rerngvit_phd_seminar
rerngvit_phd_seminarrerngvit_phd_seminar
rerngvit_phd_seminar
 
Study and development of methods and tools for testing, validation and verif...
 Study and development of methods and tools for testing, validation and verif... Study and development of methods and tools for testing, validation and verif...
Study and development of methods and tools for testing, validation and verif...
 

Kürzlich hochgeladen

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 

Kürzlich hochgeladen (20)

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

Statistical Arbitrage

  • 1. Project Report: Statistical Arbitrage Shubham Patil • 19-03-2020 Eckovation Machine Learning
  • 2. Overview Deep Learning Strategy ● RNN Overview ● Feature and Label Generation ● Model Formation ● Strategy ● Results Statistical Arbitrage Strategy ● Statistical Arbitrage Overview ● Finding Correlated Pairs ● Stochastic Control ● Parameter Tuning ● Results
  • 3. Recurrent Neural Networks (RNN) What is RNN? ● Family of Neural Network specialized for sequence data. ● ‘Many-to-One’ architecture. ● ‘Vanilla’ vs. Long short-term memory (LSTM)
  • 4. RNN: Feature and Label Generation Feature ● Bid/Ask Prices and Spread (10 levels) ● Volumes (10 levels) ● Mean Prices and Volumes ● Accumulated Price and Volume Differences ● Price and Volume Changes ● Order Imbalance Changes ● VWAP Label Generation ● Mid-Price Movement ● Volume Weighted Average Price (VWAP) Movement ● Settled on classifying VWAP movement over the next time ‘window’
  • 5. RNN: Model Formation Model: ● Cost Function: Weighted Cross Entropy ○ Helps solve challenge of having an imbalanced dataset ● Output: Softmax Layer Outputs a predicted probability for each label ● Unit: LSTM Long short-term memory (LSTM) units to model longer term dependencies Hyperparameter: ● Number of Units ● Prediction Window for Label ● Trade Probability Cutoff ● Cross Entropy Weights ● Other (e.g. Learning Rate, Dropout)
  • 8. Baseline model Linearly regress the mid-price returns of a pair of historically correlated stocks. This type of trading strategy assigns stocks a desirability ranking and then constructs a portfolio to reduce risk as much as possible. Statistical arbitrage is heavily reliant on computer models and analysis and is known as one of the most rigorous approach to investing.
  • 10. What's new? ● Identifying most correlated pairs to trade ● Stochastic control to incorporate dynamically optimal thresholds ● Hyperparameter tuning (frequency, training size, leverage, etc.)
  • 11. Parameter Tuning Two approaches to parameter tuning: ● Grid search ○ Systematic exploration ○ Enables for sensitivity analysis ○ Inefficient ● Random search ○ Black-box method ○ Explore larger subspace
  • 12. Validation set ● Evaluated 4 models a. Fixed thresholds, pairs picked by performance b. Fixed thresholds, pairs picked by correlation c. Stochastic control thresholds, pairs picked by performance d. Stochastic control thresholds, pairs picked by correlation ● Model a. performed best on validation set
  • 14. Future Work To trade based on factorsfrom PCA eigen portfolio and its eigenvalues: ○ take a variable number of eigenvectors, truncate to explain a given percentage of the total variance of the system Implement a more dynamic strategy ○ Using the correlation from yesterday to decide which pairs to trade today. ○ Or observe the market for a couple of hours and then start trading based on earlier correlation
  • 15. Thank You. Shubham Balasaheb Patil Eckovation Machine Learning