SlideShare a Scribd company logo
Stock Market Prediction
Using Data Mining
By
Shivakumar Soppannavar
CMPE 239
Under the Guidance of
Prof. Eirinaki Magdalini
11/10/2015
Different machine learning algorithms are used to predict the stock market trading.
Use text from different sources and use Text and Data Mining (TDM) to extract pattern or
information or any hidden data of interest to predict the Ups and downs of the targeted
stocks.
Then
Data Mining Isn't a Good Bet For Stock-Market Predictions [2]
Aug. 8, 2009 - JASON ZWEIG , Wall Street Journal
Now
How Traders Are Using Text and Data Mining to Beat the Market [3]
Feb 12 2015 - Market Roy Kaufman , The Street
Applying Machine Learning to Stock Market Trading - Bryce Taylor [1]
Machine learning algorithm to read headlines from financial news magazines and
make predictions on the directional change of stock prices after a moderate-length
time interval
[Stanford Student project 2013, CS 229]
Introduction
Data Sources and Research question
Twitter data to predict stock market changes
Change in management, M&A
Intermittent headlines to react to the first headlines up or down ???
Data sources:
 Headlines from financial analysts
http://seekingalpha.com/
 Historic stock prices
http://www.nasdaq.com/
 7 targeted companies
IBM, NFLX, GOOG, ANF, MCD, SHLD, AAPL
Research Questions:
“Given a headline released today about some company X, will the stock price of X
rise by more than P percent over the next time period T?”
T= 3 months
Bayesian Classifiers
Bayesian Classifier
 Simple multinomial Bayesian classifier that analyze the headlines based on the
presence of each token in the headline
 51202 tokens -> Laplace smoothening -> 693 tokens -> Top 10 tokens
 Classification Error for Reduced features < 0.5
Precision/Recall
 Increase in P increases the Positive error and decrease in Negative error
Support Vector Machines
 SVM (Polynomial, linear, etc) was used on reduced data set, didn’t beat the
result obtained from Bayesian classifier
Naïve Baye’s Testing Error
Table 1: Bayesian classifier result
run for top 10 most indicative
symbols
Few more ways of analysis!
Natural Language Processing
 Stanford has a publicly available Natural Language Processing Toolkit that
provides sentiment analysis to sentences with high accuracy (>80%)
 Use of NLP didn’t achieve high success
 Natural language processors would need to be specifically tailored to processing
headline-like data to be able to make a meaningful contribution towards
answering my research questions.
Principal Component Analysis
 Principal component analysis are run on the data and then tested linear SVMs on
several of the top principal components.
Manual Key word Selection
 Keywords are selected manually
Few more ways of analysis, Results
Principal Component Analysis Manual Key word Selection
Conclusion
Sophisticated model able to beat overall market trends by reading financial news
headlines cannot be easily found without fairly sophisticated human-like processing
of the headlines. –By Author
Examples:
Tweet on Credit card breach at Home Depot (HD) -> Stocks 2% down. (9/2/2014) [3]
Nate Silver's uncannily accurate predictions of the U.S. national elections. (2012) [3]
Yes, by using Text and Data Mining and superior algorithms in near future, we may be
able to predict the stock market with greater accuracy.
Thank you
References
1. B. Taylor. (2013). “Applying Machine Learning to Stock Market Trading”. Retrieved from
Stanford CS229 project lists 2013.
http://cs229.stanford.edu/proj2013/Taylor-
Applying%20Machine%20Learning%20to%20Stock%20Market%20Trading.pdf
2. JASON ZWEIG , (Aug. 8, 2009). Retrieved from Wall Street Journal website
http://www.wsj.com/articles/SB124967937642715417
3. M. R. Kaufman,(Feb 12 2015). Retrieved from The Street website
http://www.thestreet.com/story/13044694/2/how-traders-are-using-text-and-data-
mining-to-beat-the-market.html
4. http://cs229.stanford.edu/projects2013.html

More Related Content

What's hot

Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningSharvil Katariya
 
Performance analysis and prediction of stock market for investment decision u...
Performance analysis and prediction of stock market for investment decision u...Performance analysis and prediction of stock market for investment decision u...
Performance analysis and prediction of stock market for investment decision u...Hari KC
 
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODSSTOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODSIAEME Publication
 
Stock market analysis
Stock market analysisStock market analysis
Stock market analysisSruti Jain
 
Stock Price Prediction PPT
Stock Price Prediction  PPTStock Price Prediction  PPT
Stock Price Prediction PPTPrashantGanji4
 
Google Stock Price Forecasting
Google Stock Price ForecastingGoogle Stock Price Forecasting
Google Stock Price ForecastingArkaprava Kundu
 
Deep Learning for Stock Prediction
Deep Learning for Stock PredictionDeep Learning for Stock Prediction
Deep Learning for Stock PredictionLim Zhi Yuan (Zane)
 
stock market prediction
stock market predictionstock market prediction
stock market predictionSRIGINES
 
Stock price prediction using Neural Net
Stock price prediction using Neural NetStock price prediction using Neural Net
Stock price prediction using Neural NetRajat Sharma
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learningijtsrd
 
Stock market analysis using supervised machine learning
Stock market analysis using supervised machine learningStock market analysis using supervised machine learning
Stock market analysis using supervised machine learningPriyanshu Gandhi
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningAravind Balaji
 
Stock Market Prediction.pptx
Stock Market Prediction.pptxStock Market Prediction.pptx
Stock Market Prediction.pptxRastogiAman
 
Machine learning: Stock Price Prediction
Machine learning: Stock Price PredictionMachine learning: Stock Price Prediction
Machine learning: Stock Price Predictioneurosigdoc acm
 
Stock-market-prediction.pptx
Stock-market-prediction.pptxStock-market-prediction.pptx
Stock-market-prediction.pptxrikritiKoirala1
 
Machine learning prediction of stock markets
Machine learning prediction of stock marketsMachine learning prediction of stock markets
Machine learning prediction of stock marketsNikola Milosevic
 

What's hot (20)

STOCK MARKET PREDICTION
STOCK MARKET PREDICTIONSTOCK MARKET PREDICTION
STOCK MARKET PREDICTION
 
Presentation1
Presentation1Presentation1
Presentation1
 
STOCK MARKET PREDICTION
STOCK MARKET PREDICTIONSTOCK MARKET PREDICTION
STOCK MARKET PREDICTION
 
Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised Learning
 
Performance analysis and prediction of stock market for investment decision u...
Performance analysis and prediction of stock market for investment decision u...Performance analysis and prediction of stock market for investment decision u...
Performance analysis and prediction of stock market for investment decision u...
 
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODSSTOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
STOCK MARKET PREDICTION USING MACHINE LEARNING METHODS
 
Stock market analysis
Stock market analysisStock market analysis
Stock market analysis
 
Stock Price Prediction PPT
Stock Price Prediction  PPTStock Price Prediction  PPT
Stock Price Prediction PPT
 
Google Stock Price Forecasting
Google Stock Price ForecastingGoogle Stock Price Forecasting
Google Stock Price Forecasting
 
Deep Learning for Stock Prediction
Deep Learning for Stock PredictionDeep Learning for Stock Prediction
Deep Learning for Stock Prediction
 
stock market prediction
stock market predictionstock market prediction
stock market prediction
 
STOCK_PPT.pptx
STOCK_PPT.pptxSTOCK_PPT.pptx
STOCK_PPT.pptx
 
Stock price prediction using Neural Net
Stock price prediction using Neural NetStock price prediction using Neural Net
Stock price prediction using Neural Net
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learning
 
Stock market analysis using supervised machine learning
Stock market analysis using supervised machine learningStock market analysis using supervised machine learning
Stock market analysis using supervised machine learning
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learning
 
Stock Market Prediction.pptx
Stock Market Prediction.pptxStock Market Prediction.pptx
Stock Market Prediction.pptx
 
Machine learning: Stock Price Prediction
Machine learning: Stock Price PredictionMachine learning: Stock Price Prediction
Machine learning: Stock Price Prediction
 
Stock-market-prediction.pptx
Stock-market-prediction.pptxStock-market-prediction.pptx
Stock-market-prediction.pptx
 
Machine learning prediction of stock markets
Machine learning prediction of stock marketsMachine learning prediction of stock markets
Machine learning prediction of stock markets
 

Viewers also liked

Software for Stock Market Prediction
Software for Stock Market PredictionSoftware for Stock Market Prediction
Software for Stock Market PredictionSSA KPI
 
Aditya report finaL
Aditya report finaLAditya report finaL
Aditya report finaL2767882
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discoveryJames Wong
 
Data Mining methodology
 Data Mining methodology  Data Mining methodology
Data Mining methodology rebeccatho
 
An intelligent scalable stock market prediction system
An intelligent scalable stock market prediction systemAn intelligent scalable stock market prediction system
An intelligent scalable stock market prediction systemHarshit Agarwal
 
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUESGDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUESBharat Nagalia
 
Prediction of stock market index using genetic algorithm
Prediction of stock market index using genetic algorithmPrediction of stock market index using genetic algorithm
Prediction of stock market index using genetic algorithmAlexander Decker
 
presentation of stock valuation
presentation of stock valuationpresentation of stock valuation
presentation of stock valuationMuhammad Nawaz Ali
 
1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)Jashvant Shah
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agricultureSibananda Khatai
 
Capital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
Capital Markets Development in Bangladesh: The Status of Dhaka Stock ExchangeCapital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
Capital Markets Development in Bangladesh: The Status of Dhaka Stock ExchangeZafour
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in TelecommunicationsMohsin Nadaf
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMamiteshg
 

Viewers also liked (17)

Software for Stock Market Prediction
Software for Stock Market PredictionSoftware for Stock Market Prediction
Software for Stock Market Prediction
 
Stock Market Analysis
Stock Market AnalysisStock Market Analysis
Stock Market Analysis
 
Aditya report finaL
Aditya report finaLAditya report finaL
Aditya report finaL
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
Data Mining methodology
 Data Mining methodology  Data Mining methodology
Data Mining methodology
 
An intelligent scalable stock market prediction system
An intelligent scalable stock market prediction systemAn intelligent scalable stock market prediction system
An intelligent scalable stock market prediction system
 
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUESGDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
GDP PREDICTION AND ANALYSIS USING DATA MINING TECHNIQUES
 
2558 project
2558 project 2558 project
2558 project
 
Prediction of stock market index using genetic algorithm
Prediction of stock market index using genetic algorithmPrediction of stock market index using genetic algorithm
Prediction of stock market index using genetic algorithm
 
Brochure Preview
Brochure PreviewBrochure Preview
Brochure Preview
 
presentation of stock valuation
presentation of stock valuationpresentation of stock valuation
presentation of stock valuation
 
1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
 
Earthquake prediction
Earthquake predictionEarthquake prediction
Earthquake prediction
 
Capital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
Capital Markets Development in Bangladesh: The Status of Dhaka Stock ExchangeCapital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
Capital Markets Development in Bangladesh: The Status of Dhaka Stock Exchange
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in Telecommunications
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
 

Similar to Stock market prediction using data mining

Stock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkStock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkINFOGAIN PUBLICATION
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data AnalyticsOsman Ali
 
Sentiment Analysis Dashboard for Bursa Malaysia stocks
Sentiment Analysis Dashboard for Bursa Malaysia stocksSentiment Analysis Dashboard for Bursa Malaysia stocks
Sentiment Analysis Dashboard for Bursa Malaysia stocksnorain_yusoff
 
Data-Driven Approach to Stock Market Prediction and Sentiment Analysis
Data-Driven Approach to Stock Market Prediction and Sentiment AnalysisData-Driven Approach to Stock Market Prediction and Sentiment Analysis
Data-Driven Approach to Stock Market Prediction and Sentiment AnalysisIRJET Journal
 
Election Result Prediction using Twitter Analysis
Election Result Prediction using Twitter AnalysisElection Result Prediction using Twitter Analysis
Election Result Prediction using Twitter AnalysisIRJET Journal
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxcloudserviceuit
 
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELOPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELIJCI JOURNAL
 
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptxUnit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptxtesfkeb
 
Amr assignment goutam shit - roll 11
Amr assignment   goutam shit - roll 11Amr assignment   goutam shit - roll 11
Amr assignment goutam shit - roll 11Sourav Biswas
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit satIAESIJEECS
 
IRJET - Stock Price Prediction using Microblogging Data
IRJET - Stock Price Prediction using Microblogging DataIRJET - Stock Price Prediction using Microblogging Data
IRJET - Stock Price Prediction using Microblogging DataIRJET Journal
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475IJRAT
 
Section b
Section bSection b
Section bPCTE
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Studyvivatechijri
 

Similar to Stock market prediction using data mining (20)

Stock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkStock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural Network
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Sentiment Analysis Dashboard for Bursa Malaysia stocks
Sentiment Analysis Dashboard for Bursa Malaysia stocksSentiment Analysis Dashboard for Bursa Malaysia stocks
Sentiment Analysis Dashboard for Bursa Malaysia stocks
 
Data-Driven Approach to Stock Market Prediction and Sentiment Analysis
Data-Driven Approach to Stock Market Prediction and Sentiment AnalysisData-Driven Approach to Stock Market Prediction and Sentiment Analysis
Data-Driven Approach to Stock Market Prediction and Sentiment Analysis
 
Election Result Prediction using Twitter Analysis
Election Result Prediction using Twitter AnalysisElection Result Prediction using Twitter Analysis
Election Result Prediction using Twitter Analysis
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
 
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELOPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
 
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptxUnit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
 
Amr assignment goutam shit - roll 11
Amr assignment   goutam shit - roll 11Amr assignment   goutam shit - roll 11
Amr assignment goutam shit - roll 11
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat
 
1120 track1 grossman
1120 track1 grossman1120 track1 grossman
1120 track1 grossman
 
IRJET - Stock Price Prediction using Microblogging Data
IRJET - Stock Price Prediction using Microblogging DataIRJET - Stock Price Prediction using Microblogging Data
IRJET - Stock Price Prediction using Microblogging Data
 
INFORMS 2015
INFORMS 2015INFORMS 2015
INFORMS 2015
 
Analytics
AnalyticsAnalytics
Analytics
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475
 
Intro.pptx
Intro.pptxIntro.pptx
Intro.pptx
 
Section b
Section bSection b
Section b
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 

Recently uploaded

retail automation billing system ppt.pptx
retail automation billing system ppt.pptxretail automation billing system ppt.pptx
retail automation billing system ppt.pptxfaamieahmd
 
Maestro Scripting Language CNC programacion
Maestro Scripting Language CNC programacionMaestro Scripting Language CNC programacion
Maestro Scripting Language CNC programacionliberfusta1
 
Pharmacy management system project report..pdf
Pharmacy management system project report..pdfPharmacy management system project report..pdf
Pharmacy management system project report..pdfKamal Acharya
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Construction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxConstruction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxwendy cai
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edgePaco Orozco
 
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and Visualization
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and VisualizationKIT-601 Lecture Notes-UNIT-5.pdf Frame Works and Visualization
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and VisualizationDr. Radhey Shyam
 
Electrical shop management system project report.pdf
Electrical shop management system project report.pdfElectrical shop management system project report.pdf
Electrical shop management system project report.pdfKamal Acharya
 
Electrostatic field in a coaxial transmission line
Electrostatic field in a coaxial transmission lineElectrostatic field in a coaxial transmission line
Electrostatic field in a coaxial transmission lineJulioCesarSalazarHer1
 
Online resume builder management system project report.pdf
Online resume builder management system project report.pdfOnline resume builder management system project report.pdf
Online resume builder management system project report.pdfKamal Acharya
 
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDrGurudutt
 
A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfKamal Acharya
 
Software Engineering - Modelling Concepts + Class Modelling + Building the An...
Software Engineering - Modelling Concepts + Class Modelling + Building the An...Software Engineering - Modelling Concepts + Class Modelling + Building the An...
Software Engineering - Modelling Concepts + Class Modelling + Building the An...Prakhyath Rai
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxMd. Shahidul Islam Prodhan
 
Explosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdfExplosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdf884710SadaqatAli
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringC Sai Kiran
 
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWING
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWINGBRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWING
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWINGKOUSTAV SARKAR
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfAbrahamGadissa
 
Arduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectArduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectRased Khan
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.PrashantGoswami42
 

Recently uploaded (20)

retail automation billing system ppt.pptx
retail automation billing system ppt.pptxretail automation billing system ppt.pptx
retail automation billing system ppt.pptx
 
Maestro Scripting Language CNC programacion
Maestro Scripting Language CNC programacionMaestro Scripting Language CNC programacion
Maestro Scripting Language CNC programacion
 
Pharmacy management system project report..pdf
Pharmacy management system project report..pdfPharmacy management system project report..pdf
Pharmacy management system project report..pdf
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
 
Construction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptxConstruction method of steel structure space frame .pptx
Construction method of steel structure space frame .pptx
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge
 
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and Visualization
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and VisualizationKIT-601 Lecture Notes-UNIT-5.pdf Frame Works and Visualization
KIT-601 Lecture Notes-UNIT-5.pdf Frame Works and Visualization
 
Electrical shop management system project report.pdf
Electrical shop management system project report.pdfElectrical shop management system project report.pdf
Electrical shop management system project report.pdf
 
Electrostatic field in a coaxial transmission line
Electrostatic field in a coaxial transmission lineElectrostatic field in a coaxial transmission line
Electrostatic field in a coaxial transmission line
 
Online resume builder management system project report.pdf
Online resume builder management system project report.pdfOnline resume builder management system project report.pdf
Online resume builder management system project report.pdf
 
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
 
A case study of cinema management system project report..pdf
A case study of cinema management system project report..pdfA case study of cinema management system project report..pdf
A case study of cinema management system project report..pdf
 
Software Engineering - Modelling Concepts + Class Modelling + Building the An...
Software Engineering - Modelling Concepts + Class Modelling + Building the An...Software Engineering - Modelling Concepts + Class Modelling + Building the An...
Software Engineering - Modelling Concepts + Class Modelling + Building the An...
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
 
Explosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdfExplosives Industry manufacturing process.pdf
Explosives Industry manufacturing process.pdf
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
 
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWING
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWINGBRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWING
BRAKING SYSTEM IN INDIAN RAILWAY AutoCAD DRAWING
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdf
 
Arduino based vehicle speed tracker project
Arduino based vehicle speed tracker projectArduino based vehicle speed tracker project
Arduino based vehicle speed tracker project
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
 

Stock market prediction using data mining

  • 1. Stock Market Prediction Using Data Mining By Shivakumar Soppannavar CMPE 239 Under the Guidance of Prof. Eirinaki Magdalini 11/10/2015
  • 2. Different machine learning algorithms are used to predict the stock market trading. Use text from different sources and use Text and Data Mining (TDM) to extract pattern or information or any hidden data of interest to predict the Ups and downs of the targeted stocks. Then Data Mining Isn't a Good Bet For Stock-Market Predictions [2] Aug. 8, 2009 - JASON ZWEIG , Wall Street Journal Now How Traders Are Using Text and Data Mining to Beat the Market [3] Feb 12 2015 - Market Roy Kaufman , The Street Applying Machine Learning to Stock Market Trading - Bryce Taylor [1] Machine learning algorithm to read headlines from financial news magazines and make predictions on the directional change of stock prices after a moderate-length time interval [Stanford Student project 2013, CS 229] Introduction
  • 3. Data Sources and Research question Twitter data to predict stock market changes Change in management, M&A Intermittent headlines to react to the first headlines up or down ??? Data sources:  Headlines from financial analysts http://seekingalpha.com/  Historic stock prices http://www.nasdaq.com/  7 targeted companies IBM, NFLX, GOOG, ANF, MCD, SHLD, AAPL Research Questions: “Given a headline released today about some company X, will the stock price of X rise by more than P percent over the next time period T?” T= 3 months
  • 4. Bayesian Classifiers Bayesian Classifier  Simple multinomial Bayesian classifier that analyze the headlines based on the presence of each token in the headline  51202 tokens -> Laplace smoothening -> 693 tokens -> Top 10 tokens  Classification Error for Reduced features < 0.5 Precision/Recall  Increase in P increases the Positive error and decrease in Negative error Support Vector Machines  SVM (Polynomial, linear, etc) was used on reduced data set, didn’t beat the result obtained from Bayesian classifier
  • 5. Naïve Baye’s Testing Error Table 1: Bayesian classifier result run for top 10 most indicative symbols
  • 6. Few more ways of analysis! Natural Language Processing  Stanford has a publicly available Natural Language Processing Toolkit that provides sentiment analysis to sentences with high accuracy (>80%)  Use of NLP didn’t achieve high success  Natural language processors would need to be specifically tailored to processing headline-like data to be able to make a meaningful contribution towards answering my research questions. Principal Component Analysis  Principal component analysis are run on the data and then tested linear SVMs on several of the top principal components. Manual Key word Selection  Keywords are selected manually
  • 7. Few more ways of analysis, Results Principal Component Analysis Manual Key word Selection
  • 8. Conclusion Sophisticated model able to beat overall market trends by reading financial news headlines cannot be easily found without fairly sophisticated human-like processing of the headlines. –By Author Examples: Tweet on Credit card breach at Home Depot (HD) -> Stocks 2% down. (9/2/2014) [3] Nate Silver's uncannily accurate predictions of the U.S. national elections. (2012) [3] Yes, by using Text and Data Mining and superior algorithms in near future, we may be able to predict the stock market with greater accuracy.
  • 10. References 1. B. Taylor. (2013). “Applying Machine Learning to Stock Market Trading”. Retrieved from Stanford CS229 project lists 2013. http://cs229.stanford.edu/proj2013/Taylor- Applying%20Machine%20Learning%20to%20Stock%20Market%20Trading.pdf 2. JASON ZWEIG , (Aug. 8, 2009). Retrieved from Wall Street Journal website http://www.wsj.com/articles/SB124967937642715417 3. M. R. Kaufman,(Feb 12 2015). Retrieved from The Street website http://www.thestreet.com/story/13044694/2/how-traders-are-using-text-and-data- mining-to-beat-the-market.html 4. http://cs229.stanford.edu/projects2013.html

Editor's Notes

  1. Text mining is the data analysis of natural language works (articles, books, etc.), using text as a form of data. It is often joined with data mining, the numeric analysis of data works (like filings and reports), and referred to as "text and data mining" or, simply, "TDM.“ [3]
  2. https://en.wikipedia.org/wiki/Laplacian_smoothing Support vector machines (SVMs) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis.