SlideShare ist ein Scribd-Unternehmen logo
1 von 41
A Dream of Predicting Elections 
and Trading Stocks using Twitter 
@yelenamm Yelena Mejova 
Yet Another Conference 
Moscow Nov 30 2014
Money and Power 
Financial Indexes Political Opinion 
Movie box office sales 
Consumer confidence 
Dow Jones Industrial Average 
Individual stocks 
Political leaning 
Polarization 
User classification 
Predicting elections!
More… 
CIKM 2013 Tutorial 
TWITTER AND THE REAL WORLD 
with Ingmar Weber 
https://sites.google.com/site/twitterandtherealworld/home 
Finance, Politics, Public Health, Event Detection
Can I get rich on the stock market?
Answer: NO 
• Efficient Market Hypothesis: 
– Financial markets are information efficient: 
prices fully reflect all available information 
– Cannot be predicted 
JUST AS WELL
Answer: NO MAYBE? 
A non-random walk down Wall Street 
(1999) Lo & MacKinlay 
• Behavioral Economics: 
overconfidence, overreaction, 
information bias… 
• Insider trading, governmental 
manipulation… 
• Speculative bubbles: information be 
damned! 
• Bitcoin: where is the value? 
– pure bubble
http://nymag.com/daily/intelligencer/2013/04/ http://dataminr.com/ 
bloombergs-vip-terminal-tweeters.html 
2. specialized providers 3. data analytics 
Self-reported Gains http://www.caymanatlantic.com/ 
1. content providers 
http://gnip.com/ 
4. traders
Movies 
Predicting the Future with Social Media 
@sitaramasur Asur, Huberman @ WI-IAT 2010 
Hollywood Stock 
Exchange 
• 2.89 million tweets 
• 24 movies 
Correl (tweet rate 
& box office gross) = 0.90 
using previous week’s tweets 
to predict weekend box office gross: 
Adj R2 = 0.973 
…and sentiment (positive/negative) score to 
predict second weekend box office gross: 
Adj R2 = 0.94 
least squares linear regression 
using previous week’s HSX scores 
to predict weekend box office gross: 
Adj R2 = 0.967
Consumer Confidence 
From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series 
@brendan642 O’Connor, Balasubramanyan, Routledge, Smith @ ICWSM (2011) 
• Index of Consumer Sentiment (ICS) (Reuters/UMich) 
• Economic Confidence Index (ECI) (Gallup) 
• Subjectivity Lexicon: Opinion Finder 
• High day-to-day volatility. 
• Average last k days. 
• Keyword “jobs” 
k = 1, 7, 30 
• @ k=15 correlates with 
ECI (Gallup) at r = 0.731 
[some figures from authors’ original slides]
Consumer Confidence 
From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series 
@brendan642 O’Connor, Balasubramanyan, Routledge, Smith @ ICWSM (2011) 
• Predicting 1 month in 
the future using 
previous 15 days 
• Correlation with 
Gallup poll: 
– Twitter model: 77.5% 
– Poll model: 80.4% 
• As Twitter grows, so is 
its accuracy
Twitter mood predicts the stock market 
@jlbollen Bollen, Mao, Zeng @ Journal of Computational Science (2011) 
Twitter 2008 (~10M tweets) 
DJIA 
• Opinion Finder: positive / negative 
• GPOMS: calm, alert, sure, vital, kind and happy 
[some figures from authors’ original slides] 
888 citations! 
Slight correlation only 
with Calm GPOMS 
mood (0.065 at 6 day 
lag)
Stocks Tweets and trades: The information content of stock microblogs 
@timmsprenger Sprenger, Tumasjan, Sandner, Welpe 
@ European Financial Management (2013) 
• Tracking stocks $STOCK
Stocks 
Tweets and trades: The information content of stock microblogs 
@timmsprenger Sprenger, Tumasjan, Sandner, Welpe 
@ European Financial Management (2013) 
• Tweets: Jan 1 – Jun 30, 2010 
• S&P100 companies using $STOCK (price change & volume) 
• Naïve Bayes classifier trained on 2,500 tweets 
(buy/sell/hold): 81.2% accuracy 
-0.022 p<0.05 
BULLISH STOCK RETURNS 
0.091 p<0.001 
0.073 p<0.001 
VOLUME TRADING VOLUME 
0.312 p<0.001 
1.5% posted 53.7% of all messages 
– Their quality is not much better!
Stocks 
Correlating financial time series with micro-blogging activity 
Ruiz, Hristidis, Castillo, Gionis, Jaimes @ WSDM (2012) 
• Twitter: Jan 1 – Jun 30, 2010 
• 150 (randomly selected) companies in S&P 500 
= 
– Daily relative price change 
– Traded volume normalized by mean traded volume for 
that company for entire time period 
represent tweets 
as a GRAPH 
[some figures from authors’ original slides] 
constrain graph to a company 
and a time window 
+ similarity nodes 
connecting very 
similar tweets (RTs) 
using Jaccard distance
Trading Simulation 
[some figures from authors’ original slides]
Correlating financial time series with micro-blogging activity 
Ruiz, Hristidis, Castillo, Gionis, Jaimes @ WSDM (2012) 
• the only one that obtains 
a profit during which the 
Dow Jones fell -5.8% 
• Best performance for vector auto-regression 
with the number of 
connected components 
proposed
Don’t fire your stock broker yet 
High-Speed 
Trading No 
Longer Hurtling 
Forward 
http://www.nytimes.com/interactive/20 
12/10/15/business/Declining-US-High- 
Frequency-Trading.html?ref=business 
Computer Flaws 
Get Wry Smile 
From Humans 
Displaced 
http://dealbook.nytimes.com/2013/09/19/com 
puter-flaws-get-wry-smile-from-humans-displaced/? 
ref=highfrequencyalgorithmictrading 
How a Trading 
Algorithm Went 
Awry 
http://online.wsj.com/article/SB10 
0014240527487040293045755263 
90131916792.html
Can we track & 
predict political 
sentiment?
Elections 
“the crowning of the Internet as the king of all political 
media” 
“the beginning of the Internet presidency” 
- on Obama's 2008 victory 
Mitch Wagner, InformationWeek 
Transparency 
“Instantaneous tweeting of shady government 
practices -- and the resulting uproar -- means that 
public bodies are more responsive than ever”. 
- Wesley Donehue, CNN 
Mobilization 
“This exercise of power has produced a template for 
political action on a massive scale fueled by social 
media.” 
- on PIPA and SOPA 
Vivek Wadhwa, Washington Post 
bloggeruniversity.wordpress.com
US politics 
• Most research will be presented 
• Clear left/right distinction 
• Popular political figures 
• High(ish) Twitter engagement REPUBLICAN 
(right) 
DEMOCRAT 
(left)
lets talk politics 
• Sampling Twitter for political speech 
– general keywords: #current 
– event keywords: #debate08, #tweetdebate 
– people: obama, romney, merkel 
– parties: democrat, republican, pirate 
– accounts: wefollow, twellow 
– news stories, known URL retweets 
• Caveats 
– requires expert knowledge 
– known best after the event 
– selection bias (who do you want to ignore?)
political leaning classification 
1. Text (text classification) 
2. Network (label propagation)
political leaning classification 
Predicting the political alignment of twitter users 
@vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) 
• Bootstrapped hashtag-based sample of political discussion 
• Gardenhose Sep 14 - Nov 4, 2010 
• Classes: right, left, ambiguous 
TEXT-BASED 
• remove stopwords, hashtags, mentions, urls, all words occurring 
once in the corpus 
• TFIDF weighting: 
HASHTAG-BASED 
• remove hashtags used by only one user
political leaning classification 
Predicting the political alignment of twitter users 
@vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) 
NETWORK-BASED 
• Label propagation 
– Initialize cluster membership 
arbitrarily 
– Iteratively update each node’s 
label according to the majority of 
its neighbors 
– Ties are broken randomly 
• Cluster assignment by majority 
cluster label (using manually 
labeled data) 
retweet network
political leaning classification 
Predicting the political alignment of twitter users 
@vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) 
• Classifier: Support Vector Machine
political leaning classification 
Political hashtag hijacking in the US 
Hadgu, Garimella, Weber @ WWW (2013) 
SEED-BASED (highly precise) 
1. Start with few seed users of known leaning 
2. The leaning of their followers is determined by which side 
they retweet more 
3. Propagate users’ leaning to their tweets/hashtags/etc 
hashtag accuracy: 98.6%, 93%, 90% (by source)
political leaning classification 
Visualizing media bias through Twitter 
@JisunAn An, Cha, Gummadi, Crowcroft, Quercia @ AAAI (2012) 
• Position news sources in leaning by considering the 
overlap in common audience (followers on Twitter) 
Correlates with ADA (Americans 
for Democratic Action score): 
– Spearman rank order 
correlation: .44 
– Pearson product-moment 
correlation coefficient: .51 
Jaccard similarity 
of their audience 
distance between (co-subscribers) 
two media
political leaning classification 
Russia, Ukraine, and the West: Social Media Sentiment in 
• Nov 21, 2013 – Feb 26, 2014 
• Classifier labeled to identify pro-and 
anti- protest sentiment 
• Twitter, blogs, news, forums, 
Facebook 
the Euromaidan Protests 
@bretling Etling @ Berkman Center Research (2014) 
US & UK Russia Ukraine 
Does it reflect the overall 
sentiment of the people?
look who’s talking 
Vocal Minority versus Silent Majority: Discovering the Opinions of the Long Tail 
@enimust Mustafaraj, Finn, Whitlock, Metaxas @ SocialCom (2011) 
• 2010 US Senate special election in 
Massachusetts 
• Silent majority & vocal minority 
tweet differently (different 
agendas?) 
• Spamming, fake grassroots 
movements 
number of tweets per user
look who’s talking 
Detecting and Tracking Political Abuse in Social Media 
Ratkiewicz, Conover, Meiss, Goncalves, Flammini, Menczer @ ICWSM (2011) 
• Truthiness is a quality characterizing a "truth" that a person making 
an argument or assertion claims to know intuitively "from the gut" 
or because it "feels right" without regard to evidence, logic, 
intellectual examination, or facts. 
Classifying memes for astroturf 
Truthy project by Indiana University
look who’s talking 
#ampat @PeaceKaren_25 & 
@HopeMarie_25 
gopleader.gov Chris Coons 
#Truthy @senjohnmccain on.cnn.com/aVMu5y “Obama said…” 
LEGITIMATE TRUTHY 
Detecting and Tracking Political Abuse in Social Media 
Ratkiewicz, Conover, Meiss, Goncalves, Flammini, Menczer @ ICWSM (2011)
elections 
Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment 
Tumasjan, Sprenger, Sandner, Welpe @ AAAI (2010) 
• 2009 German federal elections 
sentiment profiles of leading candidates in 
tweets mentioning them (using LIWC2007) “The mere number of tweets reflects 
voter preferences and comes close to 
traditional election polls” 
CONTROVERSY! 
638 citations!
elections 
Why the Pirate Party won the German election of 2009 or the trouble with predictions: A 
response to Tumasjan, Sprenger, Sander, & Welpe, "Predicting elections with twitter: What 
140 characters reveal about political sentiment" 
@ajungherr Jungherr, Jürgens, Schoen @ SSCR V30/N2 (2012) 
“show that the results of TSSW are contingent on arbitrary choices of the authors” 
Choice of Parties Choice of Dates 
If results of polls played a role in 
deciding upon the inclusion of particular 
parties, the TSSW method is dependent 
on public opinion surveys 
prediction analysis […] between [13.9] 
and [27.9], the day of the election, 
produces a MAE of of 2.13, significantly 
higher than the MAE for TSSW
• Non-US elections: 
elections 
– Irish: On using twitter to monitor political sentiment and predict election 
results, Bermingham, Smeaton (2011) 
• "Our approach however has demonstrated an error which is not competitive with the 
traditional polling methods.” 
– Dutch: Predicting the 2011 Dutch senate election results with twitter, Sang, 
Bos (2012) 
• Uses polls for demographic imbalances, yet performance still below traditional polls 
– Singapore: Tweets and votes: A study of the 2011 singapore general election, 
Skoric, Poor, Achananuparp, Lim, Jiang (2012) 
• Not as accurate as traditional polls, performance at local government levels 
– New Zealand: Can Social Media Predict Election Results? Evidence from New 
Zealand, Michael P. Cameron (2013) 
• “the size of the effect is small and it appears that social media presence will therefore 
only make a difference in closely contested elections” 
– many more coming out each day!
! " #$" %#&' ! ! " ( 
! "#$%&' (#)#&'%* +, (- %' . (/ - %' ' #" 
! "# "$%&' (&)*+,$' -. *&/ -+0",$"' 1%&2%"13&45"$$+- 
6. 1"+*&7. 8' 9: ;+**' &<!"#$%&'()=& >1";?&' (&@; "+0' &<AB. "1= 
/ . 1. 3"' $"%&4?&C+$.D.%&<!*'+,-./0*'1'-=& E+**+%*+8&F' **+3+&<>A: = 
)1"&C2%$. (. -.G&<!02,/3-*=& E+**+%*+8&F' **+3+&<>A: = 
Metaxas et al. @ SocialCom (2011)
elections 
How (Not) To Predict Elections @takis_metaxas Metaxas et al. @ SocialCom (2011) 
• A method of prediction should be an algorithm 
finalized before the election 
– specify data collection, cleaning, analysis, interpretation… 
• Data from social media are fundamentally different 
than data from natural phenomena 
– people change their behavior next time around 
– spammers & activists will try to take advantage 
• From a testable theory on why and when it predicts 
(avoid self-deception!) 
• (maybe) Learn from professional pollsters 
– tweet ≠ user 
– user ≠ eligible voter 
– eligible voter ≠ voter 
[from authors’ original slides]
What now? 
Now-casting  Fore-casting 
Show improvement over baseline 
or that you could make money / a difference 
Publish a paper: let us know! 
(or go to Wall Street / Political Thinktank )
thank you 
Yelena Mejova 
@yelenamm 
ymejova@qf.org.qa
day of the week market index 
Fixed-effects panel regressions at 1 and 2 day lags 
1. Bullishness is affected more strongly by returns than vice versa 
2. Message volume predicts trading volume 
3. … but high trading volume and volatility predict message volume 
more 
4. Agreement among traders leads to lower trading volumes

Weitere ähnliche Inhalte

Was ist angesagt?

Governmental trust final report_ver.1.0
Governmental trust final report_ver.1.0Governmental trust final report_ver.1.0
Governmental trust final report_ver.1.0Juyoung An
 
Grasso Fake News Final Presentation
Grasso Fake News Final PresentationGrasso Fake News Final Presentation
Grasso Fake News Final PresentationBrian Grasso
 
Fake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sitesFake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sitesPetter Bae Brandtzæg
 
2016Election Final Draft
2016Election Final Draft2016Election Final Draft
2016Election Final DraftSarah Abel
 
Are Americans worried about the NSA?
Are Americans worried about the NSA? Are Americans worried about the NSA?
Are Americans worried about the NSA? AEI
 
Cyber-enabled Information Operations 4-27-17 -- Senate Armed Service Commi...
Cyber-enabled Information Operations   4-27-17  -- Senate Armed Service Commi...Cyber-enabled Information Operations   4-27-17  -- Senate Armed Service Commi...
Cyber-enabled Information Operations 4-27-17 -- Senate Armed Service Commi...David Sweigert
 
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...Nathan J Stone
 
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017Janet Parkinson
 
Social Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 ElectionSocial Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 ElectionAjay Ohri
 
Are Filter Bubbles Real?
Are Filter Bubbles Real?Are Filter Bubbles Real?
Are Filter Bubbles Real?Axel Bruns
 
Analysis of sources and effect of fake news on society
Analysis of sources and effect of fake news on society Analysis of sources and effect of fake news on society
Analysis of sources and effect of fake news on society Arpit Khurana
 
Curating and Contextualizing Twitter Stories to Assist with Social Newsgathering
Curating and Contextualizing Twitter Stories to Assist with Social NewsgatheringCurating and Contextualizing Twitter Stories to Assist with Social Newsgathering
Curating and Contextualizing Twitter Stories to Assist with Social Newsgatheringazubiaga
 
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...Gabriela Agustini
 
"Binders full of Tweets:" Twitter, gender and the 2012 elections
"Binders full of Tweets:" Twitter, gender and the 2012 elections"Binders full of Tweets:" Twitter, gender and the 2012 elections
"Binders full of Tweets:" Twitter, gender and the 2012 electionsRachel Reis Mourao
 
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...Matthew J. Kushin, Ph.D.
 

Was ist angesagt? (20)

NLP journal paper
NLP journal paperNLP journal paper
NLP journal paper
 
Document(2)
Document(2)Document(2)
Document(2)
 
Governmental trust final report_ver.1.0
Governmental trust final report_ver.1.0Governmental trust final report_ver.1.0
Governmental trust final report_ver.1.0
 
Grasso Fake News Final Presentation
Grasso Fake News Final PresentationGrasso Fake News Final Presentation
Grasso Fake News Final Presentation
 
Pj 2016.05.05 long_form_final
Pj 2016.05.05 long_form_finalPj 2016.05.05 long_form_final
Pj 2016.05.05 long_form_final
 
Kottler Thesis 2011
Kottler Thesis 2011Kottler Thesis 2011
Kottler Thesis 2011
 
Fake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sitesFake news and trust and distrust in fact checking sites
Fake news and trust and distrust in fact checking sites
 
2016Election Final Draft
2016Election Final Draft2016Election Final Draft
2016Election Final Draft
 
Are Americans worried about the NSA?
Are Americans worried about the NSA? Are Americans worried about the NSA?
Are Americans worried about the NSA?
 
Cyber-enabled Information Operations 4-27-17 -- Senate Armed Service Commi...
Cyber-enabled Information Operations   4-27-17  -- Senate Armed Service Commi...Cyber-enabled Information Operations   4-27-17  -- Senate Armed Service Commi...
Cyber-enabled Information Operations 4-27-17 -- Senate Armed Service Commi...
 
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...
FINAL PRINT -Engagement in the Details - AN ANALYSIS OF READER INTERACTION AC...
 
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017
DataSwarm: Lessons learned from Brexit and US elections - Berlin March 2017
 
Social Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 ElectionSocial Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 Election
 
Are Filter Bubbles Real?
Are Filter Bubbles Real?Are Filter Bubbles Real?
Are Filter Bubbles Real?
 
Analysis of sources and effect of fake news on society
Analysis of sources and effect of fake news on society Analysis of sources and effect of fake news on society
Analysis of sources and effect of fake news on society
 
A research paper on Twitter_Intrinsic versus image related utility in social ...
A research paper on Twitter_Intrinsic versus image related utility in social ...A research paper on Twitter_Intrinsic versus image related utility in social ...
A research paper on Twitter_Intrinsic versus image related utility in social ...
 
Curating and Contextualizing Twitter Stories to Assist with Social Newsgathering
Curating and Contextualizing Twitter Stories to Assist with Social NewsgatheringCurating and Contextualizing Twitter Stories to Assist with Social Newsgathering
Curating and Contextualizing Twitter Stories to Assist with Social Newsgathering
 
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...
Newspaper Editors vs the Crowd: On the Appropriateness of Front Page News Sel...
 
"Binders full of Tweets:" Twitter, gender and the 2012 elections
"Binders full of Tweets:" Twitter, gender and the 2012 elections"Binders full of Tweets:" Twitter, gender and the 2012 elections
"Binders full of Tweets:" Twitter, gender and the 2012 elections
 
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...
Intro to Sentiment Analysis: What it is, how to conduct it, and what are its ...
 

Andere mochten auch

Kак_продать Agile команде
Kак_продать Agile командеKак_продать Agile команде
Kак_продать Agile командеYandex
 
Интернет-реклама: итоги 2010, тренды 2011
Интернет-реклама: итоги 2010, тренды 2011Интернет-реклама: итоги 2010, тренды 2011
Интернет-реклама: итоги 2010, тренды 2011Yandex
 
Recession-2009: results and lessons
Recession-2009: results and lessonsRecession-2009: results and lessons
Recession-2009: results and lessonsYandex
 
Как_ответить_на_вопрос_«где?»
Как_ответить_на_вопрос_«где?»Как_ответить_на_вопрос_«где?»
Как_ответить_на_вопрос_«где?»Yandex
 
Пользователи, данные_нам_в_ощущениях
Пользователи, данные_нам_в_ощущенияхПользователи, данные_нам_в_ощущениях
Пользователи, данные_нам_в_ощущенияхYandex
 
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...Yandex
 
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"Yandex
 
Михаил Карпов "Планирование для продуктового менеджера"
Михаил Карпов "Планирование для продуктового менеджера"Михаил Карпов "Планирование для продуктового менеджера"
Михаил Карпов "Планирование для продуктового менеджера"Yandex
 
Обзор_про_Яндекс_как_компанию
Обзор_про_Яндекс_как_компаниюОбзор_про_Яндекс_как_компанию
Обзор_про_Яндекс_как_компаниюYandex
 
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Yandex
 
Сильный текст для нелингвистов, Максим Ильяхов
Сильный текст для нелингвистов, Максим ИльяховСильный текст для нелингвистов, Максим Ильяхов
Сильный текст для нелингвистов, Максим ИльяховYandex
 
Эволюция мобильных устройств и поведение пользователей геосервисов
Эволюция мобильных устройств и поведение пользователей геосервисовЭволюция мобильных устройств и поведение пользователей геосервисов
Эволюция мобильных устройств и поведение пользователей геосервисовYandex
 
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_нейYandex
 
Анализ изображений и видео. Морфологическая обработка изображений
Анализ изображений и видео. Морфологическая обработка изображенийАнализ изображений и видео. Морфологическая обработка изображений
Анализ изображений и видео. Морфологическая обработка изображенийYandex
 
Experimental marketing: why and how
Experimental marketing: why and how Experimental marketing: why and how
Experimental marketing: why and how Yandex
 
Вебинар по БЭМ: сборка и оптимизация проекта
Вебинар по БЭМ: сборка и оптимизация проектаВебинар по БЭМ: сборка и оптимизация проекта
Вебинар по БЭМ: сборка и оптимизация проектаYandex
 
Statistical Arbitrage Strategies
Statistical Arbitrage StrategiesStatistical Arbitrage Strategies
Statistical Arbitrage Strategiesguest8fde7a
 
Introduction to pairtrading
Introduction to pairtradingIntroduction to pairtrading
Introduction to pairtradingKohta Ishikawa
 

Andere mochten auch (20)

Real Options KSM Case Study
Real Options KSM Case StudyReal Options KSM Case Study
Real Options KSM Case Study
 
Kак_продать Agile команде
Kак_продать Agile командеKак_продать Agile команде
Kак_продать Agile команде
 
Интернет-реклама: итоги 2010, тренды 2011
Интернет-реклама: итоги 2010, тренды 2011Интернет-реклама: итоги 2010, тренды 2011
Интернет-реклама: итоги 2010, тренды 2011
 
Recession-2009: results and lessons
Recession-2009: results and lessonsRecession-2009: results and lessons
Recession-2009: results and lessons
 
Как_ответить_на_вопрос_«где?»
Как_ответить_на_вопрос_«где?»Как_ответить_на_вопрос_«где?»
Как_ответить_на_вопрос_«где?»
 
Пользователи, данные_нам_в_ощущениях
Пользователи, данные_нам_в_ощущенияхПользователи, данные_нам_в_ощущениях
Пользователи, данные_нам_в_ощущениях
 
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...
Максим Никитин - Умные молекулы: от игры в крестики-нолики до адресной достав...
 
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"
К.В. Воронцов "Орг. вопросы и методика тестирования методов обучения"
 
Михаил Карпов "Планирование для продуктового менеджера"
Михаил Карпов "Планирование для продуктового менеджера"Михаил Карпов "Планирование для продуктового менеджера"
Михаил Карпов "Планирование для продуктового менеджера"
 
Fuzzy Presentation
Fuzzy PresentationFuzzy Presentation
Fuzzy Presentation
 
Обзор_про_Яндекс_как_компанию
Обзор_про_Яндекс_как_компаниюОбзор_про_Яндекс_как_компанию
Обзор_про_Яндекс_как_компанию
 
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
 
Сильный текст для нелингвистов, Максим Ильяхов
Сильный текст для нелингвистов, Максим ИльяховСильный текст для нелингвистов, Максим Ильяхов
Сильный текст для нелингвистов, Максим Ильяхов
 
Эволюция мобильных устройств и поведение пользователей геосервисов
Эволюция мобильных устройств и поведение пользователей геосервисовЭволюция мобильных устройств и поведение пользователей геосервисов
Эволюция мобильных устройств и поведение пользователей геосервисов
 
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней
«Яндекс_внутри»: немного_о_компании_и_организации_внутренней_жизни_в_ней
 
Анализ изображений и видео. Морфологическая обработка изображений
Анализ изображений и видео. Морфологическая обработка изображенийАнализ изображений и видео. Морфологическая обработка изображений
Анализ изображений и видео. Морфологическая обработка изображений
 
Experimental marketing: why and how
Experimental marketing: why and how Experimental marketing: why and how
Experimental marketing: why and how
 
Вебинар по БЭМ: сборка и оптимизация проекта
Вебинар по БЭМ: сборка и оптимизация проектаВебинар по БЭМ: сборка и оптимизация проекта
Вебинар по БЭМ: сборка и оптимизация проекта
 
Statistical Arbitrage Strategies
Statistical Arbitrage StrategiesStatistical Arbitrage Strategies
Statistical Arbitrage Strategies
 
Introduction to pairtrading
Introduction to pairtradingIntroduction to pairtrading
Introduction to pairtrading
 

Ähnlich wie A Dream of Predicting Elections and Trading Stocks using Twitter - Yelena Mejova, Qatar Computing Research Institute

Language of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 AnalysisLanguage of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 AnalysisYelena Mejova
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsAmit Sheth
 
Outreach Through Social Media | Ocean Sciences 2014
Outreach Through Social Media | Ocean Sciences 2014Outreach Through Social Media | Ocean Sciences 2014
Outreach Through Social Media | Ocean Sciences 2014Christie Wilcox
 
Big data, small data and everything in between
Big data, small data and everything in betweenBig data, small data and everything in between
Big data, small data and everything in betweenWeiai Wayne Xu
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1crore projects
 
Election days and social media practices: Tweeting as Australia decides
Election days and social media practices: Tweeting as Australia decidesElection days and social media practices: Tweeting as Australia decides
Election days and social media practices: Tweeting as Australia decidesTim Highfield
 
The Media Zones Where People Live And How To Connect With Them
The Media Zones Where People Live And How To Connect With ThemThe Media Zones Where People Live And How To Connect With Them
The Media Zones Where People Live And How To Connect With ThemKDMC
 
Mapping Online Publics: New Methods for Twitter Research
Mapping Online Publics: New Methods for Twitter ResearchMapping Online Publics: New Methods for Twitter Research
Mapping Online Publics: New Methods for Twitter ResearchAxel Bruns
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Axel Bruns
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analyticsrupika08
 
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data AccessEasy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data AccessAxel Bruns
 
Digging for data: opportunities and challenges in an open research landscape_...
Digging for data: opportunities and challenges in an open research landscape_...Digging for data: opportunities and challenges in an open research landscape_...
Digging for data: opportunities and challenges in an open research landscape_...Platforma Otwartej Nauki
 
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other SourcesAxel Bruns
 
Social Media and the U.S. Election: Introduction
Social Media and the U.S. Election: IntroductionSocial Media and the U.S. Election: Introduction
Social Media and the U.S. Election: IntroductionJanelle Ward
 
50,000,00 Twitter fans can't be wrong
50,000,00 Twitter fans can't be wrong50,000,00 Twitter fans can't be wrong
50,000,00 Twitter fans can't be wrongMarie Boran
 
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....KDMC
 

Ähnlich wie A Dream of Predicting Elections and Trading Stocks using Twitter - Yelena Mejova, Qatar Computing Research Institute (20)

Language of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 AnalysisLanguage of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 Analysis
 
O'Neill: Good Connections Are Always Worth Preserving: Publishing and Social ...
O'Neill: Good Connections Are Always Worth Preserving: Publishing and Social ...O'Neill: Good Connections Are Always Worth Preserving: Publishing and Social ...
O'Neill: Good Connections Are Always Worth Preserving: Publishing and Social ...
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
 
Outreach Through Social Media | Ocean Sciences 2014
Outreach Through Social Media | Ocean Sciences 2014Outreach Through Social Media | Ocean Sciences 2014
Outreach Through Social Media | Ocean Sciences 2014
 
Big data, small data and everything in between
Big data, small data and everything in betweenBig data, small data and everything in between
Big data, small data and everything in between
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
 
Election days and social media practices: Tweeting as Australia decides
Election days and social media practices: Tweeting as Australia decidesElection days and social media practices: Tweeting as Australia decides
Election days and social media practices: Tweeting as Australia decides
 
Tkclass Social Media Data for Research, Reporting
Tkclass Social Media Data for Research, ReportingTkclass Social Media Data for Research, Reporting
Tkclass Social Media Data for Research, Reporting
 
The Media Zones Where People Live And How To Connect With Them
The Media Zones Where People Live And How To Connect With ThemThe Media Zones Where People Live And How To Connect With Them
The Media Zones Where People Live And How To Connect With Them
 
Mapping Online Publics: New Methods for Twitter Research
Mapping Online Publics: New Methods for Twitter ResearchMapping Online Publics: New Methods for Twitter Research
Mapping Online Publics: New Methods for Twitter Research
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
 
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data AccessEasy Data, Hard Data? Twitter Research and the Politics of Data Access
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
 
Digging for data: opportunities and challenges in an open research landscape_...
Digging for data: opportunities and challenges in an open research landscape_...Digging for data: opportunities and challenges in an open research landscape_...
Digging for data: opportunities and challenges in an open research landscape_...
 
Nato2015
Nato2015Nato2015
Nato2015
 
Personal. Portable. Participatory. Pervasive.
Personal. Portable. Participatory. Pervasive.Personal. Portable. Participatory. Pervasive.
Personal. Portable. Participatory. Pervasive.
 
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
 
Social Media and the U.S. Election: Introduction
Social Media and the U.S. Election: IntroductionSocial Media and the U.S. Election: Introduction
Social Media and the U.S. Election: Introduction
 
50,000,00 Twitter fans can't be wrong
50,000,00 Twitter fans can't be wrong50,000,00 Twitter fans can't be wrong
50,000,00 Twitter fans can't be wrong
 
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....
Denver Event - 2013 - New Media Ecosystem: Personal. Portable. Participatory....
 

Mehr von Yandex

Предсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksПредсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksYandex
 
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Yandex
 
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаСтруктурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаYandex
 
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаПредставление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаYandex
 
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Yandex
 
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Yandex
 
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Yandex
 
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Yandex
 
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Yandex
 
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Yandex
 
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Yandex
 
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Yandex
 
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровКак защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровYandex
 
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Yandex
 
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Yandex
 
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Yandex
 
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Yandex
 
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Yandex
 
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Yandex
 
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Yandex
 

Mehr von Yandex (20)

Предсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksПредсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of Tanks
 
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
 
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаСтруктурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
 
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаПредставление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
 
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
 
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
 
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
 
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
 
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
 
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
 
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
 
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
 
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровКак защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
 
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
 
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
 
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
 
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
 
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
 
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
 
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
 

Kürzlich hochgeladen

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Kürzlich hochgeladen (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

A Dream of Predicting Elections and Trading Stocks using Twitter - Yelena Mejova, Qatar Computing Research Institute

  • 1. A Dream of Predicting Elections and Trading Stocks using Twitter @yelenamm Yelena Mejova Yet Another Conference Moscow Nov 30 2014
  • 2.
  • 3. Money and Power Financial Indexes Political Opinion Movie box office sales Consumer confidence Dow Jones Industrial Average Individual stocks Political leaning Polarization User classification Predicting elections!
  • 4. More… CIKM 2013 Tutorial TWITTER AND THE REAL WORLD with Ingmar Weber https://sites.google.com/site/twitterandtherealworld/home Finance, Politics, Public Health, Event Detection
  • 5. Can I get rich on the stock market?
  • 6. Answer: NO • Efficient Market Hypothesis: – Financial markets are information efficient: prices fully reflect all available information – Cannot be predicted JUST AS WELL
  • 7. Answer: NO MAYBE? A non-random walk down Wall Street (1999) Lo & MacKinlay • Behavioral Economics: overconfidence, overreaction, information bias… • Insider trading, governmental manipulation… • Speculative bubbles: information be damned! • Bitcoin: where is the value? – pure bubble
  • 8. http://nymag.com/daily/intelligencer/2013/04/ http://dataminr.com/ bloombergs-vip-terminal-tweeters.html 2. specialized providers 3. data analytics Self-reported Gains http://www.caymanatlantic.com/ 1. content providers http://gnip.com/ 4. traders
  • 9. Movies Predicting the Future with Social Media @sitaramasur Asur, Huberman @ WI-IAT 2010 Hollywood Stock Exchange • 2.89 million tweets • 24 movies Correl (tweet rate & box office gross) = 0.90 using previous week’s tweets to predict weekend box office gross: Adj R2 = 0.973 …and sentiment (positive/negative) score to predict second weekend box office gross: Adj R2 = 0.94 least squares linear regression using previous week’s HSX scores to predict weekend box office gross: Adj R2 = 0.967
  • 10. Consumer Confidence From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series @brendan642 O’Connor, Balasubramanyan, Routledge, Smith @ ICWSM (2011) • Index of Consumer Sentiment (ICS) (Reuters/UMich) • Economic Confidence Index (ECI) (Gallup) • Subjectivity Lexicon: Opinion Finder • High day-to-day volatility. • Average last k days. • Keyword “jobs” k = 1, 7, 30 • @ k=15 correlates with ECI (Gallup) at r = 0.731 [some figures from authors’ original slides]
  • 11. Consumer Confidence From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series @brendan642 O’Connor, Balasubramanyan, Routledge, Smith @ ICWSM (2011) • Predicting 1 month in the future using previous 15 days • Correlation with Gallup poll: – Twitter model: 77.5% – Poll model: 80.4% • As Twitter grows, so is its accuracy
  • 12. Twitter mood predicts the stock market @jlbollen Bollen, Mao, Zeng @ Journal of Computational Science (2011) Twitter 2008 (~10M tweets) DJIA • Opinion Finder: positive / negative • GPOMS: calm, alert, sure, vital, kind and happy [some figures from authors’ original slides] 888 citations! Slight correlation only with Calm GPOMS mood (0.065 at 6 day lag)
  • 13. Stocks Tweets and trades: The information content of stock microblogs @timmsprenger Sprenger, Tumasjan, Sandner, Welpe @ European Financial Management (2013) • Tracking stocks $STOCK
  • 14. Stocks Tweets and trades: The information content of stock microblogs @timmsprenger Sprenger, Tumasjan, Sandner, Welpe @ European Financial Management (2013) • Tweets: Jan 1 – Jun 30, 2010 • S&P100 companies using $STOCK (price change & volume) • Naïve Bayes classifier trained on 2,500 tweets (buy/sell/hold): 81.2% accuracy -0.022 p<0.05 BULLISH STOCK RETURNS 0.091 p<0.001 0.073 p<0.001 VOLUME TRADING VOLUME 0.312 p<0.001 1.5% posted 53.7% of all messages – Their quality is not much better!
  • 15. Stocks Correlating financial time series with micro-blogging activity Ruiz, Hristidis, Castillo, Gionis, Jaimes @ WSDM (2012) • Twitter: Jan 1 – Jun 30, 2010 • 150 (randomly selected) companies in S&P 500 = – Daily relative price change – Traded volume normalized by mean traded volume for that company for entire time period represent tweets as a GRAPH [some figures from authors’ original slides] constrain graph to a company and a time window + similarity nodes connecting very similar tweets (RTs) using Jaccard distance
  • 16. Trading Simulation [some figures from authors’ original slides]
  • 17. Correlating financial time series with micro-blogging activity Ruiz, Hristidis, Castillo, Gionis, Jaimes @ WSDM (2012) • the only one that obtains a profit during which the Dow Jones fell -5.8% • Best performance for vector auto-regression with the number of connected components proposed
  • 18. Don’t fire your stock broker yet High-Speed Trading No Longer Hurtling Forward http://www.nytimes.com/interactive/20 12/10/15/business/Declining-US-High- Frequency-Trading.html?ref=business Computer Flaws Get Wry Smile From Humans Displaced http://dealbook.nytimes.com/2013/09/19/com puter-flaws-get-wry-smile-from-humans-displaced/? ref=highfrequencyalgorithmictrading How a Trading Algorithm Went Awry http://online.wsj.com/article/SB10 0014240527487040293045755263 90131916792.html
  • 19. Can we track & predict political sentiment?
  • 20. Elections “the crowning of the Internet as the king of all political media” “the beginning of the Internet presidency” - on Obama's 2008 victory Mitch Wagner, InformationWeek Transparency “Instantaneous tweeting of shady government practices -- and the resulting uproar -- means that public bodies are more responsive than ever”. - Wesley Donehue, CNN Mobilization “This exercise of power has produced a template for political action on a massive scale fueled by social media.” - on PIPA and SOPA Vivek Wadhwa, Washington Post bloggeruniversity.wordpress.com
  • 21. US politics • Most research will be presented • Clear left/right distinction • Popular political figures • High(ish) Twitter engagement REPUBLICAN (right) DEMOCRAT (left)
  • 22. lets talk politics • Sampling Twitter for political speech – general keywords: #current – event keywords: #debate08, #tweetdebate – people: obama, romney, merkel – parties: democrat, republican, pirate – accounts: wefollow, twellow – news stories, known URL retweets • Caveats – requires expert knowledge – known best after the event – selection bias (who do you want to ignore?)
  • 23. political leaning classification 1. Text (text classification) 2. Network (label propagation)
  • 24. political leaning classification Predicting the political alignment of twitter users @vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) • Bootstrapped hashtag-based sample of political discussion • Gardenhose Sep 14 - Nov 4, 2010 • Classes: right, left, ambiguous TEXT-BASED • remove stopwords, hashtags, mentions, urls, all words occurring once in the corpus • TFIDF weighting: HASHTAG-BASED • remove hashtags used by only one user
  • 25. political leaning classification Predicting the political alignment of twitter users @vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) NETWORK-BASED • Label propagation – Initialize cluster membership arbitrarily – Iteratively update each node’s label according to the majority of its neighbors – Ties are broken randomly • Cluster assignment by majority cluster label (using manually labeled data) retweet network
  • 26. political leaning classification Predicting the political alignment of twitter users @vagabondjack Conover, Gonçalves, Ratkiewicz, Flammini, Menczer @ SocialCom (2011) • Classifier: Support Vector Machine
  • 27. political leaning classification Political hashtag hijacking in the US Hadgu, Garimella, Weber @ WWW (2013) SEED-BASED (highly precise) 1. Start with few seed users of known leaning 2. The leaning of their followers is determined by which side they retweet more 3. Propagate users’ leaning to their tweets/hashtags/etc hashtag accuracy: 98.6%, 93%, 90% (by source)
  • 28. political leaning classification Visualizing media bias through Twitter @JisunAn An, Cha, Gummadi, Crowcroft, Quercia @ AAAI (2012) • Position news sources in leaning by considering the overlap in common audience (followers on Twitter) Correlates with ADA (Americans for Democratic Action score): – Spearman rank order correlation: .44 – Pearson product-moment correlation coefficient: .51 Jaccard similarity of their audience distance between (co-subscribers) two media
  • 29. political leaning classification Russia, Ukraine, and the West: Social Media Sentiment in • Nov 21, 2013 – Feb 26, 2014 • Classifier labeled to identify pro-and anti- protest sentiment • Twitter, blogs, news, forums, Facebook the Euromaidan Protests @bretling Etling @ Berkman Center Research (2014) US & UK Russia Ukraine Does it reflect the overall sentiment of the people?
  • 30. look who’s talking Vocal Minority versus Silent Majority: Discovering the Opinions of the Long Tail @enimust Mustafaraj, Finn, Whitlock, Metaxas @ SocialCom (2011) • 2010 US Senate special election in Massachusetts • Silent majority & vocal minority tweet differently (different agendas?) • Spamming, fake grassroots movements number of tweets per user
  • 31. look who’s talking Detecting and Tracking Political Abuse in Social Media Ratkiewicz, Conover, Meiss, Goncalves, Flammini, Menczer @ ICWSM (2011) • Truthiness is a quality characterizing a "truth" that a person making an argument or assertion claims to know intuitively "from the gut" or because it "feels right" without regard to evidence, logic, intellectual examination, or facts. Classifying memes for astroturf Truthy project by Indiana University
  • 32. look who’s talking #ampat @PeaceKaren_25 & @HopeMarie_25 gopleader.gov Chris Coons #Truthy @senjohnmccain on.cnn.com/aVMu5y “Obama said…” LEGITIMATE TRUTHY Detecting and Tracking Political Abuse in Social Media Ratkiewicz, Conover, Meiss, Goncalves, Flammini, Menczer @ ICWSM (2011)
  • 33. elections Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment Tumasjan, Sprenger, Sandner, Welpe @ AAAI (2010) • 2009 German federal elections sentiment profiles of leading candidates in tweets mentioning them (using LIWC2007) “The mere number of tweets reflects voter preferences and comes close to traditional election polls” CONTROVERSY! 638 citations!
  • 34. elections Why the Pirate Party won the German election of 2009 or the trouble with predictions: A response to Tumasjan, Sprenger, Sander, & Welpe, "Predicting elections with twitter: What 140 characters reveal about political sentiment" @ajungherr Jungherr, Jürgens, Schoen @ SSCR V30/N2 (2012) “show that the results of TSSW are contingent on arbitrary choices of the authors” Choice of Parties Choice of Dates If results of polls played a role in deciding upon the inclusion of particular parties, the TSSW method is dependent on public opinion surveys prediction analysis […] between [13.9] and [27.9], the day of the election, produces a MAE of of 2.13, significantly higher than the MAE for TSSW
  • 35. • Non-US elections: elections – Irish: On using twitter to monitor political sentiment and predict election results, Bermingham, Smeaton (2011) • "Our approach however has demonstrated an error which is not competitive with the traditional polling methods.” – Dutch: Predicting the 2011 Dutch senate election results with twitter, Sang, Bos (2012) • Uses polls for demographic imbalances, yet performance still below traditional polls – Singapore: Tweets and votes: A study of the 2011 singapore general election, Skoric, Poor, Achananuparp, Lim, Jiang (2012) • Not as accurate as traditional polls, performance at local government levels – New Zealand: Can Social Media Predict Election Results? Evidence from New Zealand, Michael P. Cameron (2013) • “the size of the effect is small and it appears that social media presence will therefore only make a difference in closely contested elections” – many more coming out each day!
  • 36. ! " #$" %#&' ! ! " ( ! "#$%&' (#)#&'%* +, (- %' . (/ - %' ' #" ! "# "$%&' (&)*+,$' -. *&/ -+0",$"' 1%&2%"13&45"$$+- 6. 1"+*&7. 8' 9: ;+**' &<!"#$%&'()=& >1";?&' (&@; "+0' &<AB. "1= / . 1. 3"' $"%&4?&C+$.D.%&<!*'+,-./0*'1'-=& E+**+%*+8&F' **+3+&<>A: = )1"&C2%$. (. -.G&<!02,/3-*=& E+**+%*+8&F' **+3+&<>A: = Metaxas et al. @ SocialCom (2011)
  • 37. elections How (Not) To Predict Elections @takis_metaxas Metaxas et al. @ SocialCom (2011) • A method of prediction should be an algorithm finalized before the election – specify data collection, cleaning, analysis, interpretation… • Data from social media are fundamentally different than data from natural phenomena – people change their behavior next time around – spammers & activists will try to take advantage • From a testable theory on why and when it predicts (avoid self-deception!) • (maybe) Learn from professional pollsters – tweet ≠ user – user ≠ eligible voter – eligible voter ≠ voter [from authors’ original slides]
  • 38. What now? Now-casting  Fore-casting Show improvement over baseline or that you could make money / a difference Publish a paper: let us know! (or go to Wall Street / Political Thinktank )
  • 39. thank you Yelena Mejova @yelenamm ymejova@qf.org.qa
  • 40.
  • 41. day of the week market index Fixed-effects panel regressions at 1 and 2 day lags 1. Bullishness is affected more strongly by returns than vice versa 2. Message volume predicts trading volume 3. … but high trading volume and volatility predict message volume more 4. Agreement among traders leads to lower trading volumes

Hinweis der Redaktion

  1. Image: http://www.hdwallcloud.com/wp-content/uploads/2014/02/place_of_dreams-in-hd.jpg
  2. Dollar: http://www.circul8.com.au/wp-content/uploads/2014/07/Facebook-Money-1.jpg Cat: http://cdn29.elitedaily.com/wp-content/uploads/2014/05/Svetlana-Petrova-cat-hope-elite-daily.jpg
  3. http://en.wikipedia.org/wiki/Efficient-market_hypothesis
  4. http://en.wikipedia.org/wiki/Efficient-market_hypothesis A non-random walk down Wall Street: http://press.princeton.edu/chapters/i6558.html
  5. GNIP Provides firehose access to major social media APIs including Twitter http://gnip.com/sources/ Bloomberg Twitter terminal Added in April 2013, it shows tweets from a selection of users, including news services, financial writers, economists, and bloggers selected by Bloomberg’s terminal team. http://nymag.com/daily/intelligencer/2013/04/bloombergs-vip-terminal-tweeters.html Dataminr social analytics company with clients in finance and government which use firehose access to find tweets which may be newsworthy and relevant to a particular market. Article on Dataminr: http://www.fastcoexist.com/1681873/twitter-can-predict-the-stock-market-if-youre-reading-the-right-tweets Derwent Capital Markets and Cayman Atlantic are firms which first pioneered in the use of social media sentiment analysis for financial trading. As their inspiration they cite Bollen/Mao/Zeng study that we will talk about later in this talk, which establishes some connection between emotion-related words in Twitter to subsequent moves in the Dow Jones Industrial Average. (“Twitter mood predicts the stock market”) http://en.wikipedia.org/wiki/Derwent_Capital_Markets http://money.cnn.com/2013/07/10/investing/twitter-fund-trading/index.html
  6. Much interest to internet community Viral marketing by producers Box-office revenues are an easy indicator of market success Collected using a manually compiled term list tweet rate: # twts referring to a movie per hour linear regression uses 7 variables, each for the twt rate for the day predicting the box office gross during the opening weekend Using LingPipe for sentiment classification HSX – holywood stock exchange http://www.hsx.com/security/view/POPEY
  7. Dataset: 1 billion tweets 2008-2009 (message volume increased by a factor of 50 during this period) using Gardenhose ICS: five questions administered monthly in telephone interviews ECI: two questions administered daily (reported in 3-day averages) a message is positive (/neg) if it has a pos (/neg) lex word score = pos / neg
  8. Prediction using 44 through 30 days before the target date In a model with both variables, at first the importance of twt text is small (the coefficient), but starting in mid-2009 text becomes a much better predictor.
  9. Dataset: ~10M tweets in 2008 Models DJIA closing values Dow Jones Industrial Average - a stock market index -- price-weighted average of 30 significant stocks traded on the New York Stock Exchange and the Nasdaq Granger Causality: whether lagged values of X provide statistically significant information about future values of Y
  10. Twitter is mostly REACTIVE Financial indicators: returns, abnormal returns, cumulative abnormal returns, trading volume, daily volatility Twitter features: bullishness, message volume, agreement among messages Built a fixed-effects panel regressions at 1 and 2 day lags (coefficients are standardized) Bullishness is affected more strongly by returns than vice versa Message volume predicts trading volume … but high trading volume and volatility predict message volume more Agreement among traders leads to lower trading volumes
  11. Vector autoregression (VAR) is an econometric model used to capture the linear interdependencies among multiple time series. Is the improvement enough to compensate for the fees associated with trading stocks?
  12. High-speed trading article: http://www.nytimes.com/2012/10/15/business/with-profits-dropping-high-speed-trading-cools-down.html?ref=highfrequencyalgorithmictrading graphic: http://www.nytimes.com/interactive/2012/10/15/business/Declining-US-High-Frequency-Trading.html?ref=business Computer Flaws article: http://dealbook.nytimes.com/2013/09/19/computer-flaws-get-wry-smile-from-humans-displaced/?ref=highfrequencyalgorithmictrading
  13. http://www.informationweek.com/news/government/212000815 http://edition.cnn.com/2012/04/24/opinion/donehue-social-media-politics/index.html http://www.washingtonpost.com/national/on-innovations/social-medias-role-in-politics/2012/01/25/gIQAQvZgdQ_story.html#
  14. For Full-text classification they represent text using TFIDF (removing stopwords, hashtags, mentions, urls, and all words occurring once in the corpus T_ij – importance of a term I in the set of tweets produced by user j
  15. Using retweet network where there is an undirected link between two users if either user mentions the other during the analysis period Clusters: accept the majority cluster label Adjusted Rand Index: similarity of two cluster label assignments (-1 when totally disagree and +1 when totally agree) Clusters + Tags: topological information with 19 hashtags selected using Hall’s feature selection algorithm
  16. ADA: Americans for Democratic Action score, calculated based on various quantities such as the number of times a media outlet cites various think-tanks and other policy groups
  17. http://www.cbsnews.com/stories/2006/12/12/opinion/meyer/main2250923.shtml
  18. Dashed lines: retweets, Yellow: mentions #ampat – retweeted between two accounts who seemed to be owned by the same person @PeaceKaren_25 (and @HopeMarie_25) – two colluding accounts gopleader.gov – promoted by the two *_25 accounts above Chris Coons – a tweet smearing Chris Coons using bot accounts #Truthy – injected by NPR Science Friday radio program @senjohnmccain -- retweets from @ladygaga (don’t ask don’t tell) and mentions
  19. LIWC – Linguistic Inquiry and Word Count
  20. Second table: absolute errors
  21. method of prediction should be an algorithm finalized before the elections: (input) how Social Media data are to be collected, including the dates of data collection, (filter) the way in which the cleanup of the data is to be performed (e.g., the selection of keywords relevant to the election), (method) the algorithms to be applied on the data along with their input parameters, and (output) the semantics under which the results are to be interpreted Data from social media are fundamentally different than data from natural phenomena people will change their behavior the next time around spammers & activists will try to take advantage
  22. Fixed-effects panel regressions of market the three tweet features as independent variables at 1 and 2 day lags NWK: dummy variable signifying first trading day of the week Market: market index as control 1. There is almost no effect of bullishness on next day returns, however bullishness 2 days ago is associated with negative returns. (bold: actual values, italicized: standardized)