SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Thou Shalt not Share Collections of Tweets: Should we give a TOS?
“Thou Shalt not Share Collections ...” Interest sparked by AoIR discussion Post by Prof Stuart Shulman on May 5th 2
The Original Post (OP) 3 [Posted: Thu May 5 05:24:10 PDT 2011]
What Twitter said 4
5
Twitter-History a.k.a. ‘Twistory’ “We hope Twitter will realize the value of enabling researchers, journalists and citizens better ways to search, sort and analyze clusters of this important historical information.” 6
Twitter appears to think so too! 7
Twitter says “desist!” Prohibited other services from offering archives (for download): E.g., 140kit, TwapperKeeper, DiscoverText, ... Shut down 3rd party clients (Twidroyd & UberTwitter) for: Private Direct Messages longer than 140 characters Trademark infringement Changing the content of users' Tweets in order to make money 8
Twitter responds ... “... abide by a simple set of rules that are in the interests of our users, as well as the health and vitality of the platform as a whole.” “... on an average day we turn off more than one hundred services that violate our API rules of the road.” “You can download Twitter for Blackberry, Twitter for Android and other official Twitter apps here. You can also try our mobile web site or apps from other third-party developers.” 9
Why now? 10
Perspectives: Online social messaging service (user) Open ecosystem infrastructure (developer) Historical social record (researchers) Post “tweets” with max. 140 characters in real-time Publicly accessible (cf. CB radios) with some privacy Provides  search (limited) Uses & develops open-source software (e.g., Cassandra, Lucene, FlockDB, ...)
12
Some Twitter numbers Valuation: 4 billion (January 2011) Investment: $360 million (200m, Dec 2010) Employees: 400 (Jan 2011)200 are engineers Revenue: Ad estimates 150 million for 2011 No. of tweets: 140-150 million per day Users/Accounts: 200 million (approx.) Website ranking: Top 10-Top20 Twitter search: One billion queries per day 13
2006 (late)-2008 14
2009-2010 15
2011 16
A quick aside ...
Twitter Research Services: 140kit, TwapperKeeper, DiscoverText, The Archivist, ... Some hundreds of publications Areas:  Social network analysis, recommendations systems, social influence, user sentiment, business strategy, disaster prediction & alerts, education, software engineering, politics, ... Using:  Content analysis (narrative), ethnography, SVMs, TextRank, TFIDF, BoW, POS, ... 18
The Twitter API REST API uses HTTP protocol  All website features supported through API Programming libraries available Rate limiting (user & IP): Anonymous: 150 requests per hour OAuth:  350 requests per hour Whitelist e.g.  20,000 requests Streaming offerings: Spritzer (1%) Gardenhose (10%)  Firehose (100%) 19
General Terms of Service (Nov 2010) Under “Your Rights”: “... You grant us a worldwide, non-exclusive, royalty-free license (with the right to sublicense) to use, copy, reproduce, process, adapt, modify, publish, transmit, display and distribute such Content in any and all media or distribution methods (now known or later developed).” 20
TOS tips “This license is you authorizing us to make your Tweets available to the rest of the world and to let others do the same. But what’s yours is yours – you own your content.” “Twitter has an evolving set of rules for how API developers can interact with your content. These rules exist to enable an open ecosystem with your rights in mind.” 21
API TOS (Feb 2011) Access to Twitter Content: You will not attempt or encourage others to: sell, rent, lease, sublicense, redistribute, or syndicate the Twitter API or Twitter Content to any third party for such party to develop additional products or services without prior written approval from Twitter Content = “All use of the Twitter API and content, documentation, code, and related materials made available to you on or through Twitter.” 22
Authorised resyndication = GNIP First authorized reseller of Twitter data, Nov 2010 Offerings: Halfhose (50%, $30k / mo) Decahose (10%, $5k / mo) Power Track ($.10 per 1,000 Tweets) Link Stream ($50k / mo) User Mention Stream ($20k / mo) Keyword Search 23
Potential consequences Obstruct peer review of datasets Prohibits researchers getting access to data (in a timely way, if at all) Stifle innovations (most come from user community & 3rd party developers!) Users become more cautious about using social media Twitter becomes less useful (protest, reporting, ...) Twitter services become hacking targets: (unreliable, unstable, slow, ...) Social science researchers twiddle their thumbs
One solution ... One solution? 25
Talking points Is there a problem here? Does Twitter have any obligation to users, developers & researchers? Is it worth (or even ethical) to violate Twitter’s TOS to get access to researchable data? Should users’ content even be available to researchers?
Thanks!

Weitere Àhnliche Inhalte

Ähnlich wie Thou Shalt not Share Collections of Tweets: Should we give a TOS?

Data Access, Ownership and Control in Social Web Services: Issues for Twitter...
Data Access, Ownership and Control in Social Web Services: Issues for Twitter...Data Access, Ownership and Control in Social Web Services: Issues for Twitter...
Data Access, Ownership and Control in Social Web Services: Issues for Twitter...Cornelius Puschmann
 
Twitter in the Government
Twitter in the GovernmentTwitter in the Government
Twitter in the Governmentgencat .
 
Twitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake WhiteTwitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake WhiteJake White
 
We are losing our tweets!
We are losing our tweets!We are losing our tweets!
We are losing our tweets!John O'Brien III
 
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKDETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKIRJET Journal
 
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3 Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3 Shalin Hai-Jew
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter Presentationabradley76
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter Presentationabradley76
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonHetu Bhavsar
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteShalin Hai-Jew
 
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Artificial Intelligence Institute at UofSC
 
Twitter: As A Professional Development Tool
Twitter: As A Professional Development ToolTwitter: As A Professional Development Tool
Twitter: As A Professional Development Toolcswetzel
 
John Conroy
John ConroyJohn Conroy
John Conroyblogtalk
 
Python report on twitter sentiment analysis
Python report on twitter sentiment analysisPython report on twitter sentiment analysis
Python report on twitter sentiment analysisAntaraBhattacharya12
 
Rob Procter
Rob ProcterRob Procter
Rob ProcterNSMNSS
 
Benefits of the Social Web: How Can It Help My Museum?
Benefits of the Social Web: How Can It Help My Museum?Benefits of the Social Web: How Can It Help My Museum?
Benefits of the Social Web: How Can It Help My Museum?lisbk
 
8 tools to help filter your twitter stream & find news | poynter.
8 tools to help filter your twitter stream & find news | poynter.8 tools to help filter your twitter stream & find news | poynter.
8 tools to help filter your twitter stream & find news | poynter.Anjanette Delgado
 

Ähnlich wie Thou Shalt not Share Collections of Tweets: Should we give a TOS? (20)

Collecting Twitter Data
Collecting Twitter DataCollecting Twitter Data
Collecting Twitter Data
 
Data Access, Ownership and Control in Social Web Services: Issues for Twitter...
Data Access, Ownership and Control in Social Web Services: Issues for Twitter...Data Access, Ownership and Control in Social Web Services: Issues for Twitter...
Data Access, Ownership and Control in Social Web Services: Issues for Twitter...
 
Twitter in the Government
Twitter in the GovernmentTwitter in the Government
Twitter in the Government
 
Twitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake WhiteTwitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake White
 
We are losing our tweets!
We are losing our tweets!We are losing our tweets!
We are losing our tweets!
 
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKDETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
 
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3 Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter Presentation
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter Presentation
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging Site
 
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Pe...
 
KMA SPTechCon Deck on Collaboration
KMA SPTechCon Deck on CollaborationKMA SPTechCon Deck on Collaboration
KMA SPTechCon Deck on Collaboration
 
Twitter: As A Professional Development Tool
Twitter: As A Professional Development ToolTwitter: As A Professional Development Tool
Twitter: As A Professional Development Tool
 
John Conroy
John ConroyJohn Conroy
John Conroy
 
Twet
TwetTwet
Twet
 
Python report on twitter sentiment analysis
Python report on twitter sentiment analysisPython report on twitter sentiment analysis
Python report on twitter sentiment analysis
 
Rob Procter
Rob ProcterRob Procter
Rob Procter
 
Benefits of the Social Web: How Can It Help My Museum?
Benefits of the Social Web: How Can It Help My Museum?Benefits of the Social Web: How Can It Help My Museum?
Benefits of the Social Web: How Can It Help My Museum?
 
8 tools to help filter your twitter stream & find news | poynter.
8 tools to help filter your twitter stream & find news | poynter.8 tools to help filter your twitter stream & find news | poynter.
8 tools to help filter your twitter stream & find news | poynter.
 

KĂŒrzlich hochgeladen

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 

KĂŒrzlich hochgeladen (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 

Thou Shalt not Share Collections of Tweets: Should we give a TOS?

  • 1. Thou Shalt not Share Collections of Tweets: Should we give a TOS?
  • 2. “Thou Shalt not Share Collections ...” Interest sparked by AoIR discussion Post by Prof Stuart Shulman on May 5th 2
  • 3. The Original Post (OP) 3 [Posted: Thu May 5 05:24:10 PDT 2011]
  • 5. 5
  • 6. Twitter-History a.k.a. ‘Twistory’ “We hope Twitter will realize the value of enabling researchers, journalists and citizens better ways to search, sort and analyze clusters of this important historical information.” 6
  • 7. Twitter appears to think so too! 7
  • 8. Twitter says “desist!” Prohibited other services from offering archives (for download): E.g., 140kit, TwapperKeeper, DiscoverText, ... Shut down 3rd party clients (Twidroyd & UberTwitter) for: Private Direct Messages longer than 140 characters Trademark infringement Changing the content of users' Tweets in order to make money 8
  • 9. Twitter responds ... “... abide by a simple set of rules that are in the interests of our users, as well as the health and vitality of the platform as a whole.” “... on an average day we turn off more than one hundred services that violate our API rules of the road.” “You can download Twitter for Blackberry, Twitter for Android and other official Twitter apps here. You can also try our mobile web site or apps from other third-party developers.” 9
  • 11. Perspectives: Online social messaging service (user) Open ecosystem infrastructure (developer) Historical social record (researchers) Post “tweets” with max. 140 characters in real-time Publicly accessible (cf. CB radios) with some privacy Provides search (limited) Uses & develops open-source software (e.g., Cassandra, Lucene, FlockDB, ...)
  • 12. 12
  • 13. Some Twitter numbers Valuation: 4 billion (January 2011) Investment: $360 million (200m, Dec 2010) Employees: 400 (Jan 2011)200 are engineers Revenue: Ad estimates 150 million for 2011 No. of tweets: 140-150 million per day Users/Accounts: 200 million (approx.) Website ranking: Top 10-Top20 Twitter search: One billion queries per day 13
  • 18. Twitter Research Services: 140kit, TwapperKeeper, DiscoverText, The Archivist, ... Some hundreds of publications Areas: Social network analysis, recommendations systems, social influence, user sentiment, business strategy, disaster prediction & alerts, education, software engineering, politics, ... Using: Content analysis (narrative), ethnography, SVMs, TextRank, TFIDF, BoW, POS, ... 18
  • 19. The Twitter API REST API uses HTTP protocol All website features supported through API Programming libraries available Rate limiting (user & IP): Anonymous: 150 requests per hour OAuth: 350 requests per hour Whitelist e.g.  20,000 requests Streaming offerings: Spritzer (1%) Gardenhose (10%) Firehose (100%) 19
  • 20. General Terms of Service (Nov 2010) Under “Your Rights”: “... You grant us a worldwide, non-exclusive, royalty-free license (with the right to sublicense) to use, copy, reproduce, process, adapt, modify, publish, transmit, display and distribute such Content in any and all media or distribution methods (now known or later developed).” 20
  • 21. TOS tips “This license is you authorizing us to make your Tweets available to the rest of the world and to let others do the same. But what’s yours is yours – you own your content.” “Twitter has an evolving set of rules for how API developers can interact with your content. These rules exist to enable an open ecosystem with your rights in mind.” 21
  • 22. API TOS (Feb 2011) Access to Twitter Content: You will not attempt or encourage others to: sell, rent, lease, sublicense, redistribute, or syndicate the Twitter API or Twitter Content to any third party for such party to develop additional products or services without prior written approval from Twitter Content = “All use of the Twitter API and content, documentation, code, and related materials made available to you on or through Twitter.” 22
  • 23. Authorised resyndication = GNIP First authorized reseller of Twitter data, Nov 2010 Offerings: Halfhose (50%, $30k / mo) Decahose (10%, $5k / mo) Power Track ($.10 per 1,000 Tweets) Link Stream ($50k / mo) User Mention Stream ($20k / mo) Keyword Search 23
  • 24. Potential consequences Obstruct peer review of datasets Prohibits researchers getting access to data (in a timely way, if at all) Stifle innovations (most come from user community & 3rd party developers!) Users become more cautious about using social media Twitter becomes less useful (protest, reporting, ...) Twitter services become hacking targets: (unreliable, unstable, slow, ...) Social science researchers twiddle their thumbs
  • 25. One solution ... One solution? 25
  • 26. Talking points Is there a problem here? Does Twitter have any obligation to users, developers & researchers? Is it worth (or even ethical) to violate Twitter’s TOS to get access to researchable data? Should users’ content even be available to researchers?