SlideShare ist ein Scribd-Unternehmen logo
1 von 38
GRAMPA, WHAT'S A
DELETED TWEET?
Mohammed Nauman Siddique
Web Archiving Forensics (CS 895)
Spring, 2019
Web Science and Digital Libraries Group
Old Dominion University
Norfolk, Virginia, USA
@WebSciDL
Presidential tweets are now government records
@m_nsiddique, @WebSciDL 2
Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072
News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
11% of the social media resources are lost in their first year
@m_nsiddique, @WebSciDL 3
Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media
Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg
Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
Politwoops: Tracks deleted tweets by public officials
@m_nsiddique, @WebSciDL 4
Source: https://projects.propublica.org/politwoops/
The best way to find a typo is to hit send
@m_nsiddique, @WebSciDL 5
Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
Fixing typos only introduces more typos
@m_nsiddique, @WebSciDL 6
Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
Unretweeted after a year!!!
@m_nsiddique, @WebSciDL 7
Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
Twitter revoked access to Politwoops
@m_nsiddique, @WebSciDL 8
Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation-
2015-8/
Politwoops resumes after 6 months
@m_nsiddique, @WebSciDL 9
Tweet is deleted
Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
Flight handle is gone
@m_nsiddique, @WebSciDL 10
Source: https://twitter.com/Flight/status/656882929923059713
No worries web archives come to the rescue
@m_nsiddique, @WebSciDL 11
Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
Web archives include social media too
@m_nsiddique, @WebSciDL 12
Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
Nauman, you are not archived
@m_nsiddique, @WebSciDL 13
Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
@BreitbartNews is well archived
@m_nsiddique, @WebSciDL 14
Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
@realDonaldTrump is very heavily archived
@m_nsiddique, @WebSciDL 15
Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
Archival captures for top level pages have approximately 20 tweets
@m_nsiddique, @WebSciDL 16
Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
Tweet Ids are just a single tweet
@m_nsiddique, @WebSciDL 17
Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
Not enough to take screenshots
@m_nsiddique, @WebSciDL 18
Source: https://twitter.com/CasMudde/status/960546130684768256
News Article: https://www.huffingtonpost.com/entry/
breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
Monday Morning Quarterbacking
@m_nsiddique, @WebSciDL 19
Source: https://twitter.com/BreitbartNews/status/960565890336149504
Penalty: Unsportsmanlike conduct
@m_nsiddique, @WebSciDL 20
Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
How did we find the deleted tweets?
• Used Twitter API to fetch recent 3200 tweets
• Tweets spanned from Oct 22, 2017 to Feb 18, 2018
• Used Memgator, memento aggregator service to fetch
mementos
@m_nsiddique, @WebSciDL 21
Code to fetch recent tweets using Python-TwitterAPI
import twitter
api = twitter.Api(consumer_key='xxxxxx',
consumer_secret='xxxxxx',
access_token_key='xxxxxx',
access_token_secret='xxxxxx',
sleep_on_rate_limit=True)
twitter_response = api.GetUserTimeline(screen_name=screen_name,
count=200, include_rts=True)
@m_nsiddique, @WebSciDL 22
Run MemGator locally
$ memgator --contimeout=10s --agent=XXXXXX server
MemGator 1.0-rc7
_____ _______ __
/  _____ _____ / _____/______/ |___________
/ Y Y / __ / /  _____  _/ _ _ _ 
/ | |  ___/ Y Y  _ / __ | | |_| | | /
__/_____/______|_|__/_______/_____|__|___/|__|
TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R}
TimeGate : http://localhost:1208/timegate/{URI-R} [Accept-
Datetime]
Memento :
http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR
I-R}
# FORMAT => link|json|cdxj
# DATETIME => YYYY[MM[DD[hh[mm[ss]]]]]
# Accept-Datetime => Header in RFC1123 format
@m_nsiddique, @WebSciDL 23
Source: https://github.com/oduwsdl/MemGator
TimeGate
@m_nsiddique, @WebSciDL 24
RFC: https://tools.ietf.org/html/rfc7089
Source: http://mementoweb.org/guide/quick-intro/
TimeMap
msiddique@atria:~$ curl -i
https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html
HTTP/1.1 200 OK
Content-Type: application/link-format
Date: Wed, 06 Feb 2019 18:46:16 GMT
X-Generator: MemGator:1.0-rc7
X-Memento-Count: 93
Transfer-Encoding: chunked
<http://example.org/index.html>; rel="original",
<https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>;
rel="self"; type="application/link-format",
<http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>;
rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT",
<http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>;
rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT",
….Deletia…
<http://web.archive.org/web/20190131051056/http://www.example.org/index.html>;
rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT",
@m_nsiddique, @WebSciDL 25
Play with TimeMap and TimeGate
@m_nsiddique, @WebSciDL 26
Source: http://memgator.cs.odu.edu/api.html
Code to fetch TimeMap for any Twitter handle
url = "http://localhost:1208/timemap/"
data_format = "cdxj"
command = url + data_format +
"/http://twitter.com/<screen-name>" +
response = requests.get(command)
@m_nsiddique, @WebSciDL 27
Code to parse tweet-related information
import bs4
soup = bs4.BeautifulSoup(open(<HTML representation of
Memento>),"html.parser")
match_tweet_div_tag = soup.select('div.js-stream-tweet')
for tag in match_tweet_div_tag:
if tag.has_attr("data-tweet-id"):
# Get Tweet id
...........
# Parse tweets
match_timeline_tweets = tag.select('p.js-tweet-
text.tweet-text')
...........
# Parse tweet timestamps
match_tweet_timestamp = tag.find("span", {"class":
"js-short-timestamp"})
...........
@m_nsiddique, @WebSciDL 28
Analysis of Breitbart News Deleted Tweets
• Of the 22 deleted tweets, 20 were of the form where
Breitbart News retweeted someone's tweet but the
original tweet was lost.
• Of those 20 tweets, 18 were from two affiliates of Breitbart
News, @NolteNC and @carney. Therefore, we decided to
have a look at both the accounts to determine the reason
for their deleted tweets.
@m_nsiddique, @WebSciDL 29
Unretweeted tweet by Breitbart News
@m_nsiddique, @WebSciDL 30
Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
Original Live Tweet
@m_nsiddique, @WebSciDL 31
Source: https://twitter.com/LibertarianBlue/status/951465189161029632
Breitbart News retweets another tweet
@m_nsiddique, @WebSciDL 32
Source: https://twitter.com/LibertarianBlue/status/951578248986710016
Analysis on @carney and @NolteNC
• Mementos fetched between Nov 3, 2017 and Feb 17,
2018
• Low number of mementos for @carney
• @NolteNC had 169 live tweets and 3569 deleted tweets
• Fetched live tweets using Twitter API for both accounts for
over two weeks
@m_nsiddique, @WebSciDL 33
Tweets older than a week on Tuesday and Saturday are deleted
@m_nsiddique, @WebSciDL 34
Tweets older than a week on Wednesday and Saturday are deleted
@m_nsiddique, @WebSciDL 35
• With 1000s of deleted tweets, it seemed unlikely that he
was manually deleting tweets.
• We have all the reasons to believe that @carney and
@NolteNC deleted tweets automatically using some tweet
deletion service.
@m_nsiddique, @WebSciDL 36
Deletion Behavior
Take Away
• It is not enough to make screen shots of controversial
tweets, rather we need to push it to the web archives for
longer retention capability than our personal archives.
• For finding deleted tweets, web archives work effectively
for popular accounts but for less popular accounts this
approach might not work.
• For finding deleted tweets, top level page works better
than individual Tweet Id URLs.
• Most deletions for Breitbart News come from automatic
deletion of tweets by some of its correspondents.
@m_nsiddique, @WebSciDL 37
You can read more on the blog
http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa-
whats-deleted-tweet.html
@m_nsiddique, @WebSciDL 38

Weitere ähnliche Inhalte

Ähnlich wie Grampa, What's a deleted tweet?

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesKritika Garg
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsTarannum Zaki
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Michael Nelson
 
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESIAEME Publication
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveEugene Yan Ziyou
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python37point2
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesMichael Nelson
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Farida Vis
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1Harald Meier
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...Debanjan Mahata
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Shawn Jones
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGHeather Lee
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Michael Nelson
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22News Leaders Association's NewsTrain
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15spiderQube
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia UniversityTunghai University
 
Extracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsExtracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsTarannum Zaki
 

Ähnlich wie Grampa, What's a deleted tweet? (20)

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter Pages
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
Ijciet 08 02_024
Ijciet 08 02_024Ijciet 08 02_024
Ijciet 08 02_024
 
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python
 
Congressional Deleted Tweets
Congressional Deleted TweetsCongressional Deleted Tweets
Congressional Deleted Tweets
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pages
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITING
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
 
Extracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsExtracting Information from Twitter Screenshots
Extracting Information from Twitter Screenshots
 

Kürzlich hochgeladen

Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Servicesexy call girls service in goa
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsstephieert
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)Damian Radcliffe
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts servicevipmodelshub1
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGAPNIC
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersDamian Radcliffe
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.soniya singh
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsThierry TROUIN ☁
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 

Kürzlich hochgeladen (20)

Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girls
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with Flows
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 

Grampa, What's a deleted tweet?

  • 1. GRAMPA, WHAT'S A DELETED TWEET? Mohammed Nauman Siddique Web Archiving Forensics (CS 895) Spring, 2019 Web Science and Digital Libraries Group Old Dominion University Norfolk, Virginia, USA @WebSciDL
  • 2. Presidential tweets are now government records @m_nsiddique, @WebSciDL 2 Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072 News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
  • 3. 11% of the social media resources are lost in their first year @m_nsiddique, @WebSciDL 3 Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
  • 4. Politwoops: Tracks deleted tweets by public officials @m_nsiddique, @WebSciDL 4 Source: https://projects.propublica.org/politwoops/
  • 5. The best way to find a typo is to hit send @m_nsiddique, @WebSciDL 5 Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
  • 6. Fixing typos only introduces more typos @m_nsiddique, @WebSciDL 6 Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
  • 7. Unretweeted after a year!!! @m_nsiddique, @WebSciDL 7 Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
  • 8. Twitter revoked access to Politwoops @m_nsiddique, @WebSciDL 8 Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation- 2015-8/
  • 9. Politwoops resumes after 6 months @m_nsiddique, @WebSciDL 9 Tweet is deleted Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
  • 10. Flight handle is gone @m_nsiddique, @WebSciDL 10 Source: https://twitter.com/Flight/status/656882929923059713
  • 11. No worries web archives come to the rescue @m_nsiddique, @WebSciDL 11 Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
  • 12. Web archives include social media too @m_nsiddique, @WebSciDL 12 Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
  • 13. Nauman, you are not archived @m_nsiddique, @WebSciDL 13 Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
  • 14. @BreitbartNews is well archived @m_nsiddique, @WebSciDL 14 Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
  • 15. @realDonaldTrump is very heavily archived @m_nsiddique, @WebSciDL 15 Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
  • 16. Archival captures for top level pages have approximately 20 tweets @m_nsiddique, @WebSciDL 16 Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
  • 17. Tweet Ids are just a single tweet @m_nsiddique, @WebSciDL 17 Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
  • 18. Not enough to take screenshots @m_nsiddique, @WebSciDL 18 Source: https://twitter.com/CasMudde/status/960546130684768256 News Article: https://www.huffingtonpost.com/entry/ breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
  • 19. Monday Morning Quarterbacking @m_nsiddique, @WebSciDL 19 Source: https://twitter.com/BreitbartNews/status/960565890336149504
  • 20. Penalty: Unsportsmanlike conduct @m_nsiddique, @WebSciDL 20 Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
  • 21. How did we find the deleted tweets? • Used Twitter API to fetch recent 3200 tweets • Tweets spanned from Oct 22, 2017 to Feb 18, 2018 • Used Memgator, memento aggregator service to fetch mementos @m_nsiddique, @WebSciDL 21
  • 22. Code to fetch recent tweets using Python-TwitterAPI import twitter api = twitter.Api(consumer_key='xxxxxx', consumer_secret='xxxxxx', access_token_key='xxxxxx', access_token_secret='xxxxxx', sleep_on_rate_limit=True) twitter_response = api.GetUserTimeline(screen_name=screen_name, count=200, include_rts=True) @m_nsiddique, @WebSciDL 22
  • 23. Run MemGator locally $ memgator --contimeout=10s --agent=XXXXXX server MemGator 1.0-rc7 _____ _______ __ / _____ _____ / _____/______/ |___________ / Y Y / __ / / _____ _/ _ _ _ / | | ___/ Y Y _ / __ | | |_| | | / __/_____/______|_|__/_______/_____|__|___/|__| TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R} TimeGate : http://localhost:1208/timegate/{URI-R} [Accept- Datetime] Memento : http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR I-R} # FORMAT => link|json|cdxj # DATETIME => YYYY[MM[DD[hh[mm[ss]]]]] # Accept-Datetime => Header in RFC1123 format @m_nsiddique, @WebSciDL 23 Source: https://github.com/oduwsdl/MemGator
  • 24. TimeGate @m_nsiddique, @WebSciDL 24 RFC: https://tools.ietf.org/html/rfc7089 Source: http://mementoweb.org/guide/quick-intro/
  • 25. TimeMap msiddique@atria:~$ curl -i https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html HTTP/1.1 200 OK Content-Type: application/link-format Date: Wed, 06 Feb 2019 18:46:16 GMT X-Generator: MemGator:1.0-rc7 X-Memento-Count: 93 Transfer-Encoding: chunked <http://example.org/index.html>; rel="original", <https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>; rel="self"; type="application/link-format", <http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>; rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT", <http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>; rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT", ….Deletia… <http://web.archive.org/web/20190131051056/http://www.example.org/index.html>; rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT", @m_nsiddique, @WebSciDL 25
  • 26. Play with TimeMap and TimeGate @m_nsiddique, @WebSciDL 26 Source: http://memgator.cs.odu.edu/api.html
  • 27. Code to fetch TimeMap for any Twitter handle url = "http://localhost:1208/timemap/" data_format = "cdxj" command = url + data_format + "/http://twitter.com/<screen-name>" + response = requests.get(command) @m_nsiddique, @WebSciDL 27
  • 28. Code to parse tweet-related information import bs4 soup = bs4.BeautifulSoup(open(<HTML representation of Memento>),"html.parser") match_tweet_div_tag = soup.select('div.js-stream-tweet') for tag in match_tweet_div_tag: if tag.has_attr("data-tweet-id"): # Get Tweet id ........... # Parse tweets match_timeline_tweets = tag.select('p.js-tweet- text.tweet-text') ........... # Parse tweet timestamps match_tweet_timestamp = tag.find("span", {"class": "js-short-timestamp"}) ........... @m_nsiddique, @WebSciDL 28
  • 29. Analysis of Breitbart News Deleted Tweets • Of the 22 deleted tweets, 20 were of the form where Breitbart News retweeted someone's tweet but the original tweet was lost. • Of those 20 tweets, 18 were from two affiliates of Breitbart News, @NolteNC and @carney. Therefore, we decided to have a look at both the accounts to determine the reason for their deleted tweets. @m_nsiddique, @WebSciDL 29
  • 30. Unretweeted tweet by Breitbart News @m_nsiddique, @WebSciDL 30 Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
  • 31. Original Live Tweet @m_nsiddique, @WebSciDL 31 Source: https://twitter.com/LibertarianBlue/status/951465189161029632
  • 32. Breitbart News retweets another tweet @m_nsiddique, @WebSciDL 32 Source: https://twitter.com/LibertarianBlue/status/951578248986710016
  • 33. Analysis on @carney and @NolteNC • Mementos fetched between Nov 3, 2017 and Feb 17, 2018 • Low number of mementos for @carney • @NolteNC had 169 live tweets and 3569 deleted tweets • Fetched live tweets using Twitter API for both accounts for over two weeks @m_nsiddique, @WebSciDL 33
  • 34. Tweets older than a week on Tuesday and Saturday are deleted @m_nsiddique, @WebSciDL 34
  • 35. Tweets older than a week on Wednesday and Saturday are deleted @m_nsiddique, @WebSciDL 35
  • 36. • With 1000s of deleted tweets, it seemed unlikely that he was manually deleting tweets. • We have all the reasons to believe that @carney and @NolteNC deleted tweets automatically using some tweet deletion service. @m_nsiddique, @WebSciDL 36 Deletion Behavior
  • 37. Take Away • It is not enough to make screen shots of controversial tweets, rather we need to push it to the web archives for longer retention capability than our personal archives. • For finding deleted tweets, web archives work effectively for popular accounts but for less popular accounts this approach might not work. • For finding deleted tweets, top level page works better than individual Tweet Id URLs. • Most deletions for Breitbart News come from automatic deletion of tweets by some of its correspondents. @m_nsiddique, @WebSciDL 37
  • 38. You can read more on the blog http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa- whats-deleted-tweet.html @m_nsiddique, @WebSciDL 38