SlideShare ist ein Scribd-Unternehmen logo
GRAMPA, WHAT'S A
DELETED TWEET?
Mohammed Nauman Siddique
Web Archiving Forensics (CS 895)
Spring, 2019
Web Science and Digital Libraries Group
Old Dominion University
Norfolk, Virginia, USA
@WebSciDL
Presidential tweets are now government records
@m_nsiddique, @WebSciDL 2
Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072
News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
11% of the social media resources are lost in their first year
@m_nsiddique, @WebSciDL 3
Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media
Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg
Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
Politwoops: Tracks deleted tweets by public officials
@m_nsiddique, @WebSciDL 4
Source: https://projects.propublica.org/politwoops/
The best way to find a typo is to hit send
@m_nsiddique, @WebSciDL 5
Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
Fixing typos only introduces more typos
@m_nsiddique, @WebSciDL 6
Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
Unretweeted after a year!!!
@m_nsiddique, @WebSciDL 7
Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
Twitter revoked access to Politwoops
@m_nsiddique, @WebSciDL 8
Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation-
2015-8/
Politwoops resumes after 6 months
@m_nsiddique, @WebSciDL 9
Tweet is deleted
Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
Flight handle is gone
@m_nsiddique, @WebSciDL 10
Source: https://twitter.com/Flight/status/656882929923059713
No worries web archives come to the rescue
@m_nsiddique, @WebSciDL 11
Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
Web archives include social media too
@m_nsiddique, @WebSciDL 12
Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
Nauman, you are not archived
@m_nsiddique, @WebSciDL 13
Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
@BreitbartNews is well archived
@m_nsiddique, @WebSciDL 14
Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
@realDonaldTrump is very heavily archived
@m_nsiddique, @WebSciDL 15
Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
Archival captures for top level pages have approximately 20 tweets
@m_nsiddique, @WebSciDL 16
Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
Tweet Ids are just a single tweet
@m_nsiddique, @WebSciDL 17
Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
Not enough to take screenshots
@m_nsiddique, @WebSciDL 18
Source: https://twitter.com/CasMudde/status/960546130684768256
News Article: https://www.huffingtonpost.com/entry/
breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
Monday Morning Quarterbacking
@m_nsiddique, @WebSciDL 19
Source: https://twitter.com/BreitbartNews/status/960565890336149504
Penalty: Unsportsmanlike conduct
@m_nsiddique, @WebSciDL 20
Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
How did we find the deleted tweets?
• Used Twitter API to fetch recent 3200 tweets
• Tweets spanned from Oct 22, 2017 to Feb 18, 2018
• Used Memgator, memento aggregator service to fetch
mementos
@m_nsiddique, @WebSciDL 21
Code to fetch recent tweets using Python-TwitterAPI
import twitter
api = twitter.Api(consumer_key='xxxxxx',
consumer_secret='xxxxxx',
access_token_key='xxxxxx',
access_token_secret='xxxxxx',
sleep_on_rate_limit=True)
twitter_response = api.GetUserTimeline(screen_name=screen_name,
count=200, include_rts=True)
@m_nsiddique, @WebSciDL 22
Run MemGator locally
$ memgator --contimeout=10s --agent=XXXXXX server
MemGator 1.0-rc7
_____ _______ __
/  _____ _____ / _____/______/ |___________
/ Y Y / __ / /  _____  _/ _ _ _ 
/ | |  ___/ Y Y  _ / __ | | |_| | | /
__/_____/______|_|__/_______/_____|__|___/|__|
TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R}
TimeGate : http://localhost:1208/timegate/{URI-R} [Accept-
Datetime]
Memento :
http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR
I-R}
# FORMAT => link|json|cdxj
# DATETIME => YYYY[MM[DD[hh[mm[ss]]]]]
# Accept-Datetime => Header in RFC1123 format
@m_nsiddique, @WebSciDL 23
Source: https://github.com/oduwsdl/MemGator
TimeGate
@m_nsiddique, @WebSciDL 24
RFC: https://tools.ietf.org/html/rfc7089
Source: http://mementoweb.org/guide/quick-intro/
TimeMap
msiddique@atria:~$ curl -i
https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html
HTTP/1.1 200 OK
Content-Type: application/link-format
Date: Wed, 06 Feb 2019 18:46:16 GMT
X-Generator: MemGator:1.0-rc7
X-Memento-Count: 93
Transfer-Encoding: chunked
<http://example.org/index.html>; rel="original",
<https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>;
rel="self"; type="application/link-format",
<http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>;
rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT",
<http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>;
rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT",
….Deletia…
<http://web.archive.org/web/20190131051056/http://www.example.org/index.html>;
rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT",
@m_nsiddique, @WebSciDL 25
Play with TimeMap and TimeGate
@m_nsiddique, @WebSciDL 26
Source: http://memgator.cs.odu.edu/api.html
Code to fetch TimeMap for any Twitter handle
url = "http://localhost:1208/timemap/"
data_format = "cdxj"
command = url + data_format +
"/http://twitter.com/<screen-name>" +
response = requests.get(command)
@m_nsiddique, @WebSciDL 27
Code to parse tweet-related information
import bs4
soup = bs4.BeautifulSoup(open(<HTML representation of
Memento>),"html.parser")
match_tweet_div_tag = soup.select('div.js-stream-tweet')
for tag in match_tweet_div_tag:
if tag.has_attr("data-tweet-id"):
# Get Tweet id
...........
# Parse tweets
match_timeline_tweets = tag.select('p.js-tweet-
text.tweet-text')
...........
# Parse tweet timestamps
match_tweet_timestamp = tag.find("span", {"class":
"js-short-timestamp"})
...........
@m_nsiddique, @WebSciDL 28
Analysis of Breitbart News Deleted Tweets
• Of the 22 deleted tweets, 20 were of the form where
Breitbart News retweeted someone's tweet but the
original tweet was lost.
• Of those 20 tweets, 18 were from two affiliates of Breitbart
News, @NolteNC and @carney. Therefore, we decided to
have a look at both the accounts to determine the reason
for their deleted tweets.
@m_nsiddique, @WebSciDL 29
Unretweeted tweet by Breitbart News
@m_nsiddique, @WebSciDL 30
Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
Original Live Tweet
@m_nsiddique, @WebSciDL 31
Source: https://twitter.com/LibertarianBlue/status/951465189161029632
Breitbart News retweets another tweet
@m_nsiddique, @WebSciDL 32
Source: https://twitter.com/LibertarianBlue/status/951578248986710016
Analysis on @carney and @NolteNC
• Mementos fetched between Nov 3, 2017 and Feb 17,
2018
• Low number of mementos for @carney
• @NolteNC had 169 live tweets and 3569 deleted tweets
• Fetched live tweets using Twitter API for both accounts for
over two weeks
@m_nsiddique, @WebSciDL 33
Tweets older than a week on Tuesday and Saturday are deleted
@m_nsiddique, @WebSciDL 34
Tweets older than a week on Wednesday and Saturday are deleted
@m_nsiddique, @WebSciDL 35
• With 1000s of deleted tweets, it seemed unlikely that he
was manually deleting tweets.
• We have all the reasons to believe that @carney and
@NolteNC deleted tweets automatically using some tweet
deletion service.
@m_nsiddique, @WebSciDL 36
Deletion Behavior
Take Away
• It is not enough to make screen shots of controversial
tweets, rather we need to push it to the web archives for
longer retention capability than our personal archives.
• For finding deleted tweets, web archives work effectively
for popular accounts but for less popular accounts this
approach might not work.
• For finding deleted tweets, top level page works better
than individual Tweet Id URLs.
• Most deletions for Breitbart News come from automatic
deletion of tweets by some of its correspondents.
@m_nsiddique, @WebSciDL 37
You can read more on the blog
http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa-
whats-deleted-tweet.html
@m_nsiddique, @WebSciDL 38

Weitere ähnliche Inhalte

Ähnlich wie Grampa, What's a deleted tweet?

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter Pages
Kritika Garg
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
Tarannum Zaki
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Michael Nelson
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
IAEME Publication
 
Ijciet 08 02_024
Ijciet 08 02_024Ijciet 08 02_024
Ijciet 08 02_024
IAEME Publication
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
Eugene Yan Ziyou
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python
37point2
 
Congressional Deleted Tweets
Congressional Deleted TweetsCongressional Deleted Tweets
Congressional Deleted Tweets
Mohamed Nauman Siddique
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pages
Michael Nelson
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Farida Vis
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1
Harald Meier
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...
Debanjan Mahata
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
Tarannum Zaki
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Shawn Jones
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITING
Heather Lee
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Michael Nelson
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
News Leaders Association's NewsTrain
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15
spiderQube
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
Tunghai University
 

Ähnlich wie Grampa, What's a deleted tweet? (20)

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter Pages
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
 
Ijciet 08 02_024
Ijciet 08 02_024Ijciet 08 02_024
Ijciet 08 02_024
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python
 
Congressional Deleted Tweets
Congressional Deleted TweetsCongressional Deleted Tweets
Congressional Deleted Tweets
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pages
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITING
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
 

Kürzlich hochgeladen

不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
bseovas
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
xjq03c34
 
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
uehowe
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
vmemo1
 
Design Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptxDesign Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptx
saathvikreddy2003
 
一比一原版(USYD毕业证)悉尼大学毕业证如何办理
一比一原版(USYD毕业证)悉尼大学毕业证如何办理一比一原版(USYD毕业证)悉尼大学毕业证如何办理
一比一原版(USYD毕业证)悉尼大学毕业证如何办理
k4ncd0z
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
zyfovom
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
Trish Parr
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
fovkoyb
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
3a0sd7z3
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
davidjhones387
 
Azure EA Sponsorship - Customer Guide.pdf
Azure EA Sponsorship - Customer Guide.pdfAzure EA Sponsorship - Customer Guide.pdf
Azure EA Sponsorship - Customer Guide.pdf
AanSulistiyo
 
Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?
Paul Walk
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
uehowe
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
cuobya
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
bseovas
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
Laura Szabó
 

Kürzlich hochgeladen (20)

不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
 
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
 
Design Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptxDesign Thinking NETFLIX using all techniques.pptx
Design Thinking NETFLIX using all techniques.pptx
 
一比一原版(USYD毕业证)悉尼大学毕业证如何办理
一比一原版(USYD毕业证)悉尼大学毕业证如何办理一比一原版(USYD毕业证)悉尼大学毕业证如何办理
一比一原版(USYD毕业证)悉尼大学毕业证如何办理
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
 
Azure EA Sponsorship - Customer Guide.pdf
Azure EA Sponsorship - Customer Guide.pdfAzure EA Sponsorship - Customer Guide.pdf
Azure EA Sponsorship - Customer Guide.pdf
 
Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
 

Grampa, What's a deleted tweet?

  • 1. GRAMPA, WHAT'S A DELETED TWEET? Mohammed Nauman Siddique Web Archiving Forensics (CS 895) Spring, 2019 Web Science and Digital Libraries Group Old Dominion University Norfolk, Virginia, USA @WebSciDL
  • 2. Presidential tweets are now government records @m_nsiddique, @WebSciDL 2 Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072 News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
  • 3. 11% of the social media resources are lost in their first year @m_nsiddique, @WebSciDL 3 Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
  • 4. Politwoops: Tracks deleted tweets by public officials @m_nsiddique, @WebSciDL 4 Source: https://projects.propublica.org/politwoops/
  • 5. The best way to find a typo is to hit send @m_nsiddique, @WebSciDL 5 Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
  • 6. Fixing typos only introduces more typos @m_nsiddique, @WebSciDL 6 Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
  • 7. Unretweeted after a year!!! @m_nsiddique, @WebSciDL 7 Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
  • 8. Twitter revoked access to Politwoops @m_nsiddique, @WebSciDL 8 Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation- 2015-8/
  • 9. Politwoops resumes after 6 months @m_nsiddique, @WebSciDL 9 Tweet is deleted Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
  • 10. Flight handle is gone @m_nsiddique, @WebSciDL 10 Source: https://twitter.com/Flight/status/656882929923059713
  • 11. No worries web archives come to the rescue @m_nsiddique, @WebSciDL 11 Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
  • 12. Web archives include social media too @m_nsiddique, @WebSciDL 12 Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
  • 13. Nauman, you are not archived @m_nsiddique, @WebSciDL 13 Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
  • 14. @BreitbartNews is well archived @m_nsiddique, @WebSciDL 14 Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
  • 15. @realDonaldTrump is very heavily archived @m_nsiddique, @WebSciDL 15 Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
  • 16. Archival captures for top level pages have approximately 20 tweets @m_nsiddique, @WebSciDL 16 Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
  • 17. Tweet Ids are just a single tweet @m_nsiddique, @WebSciDL 17 Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
  • 18. Not enough to take screenshots @m_nsiddique, @WebSciDL 18 Source: https://twitter.com/CasMudde/status/960546130684768256 News Article: https://www.huffingtonpost.com/entry/ breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
  • 19. Monday Morning Quarterbacking @m_nsiddique, @WebSciDL 19 Source: https://twitter.com/BreitbartNews/status/960565890336149504
  • 20. Penalty: Unsportsmanlike conduct @m_nsiddique, @WebSciDL 20 Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
  • 21. How did we find the deleted tweets? • Used Twitter API to fetch recent 3200 tweets • Tweets spanned from Oct 22, 2017 to Feb 18, 2018 • Used Memgator, memento aggregator service to fetch mementos @m_nsiddique, @WebSciDL 21
  • 22. Code to fetch recent tweets using Python-TwitterAPI import twitter api = twitter.Api(consumer_key='xxxxxx', consumer_secret='xxxxxx', access_token_key='xxxxxx', access_token_secret='xxxxxx', sleep_on_rate_limit=True) twitter_response = api.GetUserTimeline(screen_name=screen_name, count=200, include_rts=True) @m_nsiddique, @WebSciDL 22
  • 23. Run MemGator locally $ memgator --contimeout=10s --agent=XXXXXX server MemGator 1.0-rc7 _____ _______ __ / _____ _____ / _____/______/ |___________ / Y Y / __ / / _____ _/ _ _ _ / | | ___/ Y Y _ / __ | | |_| | | / __/_____/______|_|__/_______/_____|__|___/|__| TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R} TimeGate : http://localhost:1208/timegate/{URI-R} [Accept- Datetime] Memento : http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR I-R} # FORMAT => link|json|cdxj # DATETIME => YYYY[MM[DD[hh[mm[ss]]]]] # Accept-Datetime => Header in RFC1123 format @m_nsiddique, @WebSciDL 23 Source: https://github.com/oduwsdl/MemGator
  • 24. TimeGate @m_nsiddique, @WebSciDL 24 RFC: https://tools.ietf.org/html/rfc7089 Source: http://mementoweb.org/guide/quick-intro/
  • 25. TimeMap msiddique@atria:~$ curl -i https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html HTTP/1.1 200 OK Content-Type: application/link-format Date: Wed, 06 Feb 2019 18:46:16 GMT X-Generator: MemGator:1.0-rc7 X-Memento-Count: 93 Transfer-Encoding: chunked <http://example.org/index.html>; rel="original", <https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>; rel="self"; type="application/link-format", <http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>; rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT", <http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>; rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT", ….Deletia… <http://web.archive.org/web/20190131051056/http://www.example.org/index.html>; rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT", @m_nsiddique, @WebSciDL 25
  • 26. Play with TimeMap and TimeGate @m_nsiddique, @WebSciDL 26 Source: http://memgator.cs.odu.edu/api.html
  • 27. Code to fetch TimeMap for any Twitter handle url = "http://localhost:1208/timemap/" data_format = "cdxj" command = url + data_format + "/http://twitter.com/<screen-name>" + response = requests.get(command) @m_nsiddique, @WebSciDL 27
  • 28. Code to parse tweet-related information import bs4 soup = bs4.BeautifulSoup(open(<HTML representation of Memento>),"html.parser") match_tweet_div_tag = soup.select('div.js-stream-tweet') for tag in match_tweet_div_tag: if tag.has_attr("data-tweet-id"): # Get Tweet id ........... # Parse tweets match_timeline_tweets = tag.select('p.js-tweet- text.tweet-text') ........... # Parse tweet timestamps match_tweet_timestamp = tag.find("span", {"class": "js-short-timestamp"}) ........... @m_nsiddique, @WebSciDL 28
  • 29. Analysis of Breitbart News Deleted Tweets • Of the 22 deleted tweets, 20 were of the form where Breitbart News retweeted someone's tweet but the original tweet was lost. • Of those 20 tweets, 18 were from two affiliates of Breitbart News, @NolteNC and @carney. Therefore, we decided to have a look at both the accounts to determine the reason for their deleted tweets. @m_nsiddique, @WebSciDL 29
  • 30. Unretweeted tweet by Breitbart News @m_nsiddique, @WebSciDL 30 Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
  • 31. Original Live Tweet @m_nsiddique, @WebSciDL 31 Source: https://twitter.com/LibertarianBlue/status/951465189161029632
  • 32. Breitbart News retweets another tweet @m_nsiddique, @WebSciDL 32 Source: https://twitter.com/LibertarianBlue/status/951578248986710016
  • 33. Analysis on @carney and @NolteNC • Mementos fetched between Nov 3, 2017 and Feb 17, 2018 • Low number of mementos for @carney • @NolteNC had 169 live tweets and 3569 deleted tweets • Fetched live tweets using Twitter API for both accounts for over two weeks @m_nsiddique, @WebSciDL 33
  • 34. Tweets older than a week on Tuesday and Saturday are deleted @m_nsiddique, @WebSciDL 34
  • 35. Tweets older than a week on Wednesday and Saturday are deleted @m_nsiddique, @WebSciDL 35
  • 36. • With 1000s of deleted tweets, it seemed unlikely that he was manually deleting tweets. • We have all the reasons to believe that @carney and @NolteNC deleted tweets automatically using some tweet deletion service. @m_nsiddique, @WebSciDL 36 Deletion Behavior
  • 37. Take Away • It is not enough to make screen shots of controversial tweets, rather we need to push it to the web archives for longer retention capability than our personal archives. • For finding deleted tweets, web archives work effectively for popular accounts but for less popular accounts this approach might not work. • For finding deleted tweets, top level page works better than individual Tweet Id URLs. • Most deletions for Breitbart News come from automatic deletion of tweets by some of its correspondents. @m_nsiddique, @WebSciDL 37
  • 38. You can read more on the blog http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa- whats-deleted-tweet.html @m_nsiddique, @WebSciDL 38