2. Why scraping
How to spot
opportunities for scraping
Tools and traits: what can
be scraped, and how
Why and how
3. Automating the repetitive
gathering of data, e.g.
Data from same page every
day (e.g. social media)âš
Data from multiple pagesâš
Multiple documents:
spreadsheets, PDFs
What is scraping?
4. Why is a government
website carrying fake
jobs?
Aron Pilhofer, News Rewired
25. Data outside scope of FOI
Repetitive work, e.g. 1000s
of documents
Information that changes
regularly
Sources that follow a pattern
Stories that suit scraping
26. Data that can be obtained
through asking, FOI, API etc.
Multiple sites with different
CMS
Small amounts of data/
repetition
Stories that donât.