3. moving to linked data
• moving from static HTML to dynamic,
responsive site
• introducing linked data to power content
aggregations around related topics
• starting to embed linked open data in every
page as RDFa
• using the IPTC rNews vocabulary to
describe contnet in a machine-readable way
4. impact on journalists
• annotating (“tagging”) content
with topics
• tool embedded into existing
CMS
• concept extraction/NLP for
topic suggestion
• journalists accept/reject
suggested topics for
annotation
6. learning from the pilot
• generally - it works
• but duplication for
big events
• also need pinning
• concept extraction
poor
• journalists gaming
the system
12. next steps
• rolling out tagging to journalists throughout
BBC News
• making better use of rNews/RDFa - full
mark-up integration
• piloting the use of storyline in data-driven
news
UK's most popular news website - 6 million unique browsers every day (3rd biggest site in the UK after Google and Facebook) publish around 500 articles every day - local, national global publish in 27 languages as World Service (+ 2 UK languages alongside English) hundreds of journalists, many working cross-media (TV/radio/online)
articles created in a home-grown Content Management System flat page publishing via FTP - good for high load events but limits our UX and data potential
- need to minimise impact on journalists - integration with existing tools and workflow as much as possible
pilot - can we automate the production of the local news region sub-index pages? (currently manual task to maintain these pages) GET articles about or mentioning places that fall within the BBC News region
- a simple ontology for people, organisations, places and intangibles (themes) and their intersection with events - based on rNews, the Event ontology and PA ’ s SNaP Stuff ontology - annotate articles with events, where the event:place is Birmingham etc.
- IPTC rNews terms in RDFa - basic publishing metadata in the <head> for rich snippets - linked open data in the body
- immediate results - rich snippets for articles - apparently better ranking by topic (anecdotal)
- we introduced the change in the first week of May - by the end of may we were seeing some positive press coverage, people were noticing