The document discusses the National Innovation Centre (NIC) in the UK, which aims to speed up innovation in healthcare technology for the NHS and UK economy. The NIC uses linked data, ontologies, and semantic web technologies like RDFa to create a platform for publishing and connecting health innovation data from various sources. This platform includes tools for visualizing and searching the data to help identify opportunities for collaboration and development.
2. About the National Health
Service
Established in 1948
Comprehensive, universal, free at point of delivery
World’s largest health service
Covers entire population of 51 million
Serves, on average,1 million people every 36 hours
World’s 4th largest employer
1.7m people employed, including:
120,000 hospital doctors
40,000 general practitioners (GPs)
400,000 nurses
25,000 ambulance staff
55,000 scientists
2
3. UK Research Environment
UK Life Science Industry
Employs over 120,000 people
Invests more than £4.6 billion in R&D
Global turnover more than £31.4 billion
World University Rankings
1: Harvard University
2: University of Cambridge
3: Yale University
4: University College London
5=: Imperial College London
5=: University of Oxford
3
4. Who we are
Funded by Department of Health
Response to Industry’s complaint to Government that
NHS was unresponsive to innovation
In current economic climate, Government, NHS and
Industry all see innovation as a key driver to:
improve NHS
productivity
quality
support UK plc
wealth and job creation
inward investment and export trade
4
5. What we do
We aim to speed-up the development and delivery of
technological innovations likely to benefit the NHS and
UK plc
We focus on:
Pre-commercial innovation (e.g., SBRI, Collaborative, Classic)
Linked Data
5
10. Competition Tool
NIC Define Design Develop Demonstrate Distribute
SBRI
AUCTIONS
Prizes
Competition
Std Tenders
Tool Grants
Negotiated
Collaborations
procedures
Need 1 Need 2 Need 3 Need 4 Need 5
18. Linked Data Platform
Ontology
Linked data
Triple-store
Applications
Architecture
14
19. Platform: Ontology
Open Health Innovation Ontology (OHIO) Developed by
the NIC
Spans the entire innovation pipeline
Models concepts and relationships within the domain of
innovation
Builds on existing ontologies, such as FOAF and DC
Extremely modular
TDD approach (using Pellet)
Comprises both OWL (Web Ontology Language) and
SKOS (Simple Knowledge Organization System)
15
20. Platform: Linked data
Sources listed on data.gov.uk
Unstructured, such as PubMed
Semi-structured, such as Connecting for Health CSV files
Structured, using RDFa:
Partner with others, such as the Chief Scientist's Office...
...and NHS Networks
Some data is entered manually, and augmented (e.g.,
with geocoding and entity extraction)
16
28. Platform: Triple-store
Key part of the architecture
Driven by the OHIO ontology
Publicly available via SPARQL end-point
22
29. Platform: Applications
Many ways to visualise and understand the data, so no
'one size fits all'
Open source community offers plethora of tools to help:
MIT's Exhibit
Google Visualizations API
So build a widget platform:
Find and link to data
Create widgets to visualise the data
Embed those in your own site
23
32. Platform: Applications
Many ways to visualise and understand the data
Open Source community offers plethora of tools to
exploit, such as:
MIT's Exhibit
Google Visualizations API
Widget platform:
Find and upload data
Create widgets to visualise the data
Embedded in their own sites
26
33. Platform: Architecture
Search is provided by Solr
Nutch does the crawling
Triple-store is 4Store
Management of the platform is via Drupal
Drupal also provides the user interface layer:
Views
Google Visualizations API
Exhibit
Faceted browsing (from Solr)
Embedding widgets
User registration
Ratings
Comments
Forums......could go on and on! 27
34. Well-known CMS with strong community;
good foundation for pretty much anything;
increasing number of semantic modules:
Open Calais support;
RDF import;
import results of SPARQL queries;
RDF to Apache Solr mapping;
RDFa support in core in Drupal 7;
still on Drupal 6 though (many unported modules);
manages our system, and provides end-user UI.
28
35. part of Lucene project at Apache;
full-text search;
'more like this';
'did you mean?';
hit highlighting;
faceted search;
rich document (e.g., Word, PDF) handling;
...and well integrated with Drupal.
29
36. part of Lucene project at Apache;
crawler (managing links and schedules);
parsers for HTML and other document formats (e.g.,
Word, PDF);
a Drupal module is available.
30
47. Next steps
Release OHIO as open source ontology
Launch dataShuttle beta
Establish more RDFa publishing partners
Crawl more specialist sites
NLP on some sites (probably with GATE or Mahout)
Take platform open source so that others can get up and
running quickly
38
48. Links
Dr. Michael Wilkinson:
michael.wilkinson@institute.nhs.uk
http://www.nic.nhs.uk/
Mark Birbeck
mark.birbeck@webBackplane.com
http://webBackplane.com/mark.birbeck
Tools
Exhibit: http://www.simile-widgets.org/exhibit/
GViz: http://code.google.com/apis/charttools/index.html
Solr: http://lucene.apache.org/solr/
Nutch: http://nutch.apache.org/
4Store: http://4store.org/
Drupal: http://drupal.org/
39