SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
US Government
   Linked Data
          Bernadette Hyland, CEO
 co-chair W3C Government Linked Data WG
        bhyland@3roundstones.com
               @BernHyland

NARA II - College Park MD   07 February 2013

                                               1
Agenda

• Intros ...
• Trends in data management
• Government data publication
 • Update on new Linked Data Services

                                        2
3 Round Stones produces the leading platform for
the publication of data on the Web. Our
commercially supported Open Source platform is
used by the Fortune 2000 and US Government
agencies to collect, publish and reuse data, both on
the public Internet and behind institutional firewalls.




                                                         3
Our Partners




                                Callimachus



                                                                                            4

Our partners ...
Our customers - 50% US Gov’t and 50% private sector, focused on pharma & health delivery,
and business publishing.
5

Headlines and agency memos about government transparency with open data and various government Web
sites.
... innovation challenges based on open government data

... High energy datapalooza’s are emerging with awards ranging from a couple thousand to $100k+. These
challenges open the doors to innovation for better healthcare solutions and more efficient use of energy, to
name but a few. They all require access to and re-use of HIGH QUALITY DATA.

In 2012, we read many headlines about big data and world’s search engines and social media sites.
6
7

Who is sharing their data as Linked Data? Small and large commercial and government organizations, NGOs,
Non-profits ... plus many universities.
Governments in the last few years have been responding to Open Government initiatives that mandate publishing
open government data.
Some are careful, slow-moving entities who simply needed to find real solutions to real problems.
Governments
Goals: Governmental transparency and/or improved
       internal efficiencies (data warehouses)




                                                   8
Photo credit: http://www.flickr.com/photos/glennharper/4452247708/
                                                                                               9

                                                                                                                 9

However, while there is lots of gold to be mined from public data, it is an uncomfortable time for Government
IT and business managers who are tasked with data management programs.

Most people are having a difficult time keeping up. If you feel like you are hanging on while the world changes
too fast, you are not alone.

Photo credit: http://www.flickr.com/photos/glennharper/4452247708/
10

Linked data is used extensively by the government seen to be the global leader in data
transparency -- the UK Government. This is their home page.
Big Data
                                                               Simple data
                                                               Complex data
                                                               Legacy data




                                                                                                                    11

KEY POINT: Search, discovery and data access approaches have evolved over the last decade and techniques
are beginning to come together. GoPubMed was launched in 2002 as the first semantic search portal. Later,
Microsoft’s Bing, Google’s Knowledge Graph are two of the other well known search engines employing
semantic techniques.

Big data research has grown to include the MapReduce algorithm for handling really large data sets, often
measured in terabytes or greater. This is the kind of data that people at the Large Hadron Collider at CERN
are working on to provide insights into how the universe works, including the recent discovery of the Higgs
Boson, the particle that gives mass to matter.

Under the big top tent of semantic search we’re dealing with different types of content, big, public, complex and
legacy data. Simple, complex and legacy data comes in small, medium and large sizes.

Many government agencies by contrast have lots of small to medium data sets in structured databases. These
databases (and the systems that depend upon them) are not going away however fewer new data warehouse
projects are likely to be started. Data warehouses are widely recognized to be costly to create and maintain,
and change SLOWLY.

The biggest win for governments worldwide who adopt a Web architecture for data publishing is combining data
sets to discover new or previously uncontemplated relationships.
“Big Data Is Important, but
         Open Data Is More Valuable”
        As change agents, enterprise architects can help
           their organizations become richer through
                  strategies such as open data.
                                                David Newman, VP Research, Gartner




                                                                                                                   12

Open data refers to the idea that certain data should be freely available to everyone to use and republish as they
wish, without restrictions from copyright, patents or other forms of control.

The term “open data” has gained popularity with open data initiatives including data.gov.uk, data.gov and other
government data catalog sites.

Enterprise architects are playing an important role in fostering information-sharing practices. Access to, and use
of, open data will be particularly critical for a business that operate using the Web; organizations should focus on
using open data to enhance business practices that generate growth and innovation.
13

A sound government information management strategy requires providing CONTEXT and CONFIDENCE to
those accessing and potentially re-using your data.

Giving people have timely access to information, for disaster preparedness, scientific research, policy and
research, the network effect of people helping people is our greatest hope.

On the heels of the recent East Coast hurricane that devastated parts of New York and New Jersey, government
executives suggested that fear of cyber-doom scenarios may be taking too much of our thinking & planning.
According to Secretary Panetta, it may be driving us to unrealistic and potentially dangerous responses to threats
that don’t exist.

The reality is that when disaster strike, people come together and help one another. We don’t see paralysis,
panic and social collapse.

During today’s session, I’ll describe how several agencies and private sector organizations are using Web
technologies and semantics to improve information access and discovery. Simply put, semantic technologies
provide CONTEXT.
Open Government Data




                       14
Growing chorus ...
        “We’re moving from managing
        documents to managing discrete pieces of
        open data and content which can be
        tagged, shared, secured, mashed up and
        presented in the way that is most useful
        for the consumer of that information.”
                        -- Report on Digital Government: Building a 21st Century Platform to
                                                          Better Serve the American People




                                                                                                              15

The Digital Government Strategy sets out to accomplish three things: Access to high quality digital information
& services; procure and manage devices, applications, and data in smart, secure and affordable ways; and unlock
the power of government data to spur innovation.

Governments around the world are defining detailed digital services plans based on open data, open APIs and
open source data platforms. They are defining how governments are publishing data with an eye towards
improving access and re-use. Administrators and program managers are committing to delivery of digital services
using semantic technologies broadly, and Linked Data specifically.
Open data + open standards +
          open platforms
        Highly scalable computing &
        hosting via the       Cloud
        International Data Exchange
        Standards
        5 Star Data (Linked Data)
        Open Source tools
                                                                                                                16

A Web-oriented approach to information sharing has impacted how scientists, researchers, regulators and the
public interacts with government.

Linked data lowers the barriers to re-use and interoperability among multiple, distributed and heterogeneous
data sources.

Access to high-quality Linked Open Data via the Web means millions of researchers and developers will be able
to shorten the time-consuming research process involving data cleansing and modeling.
17

How do we get a loose coupling of shared data over Web architectures? By using the structured data model for
the Web: RDF.

There is a project to create freely available data on the Web in this way, which is known as the Linked Open
Data project.

W3C sees Linked Data as the set of best practices and technologies to support worldwide data access,
integration and creative re-use of authoritative data.
18

September 2011: 295 datasets that meet the LOD Cloud criteria, consisting of over 31 billion
RDF triples and are interlinked by around 504 million links.
Callimachus
                     http://callimachusproject.org
                       http://3roundstones.com



                                                                                             19

Callimachus is that platform. It is available via 3roundstones.com or its Open Source site
callimachusproject.org.
CONTENT                                   LINKED DATA
   MANAGEMENT                                  MANAGEMENT
      SYSTEM                                      SYSTEM


                        DATA




                                                                       TEXT
                      UNSTRUCTURED




                               Callimachus

                                                                     STRUCTURED
                                                                        DATA
                          TEXT




                                                                                               20

Callimachus may be compared to a distributed CMS. CMS’s manage mostly unstructured
information. Callimachus, by contrast to a CMS, manages primarily structured Linked Data. We
call this a Linked Data management system.
Data driven Web apps using Callimachus
  US Legislation +
  enterprise data




                                                                          Clinical Trials +
       DBpedia +                                                         enterprise linked
   enterprise datasets                                                          data




                                                                                        21

                                                                                              21

Callimachus integrates (very) well with other enterprise systems as well as Web content. It
can form an entire application or part of one.
NB: Mention Documentum, Oracle via HTTP
22

•   US HHS committed to making a vast array of open data more readily available to improve health care delivery
    & reduce costs in 2013 and beyond.

•   In 2012, Sentara created a Web application that integrates authoritative data from 5 different sources including
    content from NLM, NOAA, EPA and DBpedia

•   This application utilizes open data, open standards and an open source data platform
User




          US EPA                 US EPA
 NOAA
          AirNow                SunWise




                    National
DBpedia            Library of
                   Medicine



                                          23
US EPA Linked Data
• Cloud-based Linked Data provision of 3 core
programs:

 • 2.9M Facilities
 • 100K substances
 • 25 years of toxic pollution reports
• FISMA compliant
• 16 Callimachus templates
• Official launch March 2013
                                                24
25

Envirofacts, EPA’s older system.
26

EPA’s new Linked Data system. Cooperation without coordination. Data reuse breaks the back of API gridlock.
Clay Shirky stole that from me :)
27

This data is exactly the same data used to create the interface. Unlike traditional database-driven applications,
the data is immediately accessible for reuse by third parties. This prevents data duplication, allows for tracking of
provenance and avoids reinventing the wheel.
We’ve Seen This Before




                                                                                          28

Like HTML and RDF, credit cards have a human-readable side and a machine-readable side.
Linked Data management system
                                                                                located at a Tier 1 Cloud Provider
                                                                                       (FISMA compliant)

               RDF Database


                                 Resource URIs     REST API          SPARQL endpoint




      Public


                              Web Browser




                                                          Application, Script or automated client




                                                                      Registered developer



                                                                                                                     29

Introduce Callimachus, an open source, open data platform based on open standards.
3 Round Stones provides commercial support for Callimachus and is a major contributor to the OS project.

Users of Callimachus see a generated Web interface, but can also directly access the data via REST or SPARQL.

SPARQL Named Queries (like stored procedures) allow for automated conversion to different formats for reuse in
non-RDF environments.
From EPA
                              From Wikipedia




                               Open Street Map

                                                               30

Data may be easily combined from several sources.
US GPO
• Cloud-based Linked Data provision of persistent
URLs for US Government documents:

 • 33K documents
 • Used by 1,240 Federal Depository Libraries and
 public

• In 3rd year of operation

• Deemed an    Essential service supporting US
Congress


                                                    31
Real World
                 Linked Data

                                                                 32

Now let’s look at the same workflow in the Linked Data Service.
Finding Hanson Permanente




                                                                                                  33

By keeping the application simple - and letting the results be viewed either as a table or a
map - the user can adjust their search as they see fit without extra navigation. Also, by
having the data in a table that can searched or sorted however the user sees fit, finding a
specific facility is as easy as typing the name in or sorting on relevant criteria. This is made
possible by exposing the data, rather than containing it in a standard HTML table.

I fully recognize that Envirofacts could offer identical functionality by tweaking their
application, but the key underlying point is that this application was created very cheaply and
quickly *because* the data is modeled as Linked Data. When the developing environment is a
Web Browser, and the data is described and Linked, an application can be a simple XHMTL
page with JavaScript, instead of a heavy-weight dedicated application.
Finding Mercury Released in 2004
                                                        1




                                                                     2




                                                                                                 34

There are two very important things to note on this page. 1 is that on any facility’s page,
there is always an option to download the data. This data is available in two formats (RDF/
XML and Turtle). With the click of a button a user can have all of the data that was used to
drive the creation of the current page, which means he or she can repurpose that data into
any new application. Note here that this download is not an extract, summary, or recreation
of the data - it is literally the *same* data that was used to drive that page.

2 is that because this page is “data-driven”, navigation relies on exploring the data, not the
system that contains it. On the same page where we get information like it’s latitude and
longitude, we can also find a link to a report detailing exactly how much mercury was
released in 2004. We could easily do an in-page search for 2004 or Mercury to identify the
releases associated with those terms.
TRI Report




                                                                                             35

Rather than aggregating the data for presentation, the actual report is presented with the raw
data continuously available in the top right of the page.

A subtle difference to be pointed out here is the difference in the name of the facility.
Previously it was identified as Hanson Permanente, but now it is known as Lehigh Southwest
Cement Co. During the modeling phase, the Linked Data was created to implicitly include this
relationship (which is known via the mapping of EPA FRS identifiers). On the other hand,
pulling down the CSV files would not give the user any obvious way of understanding this
relationship.
Data Reuse




                                                                                             36

Developers can grab the data off any page, at any time during navigation. The site facilitates
the reuse of data. These graphs are not natively embedded in the webpage of a given facility.
Rather, by downloading the data the user can quickly and easily make new and different
visualizations for a report or presentation in 10 minutes.

For example, this history of air stack pollution reports was made with a single parameterized
SPARQL query and a single JavaScript pattern. This could very easily be applied to any number
of facilities, changed to a bar graph, or altered in any number of other ways with very little
effort thanks to the fact it was modeled using Linked Data.
Potential Audience
✔
• Middle school student doing a science project

✔
• Concerned citizen worried about local pollution

✔Environmental Science PhD from EPA
•

✔
• Doctor from NIH writing a research paper




                                                                                                 37

Linked Data allowed us to reach all the members of our potential audience by giving the user
options, aggregating based on relevance rather than data source, and by exposing the data
that drives the service for reuse.

The middle school student or concerned citizen that want to know the location of a facility,
the amount of a particular chemical it released, and the year it was released in never have to
click any of the options in the Linked Data box. They can simply use the interface, explore
the data, and find what they need in a read-only experience.

The Environmental Science PhD is still able to find what he is looking for with Linked Data but
can do so in a much more intuitive way. The doctor from NIH is now able to find the data
they’re interested in and if they choose to take the next step, download the actual data
behind the page. By quickly and easily obtaining the raw data, anyone from scientists to
journalists can generate their own applications without any knowledge of the Linked Data
Service itself.
http://www.manning.com/dwood/




http://3roundstones.com/linking-government-data/




http://3roundstones.com/linking-enterprise-data/


                                                   38
39
The mission of the Government Linked
          Data (GLD) Working Group is to
          provide standards and other information
          which help governments around the
          world publish their data as effective and
          usable Linked Data using Semantic Web
          technologies.




                                                                                     40

We are 16 months into the Government Linked Data Working group’s two year charter.
Credits


                           Gartner: “Innovation Insight: Linked Data Drives Innovation Through Information-
      David Newman
                           Sharing Network Effects” Published: 15 December 2011

                           Linking Government Data, Springer (2011)
      David Wood, ed.
                           http://3roundstones.com/linking-government-data/

                           Digital Government Strategy: Building a 21st Century Platform to Better Serve the
                           American People,
    US Executive Branch
                           http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital-
                           government.html


W3C Linked Data Cookbook http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook




All other photos and images © 2010-2013 3 Round Stones, Inc. and released under a CC-by-sa license




                                                                                                               41
This work is Copyright © 2011-2012 3 Round Stones Inc.
                  It is licensed under the Creative Commons Attribution 3.0 Unported License
                  Full details at: http://creativecommons.org/licenses/by/3.0/

                  You are free:

                          to Share — to copy, distribute and transmit the work



                          to Remix — to adapt the work



                  Under the following conditions:
                          Attribution. You must attribute the work in the manner specified by the
                          author or licensor (but not in any way that suggests that they endorse
                          you or your use of the work).

                          Share Alike. If you alter, transform, or build upon this work, you may
                          distribute the resulting work only under the same or similar license to this
                          one.




                                                                                                         42

This presentation is licensed under a Creative Commons BY-SA license, allowing you to share
and remix its contents as long as you give us attribution and share alike.

Weitere ähnliche Inhalte

Was ist angesagt?

The Challenge - and Opportunity - of 'Big Data'
The Challenge - and Opportunity - of 'Big Data'The Challenge - and Opportunity - of 'Big Data'
The Challenge - and Opportunity - of 'Big Data'
Serge Milman
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyond
Patrick Bouillaud
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
EMC
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
parry prabhu
 

Was ist angesagt? (19)

Transparency Plus!
Transparency Plus!Transparency Plus!
Transparency Plus!
 
Whitepaper - The need self service data tools, not scientists
Whitepaper - The need  self service data tools, not scientistsWhitepaper - The need  self service data tools, not scientists
Whitepaper - The need self service data tools, not scientists
 
Future of Privacy - The Emerging View 11 06 15
Future of Privacy - The Emerging View 11 06 15 Future of Privacy - The Emerging View 11 06 15
Future of Privacy - The Emerging View 11 06 15
 
Big data for the next generation of event companies
Big data for the next generation of event companiesBig data for the next generation of event companies
Big data for the next generation of event companies
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketing
 
Smart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart dataSmart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart data
 
The Challenge - and Opportunity - of 'Big Data'
The Challenge - and Opportunity - of 'Big Data'The Challenge - and Opportunity - of 'Big Data'
The Challenge - and Opportunity - of 'Big Data'
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyond
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
 
Modern data integration | Diyotta
Modern data integration | Diyotta Modern data integration | Diyotta
Modern data integration | Diyotta
 
Documaster – The true value of documents
Documaster – The true value of documentsDocumaster – The true value of documents
Documaster – The true value of documents
 
The state of the Big Data market
The state of the Big Data marketThe state of the Big Data market
The state of the Big Data market
 
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
 
Big data basics
Big data basicsBig data basics
Big data basics
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Enabling Big Data with Data-Level Security:The Cloud Analytics Reference Arch...
Enabling Big Data with Data-Level Security:The Cloud Analytics Reference Arch...Enabling Big Data with Data-Level Security:The Cloud Analytics Reference Arch...
Enabling Big Data with Data-Level Security:The Cloud Analytics Reference Arch...
 
Exploiting Data for the Good of All
Exploiting Data for the Good of AllExploiting Data for the Good of All
Exploiting Data for the Good of All
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 

Andere mochten auch

Linked Data and the Future of Publishing
Linked Data and the Future of PublishingLinked Data and the Future of Publishing
Linked Data and the Future of Publishing
3 Round Stones
 
Altos montes
Altos montesAltos montes
Altos montes
ibr-bh
 

Andere mochten auch (17)

Vespertino
VespertinoVespertino
Vespertino
 
Form part1
Form part1Form part1
Form part1
 
Glory to god forever
Glory to god foreverGlory to god forever
Glory to god forever
 
EPA OEI Linked Data Process
EPA OEI Linked Data ProcessEPA OEI Linked Data Process
EPA OEI Linked Data Process
 
Shiv sales-corporation
Shiv sales-corporationShiv sales-corporation
Shiv sales-corporation
 
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the WebLinked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the Web
 
Debe de ser el imc un criterio para la cirugía metabólica
Debe de ser el imc un criterio para la cirugía metabólicaDebe de ser el imc un criterio para la cirugía metabólica
Debe de ser el imc un criterio para la cirugía metabólica
 
Callimachus introduction 20111021
Callimachus introduction 20111021Callimachus introduction 20111021
Callimachus introduction 20111021
 
Linked Data and the Future of Publishing
Linked Data and the Future of PublishingLinked Data and the Future of Publishing
Linked Data and the Future of Publishing
 
Samlerhuset diverse mynter
Samlerhuset diverse mynterSamlerhuset diverse mynter
Samlerhuset diverse mynter
 
Altos montes
Altos montesAltos montes
Altos montes
 
Matutino
MatutinoMatutino
Matutino
 
Sharing Data on the Web
Sharing Data on the WebSharing Data on the Web
Sharing Data on the Web
 
Why Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David WoodWhy Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David Wood
 
Praise your name
Praise your namePraise your name
Praise your name
 
Role of Linked Data for Scholarly Publishers
Role of Linked Data for Scholarly PublishersRole of Linked Data for Scholarly Publishers
Role of Linked Data for Scholarly Publishers
 
Words
WordsWords
Words
 

Ähnlich wie US National Archives & Open Government Data

W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data
3 Round Stones
 
Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea
Haklae Kim
 
Module 10 Open Government and Data
Module 10 Open Government and DataModule 10 Open Government and Data
Module 10 Open Government and Data
IPAC-IAPC
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do
Haklae Kim
 
An intro to linked and open local gov data
An intro to linked and open local gov dataAn intro to linked and open local gov data
An intro to linked and open local gov data
Ingrid Koehler
 
Big Data - CRM's Promise Land
Big Data - CRM's Promise LandBig Data - CRM's Promise Land
Big Data - CRM's Promise Land
Danny Camprubi Douglas
 
141900791 big-data
141900791 big-data141900791 big-data
141900791 big-data
glittaz
 
Unleashing government’s ‘innovation mojo’ an interview with the us chief tec...
Unleashing government’s ‘innovation mojo’  an interview with the us chief tec...Unleashing government’s ‘innovation mojo’  an interview with the us chief tec...
Unleashing government’s ‘innovation mojo’ an interview with the us chief tec...
Mondher Ben-Hamida
 
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Katie Whipkey
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
vvpadhu
 

Ähnlich wie US National Archives & Open Government Data (20)

W3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked DataW3C TPAC 2012 Breakout Session on Government Linked Data
W3C TPAC 2012 Breakout Session on Government Linked Data
 
Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea Open Government Data, Linked Data, and the Missing Blocks in Korea
Open Government Data, Linked Data, and the Missing Blocks in Korea
 
Module 10 Open Government and Data
Module 10 Open Government and DataModule 10 Open Government and Data
Module 10 Open Government and Data
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do
 
An intro to linked and open local gov data
An intro to linked and open local gov dataAn intro to linked and open local gov data
An intro to linked and open local gov data
 
Big Data - CRM's Promise Land
Big Data - CRM's Promise LandBig Data - CRM's Promise Land
Big Data - CRM's Promise Land
 
Big Data
Big DataBig Data
Big Data
 
141900791 big-data
141900791 big-data141900791 big-data
141900791 big-data
 
Big Data Commission Report
Big Data Commission ReportBig Data Commission Report
Big Data Commission Report
 
ISWC 2012 Keynote
ISWC 2012 KeynoteISWC 2012 Keynote
ISWC 2012 Keynote
 
Complying with the EC Open Data Directive
Complying with the EC Open Data DirectiveComplying with the EC Open Data Directive
Complying with the EC Open Data Directive
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
Big Data Analytics (1).ppt
Big Data Analytics (1).pptBig Data Analytics (1).ppt
Big Data Analytics (1).ppt
 
Unleashing government’s ‘innovation mojo’ an interview with the us chief tec...
Unleashing government’s ‘innovation mojo’  an interview with the us chief tec...Unleashing government’s ‘innovation mojo’  an interview with the us chief tec...
Unleashing government’s ‘innovation mojo’ an interview with the us chief tec...
 
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
 
Government Trends 2021
Government Trends 2021Government Trends 2021
Government Trends 2021
 
Overview of Open Data, Linked Data and Web Science
Overview of Open Data, Linked Data and Web ScienceOverview of Open Data, Linked Data and Web Science
Overview of Open Data, Linked Data and Web Science
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
 

Mehr von 3 Round Stones

Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones
3 Round Stones
 
Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129
3 Round Stones
 
US EPA Linked Data Success Story - 2013
US EPA Linked Data Success Story - 2013US EPA Linked Data Success Story - 2013
US EPA Linked Data Success Story - 2013
3 Round Stones
 

Mehr von 3 Round Stones (20)

Brief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data ScientistBrief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data Scientist
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data
 
W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014
 
Open by Default
Open by DefaultOpen by Default
Open by Default
 
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round StonesLightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
 
Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014
 
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web
 
Callimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 TutorialCallimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 Tutorial
 
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
 
Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203
 
Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project
 
The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
Sharing data on the web (2013)
Sharing data on the web (2013)Sharing data on the web (2013)
Sharing data on the web (2013)
 
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
 
Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129Linked Data Book: DC Semantic Web Meetup 20130129
Linked Data Book: DC Semantic Web Meetup 20130129
 
US EPA Linked Data Success Story - 2013
US EPA Linked Data Success Story - 2013US EPA Linked Data Success Story - 2013
US EPA Linked Data Success Story - 2013
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

US National Archives & Open Government Data

  • 1. US Government Linked Data Bernadette Hyland, CEO co-chair W3C Government Linked Data WG bhyland@3roundstones.com @BernHyland NARA II - College Park MD 07 February 2013 1
  • 2. Agenda • Intros ... • Trends in data management • Government data publication • Update on new Linked Data Services 2
  • 3. 3 Round Stones produces the leading platform for the publication of data on the Web. Our commercially supported Open Source platform is used by the Fortune 2000 and US Government agencies to collect, publish and reuse data, both on the public Internet and behind institutional firewalls. 3
  • 4. Our Partners Callimachus 4 Our partners ... Our customers - 50% US Gov’t and 50% private sector, focused on pharma & health delivery, and business publishing.
  • 5. 5 Headlines and agency memos about government transparency with open data and various government Web sites. ... innovation challenges based on open government data ... High energy datapalooza’s are emerging with awards ranging from a couple thousand to $100k+. These challenges open the doors to innovation for better healthcare solutions and more efficient use of energy, to name but a few. They all require access to and re-use of HIGH QUALITY DATA. In 2012, we read many headlines about big data and world’s search engines and social media sites.
  • 6. 6
  • 7. 7 Who is sharing their data as Linked Data? Small and large commercial and government organizations, NGOs, Non-profits ... plus many universities. Governments in the last few years have been responding to Open Government initiatives that mandate publishing open government data. Some are careful, slow-moving entities who simply needed to find real solutions to real problems.
  • 8. Governments Goals: Governmental transparency and/or improved internal efficiencies (data warehouses) 8
  • 9. Photo credit: http://www.flickr.com/photos/glennharper/4452247708/ 9 9 However, while there is lots of gold to be mined from public data, it is an uncomfortable time for Government IT and business managers who are tasked with data management programs. Most people are having a difficult time keeping up. If you feel like you are hanging on while the world changes too fast, you are not alone. Photo credit: http://www.flickr.com/photos/glennharper/4452247708/
  • 10. 10 Linked data is used extensively by the government seen to be the global leader in data transparency -- the UK Government. This is their home page.
  • 11. Big Data Simple data Complex data Legacy data 11 KEY POINT: Search, discovery and data access approaches have evolved over the last decade and techniques are beginning to come together. GoPubMed was launched in 2002 as the first semantic search portal. Later, Microsoft’s Bing, Google’s Knowledge Graph are two of the other well known search engines employing semantic techniques. Big data research has grown to include the MapReduce algorithm for handling really large data sets, often measured in terabytes or greater. This is the kind of data that people at the Large Hadron Collider at CERN are working on to provide insights into how the universe works, including the recent discovery of the Higgs Boson, the particle that gives mass to matter. Under the big top tent of semantic search we’re dealing with different types of content, big, public, complex and legacy data. Simple, complex and legacy data comes in small, medium and large sizes. Many government agencies by contrast have lots of small to medium data sets in structured databases. These databases (and the systems that depend upon them) are not going away however fewer new data warehouse projects are likely to be started. Data warehouses are widely recognized to be costly to create and maintain, and change SLOWLY. The biggest win for governments worldwide who adopt a Web architecture for data publishing is combining data sets to discover new or previously uncontemplated relationships.
  • 12. “Big Data Is Important, but Open Data Is More Valuable” As change agents, enterprise architects can help their organizations become richer through strategies such as open data. David Newman, VP Research, Gartner 12 Open data refers to the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other forms of control. The term “open data” has gained popularity with open data initiatives including data.gov.uk, data.gov and other government data catalog sites. Enterprise architects are playing an important role in fostering information-sharing practices. Access to, and use of, open data will be particularly critical for a business that operate using the Web; organizations should focus on using open data to enhance business practices that generate growth and innovation.
  • 13. 13 A sound government information management strategy requires providing CONTEXT and CONFIDENCE to those accessing and potentially re-using your data. Giving people have timely access to information, for disaster preparedness, scientific research, policy and research, the network effect of people helping people is our greatest hope. On the heels of the recent East Coast hurricane that devastated parts of New York and New Jersey, government executives suggested that fear of cyber-doom scenarios may be taking too much of our thinking & planning. According to Secretary Panetta, it may be driving us to unrealistic and potentially dangerous responses to threats that don’t exist. The reality is that when disaster strike, people come together and help one another. We don’t see paralysis, panic and social collapse. During today’s session, I’ll describe how several agencies and private sector organizations are using Web technologies and semantics to improve information access and discovery. Simply put, semantic technologies provide CONTEXT.
  • 15. Growing chorus ... “We’re moving from managing documents to managing discrete pieces of open data and content which can be tagged, shared, secured, mashed up and presented in the way that is most useful for the consumer of that information.” -- Report on Digital Government: Building a 21st Century Platform to Better Serve the American People 15 The Digital Government Strategy sets out to accomplish three things: Access to high quality digital information & services; procure and manage devices, applications, and data in smart, secure and affordable ways; and unlock the power of government data to spur innovation. Governments around the world are defining detailed digital services plans based on open data, open APIs and open source data platforms. They are defining how governments are publishing data with an eye towards improving access and re-use. Administrators and program managers are committing to delivery of digital services using semantic technologies broadly, and Linked Data specifically.
  • 16. Open data + open standards + open platforms Highly scalable computing & hosting via the Cloud International Data Exchange Standards 5 Star Data (Linked Data) Open Source tools 16 A Web-oriented approach to information sharing has impacted how scientists, researchers, regulators and the public interacts with government. Linked data lowers the barriers to re-use and interoperability among multiple, distributed and heterogeneous data sources. Access to high-quality Linked Open Data via the Web means millions of researchers and developers will be able to shorten the time-consuming research process involving data cleansing and modeling.
  • 17. 17 How do we get a loose coupling of shared data over Web architectures? By using the structured data model for the Web: RDF. There is a project to create freely available data on the Web in this way, which is known as the Linked Open Data project. W3C sees Linked Data as the set of best practices and technologies to support worldwide data access, integration and creative re-use of authoritative data.
  • 18. 18 September 2011: 295 datasets that meet the LOD Cloud criteria, consisting of over 31 billion RDF triples and are interlinked by around 504 million links.
  • 19. Callimachus http://callimachusproject.org http://3roundstones.com 19 Callimachus is that platform. It is available via 3roundstones.com or its Open Source site callimachusproject.org.
  • 20. CONTENT LINKED DATA MANAGEMENT MANAGEMENT SYSTEM SYSTEM DATA TEXT UNSTRUCTURED Callimachus STRUCTURED DATA TEXT 20 Callimachus may be compared to a distributed CMS. CMS’s manage mostly unstructured information. Callimachus, by contrast to a CMS, manages primarily structured Linked Data. We call this a Linked Data management system.
  • 21. Data driven Web apps using Callimachus US Legislation + enterprise data Clinical Trials + DBpedia + enterprise linked enterprise datasets data 21 21 Callimachus integrates (very) well with other enterprise systems as well as Web content. It can form an entire application or part of one. NB: Mention Documentum, Oracle via HTTP
  • 22. 22 • US HHS committed to making a vast array of open data more readily available to improve health care delivery & reduce costs in 2013 and beyond. • In 2012, Sentara created a Web application that integrates authoritative data from 5 different sources including content from NLM, NOAA, EPA and DBpedia • This application utilizes open data, open standards and an open source data platform
  • 23. User US EPA US EPA NOAA AirNow SunWise National DBpedia Library of Medicine 23
  • 24. US EPA Linked Data • Cloud-based Linked Data provision of 3 core programs: • 2.9M Facilities • 100K substances • 25 years of toxic pollution reports • FISMA compliant • 16 Callimachus templates • Official launch March 2013 24
  • 26. 26 EPA’s new Linked Data system. Cooperation without coordination. Data reuse breaks the back of API gridlock. Clay Shirky stole that from me :)
  • 27. 27 This data is exactly the same data used to create the interface. Unlike traditional database-driven applications, the data is immediately accessible for reuse by third parties. This prevents data duplication, allows for tracking of provenance and avoids reinventing the wheel.
  • 28. We’ve Seen This Before 28 Like HTML and RDF, credit cards have a human-readable side and a machine-readable side.
  • 29. Linked Data management system located at a Tier 1 Cloud Provider (FISMA compliant) RDF Database Resource URIs REST API SPARQL endpoint Public Web Browser Application, Script or automated client Registered developer 29 Introduce Callimachus, an open source, open data platform based on open standards. 3 Round Stones provides commercial support for Callimachus and is a major contributor to the OS project. Users of Callimachus see a generated Web interface, but can also directly access the data via REST or SPARQL. SPARQL Named Queries (like stored procedures) allow for automated conversion to different formats for reuse in non-RDF environments.
  • 30. From EPA From Wikipedia Open Street Map 30 Data may be easily combined from several sources.
  • 31. US GPO • Cloud-based Linked Data provision of persistent URLs for US Government documents: • 33K documents • Used by 1,240 Federal Depository Libraries and public • In 3rd year of operation • Deemed an Essential service supporting US Congress 31
  • 32. Real World Linked Data 32 Now let’s look at the same workflow in the Linked Data Service.
  • 33. Finding Hanson Permanente 33 By keeping the application simple - and letting the results be viewed either as a table or a map - the user can adjust their search as they see fit without extra navigation. Also, by having the data in a table that can searched or sorted however the user sees fit, finding a specific facility is as easy as typing the name in or sorting on relevant criteria. This is made possible by exposing the data, rather than containing it in a standard HTML table. I fully recognize that Envirofacts could offer identical functionality by tweaking their application, but the key underlying point is that this application was created very cheaply and quickly *because* the data is modeled as Linked Data. When the developing environment is a Web Browser, and the data is described and Linked, an application can be a simple XHMTL page with JavaScript, instead of a heavy-weight dedicated application.
  • 34. Finding Mercury Released in 2004 1 2 34 There are two very important things to note on this page. 1 is that on any facility’s page, there is always an option to download the data. This data is available in two formats (RDF/ XML and Turtle). With the click of a button a user can have all of the data that was used to drive the creation of the current page, which means he or she can repurpose that data into any new application. Note here that this download is not an extract, summary, or recreation of the data - it is literally the *same* data that was used to drive that page. 2 is that because this page is “data-driven”, navigation relies on exploring the data, not the system that contains it. On the same page where we get information like it’s latitude and longitude, we can also find a link to a report detailing exactly how much mercury was released in 2004. We could easily do an in-page search for 2004 or Mercury to identify the releases associated with those terms.
  • 35. TRI Report 35 Rather than aggregating the data for presentation, the actual report is presented with the raw data continuously available in the top right of the page. A subtle difference to be pointed out here is the difference in the name of the facility. Previously it was identified as Hanson Permanente, but now it is known as Lehigh Southwest Cement Co. During the modeling phase, the Linked Data was created to implicitly include this relationship (which is known via the mapping of EPA FRS identifiers). On the other hand, pulling down the CSV files would not give the user any obvious way of understanding this relationship.
  • 36. Data Reuse 36 Developers can grab the data off any page, at any time during navigation. The site facilitates the reuse of data. These graphs are not natively embedded in the webpage of a given facility. Rather, by downloading the data the user can quickly and easily make new and different visualizations for a report or presentation in 10 minutes. For example, this history of air stack pollution reports was made with a single parameterized SPARQL query and a single JavaScript pattern. This could very easily be applied to any number of facilities, changed to a bar graph, or altered in any number of other ways with very little effort thanks to the fact it was modeled using Linked Data.
  • 37. Potential Audience ✔ • Middle school student doing a science project ✔ • Concerned citizen worried about local pollution ✔Environmental Science PhD from EPA • ✔ • Doctor from NIH writing a research paper 37 Linked Data allowed us to reach all the members of our potential audience by giving the user options, aggregating based on relevance rather than data source, and by exposing the data that drives the service for reuse. The middle school student or concerned citizen that want to know the location of a facility, the amount of a particular chemical it released, and the year it was released in never have to click any of the options in the Linked Data box. They can simply use the interface, explore the data, and find what they need in a read-only experience. The Environmental Science PhD is still able to find what he is looking for with Linked Data but can do so in a much more intuitive way. The doctor from NIH is now able to find the data they’re interested in and if they choose to take the next step, download the actual data behind the page. By quickly and easily obtaining the raw data, anyone from scientists to journalists can generate their own applications without any knowledge of the Linked Data Service itself.
  • 39. 39
  • 40. The mission of the Government Linked Data (GLD) Working Group is to provide standards and other information which help governments around the world publish their data as effective and usable Linked Data using Semantic Web technologies. 40 We are 16 months into the Government Linked Data Working group’s two year charter.
  • 41. Credits Gartner: “Innovation Insight: Linked Data Drives Innovation Through Information- David Newman Sharing Network Effects” Published: 15 December 2011 Linking Government Data, Springer (2011) David Wood, ed. http://3roundstones.com/linking-government-data/ Digital Government Strategy: Building a 21st Century Platform to Better Serve the American People, US Executive Branch http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital- government.html W3C Linked Data Cookbook http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook All other photos and images © 2010-2013 3 Round Stones, Inc. and released under a CC-by-sa license 41
  • 42. This work is Copyright © 2011-2012 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one. 42 This presentation is licensed under a Creative Commons BY-SA license, allowing you to share and remix its contents as long as you give us attribution and share alike.