SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
Bernadette Hyland
CEO & co-founder
11911 Freedom Drive, Suite 850
Reston,VA 20190
Tel. +1-571-331-3758
bhyland@3RoundStones.com
@BernHyland
info@3RoundStones.com
@3RoundStones
ExtendYourReach.
Linked Data for
Smarter Decisions.
Follow up information prepared for
RobinThottungal, Chief Data Scientist / Director of Analytics
US Environmental Protection Agency - Feb 26, 2016
Today’s reality at EPA
»Tens of thousands of sources
»Many formats - JSON, XML,
CSV, PDF, PPT, SHP, SHX, text,
binary…
»Thousands of data silos
»No single source of truth
»Varied interpretations
»Brittle interfaces - lack of
interoperability
Image Credit: Smart Data Collective
WideVariety of Data at EPA
3
Image Credit: MarkLogic, see http://www.marklogic.com/resources/marklogic-semantics-datasheet/
resource_download/datasheets/
Credit: Frederick Giasson, Data Scientist & Software Developer,
http://fgiasson.com/blog/index.php/2014/07/23/big-structures-where-the-semantic-web-meets-artificial-intelligence/
Potential at EPA …
• Findable data
• Accessible data
• Interoperable
data
• Re-usable data
• Shared context
• Data Platforms
(HDFS, NoSQL)
Linked Data is helping to extend & augment
EPA’s significant investment in enterprise relational technologies
How?
By leveraging NoSQL Data Platforms that rigorously adhere to
international data interoperability standards. *
* Relevant international data exchange standards are published by the
W3C, OGC, IEEE
Image Credit: MarkLogic
Graph databases, as a subset of NoSQL databases, are the
most efficient way to look at the relationships between data
items, patterns of relationships and interactions.
Image Credit: Cray, see http://www.cray.com/blog/graph-databases-101/
Graph Databases 101
Hadoop Integration
»While over 90% of the world’s data has been created in the last two
years, EPA has tremendous variety of data requires the “right tool for
the job”
»Historic data (“short, wide, complex data”) vs.
»Granular sensor & GIS data (“long skinny data”)
»Core mission-based systems with robust historic data, includes:
»Toxics Release Inventory (TRI)
»Facilities Registry (FRS)
»RCRA Handler
»EPA’s enterprise information architecture should include a data
platform that leverages Hadoop: HDFS and MapReduce, and
accommodates EPA’s robust data landscape.
»Must support modern, open source tools for application development,
visualizations, crowdsourcing, and deployment on the Web
8
One option - MarkLogic
Integrates Hadoop Ecosystem &
EPA’s Robust Data Landscape
Image Credit: http://www.marklogic.com/what-is-marklogic/features/hadoop-integration/
EPA Robust Data Ecosystem is adaptable
using a Linked Data Approach
» Makes data integration faster and easier
» By using a global addressing scheme, HTTP URIs.
» Uses semantics to “glue” together data faster.
» Common semantic definitions link traditional relational
models.
» No more out of data documentation using standard
vocabularies.
» Robust search and discovery by leveraging the semantic graph.
» Scales to the Web!
9
All modern data platforms
deployed at EPA should
»Support options for data modeling - Linked Data (JSON-LD, RDF), SQL (JSON, XML)
»Native store and query of documents, blobs and structured data.
»Standards-based query interface across documents and data, e.g., Full support for
SPARQL 1.1
»Offer enterprise functionality including high availability & disaster recovery, scalability &
elasticity, ACID transactions
»Be deployable on FedRamp certified cloud provider certifying controls for security, high
availability, disaster recovery
»Scale to billions of statements, triples, etc.
»Store unstructured data across clusters like Hadoop, making it easy to move data
partitions.
»Much but not all of
EPA’s data is well
suited for a Linked
Data approach.
»Linked Data is based
on 20+ year old idea,
a system of linked
information systems
M A N N I N G
David Wood
Marsha Zaidman
Luke Ruth
WITH Michael Hausenblas
FOREWORD BY Tim Berners-Lee
Structured data on the Web
Linked Data
Goals: Governmental transparency and/or improved internal efficiencies
Governments Worldwide are
using a Linked Data Approach
Linked Data Apps
use data from many
EPA programs and other
Open Data Sources
Linked Data Management System
For government open data publishing
Funded by
Linked Data
Platform is in QA
now! https://usepa.
3roundstones.net
Anticipated to
move to production
in 2016.
shared innovation™
Search for facilities
where we live. Unlike
many EPA Web portals,
linked data is human
AND machine readable
data. No screen
scraping is required.
Encourages re-use
(discourages data silos)
The EPA Linked
Data service
CONNECTS data
silos, and provides
familiar map and
table data views
Click to drill down
to pollution reports
that combine data
from 5 previously
unconnected data
silos.
Click through to
the source of the
pollution data via
the source reports
(TRI).
EPA collects
granular pollution
data. Linked Data
opens up the data
to a much wider
audience in a
human readable
format.
Previously, only people
who employed complex
screen scraping
techniques could get at
this data. Now, EPA open
data is available using an
international data
standard, with one click!
Good news story!
Pollution graphs
created in one
week using Open
Source Software &
EPA Linked Data
Use of shared
vocabularies, e.g.
Places, Geographis,
Dublin Core, Geo,
FOAF, ORG, Vcard
are the “lingua franca”
of data interoperability
Case Study
Using EPA Linked Data to assist
chronic asthma/COPD patients
with timely weather alerts
Funded by
User
NOAA
US EPA
AirNow
DBpedia
National
Library of
Medicine
US EPA
SunWise
Case Study: Orgpedia
An open organizational data project
on public & private companies
Funded by
Using the
Callimachus Open
Source Data Platform,
we rapidly built a
crowdsourcing
platform.
3 Round Stones
provides commercial application
support on the cloud or behind the
enterprise firewall using
@3RoundStones http://3RoundStones.com
CONTENT
MANAGEMENT
SYSTEM
LINKED DATA
MANAGEMENT
SYSTEM
Callimachus
UNSTRUCTURED
TEXT
TEXT
STRUCTURED
DATA
DATA
Callimachus
supports
in-browser
development
Callimachus Enterprise customers are creating data-
driven applications with data from leading graph
databases:
Callimachus is a scalable Web application server for
publishing and consuming open data
Who uses it?
• Government, international publishers, healthcare / life sciences


What pain does Callimachus address?
• Integration of data silos where a graph approach is needed
• Rapid creation of visualizations, dashboards (mashups) & info graphics
• Less expensive solution to a data warehouse


Example apps?
• Collaborative knowledge management
• Publishing workflow
• Drug discovery / clinical trials
• Predictive Analytics
data interoperability & portability
Supports:
• HTML5, XHTML5, CSS3, JavaScript
• XQuery, XProc, XPath, XSLT
• SPARQL 1.1 Query, Update, Federated Query,
Service Description, Property Paths, Graph Store
HTTP Protocol
• RDF/XML, RDF/Turtle, JSON-LD, SPARQL XML,
SPARQL JSON
Callimachus is fanatical about
Public
Application, Script or automated client
Web Browser
SPARQL endpointREST APIResource URIs
Linked Data management system
located at a Tier 1 Cloud Provider
(FISMA compliant)
RDF Database
Registered developer
<HTML>
Enterprise Data Documents
Read/
Write
Point to,
include
Callimachus
Enterprise
“Big Data Is Important, but
Open Data Is More Valuable”
As change agents, enterprise architects can help
their organizations become richer through
strategies such as open data.
David Newman, VP Research, Gartner
Open Source Enterprise License
Community supported Commercial support
in-browser development, deployment, backups
Linked Data publication
User profiles, social sharing
Document, app management
OpenAnnotation support
External datasources
Shared deployments
Realms (virtual hosts)
Enterprise management
Cloud deployments
Callimachus
Callimachus™, the Callimachus logo, Callimachus Enterprise™, the
Callimachus Enterprise logo and tagline, are trademarks of 3 Round
Stones, Inc. and are registered in the United States and abroad.
Copyright © 2011-2016 3 Round Stones, Inc. All rights reserved.
Callimachus
Enterprise

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
The State of Linked Government Data
The State of Linked Government DataThe State of Linked Government Data
The State of Linked Government Data
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data Challenges
 
Keynote Presentation at MTSR07
Keynote Presentation at MTSR07Keynote Presentation at MTSR07
Keynote Presentation at MTSR07
 
introduction to big data frameworks
introduction to big data frameworksintroduction to big data frameworks
introduction to big data frameworks
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data frameworks
Big data frameworksBig data frameworks
Big data frameworks
 
Search Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureSearch Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited Lecture
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Big Data
Big DataBig Data
Big Data
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Hadoop
HadoopHadoop
Hadoop
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDM
 
Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introduction
 
Big data
Big dataBig data
Big data
 
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
 
A Survey on Geographically Distributed Big-Data Processing using Map Reduce
A Survey on Geographically Distributed Big-Data Processing using Map ReduceA Survey on Geographically Distributed Big-Data Processing using Map Reduce
A Survey on Geographically Distributed Big-Data Processing using Map Reduce
 

Ähnlich wie 3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data

Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
EMC
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
eswcsummerschool
 

Ähnlich wie 3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data (20)

Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big DataCombine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
 
Linked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter HaaseLinked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter Haase
 
Enabling Low-cost Open Data Publishing and Reuse
Enabling Low-cost Open Data Publishing and ReuseEnabling Low-cost Open Data Publishing and Reuse
Enabling Low-cost Open Data Publishing and Reuse
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Unit 1
Unit 1Unit 1
Unit 1
 
Top 10 data science technologies
Top 10 data science technologiesTop 10 data science technologies
Top 10 data science technologies
 
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
EPA OEI Linked Data Process
EPA OEI Linked Data ProcessEPA OEI Linked Data Process
EPA OEI Linked Data Process
 
Big Data & Data Mining
Big Data & Data MiningBig Data & Data Mining
Big Data & Data Mining
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 

Mehr von Bernadette Hyland-Wood

20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishers
Bernadette Hyland-Wood
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
Bernadette Hyland-Wood
 
Rapid Semantic Web Application Development
Rapid Semantic Web Application DevelopmentRapid Semantic Web Application Development
Rapid Semantic Web Application Development
Bernadette Hyland-Wood
 

Mehr von Bernadette Hyland-Wood (20)

ChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyondChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyond
 
Women in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier FutureWomen in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier Future
 
Why Consider Software Engineering as a Career
Why Consider Software Engineering as a CareerWhy Consider Software Engineering as a Career
Why Consider Software Engineering as a Career
 
Diversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AUDiversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AU
 
Being Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st CenturyBeing Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st Century
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.us2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.us
 
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
 
Government Linked Data Projects in the Wild
Government Linked Data Projects in the WildGovernment Linked Data Projects in the Wild
Government Linked Data Projects in the Wild
 
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishers
 
CENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked DataCENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked Data
 
20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
 
Warsaw Poland 20-Oct-2011 on Open Government Linked Data
Warsaw Poland 20-Oct-2011 on Open Government Linked Data Warsaw Poland 20-Oct-2011 on Open Government Linked Data
Warsaw Poland 20-Oct-2011 on Open Government Linked Data
 
Rapid Web Application Development for Linked Data
Rapid Web Application Development for Linked DataRapid Web Application Development for Linked Data
Rapid Web Application Development for Linked Data
 
Rapid Semantic Web Application Development
Rapid Semantic Web Application DevelopmentRapid Semantic Web Application Development
Rapid Semantic Web Application Development
 
Rapid semantic web app dev using Callimachus
Rapid semantic web app dev using CallimachusRapid semantic web app dev using Callimachus
Rapid semantic web app dev using Callimachus
 
Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011
 
Bernadette Hyland SemTech 2011 West - Linked Data Cookbook
Bernadette Hyland SemTech 2011 West - Linked Data CookbookBernadette Hyland SemTech 2011 West - Linked Data Cookbook
Bernadette Hyland SemTech 2011 West - Linked Data Cookbook
 

Kürzlich hochgeladen

Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCeCall Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Kürzlich hochgeladen (20)

Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...Booking open Available Pune Call Girls Budhwar Peth  6297143586 Call Hot Indi...
Booking open Available Pune Call Girls Budhwar Peth 6297143586 Call Hot Indi...
 
$ Love Spells 💎 (310) 882-6330 in Pennsylvania, PA | Psychic Reading Best Bla...
$ Love Spells 💎 (310) 882-6330 in Pennsylvania, PA | Psychic Reading Best Bla...$ Love Spells 💎 (310) 882-6330 in Pennsylvania, PA | Psychic Reading Best Bla...
$ Love Spells 💎 (310) 882-6330 in Pennsylvania, PA | Psychic Reading Best Bla...
 
(NEHA) Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts 24x7
(NEHA) Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts 24x7(NEHA) Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts 24x7
(NEHA) Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts 24x7
 
DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024DENR EPR Law Compliance Updates April 2024
DENR EPR Law Compliance Updates April 2024
 
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur EscortsCall Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
 
Hot Call Girls |Delhi |Preet Vihar ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Preet Vihar ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Preet Vihar ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Preet Vihar ☎ 9711199171 Book Your One night Stand
 
NO1 Verified kala jadu karne wale ka contact number kala jadu karne wale baba...
NO1 Verified kala jadu karne wale ka contact number kala jadu karne wale baba...NO1 Verified kala jadu karne wale ka contact number kala jadu karne wale baba...
NO1 Verified kala jadu karne wale ka contact number kala jadu karne wale baba...
 
Proposed Amendments to Chapter 15, Article X: Wetland Conservation Areas
Proposed Amendments to Chapter 15, Article X: Wetland Conservation AreasProposed Amendments to Chapter 15, Article X: Wetland Conservation Areas
Proposed Amendments to Chapter 15, Article X: Wetland Conservation Areas
 
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
VVIP Pune Call Girls Moshi WhatSapp Number 8005736733 With Elite Staff And Re...
 
VVIP Pune Call Girls Koregaon Park (7001035870) Pune Escorts Nearby with Comp...
VVIP Pune Call Girls Koregaon Park (7001035870) Pune Escorts Nearby with Comp...VVIP Pune Call Girls Koregaon Park (7001035870) Pune Escorts Nearby with Comp...
VVIP Pune Call Girls Koregaon Park (7001035870) Pune Escorts Nearby with Comp...
 
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Kondhwa ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Kondhwa ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
VIP Model Call Girls Viman Nagar ( Pune ) Call ON 8005736733 Starting From 5K...
VIP Model Call Girls Viman Nagar ( Pune ) Call ON 8005736733 Starting From 5K...VIP Model Call Girls Viman Nagar ( Pune ) Call ON 8005736733 Starting From 5K...
VIP Model Call Girls Viman Nagar ( Pune ) Call ON 8005736733 Starting From 5K...
 
Get Premium Hoskote Call Girls (8005736733) 24x7 Rate 15999 with A/c Room Cas...
Get Premium Hoskote Call Girls (8005736733) 24x7 Rate 15999 with A/c Room Cas...Get Premium Hoskote Call Girls (8005736733) 24x7 Rate 15999 with A/c Room Cas...
Get Premium Hoskote Call Girls (8005736733) 24x7 Rate 15999 with A/c Room Cas...
 
Booking open Available Pune Call Girls Parvati Darshan 6297143586 Call Hot I...
Booking open Available Pune Call Girls Parvati Darshan  6297143586 Call Hot I...Booking open Available Pune Call Girls Parvati Darshan  6297143586 Call Hot I...
Booking open Available Pune Call Girls Parvati Darshan 6297143586 Call Hot I...
 
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Moshi Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls Magarpatta Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Magarpatta Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Magarpatta Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Magarpatta Call Me 7737669865 Budget Friendly No Advance Booking
 
BOOK Call Girls in (Dwarka) CALL | 8377087607 Delhi Escorts Services
BOOK Call Girls in (Dwarka) CALL | 8377087607 Delhi Escorts ServicesBOOK Call Girls in (Dwarka) CALL | 8377087607 Delhi Escorts Services
BOOK Call Girls in (Dwarka) CALL | 8377087607 Delhi Escorts Services
 
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
VIP Model Call Girls Chakan ( Pune ) Call ON 8005736733 Starting From 5K to 2...
 
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Budhwar Peth Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCeCall Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Yamuna Vihar꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
 

3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data

  • 1. Bernadette Hyland CEO & co-founder 11911 Freedom Drive, Suite 850 Reston,VA 20190 Tel. +1-571-331-3758 bhyland@3RoundStones.com @BernHyland info@3RoundStones.com @3RoundStones ExtendYourReach. Linked Data for Smarter Decisions. Follow up information prepared for RobinThottungal, Chief Data Scientist / Director of Analytics US Environmental Protection Agency - Feb 26, 2016
  • 2. Today’s reality at EPA »Tens of thousands of sources »Many formats - JSON, XML, CSV, PDF, PPT, SHP, SHX, text, binary… »Thousands of data silos »No single source of truth »Varied interpretations »Brittle interfaces - lack of interoperability Image Credit: Smart Data Collective
  • 3. WideVariety of Data at EPA 3 Image Credit: MarkLogic, see http://www.marklogic.com/resources/marklogic-semantics-datasheet/ resource_download/datasheets/
  • 4. Credit: Frederick Giasson, Data Scientist & Software Developer, http://fgiasson.com/blog/index.php/2014/07/23/big-structures-where-the-semantic-web-meets-artificial-intelligence/ Potential at EPA … • Findable data • Accessible data • Interoperable data • Re-usable data • Shared context • Data Platforms (HDFS, NoSQL)
  • 5. Linked Data is helping to extend & augment EPA’s significant investment in enterprise relational technologies How? By leveraging NoSQL Data Platforms that rigorously adhere to international data interoperability standards. * * Relevant international data exchange standards are published by the W3C, OGC, IEEE Image Credit: MarkLogic
  • 6. Graph databases, as a subset of NoSQL databases, are the most efficient way to look at the relationships between data items, patterns of relationships and interactions. Image Credit: Cray, see http://www.cray.com/blog/graph-databases-101/ Graph Databases 101
  • 7. Hadoop Integration »While over 90% of the world’s data has been created in the last two years, EPA has tremendous variety of data requires the “right tool for the job” »Historic data (“short, wide, complex data”) vs. »Granular sensor & GIS data (“long skinny data”) »Core mission-based systems with robust historic data, includes: »Toxics Release Inventory (TRI) »Facilities Registry (FRS) »RCRA Handler »EPA’s enterprise information architecture should include a data platform that leverages Hadoop: HDFS and MapReduce, and accommodates EPA’s robust data landscape. »Must support modern, open source tools for application development, visualizations, crowdsourcing, and deployment on the Web
  • 8. 8 One option - MarkLogic Integrates Hadoop Ecosystem & EPA’s Robust Data Landscape Image Credit: http://www.marklogic.com/what-is-marklogic/features/hadoop-integration/
  • 9. EPA Robust Data Ecosystem is adaptable using a Linked Data Approach » Makes data integration faster and easier » By using a global addressing scheme, HTTP URIs. » Uses semantics to “glue” together data faster. » Common semantic definitions link traditional relational models. » No more out of data documentation using standard vocabularies. » Robust search and discovery by leveraging the semantic graph. » Scales to the Web! 9
  • 10. All modern data platforms deployed at EPA should »Support options for data modeling - Linked Data (JSON-LD, RDF), SQL (JSON, XML) »Native store and query of documents, blobs and structured data. »Standards-based query interface across documents and data, e.g., Full support for SPARQL 1.1 »Offer enterprise functionality including high availability & disaster recovery, scalability & elasticity, ACID transactions »Be deployable on FedRamp certified cloud provider certifying controls for security, high availability, disaster recovery »Scale to billions of statements, triples, etc. »Store unstructured data across clusters like Hadoop, making it easy to move data partitions.
  • 11. »Much but not all of EPA’s data is well suited for a Linked Data approach. »Linked Data is based on 20+ year old idea, a system of linked information systems M A N N I N G David Wood Marsha Zaidman Luke Ruth WITH Michael Hausenblas FOREWORD BY Tim Berners-Lee Structured data on the Web Linked Data
  • 12. Goals: Governmental transparency and/or improved internal efficiencies Governments Worldwide are using a Linked Data Approach
  • 13.
  • 14. Linked Data Apps use data from many EPA programs and other Open Data Sources
  • 15. Linked Data Management System For government open data publishing Funded by
  • 16. Linked Data Platform is in QA now! https://usepa. 3roundstones.net Anticipated to move to production in 2016.
  • 17. shared innovation™ Search for facilities where we live. Unlike many EPA Web portals, linked data is human AND machine readable data. No screen scraping is required. Encourages re-use (discourages data silos)
  • 18. The EPA Linked Data service CONNECTS data silos, and provides familiar map and table data views
  • 19. Click to drill down to pollution reports that combine data from 5 previously unconnected data silos.
  • 20. Click through to the source of the pollution data via the source reports (TRI).
  • 21. EPA collects granular pollution data. Linked Data opens up the data to a much wider audience in a human readable format.
  • 22. Previously, only people who employed complex screen scraping techniques could get at this data. Now, EPA open data is available using an international data standard, with one click!
  • 23.
  • 24.
  • 25. Good news story! Pollution graphs created in one week using Open Source Software & EPA Linked Data
  • 26. Use of shared vocabularies, e.g. Places, Geographis, Dublin Core, Geo, FOAF, ORG, Vcard are the “lingua franca” of data interoperability
  • 27. Case Study Using EPA Linked Data to assist chronic asthma/COPD patients with timely weather alerts Funded by
  • 28.
  • 30. Case Study: Orgpedia An open organizational data project on public & private companies Funded by
  • 31.
  • 32. Using the Callimachus Open Source Data Platform, we rapidly built a crowdsourcing platform.
  • 33. 3 Round Stones provides commercial application support on the cloud or behind the enterprise firewall using @3RoundStones http://3RoundStones.com
  • 36. Callimachus Enterprise customers are creating data- driven applications with data from leading graph databases:
  • 37. Callimachus is a scalable Web application server for publishing and consuming open data Who uses it? • Government, international publishers, healthcare / life sciences 
 What pain does Callimachus address? • Integration of data silos where a graph approach is needed • Rapid creation of visualizations, dashboards (mashups) & info graphics • Less expensive solution to a data warehouse 
 Example apps? • Collaborative knowledge management • Publishing workflow • Drug discovery / clinical trials • Predictive Analytics
  • 38. data interoperability & portability Supports: • HTML5, XHTML5, CSS3, JavaScript • XQuery, XProc, XPath, XSLT • SPARQL 1.1 Query, Update, Federated Query, Service Description, Property Paths, Graph Store HTTP Protocol • RDF/XML, RDF/Turtle, JSON-LD, SPARQL XML, SPARQL JSON Callimachus is fanatical about
  • 39. Public Application, Script or automated client Web Browser SPARQL endpointREST APIResource URIs Linked Data management system located at a Tier 1 Cloud Provider (FISMA compliant) RDF Database Registered developer
  • 40. <HTML> Enterprise Data Documents Read/ Write Point to, include Callimachus Enterprise
  • 41. “Big Data Is Important, but Open Data Is More Valuable” As change agents, enterprise architects can help their organizations become richer through strategies such as open data. David Newman, VP Research, Gartner
  • 42. Open Source Enterprise License Community supported Commercial support in-browser development, deployment, backups Linked Data publication User profiles, social sharing Document, app management OpenAnnotation support External datasources Shared deployments Realms (virtual hosts) Enterprise management Cloud deployments Callimachus Callimachus™, the Callimachus logo, Callimachus Enterprise™, the Callimachus Enterprise logo and tagline, are trademarks of 3 Round Stones, Inc. and are registered in the United States and abroad. Copyright © 2011-2016 3 Round Stones, Inc. All rights reserved. Callimachus Enterprise