SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Panel: The Art & Science of Data Visualization
                                                            #IOGDC




          First they have to find it:
Getting Government Data Discovered and Used




   John S. Erickson, Ph.D.
   Tetherless World Constellation
   Rensselaer Polytechnic Institute
   Troy, New York, USA

   Twitter: @olyerickson #TWCRPI
Open Government Data Around the World

                                   Percent of total collection (from 1M+ datasets)
Starting with efforts
in the US and UK,
governments around
the world have
recognized the need to
publish their critical data



                                                                                     2
Diverse Approaches to Open Gov't Data

 Government data
 initiatives have taken
 many forms
 GovData portals are

 widely varied in how     Percent of total catalogs
                          (from 192 catalogs)



 they help users
 discover and use
 relevant datasets


                                                      3
Federated Discovery of Government Data


                   Stakeholders have seen
                               the need for
                     Federated discovery
                          across catalogs,
                     especially from within
                      major search engines
                                  including
                    Bing, Google, Yahoo!
                              and Yandex


                                      4
Linked Data is Not Enough...


• Publishing open
  government data as
  Linked Data is not
  enough
• For OGD to be useful,
  datasets must be
  published using
  metadata, markup
  standards and
  presentation that aid
  discovery and use                                      5
Linked Data is Not Enough...


• Publishing open
  government data as
  Linked Data is not
  enough
• For OGD to be useful,
  datasets must be
  published using
  metadata, markup
  standards and
  presentation that aid
  discovery and use                                      6
Dataset Metadata for Discovery and Use



                      Recent work at
               TWC RPI demonstrates
                 the value of applying
               emerging standards for
                uniformly describing
                 government datasets
                         and catalogs

                                   7
International Open Government Dataset Search


 TWC's IOGDS
 application is an
 aggregated catalog of
 more than 1M datasets
 from over 192 dataset
 catalogs from
 governments at every
 level around the world


                                                          8
See: http://logd.tw.rpi.edu
International Open Government Dataset Search

                  
                     Anticipates W3C
                    DCAT RDF vocabulary
                  
                     Demos what a
                    comprehensive
                    federated catalog
                    based on DCAT and
                    aggregation API
                    might look like


                                       9
International Open Government Dataset Search


      IOGDS is a multi-year effort based on downloading, scraping or
      accessing APIs, converting metadata to a proto-DCAT model,
      and publishing via endpoint and download
Catalogs

             API                     IOGDS Workflow
                         ad hoc
           Download       code
                                    IODGS   Csv2rdf4lod
                         Per-site    CSV    automation
            Web
            Web          scraper
             Web          code




                        See: http://logd.tw.rpi.edu                    10
Schema.org: Semantic Markup for Discovery



              TWC RPI has published
              dataset listings based on
              IOGDS using emerging
              microdata standards, esp.
              schema.org model
              endorsed by Bing, Google,
              Yahoo!, Yandex...


                                    11
Schema.org datasets extension

• TWC RPI's schema.org
  dataset extension will enable
  government dataset catalogs
  to more easily be parsed and
  indexed by the major search
  engines...
• ...which will help users find
  relevant datasets!
• TWC's dataset extension
  entered public discussion
  June 2012

                                                            12
Schema.org datasets extension


          The schema.org
          datasets extension
          enables relevant
          datasets to be more
          easily discovered by a
          range of stakeholders
          including researchers,
          data journalists,
          bloggers and
          developers
                                13
Schema.org datasets extension


                “...we've reviewed the current
                datasets schema proposal in
                draft, and we are comfortable
                with the current state of
                things...

                “...At this point, if the group
                would solidify on the dataset
                proposal, then Data.gov
                would support and use it.

                           ---Chris Musialek




                                          14
CKAN Data Catalog Scheme & Protocol




   API-based catalog
  federation is also
  possible

   ckan announced
  DCAT-based
  query/federation API

   enables OAI-PMH-like
  harvesting and more
                                                 15
Other Thoughts...


  Geo-based discovery:
  What data is available by geo-selection?

  Provenance-based discovery:
  How do I get the data that someone else
  used? “Get the Data”

  Community/social-based discovery:
  Dude, check out this data! (Linked Data
  perfect for this...
Other Thoughts...


    Geo-based discovery:
    What data is available by geo-selection?


    DATA.GOV Geo Viewer
Other Thoughts...


    Community/social-based discovery:
    Dude, check out this data!


    OPENEI.org
Choose your own medicine...
  but do expose your metadata
and get your catalogs discovered!



                                    19

Weitere ähnliche Inhalte

Was ist angesagt?

Data Visualization in the Newsroom
Data Visualization in the NewsroomData Visualization in the Newsroom
Data Visualization in the Newsroom
Carl V. Lewis
 

Was ist angesagt? (20)

Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
Research Data Management, Open Data and Zenodo - 6th National Open Access Con...
 
Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
Open Data in a Day - Introduction to Open Data
Open Data in a Day - Introduction to Open DataOpen Data in a Day - Introduction to Open Data
Open Data in a Day - Introduction to Open Data
 
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
Treasury Board of Canada - Open Government / Open Data in Canada - July 2013
 
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
 
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
 
LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?
 
The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
STI Summit 2011 - Global data integration and global data mining
STI Summit 2011 - Global data integration and global data miningSTI Summit 2011 - Global data integration and global data mining
STI Summit 2011 - Global data integration and global data mining
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
DataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying KnowledgeDataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying Knowledge
 
Brdi rda 9 13 -- rda
Brdi rda 9 13 -- rdaBrdi rda 9 13 -- rda
Brdi rda 9 13 -- rda
 
FAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingFAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data Sharing
 
OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data?
 
Closing Remarks
Closing RemarksClosing Remarks
Closing Remarks
 
Open Data and Linked Data
Open Data and Linked DataOpen Data and Linked Data
Open Data and Linked Data
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
From Structured Data to Linked Open Governmental Data
From Structured Data to Linked Open Governmental DataFrom Structured Data to Linked Open Governmental Data
From Structured Data to Linked Open Governmental Data
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
 
Data Visualization in the Newsroom
Data Visualization in the NewsroomData Visualization in the Newsroom
Data Visualization in the Newsroom
 

Ähnlich wie First they have to find it: Getting Open Government Data Discovered and Used

NordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk Open Access Reykjavik 14-15/8-2014:RdaNordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk
 
Going local with a world-class data infrastructure: Enabling SDMX for researc...
Going local with a world-class data infrastructure: Enabling SDMX for researc...Going local with a world-class data infrastructure: Enabling SDMX for researc...
Going local with a world-class data infrastructure: Enabling SDMX for researc...
Rob Grim
 

Ähnlich wie First they have to find it: Getting Open Government Data Discovered and Used (20)

Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information
 
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-2013US EPA OSWER Linked Data Workshop 1-Feb-2013
US EPA OSWER Linked Data Workshop 1-Feb-2013
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Delivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked DataDelivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked Data
 
ISWC 2012 Keynote
ISWC 2012 KeynoteISWC 2012 Keynote
ISWC 2012 Keynote
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
NordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk Open Access Reykjavik 14-15/8-2014:RdaNordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk Open Access Reykjavik 14-15/8-2014:Rda
 
Going local with a world-class data infrastructure: Enabling SDMX for researc...
Going local with a world-class data infrastructure: Enabling SDMX for researc...Going local with a world-class data infrastructure: Enabling SDMX for researc...
Going local with a world-class data infrastructure: Enabling SDMX for researc...
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy data
 
How google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrowHow google is using linked data today and vision for tomorrow
How google is using linked data today and vision for tomorrow
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
Linked Data In Action
Linked Data In ActionLinked Data In Action
Linked Data In Action
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltools
 

Mehr von Rensselaer Polytechnic Institute

The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)
Rensselaer Polytechnic Institute
 

Mehr von Rensselaer Polytechnic Institute (11)

ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)
 
Intro to Web Science (Oct 2022)
Intro to Web Science (Oct 2022)Intro to Web Science (Oct 2022)
Intro to Web Science (Oct 2022)
 
Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
 
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
 
ITWS Capstone (RPI, Fall 2013)
ITWS Capstone (RPI, Fall 2013)ITWS Capstone (RPI, Fall 2013)
ITWS Capstone (RPI, Fall 2013)
 
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
 
ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)
 
The Semantic Web: RPI ITWS Capstone (Fall 2012)
The Semantic Web: RPI ITWS Capstone (Fall 2012)The Semantic Web: RPI ITWS Capstone (Fall 2012)
The Semantic Web: RPI ITWS Capstone (Fall 2012)
 
Where is the World is my Open Government Data?
Where is the World is my Open Government Data?Where is the World is my Open Government Data?
Where is the World is my Open Government Data?
 
The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)
 

Kürzlich hochgeladen

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Kürzlich hochgeladen (20)

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 

First they have to find it: Getting Open Government Data Discovered and Used

  • 1. Panel: The Art & Science of Data Visualization #IOGDC First they have to find it: Getting Government Data Discovered and Used John S. Erickson, Ph.D. Tetherless World Constellation Rensselaer Polytechnic Institute Troy, New York, USA Twitter: @olyerickson #TWCRPI
  • 2. Open Government Data Around the World Percent of total collection (from 1M+ datasets) Starting with efforts in the US and UK, governments around the world have recognized the need to publish their critical data 2
  • 3. Diverse Approaches to Open Gov't Data  Government data initiatives have taken many forms  GovData portals are widely varied in how Percent of total catalogs (from 192 catalogs) they help users discover and use relevant datasets 3
  • 4. Federated Discovery of Government Data Stakeholders have seen the need for Federated discovery across catalogs, especially from within major search engines including Bing, Google, Yahoo! and Yandex 4
  • 5. Linked Data is Not Enough... • Publishing open government data as Linked Data is not enough • For OGD to be useful, datasets must be published using metadata, markup standards and presentation that aid discovery and use 5
  • 6. Linked Data is Not Enough... • Publishing open government data as Linked Data is not enough • For OGD to be useful, datasets must be published using metadata, markup standards and presentation that aid discovery and use 6
  • 7. Dataset Metadata for Discovery and Use Recent work at TWC RPI demonstrates the value of applying emerging standards for uniformly describing government datasets and catalogs 7
  • 8. International Open Government Dataset Search TWC's IOGDS application is an aggregated catalog of more than 1M datasets from over 192 dataset catalogs from governments at every level around the world 8 See: http://logd.tw.rpi.edu
  • 9. International Open Government Dataset Search  Anticipates W3C DCAT RDF vocabulary  Demos what a comprehensive federated catalog based on DCAT and aggregation API might look like 9
  • 10. International Open Government Dataset Search IOGDS is a multi-year effort based on downloading, scraping or accessing APIs, converting metadata to a proto-DCAT model, and publishing via endpoint and download Catalogs API IOGDS Workflow ad hoc Download code IODGS Csv2rdf4lod Per-site CSV automation Web Web scraper Web code See: http://logd.tw.rpi.edu 10
  • 11. Schema.org: Semantic Markup for Discovery TWC RPI has published dataset listings based on IOGDS using emerging microdata standards, esp. schema.org model endorsed by Bing, Google, Yahoo!, Yandex... 11
  • 12. Schema.org datasets extension • TWC RPI's schema.org dataset extension will enable government dataset catalogs to more easily be parsed and indexed by the major search engines... • ...which will help users find relevant datasets! • TWC's dataset extension entered public discussion June 2012 12
  • 13. Schema.org datasets extension The schema.org datasets extension enables relevant datasets to be more easily discovered by a range of stakeholders including researchers, data journalists, bloggers and developers 13
  • 14. Schema.org datasets extension “...we've reviewed the current datasets schema proposal in draft, and we are comfortable with the current state of things... “...At this point, if the group would solidify on the dataset proposal, then Data.gov would support and use it. ---Chris Musialek 14
  • 15. CKAN Data Catalog Scheme & Protocol  API-based catalog federation is also possible  ckan announced DCAT-based query/federation API  enables OAI-PMH-like harvesting and more 15
  • 16. Other Thoughts...  Geo-based discovery: What data is available by geo-selection?  Provenance-based discovery: How do I get the data that someone else used? “Get the Data”  Community/social-based discovery: Dude, check out this data! (Linked Data perfect for this...
  • 17. Other Thoughts...  Geo-based discovery: What data is available by geo-selection? DATA.GOV Geo Viewer
  • 18. Other Thoughts...  Community/social-based discovery: Dude, check out this data! OPENEI.org
  • 19. Choose your own medicine... but do expose your metadata and get your catalogs discovered! 19