RDFizing PubMed Central with Biotea

•Als PPTX, PDF herunterladen•

0 gefällt mir•558 views

Biotea is a semantic dataset that RDFizes (converts to RDF) the open-access subset of PubMed Central. It makes scholarly documents and their metadata interconnected by extensively using existing ontologies and semantic enrichment services. This allows the generation of machine-readable scholarly documents that are self-describing. The Biotea dataset and tools provide a flexible and adaptable way to semantically enrich and process biomedical documents into a highly interconnected and semantically rich dataset.

Technologie

$Biotea: RDFizing PubMed Central in Support for the Paper as an Interface to the Web of Data Alexander Garcia, Casey Mclaughlin, Institute for Digital Information, Florida State University. Tallahassee Leyla Garcia Castro Departamento de Leguajes y Sistemas Informáticos Universitat Jaumé I Corresponding author: alexgarciac@gmail.com Scholarly data and documents are of most value when they are interconnected rather than independent Christine L. Borgman In a nutshell, Biotea at http://biotea.idiginfo.org • Is a semantic dataset for full-text, open-access subset of PubMed Central • Makes extensive use of existing ontologies and semantic enrichment services • Supports the generation of self-describing machine- readable scholarly documents. • Comprises a flexible and adaptable set of tools for metadata enrichment and semantic processing of biomedical documents. • Provides semantically rich and highly interconnected dataset with self-describing content. RDF4PMC, our workflow 1. Metadata & content Metadata Content RDFized article Provenance NXML 2. Semantic content enrichment Enriched content RDFization Annotation RDF4PMC and Bio2RDF Consuming the dataset, a first prototype 3. Navigating the neighborhood 2. Enriched content  facts-based reading 1. Retrieval: Metadata + Cloud of annotations Contextual reading Graphical tools Interactive zone Search and retrieval based on human gene names: the term is resolved with GeneWiki, and the associated UniProt accession is used in the query Enriched content based on annotations is displayed in the interactive zone Graph-based retrieval for the terms “catalase”; only shared terms with more than 30 associated biological terms are included in the results. Consuming the dataset, SPARQL and API Retrieval Service A list of terms and their related topics SELECT distinct ?pmid WHERE { ?article a bibo:AcademicArticle ; bibo:pmid ?pmid . ?annotation a aot:ExactQualifier ; ao:annotatesResource ?article ; ao:hasTopic <http://purl.obolibrary.org/obo/CHEBI_60004> . } have been semantically annotated with the biological  entity CHEBI:60004. The semantic annotation comes from the occurrence of the term “mixture” in any paragraph of the retrieved articles. e.g., http://biotea.idiginfo.org/api/topics?term=cancer e.g., http://biotea.idiginfo.org/api/vocabularies?term=cancer All terms that start with a specific string (for autocompletion) e.g.,http://biotea.idiginfo.org/api/terms?prefix=canc All topics related to a vocabulary e.g., http://biotea.idiginfo.org/api/topics?vocabulary=po RDF of articles that include a term e.g., http://biotea.idiginfo.org/api/articles?term=cancer Count of RDF of articles that include a term e.g., http://biotea.idiginfo.org/api/articles?term=cancer&count=true A list of vocabularies and their prefixes http://biotea.idiginfo.org/vocabularies RDF of articles that include a vocabulary Retrieving PubMed identifier for those articles that http://biotea.idiginfo.org/api/topics All vocabularies related to a term Query expressed in natural language A list of topics and their related vocabularies All topics related to a term SPARQL query http://biotea.idiginfo.org/api/terms e.g., http://biotea.idiginfo.org/api/articles?vocabulary=po AGC and CM have been funded by US DoD Grant MOMRP w81xwh-10-2-0181.$

Weitere ähnliche Inhalte

Was ist angesagt?

Journal TOCs Presentation at EUROCRISazami

Presentation from Code Camp 2017Mitch Miller

Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...Trish Whetzel

2016 bmdid-mappingsMichel Dumontier

New member Crossref

Cedar Overviewjbgraybeal

Chemspider Presentation at the ACS Meeting in New orleansUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Webtools For Reference Searchwiser pku

Funding data & the Funder RegistryCrossref

Talk_linked_data_for_hcls_at_iswc2009Jun Zhao

Getting started with Reference LinkingCrossref

crossref Cited byCrossref

Chemical Abstracts to Scifinder ScholarBruce Slutsky

How an Online Resource for Chemistry Can Change Our WorldUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Funding data for researchCrossref

Crossmark Update WebinarCrossref

Crossref Metadata and Metadata ServicesCrossref

MENGGUNAKAN METADATA PADA CROSSREFRelawan Jurnal Indonesia

CrossRef Branding UpdateCrossref

bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...dkNET

Was ist angesagt? (20)

Journal TOCs Presentation at EUROCRIS

Presentation from Code Camp 2017

Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...

2016 bmdid-mappings

New member

Cedar Overview

Chemspider Presentation at the ACS Meeting in New orleans

Webtools For Reference Search

Funding data & the Funder Registry

Talk_linked_data_for_hcls_at_iswc2009

Getting started with Reference Linking

crossref Cited by

Chemical Abstracts to Scifinder Scholar

How an Online Resource for Chemistry Can Change Our World

Funding data for research

Crossmark Update Webinar

Crossref Metadata and Metadata Services

MENGGUNAKAN METADATA PADA CROSSREF

CrossRef Branding Update

bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...

Andere mochten auch

Amy siegel resumeAmy Siegel

1937germanlee_ncku

Final interviewChristian Sario

국빈카지노 ''SX797．ＣＯＭ'' 생중계경마sdhfisjuh

CPR CertCourtney Bergk

Doing Business Naked: Research & Communication in the Age of Full TransparencyJoseph Stabb, ABD

라이브바카라[[SX797。CΟＭ]]모바일카지노 jertgerh

Mobile Web MarketingJoseph Stabb, ABD

스포츠북하는법 ''SX797．ＣＯＭ'' 실전마작jdpofgk

Resume_sales augGeorge Smead

동네영웅『SX797』『СＯM』생중계바카라jdpofgk

吾主上帝，涯大君王gaanchurch

Building regualations-for-noida 0Praveen chandra Chaurasia

Presenting OhSoLocal at Podcamp TorontoJoseph Stabb, ABD

Migracion de rational a sql server 2014nelson rodriguez huallpa

TCC - RELAÇÃO FAMÍLIA E ESCOLAJJOAOPAULO7

Andere mochten auch (16)

Amy siegel resume

1937

Final interview

국빈카지노 ''SX797．ＣＯＭ'' 생중계경마

CPR Cert

Doing Business Naked: Research & Communication in the Age of Full Transparency

라이브바카라[[SX797。CΟＭ]]모바일카지노

Mobile Web Marketing

스포츠북하는법 ''SX797．ＣＯＭ'' 실전마작

Resume_sales aug

동네영웅『SX797』『СＯM』생중계바카라

吾主上帝，涯大君王

Building regualations-for-noida 0

Presenting OhSoLocal at Podcamp Toronto

Migracion de rational a sql server 2014

TCC - RELAÇÃO FAMÍLIA E ESCOLA

Ähnlich wie RDFizing PubMed Central with Biotea

Information Searching SkillsAnn Celestine

Ontology Web Services for Semantic Applications Trish Whetzel

Boston SciVerse "Brunch & Learn"colleeflower22

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk

Implementing chemistry platform for OpenPHACTSValery Tkachenko

Freedom for bibliographic references: OpenCitations ariseUniversity of Bologna

Health Datapalooza 2013: Open Government Data - Natasha NoyHealth Data Consortium

An Open Annotation Ontology For Science On Web 3.0Natasha Grant

Literature Based Framework for Semantic Descriptions of e-Science resourcesHammad Afzal

Building an integrated system for chemistry markup and online publishing inte...US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Semantic citationDeepak K

OpenCitationsUniversity of Bologna

SciVerse @ TJUrachelmccullough

ScienceDirect Presentation: Seton Hallrachelmccullough

SpotlightStefano Lariccia

Holy Cross Lunch and Learnrachelmccullough

Cataloger 3.0: Competencies and Education for the BIBFRAME CatalogAllison Jai O'Dell

The benefits of using Crossref metadata for libraries and scientists - Crossr...Crossref

Open Access NBIC Workshop April 19, 2011Philip Bourne

The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble

Ähnlich wie RDFizing PubMed Central with Biotea (20)

Information Searching Skills

Ontology Web Services for Semantic Applications

Boston SciVerse "Brunch & Learn"

Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...

Implementing chemistry platform for OpenPHACTS

Freedom for bibliographic references: OpenCitations arise

Health Datapalooza 2013: Open Government Data - Natasha Noy

An Open Annotation Ontology For Science On Web 3.0

Literature Based Framework for Semantic Descriptions of e-Science resources

Building an integrated system for chemistry markup and online publishing inte...

Semantic citation

OpenCitations

SciVerse @ TJU

ScienceDirect Presentation: Seton Hall

Spotlight

Holy Cross Lunch and Learn

Cataloger 3.0: Competencies and Education for the BIBFRAME Catalog

The benefits of using Crossref metadata for libraries and scientists - Crossr...

Open Access NBIC Workshop April 19, 2011

The swings and roundabouts of a decade of fun and games with Research Objects

Mehr von alexander garcia

Pptx4landing pagealexander garcia

literature based discoveryalexander garcia

Knowledge Driven User Interfaces for Complex Biological Queriesalexander garcia

Nanotweetsalexander garcia

Paper as a Research Objectalexander garcia

RDF for PubMedCentral alexander garcia

Monday presentation 1336-may23alexander garcia

Mehr von alexander garcia (7)

Pptx4landing page

literature based discovery

Knowledge Driven User Interfaces for Complex Biological Queries

Nanotweets

Paper as a Research Object

RDF for PubMedCentral

Monday presentation 1336-may23

Kürzlich hochgeladen

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan

SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521

Commit 2024 - Secret Management made easyAlfredo García Lavilla

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Gen AI in Business - Global Trends Report 2024.pdfAddepto

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

Kürzlich hochgeladen (20)

What's New in Teams Calling, Meetings and Devices March 2024

Generative AI for Technical Writer or Information Developers

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES

Commit 2024 - Secret Management made easy

DevEX - reference for building teams, processes, and platforms

Are Multi-Cloud and Serverless Good or Bad?

WordPress Websites for Engineers: Elevate Your Brand

Unleash Your Potential - Namagunga Girls Coding Club

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Scanning the Internet for External Cloud Exposures via SSL Certs

Gen AI in Business - Global Trends Report 2024.pdf

SIP trunking in Janus @ Kamailio World 2024

Streamlining Python Development: A Guide to a Modern Project Setup

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf

DSPy a system for AI to Write Prompts and Do Fine Tuning

Developer Data Modeling Mistakes: From Postgres to NoSQL

Unraveling Multimodality with Large Language Models.pdf

RDFizing PubMed Central with Biotea

1. Biotea: RDFizing PubMed Central in Support for the Paper as an Interface to the Web of Data Alexander Garcia, Casey Mclaughlin, Institute for Digital Information, Florida State University. Tallahassee Leyla Garcia Castro Departamento de Leguajes y Sistemas Informáticos Universitat Jaumé I Corresponding author: alexgarciac@gmail.com Scholarly data and documents are of most value when they are interconnected rather than independent Christine L. Borgman In a nutshell, Biotea at http://biotea.idiginfo.org • Is a semantic dataset for full-text, open-access subset of PubMed Central • Makes extensive use of existing ontologies and semantic enrichment services • Supports the generation of self-describing machine- readable scholarly documents. • Comprises a flexible and adaptable set of tools for metadata enrichment and semantic processing of biomedical documents. • Provides semantically rich and highly interconnected dataset with self-describing content. RDF4PMC, our workflow 1. Metadata & content Metadata Content RDFized article Provenance NXML 2. Semantic content enrichment Enriched content RDFization Annotation RDF4PMC and Bio2RDF Consuming the dataset, a first prototype 3. Navigating the neighborhood 2. Enriched content  facts-based reading 1. Retrieval: Metadata + Cloud of annotations Contextual reading Graphical tools Interactive zone Search and retrieval based on human gene names: the term is resolved with GeneWiki, and the associated UniProt accession is used in the query Enriched content based on annotations is displayed in the interactive zone Graph-based retrieval for the terms “catalase”; only shared terms with more than 30 associated biological terms are included in the results. Consuming the dataset, SPARQL and API Retrieval Service A list of terms and their related topics SELECT distinct ?pmid WHERE { ?article a bibo:AcademicArticle ; bibo:pmid ?pmid . ?annotation a aot:ExactQualifier ; ao:annotatesResource ?article ; ao:hasTopic <http://purl.obolibrary.org/obo/CHEBI_60004> . } have been semantically annotated with the biological  entity CHEBI:60004. The semantic annotation comes from the occurrence of the term “mixture” in any paragraph of the retrieved articles. e.g., http://biotea.idiginfo.org/api/topics?term=cancer e.g., http://biotea.idiginfo.org/api/vocabularies?term=cancer All terms that start with a specific string (for autocompletion) e.g.,http://biotea.idiginfo.org/api/terms?prefix=canc All topics related to a vocabulary e.g., http://biotea.idiginfo.org/api/topics?vocabulary=po RDF of articles that include a term e.g., http://biotea.idiginfo.org/api/articles?term=cancer Count of RDF of articles that include a term e.g., http://biotea.idiginfo.org/api/articles?term=cancer&count=true A list of vocabularies and their prefixes http://biotea.idiginfo.org/vocabularies RDF of articles that include a vocabulary Retrieving PubMed identifier for those articles that http://biotea.idiginfo.org/api/topics All vocabularies related to a term Query expressed in natural language A list of topics and their related vocabularies All topics related to a term SPARQL query http://biotea.idiginfo.org/api/terms e.g., http://biotea.idiginfo.org/api/articles?vocabulary=po AGC and CM have been funded by US DoD Grant MOMRP w81xwh-10-2-0181.

RDFizing PubMed Central with Biotea

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (16)

Ähnlich wie RDFizing PubMed Central with Biotea

Ähnlich wie RDFizing PubMed Central with Biotea (20)

Mehr von alexander garcia

Mehr von alexander garcia (7)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

RDFizing PubMed Central with Biotea