SlideShare ist ein Scribd-Unternehmen logo
1 von 42
How do we know what we don’tHow do we know what we don’t
know: Using the Neuroscienceknow: Using the Neuroscience
Information Framework to revealInformation Framework to reveal
knowledge gapsknowledge gaps
Maryann E. Martone, Ph. D.
University of California, San Diego
Tools for Integrating and Planning Experiments in
Neuroscience-UCLA March 11, 2014
We say this to each other all the time,
but we set up systems for scholarly
advancement and communication that
are the antithesis of integration
Whole brain data
(20 um
microscopic MRI)
Mosiac LM
images (1 GB+)
Conventional LM
images
Individual cell
morphologies
EM volumes &
reconstructions
Solved molecular
structures
No single technology serves
these all equally well.
Multiple data types;
multiple scales; multiple
databases
A data integration problemA data integration problem
• NIF is an initiative of the NIH Blueprint consortium of institutesNIF is an initiative of the NIH Blueprint consortium of institutes
– What types of resources (data, tools, materials, services) are available to theWhat types of resources (data, tools, materials, services) are available to the
neuroscience community?neuroscience community?
– How many are there?How many are there?
– What domains do they cover? What domains do they not cover?What domains do they cover? What domains do they not cover?
– Where are they?Where are they?
• Web sitesWeb sites
• DatabasesDatabases
• LiteratureLiterature
• Supplementary materialSupplementary material
– Who uses them?Who uses them?
– Who creates them?Who creates them?
– How can we find them?How can we find them?
– How can we make them better in the future?How can we make them better in the future?
http://neuinfo.org
• PDF filesPDF files
• Desk drawersDesk drawers
Old Model: Single type of content; singleOld Model: Single type of content; single
mode of distributionmode of distribution
ScholarScholar
LibraryLibrary
Scholar
PublisherPublisher
Systems for cataloging, standards, and citation in placeSystems for cataloging, standards, and citation in place
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
Networks
Social
Networks
Social
NetworksSocial
Networks
Social
Networks
Peer Reviewers
NarrativeNarrative
WorkflowsWorkflows
DataData
ModelsModels
MultimediaMultimedia
NanopublicationsNanopublications
CodeCode
The duality of modern scholarship
Observation: Those who build information systems from the
machine side don’t understand the requirements of the
human very well
Those who build information systems from the human side,
don’t understand requirements of machines very well
Scholarship requires the ability to cite and track usage of scholarly
artifacts. In our current mode of working, there is no way to track
artifacts as they move through the ecosystem; no way to incrementally
add human expertise; no way to look across the entirety
Scholarship requires the ability to cite and track usage of scholarly
artifacts. In our current mode of working, there is no way to track
artifacts as they move through the ecosystem; no way to incrementally
add human expertise; no way to look across the entirety
Whither neuroscience information?Whither neuroscience information?
∞
What is easily machine
processable and accessible
What is easily machine
processable and accessible
What is potentially knowableWhat is potentially knowable
What is known:
Literature, images, human
knowledge
What is known:
Literature, images, human
knowledge
Unstructured;
Natural language
processing, entity
recognition, image
processing and
analysis; paywalls
communication
Abstracts vs full
text vs tables etc
NIF: A New Type of Entity for New Modes ofNIF: A New Type of Entity for New Modes of
Scientific DisseminationScientific Dissemination
• NIF’s mission is to maximize the awareness of, access to
and utility of research resources produced worldwide to
enable better science and promote efficient use
– NIF unites neuroscience information without respect to
domain, funding agency, institute or community
– NIF is like a “Pub Med” for all biomedical resources and a “Pub
Med Central” for databases
– Makes them searchable from a single interface
– Practical and cost-effective; tries to be sensible
– Learned a lot about the effective data sharing
The Neuroscience Information Framework is an initiative of the
NIH Blueprint consortium of institutes http://neuinfo.org
The Neuroscience Information Framework is an initiative of the
NIH Blueprint consortium of institutes http://neuinfo.org
Surveying the resourceSurveying the resource
landscapelandscape
Data Federation: Deep searchData Federation: Deep search
http://neuinfo.org
With the thousands of databases and other information sources
available, simple descriptive metadata will not suffice
With the thousands of databases and other information sources
available, simple descriptive metadata will not suffice
A unified framework for neuroscienceA unified framework for neuroscience
Hippocampus OR “Cornu Ammonis” OR
“Ammon’s horn”
Hippocampus OR “Cornu Ammonis” OR
“Ammon’s horn”
NIF queries > 200 databases; ~400 million recordsNIF queries > 200 databases; ~400 million records
NIF Semantic Framework: NIFSTD ontologyNIF Semantic Framework: NIFSTD ontology
• NIF uses ontologies to help navigate across and unify neuroscience
resources
• Ontologies are built from community ontologies  cross integration with
other domains
NIFSTDNIFSTD
OrganismOrganism
NS FunctionNS FunctionMoleculeMolecule InvestigationInvestigationSubcellular
structure
Subcellular
structure
MacromoleculeMacromolecule GeneGene
Molecule DescriptorsMolecule Descriptors
TechniquesTechniques
ReagentReagent ProtocolsProtocols
CellCell
ResourceResource InstrumentInstrument
DysfunctionDysfunction QualityQualityAnatomical
Structure
Anatomical
Structure
Purkinje
Cell
Axon
Terminal
Axon
Dendritic
Tree
Dendritic
Spine
Dendrite
Cell body
Cerebellar
cortex
Bringing knowledge to data: Ontologies as frameworkBringing knowledge to data: Ontologies as framework
There is little obvious connection between
data sets taken at different scales using
different microscopies without an explicit
representation of the biological objects that
the data represent
There is little obvious connection between
data sets taken at different scales using
different microscopies without an explicit
representation of the biological objects that
the data represent
: C: C
Neurolex: > 1 million triples
Dr. Yi Zeng: Chinese neural knowledge base
NIF Cell Graph
This is your brain on
computers
Ontologies as a data integration frameworkOntologies as a data integration framework
•NIF Connectivity: 7 databases containing connectivity primary data or claims
from literature on connectivity between brain regions
•Brain Architecture Management System (rodent)
•Temporal lobe.com (rodent)
•Connectome Wiki (human)
•Brain Maps (various)
•CoCoMac (primate cortex)
•UCLA Multimodal database (Human fMRI)
•Avian Brain Connectivity Database (Bird)
•Total: 1800 unique brain terms (excluding Avian)
•Number of exact terms used in > 1 database: 42
•Number of synonym matches: 99
•Number of 1st
order partonomy matches: 385
0
1-10
11-100
>101
Open World-Closed World: Mapping the knowledge - data space
Data Sources
NIF lets us ask: where isn’t there data? What isn’t studied? Why?NIF lets us ask: where isn’t there data? What isn’t studied? Why?
ForebrainForebrain
MidbrainMidbrain
HindbrainHindbrain
0
1-10
11-100
>101
Neuroimaging Data-Knowledge Space?
Data Sources
““The Data Homunculus”The Data Homunculus”
Funding drives representation in the data spaceFunding drives representation in the data space
Neurolex.org: A computableNeurolex.org: A computable
lexicon for neurosciencelexicon for neuroscience
http://neurolex.org Larson et al, Frontiers in Neuroinformatics, 2013Larson et al, Frontiers in Neuroinformatics, 2013
•Semantic MediaWiki
•Provide a simple interface
for defining the concepts
required
•Light weight semantics
•Community based:
•Anyone can contribute their
terms, concepts, things
•Anyone can edit
•Anyone can link
•Accessible: searched by Google
•Growing into a significant
knowledge base for
neuroscience
•25,000 concepts
Demo D03
200,000
edits
150
contributors
200,000
edits
150
contributors
Neurolex Structural Lexicon: Defining brainNeurolex Structural Lexicon: Defining brain
partsparts
Structural LexiconStructural Lexicon
The scourge of neuroanatomical nomenclatureThe scourge of neuroanatomical nomenclature
• Problem: Neuroscientists have a
myriad number of ways to
parcellate the brain
– Brains are made up of networks
that do not respect gross
anatomical boundaries
– Partonomies are generally along
multiple axes:
• Volummetric (species
dependent): NeuroNames
• Functional (Swanson)
• Developmental
• Cytoarchitectural
– Partonomies are often weak
• Arbitrary but defensible
Program on Ontologies for Neural Structures, INCF-
creating a computable lexicon for neural structures
Program on Ontologies for Neural Structures, INCF-
creating a computable lexicon for neural structures
Neuroanatomy without bordersNeuroanatomy without borders
Brainmaps.org
Structural Lexicon in NeurolexStructural Lexicon in Neurolex
Brain
Region
Brain
Region
Brain
Parcel
Brain
Parcel
•Trans-species
•“Stateless”, i.e. no universal defining
criteria
•General structures and partonomies
based on Neuroanatomy 101
Partially overlaps
e.g., Hippocampus, Dentate gyrus
•Species specific
•Specific reference
•Defining criteria
•Sometimes partonomy;
sometimes not
e.g., Hippocampus of ABA2009
““When I use a word...it means what I choose itWhen I use a word...it means what I choose it
to mean”to mean”
Neurolex NeuronNeurolex Neuron
• Led by Dr. Gordon
Shepherd
• > 30 world wide
experts
• Simple set of
properties
• Consistent naming
scheme
• Integrated with
Structural Lexicon
• Used for annotation in
other resources, e.g.,
NeuroElectro
““You have broken links”You have broken links”
Red Links: Information is missing (or misspelled)Red Links: Information is missing (or misspelled)
Location of Cell Soma
Location of dendrites
Location of local axon
arbor
Analysis of Red Links in the Neuron RegistryAnalysis of Red Links in the Neuron Registry
• Analysis of red links
tells us where
instructions aren’t
clear, the information
isn’t available, or the
model insufficient
– Conceptualization not
clear
• what is most important
thing about local axon
terminals?
– Tool doesn’t capture
all details
Social networks and community sites let us learn things from the
collective behavior of contributors  INCF/HBP Knowledge Space
Social networks and community sites let us learn things from the
collective behavior of contributors  INCF/HBP Knowledge Space
Re-inventing Narrative: Do I have to write inRe-inventing Narrative: Do I have to write in
triples?triples?
• Not all entities are well-enough specified that they
lend themselves to deep annotation
– And, as we’ve seen in the previous example, we probably
don’t want to pretend that they are
• But…sometimes they are
– Semantic annotation of research papers to make them
“machine-interpretable” has been a goal of many
– Can we update the way that authors produce manuscripts
so that they are easier to process?
•  NIF pilot project: Semantic annotation of entities
that researchers would understand
The problem: How many papers were
published that used my: antibody
Paz et al,
J Neurosci, 2010
Now, go find the antibody
http://www.millipore.com/searchsummary.do?tabValue=&q=gfap Nov 12,
Jan 15, 2014A catalog number is not a persistent identifierA catalog number is not a persistent identifier
The
problem is
general
across
multiple
resource
types and
disciplines
The
problem is
general
across
multiple
resource
types and
disciplines
Vasilevsky et al, Peer J 2013Vasilevsky et al, Peer J 2013
If we can’t do it,
neither can the robot
• Automated text mining tools were not
deployed on this problem, because too few
antibodies were able to be automatically
identified
• We are asking authors to change their ways,
instead!
• Almost all antibodies were identified with the
company name, city and state, but the
information is useless if the goal is to identify
the antibody used
The Resource Identification InitiativeThe Resource Identification Initiative
• NIF, FORCE11 and
partners
– Led by Anita
Bandrowski and
Melissa Haendel
• Identify 3 types of
research resources
– Antibodies
– Genetically
modified animals
– Software
http://force11.org/Resource_identification_initiativehttp://force11.org/Resource_identification_initiative
Musings: You can’t do that!Musings: You can’t do that!
• Two powerful trends in the 21st
century:
– Networking machines and networking people
– Moving science into a machine-accessible platform has been a challenge
• Mechanistically
• Culturally
• Sociologically
• “A foolish consistency is the hobgoblin of little minds”
– When you have a lot of data and information in an accessible form, we
can start to look at actual practices and trends
– Focusing on the “negative space”, i.e., what we don’t know, reveals
glimpses into sources of bias and confusion
• When we scratch the surface of science, we find uncertainty and confusion
– Not a failure, but an opportunity
• Sometimes we can be precise, i.e., which reagents we used
• Sometimes, we can’t  so we should set up systems so we can learn from that
Next Steps: Neurolex to Knowledge SpaceNext Steps: Neurolex to Knowledge Space
Data SpaceData Space
Laboratory
Space
Laboratory
Space
Knowledge
Space
Knowledge
Space
BAMS
LexiconLexicon
EncyclopediaEncyclopedia
AnatomistAnatomist  InformaticistInformaticist
What is the “completeness” of our knowledge?What is the “completeness” of our knowledge?
Neocortex
Olfactory bulb
Neostriatum
Cochlear nucleus
All neurons with cell bodies in the same brain region are grouped
together
All neurons with cell bodies in the same brain region are grouped
together
Properties in Neurolex
•Simple set of
properties that can
be reasonably
supplied with a
minimal amount of
effort
The case of the meanest journal in the world,
coincidentally having the lowest retraction rate
The landscape is messy, diverse and evolving: Data toThe landscape is messy, diverse and evolving: Data to
Knowledge – Knowledge to DataKnowledge – Knowledge to Data
NIF favors a hybrid, tiered,
federated system
• Domain knowledge
– Ontologies
• Claims, models and
observations
– Virtuoso RDF triples
– Model repositories
• Data
– Data federation
– Spatial data
– Workflows
• Narrative
– Full text access
NeuronNeuron Brain partBrain part DiseaseDisease
OrganismOrganism GeneGene
Caudate projects to
Snpc
Caudate projects to
Snpc Grm1 is upregulated
in chronic cocaine
Grm1 is upregulated
in chronic cocaine
Betz cells
degenerate in ALS
Betz cells
degenerate in ALS
NIF provides the tentacles that connect the pieces: a
new type of entity for 21st
century science
NIF provides the tentacles that connect the pieces: a
new type of entity for 21st
century science
TechniqueTechnique
PeoplePeople
Data about the subthalamusData about the subthalamus
http://neuinfo.org

Weitere ähnliche Inhalte

Was ist angesagt?

Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...
Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...
Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...Bryan Heidorn
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNADaniel S. Katz
 
Data Landscapes: The Neuroscience Information Framework
Data Landscapes:  The Neuroscience Information FrameworkData Landscapes:  The Neuroscience Information Framework
Data Landscapes: The Neuroscience Information FrameworkMaryann Martone
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation HeidornBryan Heidorn
 
Phyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebPhyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebRutger Vos
 
Looking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebLooking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebValentina Presutti
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.docbutest
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
 
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...Artificial Intelligence Institute at UofSC
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceJian Qin
 
BCIs and DNA Nanotechnology
BCIs and DNA NanotechnologyBCIs and DNA Nanotechnology
BCIs and DNA NanotechnologyMelanie Swan
 
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...summersocialwebshop
 

Was ist angesagt? (18)

Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...
Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...
Heidorn The Path to Enlightened Solutions for Biodiversity's Dark DataViBRANT...
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNA
 
Data Landscapes: The Neuroscience Information Framework
Data Landscapes:  The Neuroscience Information FrameworkData Landscapes:  The Neuroscience Information Framework
Data Landscapes: The Neuroscience Information Framework
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
Phyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebPhyloinformatics and the Semantic Web
Phyloinformatics and the Semantic Web
 
Looking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebLooking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic Web
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.doc
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
3234150
32341503234150
3234150
 
Summary of 3DPAS
Summary of 3DPASSummary of 3DPAS
Summary of 3DPAS
 
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
Inauguration Function - Ohio Center of Excellence in Knowledge-Enabled Comput...
 
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information Science
 
BCIs and DNA Nanotechnology
BCIs and DNA NanotechnologyBCIs and DNA Nanotechnology
BCIs and DNA Nanotechnology
 
GSU-RF-2013-Reddy-3
GSU-RF-2013-Reddy-3GSU-RF-2013-Reddy-3
GSU-RF-2013-Reddy-3
 
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
 
Irt
IrtIrt
Irt
 

Ähnlich wie How do we know what we don’t know: Using the Neuroscience Information Framework to reveal knowledge gaps

RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkASIS&T
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...Maryann Martone
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemMaryann Martone
 
The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...Neuroscience Information Framework
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...Neuroscience Information Framework
 
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...Maryann Martone
 
Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neuroscience Information Framework
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...Neuroscience Information Framework
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...Maryann Martone
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Nolan Nichols
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Maryann Martone
 
Biological Foundations for Deep Learning: Towards Decision Networks
 Biological Foundations for Deep Learning: Towards Decision Networks Biological Foundations for Deep Learning: Towards Decision Networks
Biological Foundations for Deep Learning: Towards Decision Networksdiannepatricia
 

Ähnlich wie How do we know what we don’t know: Using the Neuroscience Information Framework to reveal knowledge gaps (20)

RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...
 
Navigating the Neuroscience Data Landscape
Navigating the Neuroscience Data LandscapeNavigating the Neuroscience Data Landscape
Navigating the Neuroscience Data Landscape
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...
 
Martone acs presentation
Martone acs presentationMartone acs presentation
Martone acs presentation
 
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...
EcsiNeurosciences Information Framework (NIF): An example of community Cyberi...
 
Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
 
A Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource LandscapeA Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource Landscape
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11
 
Paul Groth
Paul GrothPaul Groth
Paul Groth
 
Biological Foundations for Deep Learning: Towards Decision Networks
 Biological Foundations for Deep Learning: Towards Decision Networks Biological Foundations for Deep Learning: Towards Decision Networks
Biological Foundations for Deep Learning: Towards Decision Networks
 

Kürzlich hochgeladen

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 

Kürzlich hochgeladen (20)

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 

How do we know what we don’t know: Using the Neuroscience Information Framework to reveal knowledge gaps

  • 1. How do we know what we don’tHow do we know what we don’t know: Using the Neuroscienceknow: Using the Neuroscience Information Framework to revealInformation Framework to reveal knowledge gapsknowledge gaps Maryann E. Martone, Ph. D. University of California, San Diego Tools for Integrating and Planning Experiments in Neuroscience-UCLA March 11, 2014
  • 2. We say this to each other all the time, but we set up systems for scholarly advancement and communication that are the antithesis of integration Whole brain data (20 um microscopic MRI) Mosiac LM images (1 GB+) Conventional LM images Individual cell morphologies EM volumes & reconstructions Solved molecular structures No single technology serves these all equally well. Multiple data types; multiple scales; multiple databases A data integration problemA data integration problem
  • 3. • NIF is an initiative of the NIH Blueprint consortium of institutesNIF is an initiative of the NIH Blueprint consortium of institutes – What types of resources (data, tools, materials, services) are available to theWhat types of resources (data, tools, materials, services) are available to the neuroscience community?neuroscience community? – How many are there?How many are there? – What domains do they cover? What domains do they not cover?What domains do they cover? What domains do they not cover? – Where are they?Where are they? • Web sitesWeb sites • DatabasesDatabases • LiteratureLiterature • Supplementary materialSupplementary material – Who uses them?Who uses them? – Who creates them?Who creates them? – How can we find them?How can we find them? – How can we make them better in the future?How can we make them better in the future? http://neuinfo.org • PDF filesPDF files • Desk drawersDesk drawers
  • 4. Old Model: Single type of content; singleOld Model: Single type of content; single mode of distributionmode of distribution ScholarScholar LibraryLibrary Scholar PublisherPublisher Systems for cataloging, standards, and citation in placeSystems for cataloging, standards, and citation in place
  • 6. The duality of modern scholarship Observation: Those who build information systems from the machine side don’t understand the requirements of the human very well Those who build information systems from the human side, don’t understand requirements of machines very well Scholarship requires the ability to cite and track usage of scholarly artifacts. In our current mode of working, there is no way to track artifacts as they move through the ecosystem; no way to incrementally add human expertise; no way to look across the entirety Scholarship requires the ability to cite and track usage of scholarly artifacts. In our current mode of working, there is no way to track artifacts as they move through the ecosystem; no way to incrementally add human expertise; no way to look across the entirety
  • 7. Whither neuroscience information?Whither neuroscience information? ∞ What is easily machine processable and accessible What is easily machine processable and accessible What is potentially knowableWhat is potentially knowable What is known: Literature, images, human knowledge What is known: Literature, images, human knowledge Unstructured; Natural language processing, entity recognition, image processing and analysis; paywalls communication Abstracts vs full text vs tables etc
  • 8. NIF: A New Type of Entity for New Modes ofNIF: A New Type of Entity for New Modes of Scientific DisseminationScientific Dissemination • NIF’s mission is to maximize the awareness of, access to and utility of research resources produced worldwide to enable better science and promote efficient use – NIF unites neuroscience information without respect to domain, funding agency, institute or community – NIF is like a “Pub Med” for all biomedical resources and a “Pub Med Central” for databases – Makes them searchable from a single interface – Practical and cost-effective; tries to be sensible – Learned a lot about the effective data sharing The Neuroscience Information Framework is an initiative of the NIH Blueprint consortium of institutes http://neuinfo.org The Neuroscience Information Framework is an initiative of the NIH Blueprint consortium of institutes http://neuinfo.org
  • 9. Surveying the resourceSurveying the resource landscapelandscape
  • 10. Data Federation: Deep searchData Federation: Deep search http://neuinfo.org With the thousands of databases and other information sources available, simple descriptive metadata will not suffice With the thousands of databases and other information sources available, simple descriptive metadata will not suffice
  • 11. A unified framework for neuroscienceA unified framework for neuroscience Hippocampus OR “Cornu Ammonis” OR “Ammon’s horn” Hippocampus OR “Cornu Ammonis” OR “Ammon’s horn” NIF queries > 200 databases; ~400 million recordsNIF queries > 200 databases; ~400 million records
  • 12. NIF Semantic Framework: NIFSTD ontologyNIF Semantic Framework: NIFSTD ontology • NIF uses ontologies to help navigate across and unify neuroscience resources • Ontologies are built from community ontologies  cross integration with other domains NIFSTDNIFSTD OrganismOrganism NS FunctionNS FunctionMoleculeMolecule InvestigationInvestigationSubcellular structure Subcellular structure MacromoleculeMacromolecule GeneGene Molecule DescriptorsMolecule Descriptors TechniquesTechniques ReagentReagent ProtocolsProtocols CellCell ResourceResource InstrumentInstrument DysfunctionDysfunction QualityQualityAnatomical Structure Anatomical Structure
  • 13. Purkinje Cell Axon Terminal Axon Dendritic Tree Dendritic Spine Dendrite Cell body Cerebellar cortex Bringing knowledge to data: Ontologies as frameworkBringing knowledge to data: Ontologies as framework There is little obvious connection between data sets taken at different scales using different microscopies without an explicit representation of the biological objects that the data represent There is little obvious connection between data sets taken at different scales using different microscopies without an explicit representation of the biological objects that the data represent
  • 14. : C: C Neurolex: > 1 million triples Dr. Yi Zeng: Chinese neural knowledge base NIF Cell Graph This is your brain on computers
  • 15. Ontologies as a data integration frameworkOntologies as a data integration framework •NIF Connectivity: 7 databases containing connectivity primary data or claims from literature on connectivity between brain regions •Brain Architecture Management System (rodent) •Temporal lobe.com (rodent) •Connectome Wiki (human) •Brain Maps (various) •CoCoMac (primate cortex) •UCLA Multimodal database (Human fMRI) •Avian Brain Connectivity Database (Bird) •Total: 1800 unique brain terms (excluding Avian) •Number of exact terms used in > 1 database: 42 •Number of synonym matches: 99 •Number of 1st order partonomy matches: 385
  • 16. 0 1-10 11-100 >101 Open World-Closed World: Mapping the knowledge - data space Data Sources NIF lets us ask: where isn’t there data? What isn’t studied? Why?NIF lets us ask: where isn’t there data? What isn’t studied? Why?
  • 18. ““The Data Homunculus”The Data Homunculus” Funding drives representation in the data spaceFunding drives representation in the data space
  • 19. Neurolex.org: A computableNeurolex.org: A computable lexicon for neurosciencelexicon for neuroscience http://neurolex.org Larson et al, Frontiers in Neuroinformatics, 2013Larson et al, Frontiers in Neuroinformatics, 2013 •Semantic MediaWiki •Provide a simple interface for defining the concepts required •Light weight semantics •Community based: •Anyone can contribute their terms, concepts, things •Anyone can edit •Anyone can link •Accessible: searched by Google •Growing into a significant knowledge base for neuroscience •25,000 concepts Demo D03 200,000 edits 150 contributors 200,000 edits 150 contributors
  • 20. Neurolex Structural Lexicon: Defining brainNeurolex Structural Lexicon: Defining brain partsparts
  • 21. Structural LexiconStructural Lexicon The scourge of neuroanatomical nomenclatureThe scourge of neuroanatomical nomenclature • Problem: Neuroscientists have a myriad number of ways to parcellate the brain – Brains are made up of networks that do not respect gross anatomical boundaries – Partonomies are generally along multiple axes: • Volummetric (species dependent): NeuroNames • Functional (Swanson) • Developmental • Cytoarchitectural – Partonomies are often weak • Arbitrary but defensible Program on Ontologies for Neural Structures, INCF- creating a computable lexicon for neural structures Program on Ontologies for Neural Structures, INCF- creating a computable lexicon for neural structures
  • 22. Neuroanatomy without bordersNeuroanatomy without borders Brainmaps.org
  • 23. Structural Lexicon in NeurolexStructural Lexicon in Neurolex Brain Region Brain Region Brain Parcel Brain Parcel •Trans-species •“Stateless”, i.e. no universal defining criteria •General structures and partonomies based on Neuroanatomy 101 Partially overlaps e.g., Hippocampus, Dentate gyrus •Species specific •Specific reference •Defining criteria •Sometimes partonomy; sometimes not e.g., Hippocampus of ABA2009
  • 24. ““When I use a word...it means what I choose itWhen I use a word...it means what I choose it to mean”to mean”
  • 25. Neurolex NeuronNeurolex Neuron • Led by Dr. Gordon Shepherd • > 30 world wide experts • Simple set of properties • Consistent naming scheme • Integrated with Structural Lexicon • Used for annotation in other resources, e.g., NeuroElectro
  • 26. ““You have broken links”You have broken links” Red Links: Information is missing (or misspelled)Red Links: Information is missing (or misspelled)
  • 27. Location of Cell Soma Location of dendrites Location of local axon arbor
  • 28. Analysis of Red Links in the Neuron RegistryAnalysis of Red Links in the Neuron Registry • Analysis of red links tells us where instructions aren’t clear, the information isn’t available, or the model insufficient – Conceptualization not clear • what is most important thing about local axon terminals? – Tool doesn’t capture all details Social networks and community sites let us learn things from the collective behavior of contributors  INCF/HBP Knowledge Space Social networks and community sites let us learn things from the collective behavior of contributors  INCF/HBP Knowledge Space
  • 29. Re-inventing Narrative: Do I have to write inRe-inventing Narrative: Do I have to write in triples?triples? • Not all entities are well-enough specified that they lend themselves to deep annotation – And, as we’ve seen in the previous example, we probably don’t want to pretend that they are • But…sometimes they are – Semantic annotation of research papers to make them “machine-interpretable” has been a goal of many – Can we update the way that authors produce manuscripts so that they are easier to process? •  NIF pilot project: Semantic annotation of entities that researchers would understand
  • 30. The problem: How many papers were published that used my: antibody Paz et al, J Neurosci, 2010
  • 31. Now, go find the antibody http://www.millipore.com/searchsummary.do?tabValue=&q=gfap Nov 12,
  • 32. Jan 15, 2014A catalog number is not a persistent identifierA catalog number is not a persistent identifier
  • 33. The problem is general across multiple resource types and disciplines The problem is general across multiple resource types and disciplines Vasilevsky et al, Peer J 2013Vasilevsky et al, Peer J 2013
  • 34. If we can’t do it, neither can the robot • Automated text mining tools were not deployed on this problem, because too few antibodies were able to be automatically identified • We are asking authors to change their ways, instead! • Almost all antibodies were identified with the company name, city and state, but the information is useless if the goal is to identify the antibody used
  • 35. The Resource Identification InitiativeThe Resource Identification Initiative • NIF, FORCE11 and partners – Led by Anita Bandrowski and Melissa Haendel • Identify 3 types of research resources – Antibodies – Genetically modified animals – Software http://force11.org/Resource_identification_initiativehttp://force11.org/Resource_identification_initiative
  • 36. Musings: You can’t do that!Musings: You can’t do that! • Two powerful trends in the 21st century: – Networking machines and networking people – Moving science into a machine-accessible platform has been a challenge • Mechanistically • Culturally • Sociologically • “A foolish consistency is the hobgoblin of little minds” – When you have a lot of data and information in an accessible form, we can start to look at actual practices and trends – Focusing on the “negative space”, i.e., what we don’t know, reveals glimpses into sources of bias and confusion • When we scratch the surface of science, we find uncertainty and confusion – Not a failure, but an opportunity • Sometimes we can be precise, i.e., which reagents we used • Sometimes, we can’t  so we should set up systems so we can learn from that
  • 37. Next Steps: Neurolex to Knowledge SpaceNext Steps: Neurolex to Knowledge Space Data SpaceData Space Laboratory Space Laboratory Space Knowledge Space Knowledge Space BAMS LexiconLexicon EncyclopediaEncyclopedia
  • 39. What is the “completeness” of our knowledge?What is the “completeness” of our knowledge? Neocortex Olfactory bulb Neostriatum Cochlear nucleus All neurons with cell bodies in the same brain region are grouped together All neurons with cell bodies in the same brain region are grouped together Properties in Neurolex •Simple set of properties that can be reasonably supplied with a minimal amount of effort
  • 40. The case of the meanest journal in the world, coincidentally having the lowest retraction rate
  • 41. The landscape is messy, diverse and evolving: Data toThe landscape is messy, diverse and evolving: Data to Knowledge – Knowledge to DataKnowledge – Knowledge to Data NIF favors a hybrid, tiered, federated system • Domain knowledge – Ontologies • Claims, models and observations – Virtuoso RDF triples – Model repositories • Data – Data federation – Spatial data – Workflows • Narrative – Full text access NeuronNeuron Brain partBrain part DiseaseDisease OrganismOrganism GeneGene Caudate projects to Snpc Caudate projects to Snpc Grm1 is upregulated in chronic cocaine Grm1 is upregulated in chronic cocaine Betz cells degenerate in ALS Betz cells degenerate in ALS NIF provides the tentacles that connect the pieces: a new type of entity for 21st century science NIF provides the tentacles that connect the pieces: a new type of entity for 21st century science TechniqueTechnique PeoplePeople
  • 42. Data about the subthalamusData about the subthalamus http://neuinfo.org

Hinweis der Redaktion

  1. Queue movie after this.  Would be nice to visually pull this together with an animated view.
  2. Current model: Scholars are producing multiple types of research objects; each goes to their own infrastructure with little coordination among them. Consumer no longer exclusively a scholar: General public wants access to what they pay for; automated agents are accessing first and mining the content.