SlideShare ist ein Scribd-Unternehmen logo
1 von 25
A Knowledge Framework for Smart
Discovery of Planetary Defense Resources
NASA Goddard (NNG16PU001) and NSF (IIP-1338925)
Yongyao Jiang, Chaowei Phil Yang, Han Qin, Yun Li, Manzhu Yu, Mengchao Xu
George Mason University
Myra Bambacus, Ronald Y. Leung, Brent Barbee, Joseph A. Nuth, and Bernie Seery
NASA Goddard Space Flight Center
David S. P. Dearborn
Lawrence Livermore National Laboratory
Catherine Plesko
Los Alamos National Laboratory
• Background
• Framework architecture
• Current results
• Ongoing research
• Conclusions
Planetary Defense (PD)
• Near Earth object (NEO) observation
• Design reference asteroids
• Impact modelling
• Decision support
• Mitigation action
• In this U.S., the NASA Planetary Defense Coordination Office
(PDCO) was established in 2016 to study the mitigation of
potential Near-Earth Object (NEO) impacts to our home
planet.
Image source: http://www.universetoday.com/128347/nasa-discovers-72-new-never-seen-neos/
Motivation for an
Information Framework
• Information about detecting, characterizing and mitigating NEO threats is
dispersed (e.g. publications, briefings.)
• An overall architecture to facilitate the collaborations and integrate the different
capabilities to achieve the most sensible, executable options for mitigation
• A cyberinfrastructure to capture mitigation trades, analyses, model output, risk
projections, and mitigation mission design concepts
• Discovery and easy access to knowledge and expert opinion within the project
team, as well as factoring in related information from other research and
analysis activities
Why Another Resource
Discovery Engine?
• Domain-specific vs. general-purpose
• Indexed content
– Google searches from nearly the entire Internet
– The framework is PD-specific
• Knowledge base
– Google’s Knowledge Graph is based on generic sources such as Wikipedia
– The framework will create a PD ontology aided by domain experts, combined with
machine learning and Natural Language Processing (NLP) results
• Decision makers can have easy access to required information and quality
knowledge
Planetary Defense
related info
Design
Reference
Missions
Project Organizational Collaboration
NEO Impact
Modeling
Physics Based Models
& Variation Analyses
Decision Support
Mission Design and
Assessment & Risk Analysis
Design Reference
Asteroids
Orbit and Physical Structure
Design
Standard
Interface
Complete
Characterization
Hybrid Cloud Computing
Hybrid Cloud
Big Data Discovery, Simulation, Analytics, and Access
Analytics
Standard
Interface
Computing Foundation
Big Data Processing
NEO
Mitigation
Planetary Defense
Design
Reference
Missions
ArchitecturalFramework
Listisnotexclusive
NEO Observation
Radar and Space Detection
Trajectory Analysis/NEO
Obitz Estimation
NEO Characterization NEO Impact Modeling
Physics Based Models &
Variation Analyses
Mitigation Action
Decision Support
Mission Design and
Assessment & Risk Analysis
Design Reference
Asteroids
Orbit and Physical Structure
Design
Standard
Interface
Complete
Characterization
Complete
Description
Playbook
Hybrid Cloud Computing
Hybrid Cloud
Big Data Management, Simulation, Analytics
Analytics
Standard
Interface
Standard
Interface
Standard
Interface
Information Flow
Application
Crawling
Sources
DocumentsWeb pages Access logs
Gateway
Upload Generate
Data
System
Content
Index
Special
Index
• Data management
• Data access
• Security
• OperationRepositories
Analytics • Name entity recognition (NER)
• Relation extraction (RE)
• Summarization
• Topic modeling
• Profile mining
• Link analysis
• Ranking
• Recommendation
• Semantic reasoning
Knowledge
Base
QueryKnowledgeDiscovery&
UsageFramework
Domain-specific
knowledge base
Planetary defense (PD)
Framework Gateway
• Web Portal: http://pd.cloud.gmu.edu/
• User management, document archiving, vocabulary editing
web crawling, search engine
User management
• User roles: Administer, authenticated user, anonymous
user
• Manage access control with permissions and user roles
• Assign permissions and roles to users
• Ban an IP address - The Ban module allows administrators
to ban visits to their site from individual or a range of IP
addresses.
FileDepot Module:
File/document Management
• Create folders or upload new files
• More actions:
• Set permissions of specific folder for different user roles
• Text mining will be preformed to find the relations between docs (&keywords)
Vocabulary editing module
• 130+ concepts
• Create and edit landing
page for each concept
• Different user roles have
different permissions
Web crawling module
• Nutch: Open Source web
crawler
• Store them in Elasticsearch
(full-text search engine)
• 5 seed URLs
• Similarity between page vs.
vocab list
• Baseline
Ongoing research
• Domain specific crawling
• Knowledge extraction from
plain text
Domain specific crawling
Simplest approach: filter web pages using a keyword list (e.g.
NEO, asteroid, Bennu, …) composed by domain experts.
Problems:
• Expensive
• Difficult to exhaust
• Difficult to assign weights to different
keywords
• Treat all web pages equally (a page on NASA website and
a random one)
Distribution of
relevant pages
in blue
Image source: http://www.seminarsonly.com/computer%20science/focused-web-crawling-for-e-learning-content.php
Domain specific crawling
Existing tools in Open Source crawler (e.g. Nutch):
• Link-based
– Scoring links (OPIC, PageRank scoring)
– Breadth first or Depth first crawl
• Content-based
– URL, mimetype filter
– Cosine Similarity scoring filter (what we are using)
– Naive Bayes parse filter
Image source: https://en.wikipedia.org/wiki/PageRank
Seed
URLs
Fetch pages
Extract content&
links
Vocab
List
Relevance Classifier
(cosine similarity +
Page rank)
Store/Index pages,
links
Keywords extractor
(e.g. title)
URL list
Keywords
Yes
Update
Frequency
weighting
Update
• Combine content and
link-based scoring to
boost the authoritative
and relevant web pages
• Dynamically update/grow
the vocab using info (e.g.
title) from the web pages
• Weight keywords based
on frequency clustering
(i.e. more frequently seen
terms have more weights)
Proposed method
Engage the community to help with the evaluation
Knowledge extraction from
plain text
• Goal: Extract structured information from unstructured web
pages and user uploaded documents
• Relation extraction in NLP: finding semantic triples (SPO)
from sentences
• Pattern-based, supervised, semi-supervised, and open
information extraction
The UV Index is a measure of the intensity of UV rays from the Sun.
Subject
Predicate
Object
Relation extraction
Hand-written patterns
• “Y such as X”
• “such Y as X”
• “X or other Y”
• “Y including X”
• + Tend to be high-precision
• + Tailored to specific domains
• - Human patterns are often low-
recall
• - Hard to be exhaustive
Open Information Extraction
• Recently published by Univ. of Washington
• Extract relations from the sentences with no training data, no list of
relations (unsupervised)
• Self-learning process, syntactic and lexical/semantic patterns
Gabor Angeli, Melvin Johnson Premkumar, and Christopher D. Manning. Leveraging Linguistic Structure For Open Domain Information Extraction. In
Proceedings of the Association of Computational Linguistics (ACL), 2015.
Open Information Extraction
• Some are reasonable, some are noise
• Working on reducing noise/identifying reasonable results
Conclusion and Next Steps
• The proposed architecture framework benefits the PD community by
– Providing discovery and easy access to the knowledge and expert opinion
within the project team
– Maximizing the linkage between different organizations, scientists, engineers,
decision makers, and citizens
• Next steps
– Develop a knowledge base & search ranking for NEO mitigation resources
– Investigate a knowledge reasoning model for potential mitigation by
assimilating existing scenarios
– Build a 4D visualization tool based on new datasets and existing tools
• Agichtein, E., Brill, E. and Dumais, S., 2006. Improving web search ranking by incorporating user behavior information. ed. Proceedings of the 29th
annual international ACM SIGIR conference on Research and development in information retrieval, 19-26.
• AlJadda, K., et al., 2014. Crowdsourced query augmentation through semantic discovery of domain-specific jargon. ed. Big Data (Big Data), 2014
IEEE International Conference on, 808-815.
• Auer, S., et al., 2007. Dbpedia: A nucleus for a web of open data. The semantic web. Springer, 722-735.
• Bach, N. and Badaskar, S. 2007. A survey on relation extraction. Language Technologies Institute, Carnegie Mellon University.
• Banko, M., et al., 2007. Open Information Extraction from the Web. ed. IJCAI, 2670-2676.
• Bargellini, P., et al., 2013. Big Data from Space: Event Report. European Space Agency Publication.
• Battle, R. and Kolas, D. 2012. Enabling the geospatial semantic web with parliament and geosparql. Semantic Web, 3(4), 355-370.
• Cook, S., et al. 2011. Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic. PloS one,
6(8), e23610.
• Egenhofer, M. J., 2002. Toward the semantic geospatial web. ed. Proceedings of the 10th ACM international symposium on Advances in geographic
information systems, 1-4.
• Graybeal, J., 2015. Community Ontology Repository Prototype: Development [online]. ESIP. Available from:
http://testbed.esipfed.org/cor_prototype [Accessed 12/5 2016].
• Jiang, Y., et al. 2016a. A Comprehensive Approach to Discovering and Determining the Semantic Relationship among Geospatial Vocabularies - An
Example with Oceanographic Data Discovery. International Journal of Geographical Information Science.
• Jiang, Y., et al. 2016b. Reconstructing Sessions from Data Discovery and Access Logs to Build a Semantic Knowledge Base for Improving Data
Discovery. ISPRS International Journal of Geo-Information, 5(5), 54.
• Joachims, T., 2002. Optimizing search engines using clickthrough data. ed. Proceedings of the eighth ACM SIGKDD international conference on
Knowledge discovery and data mining, 133-142.
• Krisnadhi, A., et al., 2015. The GeoLink modular oceanography ontology. ed. International Semantic Web Conference, 301-309.
• Lee, J.-G. and Kang, M. 2015. Geospatial big data: challenges and opportunities. Big Data Research, 2(2), 74-81.
• Li, W., Goodchild, M. F. and Raskin, R. 2014. Towards geospatial semantic search: exploiting latent semantic relations in geospatial data.
International Journal of Digital Earth, 7(1), 17-37.
References
• Manning, C. D., et al., 2014. The Stanford CoreNLP Natural Language Processing Toolkit. ed. ACL (System Demonstrations), 55-60.
• Miller, G. A. 1995. WordNet: a lexical database for English. Communications of the ACM, 38(11), 39-41.
• Nativi, S., et al. 2015. Big data challenges in building the global earth observation system of systems. Environmental Modelling & Software,
68, 1-26.
• Noy, N. F. and Musen, M. A., 2000. Algorithm and tool for automated ontology merging and alignment. ed. Proceedings of the 17th
National Conference on Artificial Intelligence (AAAI-00). Available as SMI technical report SMI-2000-0831.
• Pouchard, L., 2013. ESIP Semantic Portal [online]. ESIP. Available from: http://testbed.esipfed.org/node/1243 [Accessed 12/5 2016].
• Raskin, R. and Pan, M., 2003. Semantic web for earth and environmental terminology (sweet). ed. Proc. of the Workshop on Semantic Web
Technologies for Searching and Retrieving Scientific Data.
• Singhal, A. 2012. Introducing the knowledge graph: things, not strings. Official google blog.
• Soderland, S., et al. 2010. Adapting open information extraction to domain-specific relations. AI magazine, 31(3), 93-102.
• Srivastava, J., et al. 2000. Web usage mining: Discovery and applications of usage patterns from web data. ACM SIGKDD Explorations
Newsletter, 1(2), 12-23.
• Sun, A., Grishman, R. and Sekine, S., 2011. Semi-supervised relation extraction with large-scale word clustering. ed. Proceedings of the 49th
Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, 521-529.
• Wu, L. and Brynjolfsson, E. 2013. The future of prediction: How Google searches foreshadow housing prices and sales. Available at SSRN
2022293.
• Yang C., H. Q., Li Z., Liu K., Hu F., 2016 2016a. Big Data and cloud computing: innovation opportunities and challenges. International Journal
of Digital Earth.
• Yang C., Y. M., Hu F., Jiang Y., Li Y. 2016b. Utilizing Cloud Computing to Address Big Geospatial Data Challenges. Computers, Environment,
and Urban Systems.
• Zelenko, D., Aone, C. and Richardella, A. 2003. Kernel methods for relation extraction. Journal of machine learning research, 3(Feb), 1083-
1106.
• Zhou, Y., et al., 2008. Large-scale parallel collaborative filtering for the netflix prize. Algorithmic Aspects in Information and Management.
Springer, 337-348.

Weitere ähnliche Inhalte

Was ist angesagt?

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptbutest
 
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Amit Sheth
 
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...Ben Blaiszik
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Nolan Nichols
 
Insights from Knowledge Graphs
Insights from Knowledge GraphsInsights from Knowledge Graphs
Insights from Knowledge GraphsAnirudh Prabhu
 
Stack_Overflow-Network_Graph
Stack_Overflow-Network_GraphStack_Overflow-Network_Graph
Stack_Overflow-Network_GraphYaopeng (Gyoho) Wu
 
DuraMat Data Management and Analytics
DuraMat Data Management and AnalyticsDuraMat Data Management and Analytics
DuraMat Data Management and AnalyticsAnubhav Jain
 
AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016Manjula Ambur
 
Himansu sahoo resume-ds
Himansu sahoo resume-dsHimansu sahoo resume-ds
Himansu sahoo resume-dsHimansu Sahoo
 
Defensa.V11
Defensa.V11Defensa.V11
Defensa.V11promanas
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Anubhav Jain
 
Time-aware Approaches to Information Retrieval
Time-aware Approaches to Information RetrievalTime-aware Approaches to Information Retrieval
Time-aware Approaches to Information RetrievalNattiya Kanhabua
 
Sciunits: Reusable Research Objects
Sciunits: Reusable Research Objects Sciunits: Reusable Research Objects
Sciunits: Reusable Research Objects Globus
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
Linked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and LuzzuLinked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and Luzzujerdeb
 
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Hakka Labs
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsDominik Kowald
 

Was ist angesagt? (19)

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.ppt
 
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
 
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
 
Insights from Knowledge Graphs
Insights from Knowledge GraphsInsights from Knowledge Graphs
Insights from Knowledge Graphs
 
Stack_Overflow-Network_Graph
Stack_Overflow-Network_GraphStack_Overflow-Network_Graph
Stack_Overflow-Network_Graph
 
DuraMat Data Management and Analytics
DuraMat Data Management and AnalyticsDuraMat Data Management and Analytics
DuraMat Data Management and Analytics
 
AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016
 
Himansu sahoo resume-ds
Himansu sahoo resume-dsHimansu sahoo resume-ds
Himansu sahoo resume-ds
 
Defensa.V11
Defensa.V11Defensa.V11
Defensa.V11
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
Time-aware Approaches to Information Retrieval
Time-aware Approaches to Information RetrievalTime-aware Approaches to Information Retrieval
Time-aware Approaches to Information Retrieval
 
Sciunits: Reusable Research Objects
Sciunits: Reusable Research Objects Sciunits: Reusable Research Objects
Sciunits: Reusable Research Objects
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
Linked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and LuzzuLinked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and Luzzu
 
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
 

Andere mochten auch

3Com WDBBXV0010BBK
3Com WDBBXV0010BBK3Com WDBBXV0010BBK
3Com WDBBXV0010BBKsavomir
 
224485355 amigo-de-palo
224485355 amigo-de-palo224485355 amigo-de-palo
224485355 amigo-de-paloJuan Opazo
 
Planificación estratégica educ. amb.
Planificación estratégica educ. amb.Planificación estratégica educ. amb.
Planificación estratégica educ. amb.Leonela Villarreal
 
La ira
La iraLa ira
La irapriapismo
 
The First 3 Steps To Taking Your Church Online
The First 3 Steps To Taking Your Church OnlineThe First 3 Steps To Taking Your Church Online
The First 3 Steps To Taking Your Church OnlineVibrant Faith
 
Blog 17 Unad Psicofisiologia
Blog 17 Unad Psicofisiologia Blog 17 Unad Psicofisiologia
Blog 17 Unad Psicofisiologia Dantalion Seddim
 
Proceso de la planeaciĂłn
Proceso de la planeaciĂłnProceso de la planeaciĂłn
Proceso de la planeaciĂłnEnrique Sanchez
 
1 jesus hacia el bienn (1)
1 jesus hacia el bienn (1)1 jesus hacia el bienn (1)
1 jesus hacia el bienn (1)viviana alvarez
 
3Com 80-001958-02
3Com 80-001958-023Com 80-001958-02
3Com 80-001958-02savomir
 
Performance Management
Performance ManagementPerformance Management
Performance ManagementMin Jeong Lee
 
3Com 3C1210-0
3Com 3C1210-03Com 3C1210-0
3Com 3C1210-0savomir
 
FIBRODISPLASIA OSIFICANTE PROGRESIVA
FIBRODISPLASIA OSIFICANTE PROGRESIVAFIBRODISPLASIA OSIFICANTE PROGRESIVA
FIBRODISPLASIA OSIFICANTE PROGRESIVAAlvaro Benavides
 
Fotoreportaje
FotoreportajeFotoreportaje
Fotoreportajeeliiasmalav
 
RevisiĂłn fĂ­sica del equipo de computĂł
RevisiĂłn fĂ­sica del equipo de computĂłRevisiĂłn fĂ­sica del equipo de computĂł
RevisiĂłn fĂ­sica del equipo de computĂłJoshua Ramos
 

Andere mochten auch (18)

La ira
La iraLa ira
La ira
 
3Com WDBBXV0010BBK
3Com WDBBXV0010BBK3Com WDBBXV0010BBK
3Com WDBBXV0010BBK
 
Resolucion 448 ica
Resolucion 448 icaResolucion 448 ica
Resolucion 448 ica
 
Cromosomas
CromosomasCromosomas
Cromosomas
 
224485355 amigo-de-palo
224485355 amigo-de-palo224485355 amigo-de-palo
224485355 amigo-de-palo
 
TMK
TMKTMK
TMK
 
Planificación estratégica educ. amb.
Planificación estratégica educ. amb.Planificación estratégica educ. amb.
Planificación estratégica educ. amb.
 
La ira
La iraLa ira
La ira
 
The First 3 Steps To Taking Your Church Online
The First 3 Steps To Taking Your Church OnlineThe First 3 Steps To Taking Your Church Online
The First 3 Steps To Taking Your Church Online
 
Blog 17 Unad Psicofisiologia
Blog 17 Unad Psicofisiologia Blog 17 Unad Psicofisiologia
Blog 17 Unad Psicofisiologia
 
Proceso de la planeaciĂłn
Proceso de la planeaciĂłnProceso de la planeaciĂłn
Proceso de la planeaciĂłn
 
1 jesus hacia el bienn (1)
1 jesus hacia el bienn (1)1 jesus hacia el bienn (1)
1 jesus hacia el bienn (1)
 
3Com 80-001958-02
3Com 80-001958-023Com 80-001958-02
3Com 80-001958-02
 
Performance Management
Performance ManagementPerformance Management
Performance Management
 
3Com 3C1210-0
3Com 3C1210-03Com 3C1210-0
3Com 3C1210-0
 
FIBRODISPLASIA OSIFICANTE PROGRESIVA
FIBRODISPLASIA OSIFICANTE PROGRESIVAFIBRODISPLASIA OSIFICANTE PROGRESIVA
FIBRODISPLASIA OSIFICANTE PROGRESIVA
 
Fotoreportaje
FotoreportajeFotoreportaje
Fotoreportaje
 
RevisiĂłn fĂ­sica del equipo de computĂł
RevisiĂłn fĂ­sica del equipo de computĂłRevisiĂłn fĂ­sica del equipo de computĂł
RevisiĂłn fĂ­sica del equipo de computĂł
 

Ă„hnlich wie Smart Discovery Framework for Planetary Defense Resources

EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...Maryann Martone
 
IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014iedadata
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavisSean Davis
 
MS-Word.doc
MS-Word.docMS-Word.doc
MS-Word.docbutest
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...Neuroscience Information Framework
 
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...Yongyao Jiang
 
Epistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsEpistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsSimon Knight
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsAndre Freitas
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsEarthCube
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelKrzysztof Gorgolewski
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...Craig Knoblock
 
Keller geo edu
Keller geo eduKeller geo edu
Keller geo eduhasnakhan12
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...Maryann Martone
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)TimelessFuture
 

Ă„hnlich wie Smart Discovery Framework for Planetary Defense Resources (20)

EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
 
A Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource LandscapeA Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource Landscape
 
IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014
 
Sharing data
Sharing dataSharing data
Sharing data
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis
 
MS-Word.doc
MS-Word.docMS-Word.doc
MS-Word.doc
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...
 
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
 
Epistemic networks for Epistemic Commitments
Epistemic networks for Epistemic CommitmentsEpistemic networks for Epistemic Commitments
Epistemic networks for Epistemic Commitments
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge Graphs
 
Lightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded ProjectsLightning Talks: All EartCube Funded Projects
Lightning Talks: All EartCube Funded Projects
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next level
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...
 
Keller geo edu
Keller geo eduKeller geo edu
Keller geo edu
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)Towards Research Engines: Supporting Search Stages in Web Archives (2015)
Towards Research Engines: Supporting Search Stages in Web Archives (2015)
 

KĂĽrzlich hochgeladen

Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

KĂĽrzlich hochgeladen (20)

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call đź‘— 7737669865 đź‘— Top Class Call Girl Service Ba...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

Smart Discovery Framework for Planetary Defense Resources

  • 1. A Knowledge Framework for Smart Discovery of Planetary Defense Resources NASA Goddard (NNG16PU001) and NSF (IIP-1338925) Yongyao Jiang, Chaowei Phil Yang, Han Qin, Yun Li, Manzhu Yu, Mengchao Xu George Mason University Myra Bambacus, Ronald Y. Leung, Brent Barbee, Joseph A. Nuth, and Bernie Seery NASA Goddard Space Flight Center David S. P. Dearborn Lawrence Livermore National Laboratory Catherine Plesko Los Alamos National Laboratory
  • 2. • Background • Framework architecture • Current results • Ongoing research • Conclusions
  • 3. Planetary Defense (PD) • Near Earth object (NEO) observation • Design reference asteroids • Impact modelling • Decision support • Mitigation action • In this U.S., the NASA Planetary Defense Coordination Office (PDCO) was established in 2016 to study the mitigation of potential Near-Earth Object (NEO) impacts to our home planet. Image source: http://www.universetoday.com/128347/nasa-discovers-72-new-never-seen-neos/
  • 4. Motivation for an Information Framework • Information about detecting, characterizing and mitigating NEO threats is dispersed (e.g. publications, briefings.) • An overall architecture to facilitate the collaborations and integrate the different capabilities to achieve the most sensible, executable options for mitigation • A cyberinfrastructure to capture mitigation trades, analyses, model output, risk projections, and mitigation mission design concepts • Discovery and easy access to knowledge and expert opinion within the project team, as well as factoring in related information from other research and analysis activities
  • 5. Why Another Resource Discovery Engine? • Domain-specific vs. general-purpose • Indexed content – Google searches from nearly the entire Internet – The framework is PD-specific • Knowledge base – Google’s Knowledge Graph is based on generic sources such as Wikipedia – The framework will create a PD ontology aided by domain experts, combined with machine learning and Natural Language Processing (NLP) results • Decision makers can have easy access to required information and quality knowledge Planetary Defense related info
  • 6. Design Reference Missions Project Organizational Collaboration NEO Impact Modeling Physics Based Models & Variation Analyses Decision Support Mission Design and Assessment & Risk Analysis Design Reference Asteroids Orbit and Physical Structure Design Standard Interface Complete Characterization Hybrid Cloud Computing Hybrid Cloud Big Data Discovery, Simulation, Analytics, and Access Analytics Standard Interface Computing Foundation Big Data Processing NEO Mitigation
  • 7. Planetary Defense Design Reference Missions ArchitecturalFramework Listisnotexclusive NEO Observation Radar and Space Detection Trajectory Analysis/NEO Obitz Estimation NEO Characterization NEO Impact Modeling Physics Based Models & Variation Analyses Mitigation Action Decision Support Mission Design and Assessment & Risk Analysis Design Reference Asteroids Orbit and Physical Structure Design Standard Interface Complete Characterization Complete Description Playbook Hybrid Cloud Computing Hybrid Cloud Big Data Management, Simulation, Analytics Analytics Standard Interface Standard Interface Standard Interface
  • 9. Application Crawling Sources DocumentsWeb pages Access logs Gateway Upload Generate Data System Content Index Special Index • Data management • Data access • Security • OperationRepositories Analytics • Name entity recognition (NER) • Relation extraction (RE) • Summarization • Topic modeling • Profile mining • Link analysis • Ranking • Recommendation • Semantic reasoning Knowledge Base QueryKnowledgeDiscovery& UsageFramework Domain-specific knowledge base
  • 10. Planetary defense (PD) Framework Gateway • Web Portal: http://pd.cloud.gmu.edu/ • User management, document archiving, vocabulary editing web crawling, search engine
  • 11. User management • User roles: Administer, authenticated user, anonymous user • Manage access control with permissions and user roles • Assign permissions and roles to users • Ban an IP address - The Ban module allows administrators to ban visits to their site from individual or a range of IP addresses.
  • 12. FileDepot Module: File/document Management • Create folders or upload new files • More actions: • Set permissions of specific folder for different user roles • Text mining will be preformed to find the relations between docs (&keywords)
  • 13. Vocabulary editing module • 130+ concepts • Create and edit landing page for each concept • Different user roles have different permissions
  • 14. Web crawling module • Nutch: Open Source web crawler • Store them in Elasticsearch (full-text search engine) • 5 seed URLs • Similarity between page vs. vocab list • Baseline
  • 15. Ongoing research • Domain specific crawling • Knowledge extraction from plain text
  • 16. Domain specific crawling Simplest approach: filter web pages using a keyword list (e.g. NEO, asteroid, Bennu, …) composed by domain experts. Problems: • Expensive • Difficult to exhaust • Difficult to assign weights to different keywords • Treat all web pages equally (a page on NASA website and a random one) Distribution of relevant pages in blue Image source: http://www.seminarsonly.com/computer%20science/focused-web-crawling-for-e-learning-content.php
  • 17. Domain specific crawling Existing tools in Open Source crawler (e.g. Nutch): • Link-based – Scoring links (OPIC, PageRank scoring) – Breadth first or Depth first crawl • Content-based – URL, mimetype filter – Cosine Similarity scoring filter (what we are using) – Naive Bayes parse filter Image source: https://en.wikipedia.org/wiki/PageRank
  • 18. Seed URLs Fetch pages Extract content& links Vocab List Relevance Classifier (cosine similarity + Page rank) Store/Index pages, links Keywords extractor (e.g. title) URL list Keywords Yes Update Frequency weighting Update • Combine content and link-based scoring to boost the authoritative and relevant web pages • Dynamically update/grow the vocab using info (e.g. title) from the web pages • Weight keywords based on frequency clustering (i.e. more frequently seen terms have more weights) Proposed method Engage the community to help with the evaluation
  • 19. Knowledge extraction from plain text • Goal: Extract structured information from unstructured web pages and user uploaded documents • Relation extraction in NLP: finding semantic triples (SPO) from sentences • Pattern-based, supervised, semi-supervised, and open information extraction The UV Index is a measure of the intensity of UV rays from the Sun. Subject Predicate Object
  • 20. Relation extraction Hand-written patterns • “Y such as X” • “such Y as X” • “X or other Y” • “Y including X” • + Tend to be high-precision • + Tailored to specific domains • - Human patterns are often low- recall • - Hard to be exhaustive
  • 21. Open Information Extraction • Recently published by Univ. of Washington • Extract relations from the sentences with no training data, no list of relations (unsupervised) • Self-learning process, syntactic and lexical/semantic patterns Gabor Angeli, Melvin Johnson Premkumar, and Christopher D. Manning. Leveraging Linguistic Structure For Open Domain Information Extraction. In Proceedings of the Association of Computational Linguistics (ACL), 2015.
  • 22. Open Information Extraction • Some are reasonable, some are noise • Working on reducing noise/identifying reasonable results
  • 23. Conclusion and Next Steps • The proposed architecture framework benefits the PD community by – Providing discovery and easy access to the knowledge and expert opinion within the project team – Maximizing the linkage between different organizations, scientists, engineers, decision makers, and citizens • Next steps – Develop a knowledge base & search ranking for NEO mitigation resources – Investigate a knowledge reasoning model for potential mitigation by assimilating existing scenarios – Build a 4D visualization tool based on new datasets and existing tools
  • 24. • Agichtein, E., Brill, E. and Dumais, S., 2006. Improving web search ranking by incorporating user behavior information. ed. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, 19-26. • AlJadda, K., et al., 2014. Crowdsourced query augmentation through semantic discovery of domain-specific jargon. ed. Big Data (Big Data), 2014 IEEE International Conference on, 808-815. • Auer, S., et al., 2007. Dbpedia: A nucleus for a web of open data. The semantic web. Springer, 722-735. • Bach, N. and Badaskar, S. 2007. A survey on relation extraction. Language Technologies Institute, Carnegie Mellon University. • Banko, M., et al., 2007. Open Information Extraction from the Web. ed. IJCAI, 2670-2676. • Bargellini, P., et al., 2013. Big Data from Space: Event Report. European Space Agency Publication. • Battle, R. and Kolas, D. 2012. Enabling the geospatial semantic web with parliament and geosparql. Semantic Web, 3(4), 355-370. • Cook, S., et al. 2011. Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic. PloS one, 6(8), e23610. • Egenhofer, M. J., 2002. Toward the semantic geospatial web. ed. Proceedings of the 10th ACM international symposium on Advances in geographic information systems, 1-4. • Graybeal, J., 2015. Community Ontology Repository Prototype: Development [online]. ESIP. Available from: http://testbed.esipfed.org/cor_prototype [Accessed 12/5 2016]. • Jiang, Y., et al. 2016a. A Comprehensive Approach to Discovering and Determining the Semantic Relationship among Geospatial Vocabularies - An Example with Oceanographic Data Discovery. International Journal of Geographical Information Science. • Jiang, Y., et al. 2016b. Reconstructing Sessions from Data Discovery and Access Logs to Build a Semantic Knowledge Base for Improving Data Discovery. ISPRS International Journal of Geo-Information, 5(5), 54. • Joachims, T., 2002. Optimizing search engines using clickthrough data. ed. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 133-142. • Krisnadhi, A., et al., 2015. The GeoLink modular oceanography ontology. ed. International Semantic Web Conference, 301-309. • Lee, J.-G. and Kang, M. 2015. Geospatial big data: challenges and opportunities. Big Data Research, 2(2), 74-81. • Li, W., Goodchild, M. F. and Raskin, R. 2014. Towards geospatial semantic search: exploiting latent semantic relations in geospatial data. International Journal of Digital Earth, 7(1), 17-37. References
  • 25. • Manning, C. D., et al., 2014. The Stanford CoreNLP Natural Language Processing Toolkit. ed. ACL (System Demonstrations), 55-60. • Miller, G. A. 1995. WordNet: a lexical database for English. Communications of the ACM, 38(11), 39-41. • Nativi, S., et al. 2015. Big data challenges in building the global earth observation system of systems. Environmental Modelling & Software, 68, 1-26. • Noy, N. F. and Musen, M. A., 2000. Algorithm and tool for automated ontology merging and alignment. ed. Proceedings of the 17th National Conference on Artificial Intelligence (AAAI-00). Available as SMI technical report SMI-2000-0831. • Pouchard, L., 2013. ESIP Semantic Portal [online]. ESIP. Available from: http://testbed.esipfed.org/node/1243 [Accessed 12/5 2016]. • Raskin, R. and Pan, M., 2003. Semantic web for earth and environmental terminology (sweet). ed. Proc. of the Workshop on Semantic Web Technologies for Searching and Retrieving Scientific Data. • Singhal, A. 2012. Introducing the knowledge graph: things, not strings. Official google blog. • Soderland, S., et al. 2010. Adapting open information extraction to domain-specific relations. AI magazine, 31(3), 93-102. • Srivastava, J., et al. 2000. Web usage mining: Discovery and applications of usage patterns from web data. ACM SIGKDD Explorations Newsletter, 1(2), 12-23. • Sun, A., Grishman, R. and Sekine, S., 2011. Semi-supervised relation extraction with large-scale word clustering. ed. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, 521-529. • Wu, L. and Brynjolfsson, E. 2013. The future of prediction: How Google searches foreshadow housing prices and sales. Available at SSRN 2022293. • Yang C., H. Q., Li Z., Liu K., Hu F., 2016 2016a. Big Data and cloud computing: innovation opportunities and challenges. International Journal of Digital Earth. • Yang C., Y. M., Hu F., Jiang Y., Li Y. 2016b. Utilizing Cloud Computing to Address Big Geospatial Data Challenges. Computers, Environment, and Urban Systems. • Zelenko, D., Aone, C. and Richardella, A. 2003. Kernel methods for relation extraction. Journal of machine learning research, 3(Feb), 1083- 1106. • Zhou, Y., et al., 2008. Large-scale parallel collaborative filtering for the netflix prize. Algorithmic Aspects in Information and Management. Springer, 337-348.

Hinweis der Redaktion

  1. Although the number of existing PD pages is considerable, they are still a very small portion compared to the information volume on the Web. The search results will be flooded and hidden in a long list of search results
  2. This is the NEO mitigation structure, which has multiple organizations and progresses in 3 steps. Namely, 1, 2, 3
  3. The organization framework can be expanded into a PD architecture framework, which lays out the roles of different organizations. Big data is the key here and also what we are trying to address in the research
  4. This is the information workflow. It lays out how the data is transferred from one component to another….
  5. To address that challenge of dispersed information, we propose a knowledge discovery framework……
  6. As a starting point
  7. The difference is that. This is just a starting point,
  8. NASA NEO website, Europe Space Agency just a beginning, …
  9. Two ongoing research is being carried out
  10. Triple is the atomic data entity in semantics
  11. The technique we are focusing on Use parsed data to train a “trustworthy tuple” classifier Single-pass extract all relations between NPs, keep if trustworthy Assessor ranks relations based on text redundancy