SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Creating Knowledge out of Interlinked 
Data 
LOD2 Webinar . 24.06.2014 . Page 1 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
LOD2 is a large-scale integrating project co-funded by the European 
Commission within the FP7 Information and Communication Technologies 
Work Programme. This 4-year project comprises leading Linked Open 
Data technology researchers, companies, and service providers. Coming 
from across 12 countries the partners are coordinated by the Agile 
Knowledge Engineering and Semantic Web Research Group at the 
University of Leipzig, Germany. 
LOD2 will integrate and syndicate Linked Data with existing large-scale 
applications. The project shows the benefits in the scenarios of Media and 
Publishing, Corporate Data intranets and eGovernment. 
LOD2 Webinar . 24.06.2014 . Page 2 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
Once per month the LOD2 webinar series offer a free webinar about 
tools and services along the Linked Open Data Life Cycle. 
Stay with us and learn more about acquisition, editing, composing, 
connected applications – and finally publishing Linked Open Data. 
LOD2 Webinar . 24.06.2014 . Page 3 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Agenda 
• Nested Data Model 
• SIREn Overview 
• Getting Started with the SIREn Elasticsearch Plugin 
• Demo 
LOD2 Webinar . 24.06.2014 . Page 4 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Schema-Less Nested Data Model 
• Model becoming prevalent: JSON, XML, Avro, … 
– Can be arbitrarily nested and large 
– No strict schema / structure enforced 
• Schema-less brings 
– Flexibility 
– Ease of development 
• Developers do not have to invest significant modelling 
effort upfront 
LOD2 Webinar . 24.06.2014 . Page 5 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Introducing SIREn 
• Lucene, Solr and Elasticsearch plugin for indexing and 
searching JSON 
• Rich data model (JSON) 
– Nested objects, nested arrays, datatypes 
– Generic architecture compatible with various nested data models: JSON, 
JSON-LD, XML, Avro, ... 
• Schema-agnostic 
– SIREn does not require any schema definition to index and search data 
– Schema definition can change across records 
• Designed from the ground up for high performance and 
scalability 
LOD2 Webinar . 24.06.2014 . Page 6 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Introducing SIREn 
LOD2 Webinar . 24.06.2014 . Page 7 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Elasticsearch - Overview 
• Document-oriented search and analytics engine 
– JSON Document 
– Based on Apache Lucene 
• Distributed, Replication 
– High Performance and Availability 
• REST API 
LOD2 Webinar . 24.06.2014 . Page 8 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Elasticsearch – Basic Concepts 
• Index = Collection of Documents 
– Can have multiple shards and replicas 
• Type = A set of documents sharing the same schema 
– Similar to a DB table 
• Document = JSON object 
– Uniquely identified (index/type/id) 
– Similar to a DB record 
LOD2 Webinar . 24.06.2014 . Page 9 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn – Basic Concepts 
• JSON object = Tree 
– Different mapping available 
• Node = An element of the tree 
– Can have a parent and one or more children 
– Contains data: text, numeric, boolean 
LOD2 Webinar . 24.06.2014 . Page 10 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn – Basic Concepts 
JSON Object Tree 
{ 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
}, 
… 
}, 
… 
] 
] 
}, 
… 
}, 
… 
] 
] 
} 
} 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 11 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Getting Started with Elasticsearch & SIREn 
http://sirendb.com/downloads/ 
(Elasticsearch Distribution Coming Soon) 
LOD2 Webinar . 24.06.2014 . Page 12 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn Elasticsearch Distribution 
├── dist 
│ ├── siren-core-1.3.jar 
│ └── siren-qparser-1.3.jar 
├── docs 
│ ├── apidoc 
│ └── siren-1.3-javadoc.jar 
├── example 
├── CHANGES.txt 
├── LICENSE.md 
├── README.md 
└── THIRD-PARTY.txt 
├── dist 
│ ├── siren-core-1.3.jar 
│ └── siren-qparser-1.3.jar 
├── docs 
│ ├── apidoc 
│ └── siren-1.3-javadoc.jar 
├── example 
├── CHANGES.txt 
├── LICENSE.md 
├── README.md 
└── THIRD-PARTY.txt 
SIREn 
Libraries 
SIREn 
Libraries 
SIREn 
Javadoc 
SIREn 
Javadoc 
Elasticsearch 
Distribution 
Elasticsearch 
Distribution 
LOD2 Webinar . 24.06.2014 . Page 13 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn Elasticsearch 
Distribution 
├── bin 
│ ├── elasticsearch 
│ └── ... 
├── config 
│ ├── elasticsearch.yml 
│ └── logging.yml 
├── lib 
│ ├── elasticsearch-1.2.0.jar 
│ └── ... 
├── plugins 
│ └── siren-plugin 
│ ├── siren-elasticsearch-1.3.jar 
│ └── ... 
├── datasets 
├── NOTICE.txt 
└── README.md 
├── bin 
│ ├── elasticsearch 
│ └── ... 
├── config 
│ ├── elasticsearch.yml 
│ └── logging.yml 
├── lib 
│ ├── elasticsearch-1.2.0.jar 
│ └── ... 
├── plugins 
│ └── siren-plugin 
│ ├── siren-elasticsearch-1.3.jar 
│ └── ... 
├── datasets 
├── NOTICE.txt 
└── README.md 
EExxeeccuutatabbleless 
CCoonnfifgiguuraratitoionn 
SSIRIREEnn P Plulugginin 
LOD2 Webinar . 24.06.2014 . Page 14 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Starting Elasticsearch 
$ ./bin/elasticsearch 
$ ./bin/elasticsearch 
[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … 
[INFO ][node ] [Valerie Cooper] initializing ... 
[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] 
[INFO ][node ] [Valerie Cooper] initialized 
[INFO ][node ] [Valerie Cooper] starting ... 
… 
[INFO ][node ] [Valerie Cooper] started 
[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … 
[INFO ][node ] [Valerie Cooper] initializing ... 
[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] 
[INFO ][node ] [Valerie Cooper] initialized 
[INFO ][node ] [Valerie Cooper] starting ... 
… 
[INFO ][node ] [Valerie Cooper] started 
LOD2 Webinar . 24.06.2014 . Page 15 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Create an Index and Map 
SIREn’s Field 
$ curl -XPUT 'http://localhost:9200/test/' –d ' 
{ 
$ curl -XPUT 'http://localhost:9200/test/' –d ' 
{ 
"mappings" : { 
"mappings" : { 
"companies" : { 
"companies" : { 
"properties" : { 
"properties" : { 
SIREn’s Field 
"_siren_source" : { 
"index" : "analyzed", 
"analyzer" : "hybrid-json", 
"postings_format" : "Siren10AFor", 
"store" : "no", 
"type" : "string" 
} 
}, 
"_siren" : {} 
"_siren_source" : { 
"index" : "analyzed", 
"analyzer" : "hybrid-json", 
"postings_format" : "Siren10AFor", 
"store" : "no", 
"type" : "string" 
} 
}, 
"_siren" : {} 
} 
} 
} 
} 
}' 
}' 
SIREn’s Analyzer 
SIREn’s 
Posting Format 
LOD2 Webinar . 24.06.2014 . Page 16 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Indexing a Document 
$ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' 
{ 
$ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
} 
} 
] 
] 
} 
} 
] 
] 
}' 
}' 
LOD2 Webinar . 24.06.2014 . Page 17 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Customising Datatype Analysis 
• SIREn distinguishes 5 types of values in the JSON model: 
– Field, String, Long, Double, Boolean 
{ 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
json:field 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
} 
} 
] 
] 
} 
} 
] 
] 
} 
} 
xsd:long 
xsd:string 
LOD2 Webinar . 24.06.2014 . Page 18 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Customising Datatype Analysis 
# config/elasticsearch.yml 
# config/elasticsearch.yml 
siren.analysis.datatype: 
siren.analysis.datatype: 
http://json.org/field: 
http://json.org/field: 
index_analyzer: keyword 
index_analyzer: keyword 
http://www.w3.org/2001/XMLSchema#string: 
http://www.w3.org/2001/XMLSchema#string: 
index_analyzer: simple 
search_analyzer: simpleFolding 
index_analyzer: simple 
search_analyzer: simpleFolding 
http://www.w3.org/2001/XMLSchema#long: 
http://www.w3.org/2001/XMLSchema#long: 
index_analyzer: long 
index_analyzer: long 
index.analysis.analyzer: 
index.analysis.analyzer: 
simpleFolding: 
simpleFolding: 
type: custom 
tokenizer: letter 
filter: [lowercase, asciifolding] 
type: custom 
tokenizer: letter 
filter: [lowercase, asciifolding] 
LOD2 Webinar . 24.06.2014 . Page 19 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"query" : "search", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
SIREn’s Query Type 
SIREn’s Query 
Full-Text Query: 
•Boolean, Phrase, Range, Fuzzy, 
Regexp, Proximity, ... 
LOD2 Webinar . 24.06.2014 . Page 20 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"query" : "search", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 21 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"attribute" : "category_code", 
"query" : "search", 
"attribute" : "category_code", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 22 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"attribute" : "category_code", 
"query" : "search", 
"attribute" : "category_code", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 23 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
LOD2 Webinar . 24.06.2014 . Page 24 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query 
Boolean 
LOD2 Webinar . 24.06.2014 . Page 25 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child 
Boolean 
Phrase 
MUST 
LOD2 Webinar . 24.06.2014 . Page 26 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child and 
descendant queries 
Boolean 
Phrase 
MUST 
Boolean 
SHOULD 
LOD2 Webinar . 24.06.2014 . Page 27 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child and 
descendant queries 
• Can be nested to form complex tree structure 
Boolean 
Phrase 
MUST 
Twig 
NOT 
Range 
MUST 
Boolean 
SHOULD 
LOD2 Webinar . 24.06.2014 . Page 28 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
Twig Query 
Operator Root Query 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
Child Query 
LOD2 Webinar . 24.06.2014 . Page 29 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 30 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 31 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 32 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 33 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
Proximity Constraint 
"root" : "funding_rounds", 
"descendant" : [{ 
Order Constraint 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
Boolean Clauses 
LOD2 Webinar . 24.06.2014 . Page 34 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 35 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 36 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 37 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Demo 
LOD2 Webinar . 24.06.2014 . Page 38 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Conclusion 
• SIREn’s Key Feature 
– Dynamic/Schema-less Data Management 
– Nested Data 
– High performance and scalability 
– Powerful search operators 
– Elasticsearch/Solr integration 
• Contact 
– http://sirendb.com 
– SindiceTech 
– info@sindicetech.com 
LOD2 Webinar . 24.06.2014 . Page 39 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Credits 
Jingle R.E.M., Martin Kaltenböck, Florian Kondert 
Coordination Thomas Thurner 
Martin Kaltenböck 
Moderation Martin Kaltenböck 
Presented by Renaud Delbru, Harish Kumar 
LOD2 Webinar . 24.06.2014 . Page 40 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
Hope you enjoyed staying with us – if you need more detailed 
information, visit us at www.lod2.eu and let us know how we can 
improve to meet your expectations! 
Don’t forget to register for our next webinar 
20.12. 2011 - Virtuoso (Open Link Software) 
24.01. 2012 - OntoWiki (University of Leipzig, Germany) 
Have a great day and don’t forget ... 
LOD2 Webinar . 24.06.2014 . Page 41 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
LOD2 Webinar . 24.06.2014 . Page 42 http://lod2.eu

Weitere ähnliche Inhalte

Was ist angesagt?

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research datavty
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open DataIvan Herman
 
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org sopekmir
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009LeeFeigenbaum
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challengesvty
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes vty
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataversevty
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataversevty
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse vty
 
GraphChain
GraphChainGraphChain
GraphChainsopekmir
 

Was ist angesagt? (20)

LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and AuthoringLOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Lod2
Lod2Lod2
Lod2
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open Data
 
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009
 
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and RepairLOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challenges
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 
KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
GraphChain
GraphChainGraphChain
GraphChain
 
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack PrototypeLOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
 

Ähnlich wie LOD2 Webinar: SIREn

High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucenelucenerevolution
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGGRatko Mutavdzic
 
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCAccessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCKingsley Uyi Idehen
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijekRatko Mutavdzic
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesRIANIreland
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?Neo4j
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
The LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataThe LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataDavid Newbury
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainChristophe Debruyne
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...Marta Villegas
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challengesMichael Hausenblas
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
Rank | Analyse | Lead | Search
Rank | Analyse | Lead | SearchRank | Analyse | Lead | Search
Rank | Analyse | Lead | Searchsopekmir
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013François Belleau
 
Searching Relational Data with Elasticsearch
Searching Relational Data with ElasticsearchSearching Relational Data with Elasticsearch
Searching Relational Data with Elasticsearchsirensolutions
 

Ähnlich wie LOD2 Webinar: SIREn (20)

High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucene
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCAccessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBC
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish Repositories
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
The LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataThe LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked Data
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Rank | Analyse | Lead | Search
Rank | Analyse | Lead | SearchRank | Analyse | Lead | Search
Rank | Analyse | Lead | Search
 
Where is the World is my Open Government Data?
Where is the World is my Open Government Data?Where is the World is my Open Government Data?
Where is the World is my Open Government Data?
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
Searching Relational Data with Elasticsearch
Searching Relational Data with ElasticsearchSearching Relational Data with Elasticsearch
Searching Relational Data with Elasticsearch
 

Mehr von LOD2 Creating Knowledge out of Interlinked Data

Mehr von LOD2 Creating Knowledge out of Interlinked Data (19)

LOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia SpotlightLOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia Spotlight
 
LOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKANLOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKAN
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 General Presentation 2012
LOD2 General Presentation 2012LOD2 General Presentation 2012
LOD2 General Presentation 2012
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
LOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMESLOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMES
 
LOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project ManagementLOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project Management
 
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
 
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
 
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
 
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data WebLOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
 
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
 
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 StackLOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
 
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge BasesLOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
LOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWikiLOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWiki
 
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner IntroductionLOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
 

Kürzlich hochgeladen

Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsUXDXConf
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...FIDO Alliance
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKUXDXConf
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty SecureFemke de Vroome
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FIDO Alliance
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastUXDXConf
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Patrick Viafore
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?Mark Billinghurst
 

Kürzlich hochgeladen (20)

Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 

LOD2 Webinar: SIREn

  • 1. Creating Knowledge out of Interlinked Data LOD2 Webinar . 24.06.2014 . Page 1 http://lod2.eu
  • 2. Creating Knowledge out of Interlinked Data http://lod2.eu LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. This 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers. Coming from across 12 countries the partners are coordinated by the Agile Knowledge Engineering and Semantic Web Research Group at the University of Leipzig, Germany. LOD2 will integrate and syndicate Linked Data with existing large-scale applications. The project shows the benefits in the scenarios of Media and Publishing, Corporate Data intranets and eGovernment. LOD2 Webinar . 24.06.2014 . Page 2 http://lod2.eu
  • 3. Creating Knowledge out of Interlinked Data http://lod2.eu Once per month the LOD2 webinar series offer a free webinar about tools and services along the Linked Open Data Life Cycle. Stay with us and learn more about acquisition, editing, composing, connected applications – and finally publishing Linked Open Data. LOD2 Webinar . 24.06.2014 . Page 3 http://lod2.eu
  • 4. Creating Knowledge out of Interlinked Data Agenda • Nested Data Model • SIREn Overview • Getting Started with the SIREn Elasticsearch Plugin • Demo LOD2 Webinar . 24.06.2014 . Page 4 http://lod2.eu
  • 5. Creating Knowledge out of Interlinked Data Schema-Less Nested Data Model • Model becoming prevalent: JSON, XML, Avro, … – Can be arbitrarily nested and large – No strict schema / structure enforced • Schema-less brings – Flexibility – Ease of development • Developers do not have to invest significant modelling effort upfront LOD2 Webinar . 24.06.2014 . Page 5 http://lod2.eu
  • 6. Creating Knowledge out of Interlinked Data Introducing SIREn • Lucene, Solr and Elasticsearch plugin for indexing and searching JSON • Rich data model (JSON) – Nested objects, nested arrays, datatypes – Generic architecture compatible with various nested data models: JSON, JSON-LD, XML, Avro, ... • Schema-agnostic – SIREn does not require any schema definition to index and search data – Schema definition can change across records • Designed from the ground up for high performance and scalability LOD2 Webinar . 24.06.2014 . Page 6 http://lod2.eu
  • 7. Creating Knowledge out of Interlinked Data Introducing SIREn LOD2 Webinar . 24.06.2014 . Page 7 http://lod2.eu
  • 8. Creating Knowledge out of Interlinked Data Elasticsearch - Overview • Document-oriented search and analytics engine – JSON Document – Based on Apache Lucene • Distributed, Replication – High Performance and Availability • REST API LOD2 Webinar . 24.06.2014 . Page 8 http://lod2.eu
  • 9. Creating Knowledge out of Interlinked Data Elasticsearch – Basic Concepts • Index = Collection of Documents – Can have multiple shards and replicas • Type = A set of documents sharing the same schema – Similar to a DB table • Document = JSON object – Uniquely identified (index/type/id) – Similar to a DB record LOD2 Webinar . 24.06.2014 . Page 9 http://lod2.eu
  • 10. Creating Knowledge out of Interlinked Data SIREn – Basic Concepts • JSON object = Tree – Different mapping available • Node = An element of the tree – Can have a parent and one or more children – Contains data: text, numeric, boolean LOD2 Webinar . 24.06.2014 . Page 10 http://lod2.eu
  • 11. Creating Knowledge out of Interlinked Data SIREn – Basic Concepts JSON Object Tree { { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" }, … }, … ] ] }, … }, … ] ] } } name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 11 http://lod2.eu
  • 12. Creating Knowledge out of Interlinked Data Getting Started with Elasticsearch & SIREn http://sirendb.com/downloads/ (Elasticsearch Distribution Coming Soon) LOD2 Webinar . 24.06.2014 . Page 12 http://lod2.eu
  • 13. Creating Knowledge out of Interlinked Data SIREn Elasticsearch Distribution ├── dist │ ├── siren-core-1.3.jar │ └── siren-qparser-1.3.jar ├── docs │ ├── apidoc │ └── siren-1.3-javadoc.jar ├── example ├── CHANGES.txt ├── LICENSE.md ├── README.md └── THIRD-PARTY.txt ├── dist │ ├── siren-core-1.3.jar │ └── siren-qparser-1.3.jar ├── docs │ ├── apidoc │ └── siren-1.3-javadoc.jar ├── example ├── CHANGES.txt ├── LICENSE.md ├── README.md └── THIRD-PARTY.txt SIREn Libraries SIREn Libraries SIREn Javadoc SIREn Javadoc Elasticsearch Distribution Elasticsearch Distribution LOD2 Webinar . 24.06.2014 . Page 13 http://lod2.eu
  • 14. Creating Knowledge out of Interlinked Data SIREn Elasticsearch Distribution ├── bin │ ├── elasticsearch │ └── ... ├── config │ ├── elasticsearch.yml │ └── logging.yml ├── lib │ ├── elasticsearch-1.2.0.jar │ └── ... ├── plugins │ └── siren-plugin │ ├── siren-elasticsearch-1.3.jar │ └── ... ├── datasets ├── NOTICE.txt └── README.md ├── bin │ ├── elasticsearch │ └── ... ├── config │ ├── elasticsearch.yml │ └── logging.yml ├── lib │ ├── elasticsearch-1.2.0.jar │ └── ... ├── plugins │ └── siren-plugin │ ├── siren-elasticsearch-1.3.jar │ └── ... ├── datasets ├── NOTICE.txt └── README.md EExxeeccuutatabbleless CCoonnfifgiguuraratitoionn SSIRIREEnn P Plulugginin LOD2 Webinar . 24.06.2014 . Page 14 http://lod2.eu
  • 15. Creating Knowledge out of Interlinked Data Starting Elasticsearch $ ./bin/elasticsearch $ ./bin/elasticsearch [INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … [INFO ][node ] [Valerie Cooper] initializing ... [INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] [INFO ][node ] [Valerie Cooper] initialized [INFO ][node ] [Valerie Cooper] starting ... … [INFO ][node ] [Valerie Cooper] started [INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … [INFO ][node ] [Valerie Cooper] initializing ... [INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] [INFO ][node ] [Valerie Cooper] initialized [INFO ][node ] [Valerie Cooper] starting ... … [INFO ][node ] [Valerie Cooper] started LOD2 Webinar . 24.06.2014 . Page 15 http://lod2.eu
  • 16. Creating Knowledge out of Interlinked Data Create an Index and Map SIREn’s Field $ curl -XPUT 'http://localhost:9200/test/' –d ' { $ curl -XPUT 'http://localhost:9200/test/' –d ' { "mappings" : { "mappings" : { "companies" : { "companies" : { "properties" : { "properties" : { SIREn’s Field "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} } } } } }' }' SIREn’s Analyzer SIREn’s Posting Format LOD2 Webinar . 24.06.2014 . Page 16 http://lod2.eu
  • 17. Creating Knowledge out of Interlinked Data Indexing a Document $ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' { $ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" } } ] ] } } ] ] }' }' LOD2 Webinar . 24.06.2014 . Page 17 http://lod2.eu
  • 18. Creating Knowledge out of Interlinked Data Customising Datatype Analysis • SIREn distinguishes 5 types of values in the JSON model: – Field, String, Long, Double, Boolean { { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { json:field "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" } } ] ] } } ] ] } } xsd:long xsd:string LOD2 Webinar . 24.06.2014 . Page 18 http://lod2.eu
  • 19. Creating Knowledge out of Interlinked Data Customising Datatype Analysis # config/elasticsearch.yml # config/elasticsearch.yml siren.analysis.datatype: siren.analysis.datatype: http://json.org/field: http://json.org/field: index_analyzer: keyword index_analyzer: keyword http://www.w3.org/2001/XMLSchema#string: http://www.w3.org/2001/XMLSchema#string: index_analyzer: simple search_analyzer: simpleFolding index_analyzer: simple search_analyzer: simpleFolding http://www.w3.org/2001/XMLSchema#long: http://www.w3.org/2001/XMLSchema#long: index_analyzer: long index_analyzer: long index.analysis.analyzer: index.analysis.analyzer: simpleFolding: simpleFolding: type: custom tokenizer: letter filter: [lowercase, asciifolding] type: custom tokenizer: letter filter: [lowercase, asciifolding] LOD2 Webinar . 24.06.2014 . Page 19 http://lod2.eu
  • 20. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "query" : "search", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... SIREn’s Query Type SIREn’s Query Full-Text Query: •Boolean, Phrase, Range, Fuzzy, Regexp, Proximity, ... LOD2 Webinar . 24.06.2014 . Page 20 http://lod2.eu
  • 21. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "query" : "search", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 21 http://lod2.eu
  • 22. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "attribute" : "category_code", "query" : "search", "attribute" : "category_code", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 22 http://lod2.eu
  • 23. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "attribute" : "category_code", "query" : "search", "attribute" : "category_code", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 23 http://lod2.eu
  • 24. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships LOD2 Webinar . 24.06.2014 . Page 24 http://lod2.eu
  • 25. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query Boolean LOD2 Webinar . 24.06.2014 . Page 25 http://lod2.eu
  • 26. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child Boolean Phrase MUST LOD2 Webinar . 24.06.2014 . Page 26 http://lod2.eu
  • 27. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child and descendant queries Boolean Phrase MUST Boolean SHOULD LOD2 Webinar . 24.06.2014 . Page 27 http://lod2.eu
  • 28. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child and descendant queries • Can be nested to form complex tree structure Boolean Phrase MUST Twig NOT Range MUST Boolean SHOULD LOD2 Webinar . 24.06.2014 . Page 28 http://lod2.eu
  • 29. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { Twig Query Operator Root Query "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... Child Query LOD2 Webinar . 24.06.2014 . Page 29 http://lod2.eu
  • 30. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 30 http://lod2.eu
  • 31. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 31 http://lod2.eu
  • 32. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 32 http://lod2.eu
  • 33. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 33 http://lod2.eu
  • 34. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ Proximity Constraint "root" : "funding_rounds", "descendant" : [{ Order Constraint "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 Boolean Clauses LOD2 Webinar . 24.06.2014 . Page 34 http://lod2.eu
  • 35. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 35 http://lod2.eu
  • 36. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 36 http://lod2.eu
  • 37. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 37 http://lod2.eu
  • 38. Creating Knowledge out of Interlinked Data Demo LOD2 Webinar . 24.06.2014 . Page 38 http://lod2.eu
  • 39. Creating Knowledge out of Interlinked Data Conclusion • SIREn’s Key Feature – Dynamic/Schema-less Data Management – Nested Data – High performance and scalability – Powerful search operators – Elasticsearch/Solr integration • Contact – http://sirendb.com – SindiceTech – info@sindicetech.com LOD2 Webinar . 24.06.2014 . Page 39 http://lod2.eu
  • 40. Creating Knowledge out of Interlinked Data Credits Jingle R.E.M., Martin Kaltenböck, Florian Kondert Coordination Thomas Thurner Martin Kaltenböck Moderation Martin Kaltenböck Presented by Renaud Delbru, Harish Kumar LOD2 Webinar . 24.06.2014 . Page 40 http://lod2.eu
  • 41. Creating Knowledge out of Interlinked Data http://lod2.eu Hope you enjoyed staying with us – if you need more detailed information, visit us at www.lod2.eu and let us know how we can improve to meet your expectations! Don’t forget to register for our next webinar 20.12. 2011 - Virtuoso (Open Link Software) 24.01. 2012 - OntoWiki (University of Leipzig, Germany) Have a great day and don’t forget ... LOD2 Webinar . 24.06.2014 . Page 41 http://lod2.eu
  • 42. Creating Knowledge out of Interlinked Data http://lod2.eu LOD2 Webinar . 24.06.2014 . Page 42 http://lod2.eu

Hinweis der Redaktion

  1. Binary including full distribution of elasticsearch, with SIREn pre-installed.
  2. Full json document will be indexed both in elasticsearch, and SIREn Need to give an example here