SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Using Substitutive Itemset Mining Framework for
Finding Synonymous Properties in Linked Data
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski
Poznan University of Technology, Poland
August 3rd, 2015
RuleML 2015
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Outline
Motivating Scenario
Substitutive Sets Mining
Finding Synonymous Properties with Substitutive Sets Mining
Summary
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Motivating scenario 1/3
Wiki	
  	
  
(mappings	
  	
  
Wikipedia	
  infoboxes	
  -­‐>	
  
DBpedia	
  ontology)	
  
Norah_Jones	
  
	
  
Denton,_Texas	
  
dbpedia-­‐prop:origin	
  
Mark_Knopfler	
   Gosforth	
  
dbpedia-­‐owl:hometown	
  
Peter_Gabriel	
   Godalming	
  
dbpedia-­‐owl:hometown	
  
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Motivating scenario 2/3
DBpedia 2014 ontology has 1310 object and 1725 data properties
Many large Linked Data use relatively lightweight schemas with a
high number of object properties
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Motivating scenario 3/3
Wiki	
  	
  
(mappings	
  	
  
Wikipedia	
  infoboxes	
  -­‐>	
  
DBpedia	
  ontology)	
  
dbpedia-­‐owl:MusicalArAst	
  	
   dbpedia-­‐owl:PopulatedPlace	
  	
  
Norah_Jones	
  
	
  
Denton,_Texas	
  
dbpedia-­‐prop:origin	
  
Mark_Knopfler	
   Gosforth	
  
dbpedia-­‐owl:hometown	
  
Peter_Gabriel	
   Godalming	
  
dbpedia-­‐owl:hometown	
  
dbpedia-­‐owl:MusicalArAst	
  	
   dbpedia-­‐owl:PopulatedPlace	
  	
  
dbpedia-­‐owl:MusicalArAst	
  	
   dbpedia-­‐owl:PopulatedPlace	
  	
  
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Substitutive Sets Mining Framework
Frequent(
Itemset(Mining(
Subs1tu1ve(Set(
Genera1on(
Transac1on(DB(
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Frequent Itemsets
I = {i1,i2,...,im} - a set of items
DT = {t1,t2,...,tn}, where ∀iti ⊆ I -a database of transactions
support(X) =
{t∈DT X⊆t}
DT
ID Items
1 Nachos, Pepsi, Salsa
2 Nachos, Coca-Cola, Salsa
3 Nachos, Coca-Cola
4 Nachos, Pepsi, Salsa
5 Milk, Bread
Frequent Itemset Support
{Nachos} 80%
{Salsa} 60%
{Coca-Cola} 40%
{Pepsi} 40%
{Nachos, Salsa} 60%
{Nachos, Coca-Cola} 40%
{Nachos, Pepsi} 40%
{Salsa, Pepsi} 40%
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Covering Set
CS(i L) = {X ∈ L {i} ∪ X ∈ L}
coverage(i L) = CS(i L)
Frequent Itemset
{Nachos}
{Salsa}
{Coca-Cola}
{Pepsi}
{Nachos, Salsa}
{Nachos, Coca-Cola}
{Nachos, Pepsi}
{Salsa, Pepsi}
i CS(i) coverage
{Nachos} {{Salsa}, {Coca-Cola}, {Pepsi}} 3
{Salsa} {{Nachos}} 1
{Coca-Cola} {{Nachos}} 1
{Pepsi} {{Nachos}, {Salsa}} 2
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Substitutive Sets
A two-element itemset {x,y} is a substitutive itemset, if:
x ∈ L1,
y ∈ L1,
support({x} ∪ {y}) < ε, where ε is a user-defined threshold
representing the highest amount of noise in the data allowed,
CS(x L)∩CS(y L)
max{ CS L(x) , CS(y L) } ⩾ mincommon.
i CS(i) coverage
{Nachos} {{Salsa}, {Coca-Cola}, {Pepsi}} 3
{Salsa} {{Nachos}} 1
{Coca-Cola} {{Nachos}} 1
{Pepsi} {{Nachos}, {Salsa}} 2
CS(Pepsi)∩CS(Coca−Cola)
max{ CS(Pepsi) , CS(Coca−Cola) }
= 0.5
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Create Substitutive Sets RapidMiner operator
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Use Case: DBpedia
DBpedia knowledge base version 2014
sets of 3–item transactions {c1,p,c2}, where c1 and c2 classes of
subject and object of RDF triple, and p property connecting s and o
SELECT ?c1 ?p ?c2
WHERE {
?s rdf:type dbpedia-owl:Organization .
?s ?p ?o .
?s rdf:type ?c1 .
?o rdf:type ?c2 .
FILTER(?p != dbpedia-owl:wikiPageWikiLink) .
FILTER(?p != rdf:type) .
FILTER(?p != dbpedia-owl:wikiPageExternalLink) .
FILTER(?p != dbpedia-owl:wikiPageID) .
FILTER(?p != dbpedia-owl:wikiPageInterLanguageLink) .
FILTER(?p != dbpedia-owl:wikiPageLength) .
FILTER(?p != dbpedia-owl:wikiPageOutDegree) .
FILTER(?p != dbpedia-owl:wikiPageRedirects) .
FILTER(?p != dbpedia-owl:wikiPageRevisionID)
}
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Transaction generation
dbpedia'owl:MusicalAr2st44 dbpedia'owl:PopulatedPlace44
Norah_Jones4
4
Denton,_Texas4
dbpedia'prop:origin4
Mark_Knopfler4 Gosforth4
dbpedia'owl:hometown4
dbpedia'owl:MusicalAr2st44 dbpedia'owl:PopulatedPlace44
s4
s4 p4
p4 o4
o4
c14
c14
c24
c24
Transactions
{c1 dbpedia-owl:MusicalArtist, dbpedia-owl:hometown, c2 dbpedia-owl:PopulatedPlace }
{c1 dbpedia-owl:MusicalArtist, dbpedia-prop:origin , c2 dbpedia-owl:PopulatedPlace }
...
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Experimental Setup
FP-Growth: min number of itemsets = 500, max number of retries =
15, min support: 1.0E-4),
Create Substitutive Sets: min support = 1.0E-4, min common
= 0.7, epsilon =1.0E-5,
a sample of 100k results per each query,
desktop computer with 12GB RAM and CPU Intel(R) Core(TM)
i5-4570 3.20GHz,
a single run of mining substitutive sets (for a single class and 100k
transactions) took several seconds on average (ranging from 2s to
12s)
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Sample substitutive properties for the class Organisation
Item X Item Y Common
dbpprop:parentOrganization dbo:parentOrganisation 1.000
dbpprop:owner dbo:owner 1.000
dbpprop:origin dbo:hometown 1.000
dbpprop:headquarters dbpprop:parentOrganization 1.000
dbpprop:formerAffiliations dbo:formerBroadcastNetwork 1.000
dbo:product dbpprop:products 1.000
dbpprop:keyPeople dbo:keyPerson 0.910
dbpprop:commandStructure dbpprop:branch 0.857
dbo:schoolPatron dbo:foundedBy 0.835
dbpprop:notableCommanders dbo:notableCommander 0.824
dbo:recordLabel dbpprop:label 0.803
dbo:headquarter dbo:locationCountry 0.803
dbpprop:country dbo:state 0.753
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Summary
Introduced a model for substitutive itemsets mining
Preliminary tests of this model within the task of deduplication of
object properties in an RDF dataset (DBpedia)
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15
Acknowledgements
Foundation for Polish Science under the POMOST programme,
cofinanced from European Union, Regional Development Fund (No
POMOST/2013-7/8) (2013-2015)
EU FP7 ICT-2007.4.4 (No 231519) ”e-LICO: An e-Laboratory for
Interdisciplinary Collaborative Research in Data Mining and
Data-Intensive Science” (2009-2012)
Thanks to Ewa Kowalczuk for debugging the RapidMiner plugin
Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked
August 3rd, 2015 RuleML 20
/ 15

Weitere ähnliche Inhalte

Was ist angesagt?

Learning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic ProgrammingLearning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic ProgrammingVrije Universiteit Amsterdam
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013Luis Daniel Ibáñez
 
The lifecycle of reproducible science data and what provenance has got to do ...
The lifecycle of reproducible science data and what provenance has got to do ...The lifecycle of reproducible science data and what provenance has got to do ...
The lifecycle of reproducible science data and what provenance has got to do ...Paolo Missier
 
Scalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud
Scalable Whole-Exome Sequence Data Processing Using Workflow On A CloudScalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud
Scalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud Paolo Missier
 

Was ist angesagt? (6)

Oshs_9_11_2015
Oshs_9_11_2015Oshs_9_11_2015
Oshs_9_11_2015
 
Learning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic ProgrammingLearning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic Programming
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013
 
The lifecycle of reproducible science data and what provenance has got to do ...
The lifecycle of reproducible science data and what provenance has got to do ...The lifecycle of reproducible science data and what provenance has got to do ...
The lifecycle of reproducible science data and what provenance has got to do ...
 
Scalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud
Scalable Whole-Exome Sequence Data Processing Using Workflow On A CloudScalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud
Scalable Whole-Exome Sequence Data Processing Using Workflow On A Cloud
 
Streams&amp;io
Streams&amp;ioStreams&amp;io
Streams&amp;io
 

Andere mochten auch

RuleML2015: Rule-Based Exploration of Structured Data in the Browser
RuleML2015: Rule-Based Exploration of Structured Data in the BrowserRuleML2015: Rule-Based Exploration of Structured Data in the Browser
RuleML2015: Rule-Based Exploration of Structured Data in the BrowserRuleML
 
Frequent Itemset Mining(FIM) on BigData
Frequent Itemset Mining(FIM) on BigDataFrequent Itemset Mining(FIM) on BigData
Frequent Itemset Mining(FIM) on BigDataRaju Gupta
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodShani729
 
Frequent Pattern growth algorithm
Frequent Pattern growth algorithmFrequent Pattern growth algorithm
Frequent Pattern growth algorithmAshis Kumar Chanda
 
Frequent itemset mining methods
Frequent itemset mining methodsFrequent itemset mining methods
Frequent itemset mining methodsProf.Nilesh Magar
 

Andere mochten auch (6)

RuleML2015: Rule-Based Exploration of Structured Data in the Browser
RuleML2015: Rule-Based Exploration of Structured Data in the BrowserRuleML2015: Rule-Based Exploration of Structured Data in the Browser
RuleML2015: Rule-Based Exploration of Structured Data in the Browser
 
Frequent Itemset Mining(FIM) on BigData
Frequent Itemset Mining(FIM) on BigDataFrequent Itemset Mining(FIM) on BigData
Frequent Itemset Mining(FIM) on BigData
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth method
 
Frequent Pattern growth algorithm
Frequent Pattern growth algorithmFrequent Pattern growth algorithm
Frequent Pattern growth algorithm
 
Frequent itemset mining methods
Frequent itemset mining methodsFrequent itemset mining methods
Frequent itemset mining methods
 
Data mining
Data miningData mining
Data mining
 

Ähnlich wie RuleML2015: Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked Data

Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approachAgnieszka Ławrynowicz
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and KnowledgeIan Foster
 
Opportunities for X-Ray science in future computing architectures
Opportunities for X-Ray science in future computing architecturesOpportunities for X-Ray science in future computing architectures
Opportunities for X-Ray science in future computing architecturesIan Foster
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinalDeborah McGuinness
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresThomas Gottron
 
Inductive Entity Typing Alignment
Inductive Entity Typing AlignmentInductive Entity Typing Alignment
Inductive Entity Typing AlignmentGiuseppe Rizzo
 
Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Joanne Luciano
 
Ontology based clustering algorithms
Ontology based clustering algorithmsOntology based clustering algorithms
Ontology based clustering algorithmsIkutwa
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Webebiquity
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
Kid171 chap02 english version
Kid171 chap02 english versionKid171 chap02 english version
Kid171 chap02 english versionFrank S.C. Tseng
 
La résolution de problèmes à l'aide de graphes
La résolution de problèmes à l'aide de graphesLa résolution de problèmes à l'aide de graphes
La résolution de problèmes à l'aide de graphesData2B
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1IPLODProject
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
 
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...Nikolaos Konstantinou
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Maulik Kamdar
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)Frank van Harmelen
 
Online Relation Alignment for Linked Datasets
Online Relation Alignment for Linked DatasetsOnline Relation Alignment for Linked Datasets
Online Relation Alignment for Linked DatasetsMaria Koutraki
 

Ähnlich wie RuleML2015: Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked Data (20)

Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approach
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and Knowledge
 
Opportunities for X-Ray science in future computing architectures
Opportunities for X-Ray science in future computing architecturesOpportunities for X-Ray science in future computing architectures
Opportunities for X-Ray science in future computing architectures
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index Structures
 
Inductive Entity Typing Alignment
Inductive Entity Typing AlignmentInductive Entity Typing Alignment
Inductive Entity Typing Alignment
 
Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05
 
Ontology based clustering algorithms
Ontology based clustering algorithmsOntology based clustering algorithms
Ontology based clustering algorithms
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
Kid171 chap02 english version
Kid171 chap02 english versionKid171 chap02 english version
Kid171 chap02 english version
 
La résolution de problèmes à l'aide de graphes
La résolution de problèmes à l'aide de graphesLa résolution de problèmes à l'aide de graphes
La résolution de problèmes à l'aide de graphes
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX tool
 
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
 
ConQueSt
ConQueStConQueSt
ConQueSt
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
Online Relation Alignment for Linked Datasets
Online Relation Alignment for Linked DatasetsOnline Relation Alignment for Linked Datasets
Online Relation Alignment for Linked Datasets
 

Mehr von RuleML

Aggregates in Recursion: Issues and Solutions
Aggregates in Recursion: Issues and SolutionsAggregates in Recursion: Issues and Solutions
Aggregates in Recursion: Issues and SolutionsRuleML
 
A software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasksA software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasksRuleML
 
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...RuleML
 
RuleML 2015: When Processes Rule Events
RuleML 2015: When Processes Rule EventsRuleML 2015: When Processes Rule Events
RuleML 2015: When Processes Rule EventsRuleML
 
RuleML 2015: Ontology Reasoning using Rules in an eHealth Context
RuleML 2015: Ontology Reasoning using Rules in an eHealth ContextRuleML 2015: Ontology Reasoning using Rules in an eHealth Context
RuleML 2015: Ontology Reasoning using Rules in an eHealth ContextRuleML
 
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...RuleML
 
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...RuleML
 
Rule Generalization Strategies in Incremental Learning of Disjunctive Concepts
Rule Generalization Strategies in Incremental Learning of Disjunctive ConceptsRule Generalization Strategies in Incremental Learning of Disjunctive Concepts
Rule Generalization Strategies in Incremental Learning of Disjunctive ConceptsRuleML
 
RuleML 2015 Constraint Handling Rules - What Else?
RuleML 2015 Constraint Handling Rules - What Else?RuleML 2015 Constraint Handling Rules - What Else?
RuleML 2015 Constraint Handling Rules - What Else?RuleML
 
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML
 
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and Rules
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and RulesRuleML2015 PSOA RuleML: Integrated Object-Relational Data and Rules
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and RulesRuleML
 
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...RuleML
 
A Service for Improving the Assignments of Common Agriculture Policy Funds to...
A Service for Improving the Assignments of Common Agriculture Policy Funds to...A Service for Improving the Assignments of Common Agriculture Policy Funds to...
A Service for Improving the Assignments of Common Agriculture Policy Funds to...RuleML
 
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-RuleML
 
RuleML2015: Binary Frontier-guarded ASP with Function Symbols
RuleML2015: Binary Frontier-guarded ASP with Function SymbolsRuleML2015: Binary Frontier-guarded ASP with Function Symbols
RuleML2015: Binary Frontier-guarded ASP with Function SymbolsRuleML
 
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge Platforms
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge PlatformsRuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge Platforms
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge PlatformsRuleML
 
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...RuleML
 
RuleML2015: Compact representation of conditional probability for rule-based...
RuleML2015:  Compact representation of conditional probability for rule-based...RuleML2015:  Compact representation of conditional probability for rule-based...
RuleML2015: Compact representation of conditional probability for rule-based...RuleML
 
RuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information SystemsRuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information SystemsRuleML
 
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...RuleML
 

Mehr von RuleML (20)

Aggregates in Recursion: Issues and Solutions
Aggregates in Recursion: Issues and SolutionsAggregates in Recursion: Issues and Solutions
Aggregates in Recursion: Issues and Solutions
 
A software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasksA software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasks
 
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
 
RuleML 2015: When Processes Rule Events
RuleML 2015: When Processes Rule EventsRuleML 2015: When Processes Rule Events
RuleML 2015: When Processes Rule Events
 
RuleML 2015: Ontology Reasoning using Rules in an eHealth Context
RuleML 2015: Ontology Reasoning using Rules in an eHealth ContextRuleML 2015: Ontology Reasoning using Rules in an eHealth Context
RuleML 2015: Ontology Reasoning using Rules in an eHealth Context
 
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
RuleML 2015: Semantics of Notation3 Logic: A Solution for Implicit Quantifica...
 
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...
Challenge@RuleML2015 Developing Situation-Aware Applications for Disaster Man...
 
Rule Generalization Strategies in Incremental Learning of Disjunctive Concepts
Rule Generalization Strategies in Incremental Learning of Disjunctive ConceptsRule Generalization Strategies in Incremental Learning of Disjunctive Concepts
Rule Generalization Strategies in Incremental Learning of Disjunctive Concepts
 
RuleML 2015 Constraint Handling Rules - What Else?
RuleML 2015 Constraint Handling Rules - What Else?RuleML 2015 Constraint Handling Rules - What Else?
RuleML 2015 Constraint Handling Rules - What Else?
 
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
 
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and Rules
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and RulesRuleML2015 PSOA RuleML: Integrated Object-Relational Data and Rules
RuleML2015 PSOA RuleML: Integrated Object-Relational Data and Rules
 
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
 
A Service for Improving the Assignments of Common Agriculture Policy Funds to...
A Service for Improving the Assignments of Common Agriculture Policy Funds to...A Service for Improving the Assignments of Common Agriculture Policy Funds to...
A Service for Improving the Assignments of Common Agriculture Policy Funds to...
 
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
Datalog+-Track Introduction & Reasoning on UML Class Diagrams via Datalog+-
 
RuleML2015: Binary Frontier-guarded ASP with Function Symbols
RuleML2015: Binary Frontier-guarded ASP with Function SymbolsRuleML2015: Binary Frontier-guarded ASP with Function Symbols
RuleML2015: Binary Frontier-guarded ASP with Function Symbols
 
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge Platforms
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge PlatformsRuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge Platforms
RuleML2015: API4KP Metamodel: A Meta-API for Heterogeneous Knowledge Platforms
 
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...
RuleML2015: Ontology-Based Multidimensional Contexts with Applications to Qua...
 
RuleML2015: Compact representation of conditional probability for rule-based...
RuleML2015:  Compact representation of conditional probability for rule-based...RuleML2015:  Compact representation of conditional probability for rule-based...
RuleML2015: Compact representation of conditional probability for rule-based...
 
RuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information SystemsRuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information Systems
 
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...
RuleML2015: User Extensible System to Identify Problems in OWL Ontologies and...
 

Kürzlich hochgeladen

basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
 

Kürzlich hochgeladen (20)

basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
 

RuleML2015: Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked Data

  • 1. Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked Data Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski Poznan University of Technology, Poland August 3rd, 2015 RuleML 2015 Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 2. Outline Motivating Scenario Substitutive Sets Mining Finding Synonymous Properties with Substitutive Sets Mining Summary Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 3. Motivating scenario 1/3 Wiki     (mappings     Wikipedia  infoboxes  -­‐>   DBpedia  ontology)   Norah_Jones     Denton,_Texas   dbpedia-­‐prop:origin   Mark_Knopfler   Gosforth   dbpedia-­‐owl:hometown   Peter_Gabriel   Godalming   dbpedia-­‐owl:hometown   Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 4. Motivating scenario 2/3 DBpedia 2014 ontology has 1310 object and 1725 data properties Many large Linked Data use relatively lightweight schemas with a high number of object properties Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 5. Motivating scenario 3/3 Wiki     (mappings     Wikipedia  infoboxes  -­‐>   DBpedia  ontology)   dbpedia-­‐owl:MusicalArAst     dbpedia-­‐owl:PopulatedPlace     Norah_Jones     Denton,_Texas   dbpedia-­‐prop:origin   Mark_Knopfler   Gosforth   dbpedia-­‐owl:hometown   Peter_Gabriel   Godalming   dbpedia-­‐owl:hometown   dbpedia-­‐owl:MusicalArAst     dbpedia-­‐owl:PopulatedPlace     dbpedia-­‐owl:MusicalArAst     dbpedia-­‐owl:PopulatedPlace     Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 6. Substitutive Sets Mining Framework Frequent( Itemset(Mining( Subs1tu1ve(Set( Genera1on( Transac1on(DB( Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 7. Frequent Itemsets I = {i1,i2,...,im} - a set of items DT = {t1,t2,...,tn}, where ∀iti ⊆ I -a database of transactions support(X) = {t∈DT X⊆t} DT ID Items 1 Nachos, Pepsi, Salsa 2 Nachos, Coca-Cola, Salsa 3 Nachos, Coca-Cola 4 Nachos, Pepsi, Salsa 5 Milk, Bread Frequent Itemset Support {Nachos} 80% {Salsa} 60% {Coca-Cola} 40% {Pepsi} 40% {Nachos, Salsa} 60% {Nachos, Coca-Cola} 40% {Nachos, Pepsi} 40% {Salsa, Pepsi} 40% Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 8. Covering Set CS(i L) = {X ∈ L {i} ∪ X ∈ L} coverage(i L) = CS(i L) Frequent Itemset {Nachos} {Salsa} {Coca-Cola} {Pepsi} {Nachos, Salsa} {Nachos, Coca-Cola} {Nachos, Pepsi} {Salsa, Pepsi} i CS(i) coverage {Nachos} {{Salsa}, {Coca-Cola}, {Pepsi}} 3 {Salsa} {{Nachos}} 1 {Coca-Cola} {{Nachos}} 1 {Pepsi} {{Nachos}, {Salsa}} 2 Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 9. Substitutive Sets A two-element itemset {x,y} is a substitutive itemset, if: x ∈ L1, y ∈ L1, support({x} ∪ {y}) < ε, where ε is a user-defined threshold representing the highest amount of noise in the data allowed, CS(x L)∩CS(y L) max{ CS L(x) , CS(y L) } ⩾ mincommon. i CS(i) coverage {Nachos} {{Salsa}, {Coca-Cola}, {Pepsi}} 3 {Salsa} {{Nachos}} 1 {Coca-Cola} {{Nachos}} 1 {Pepsi} {{Nachos}, {Salsa}} 2 CS(Pepsi)∩CS(Coca−Cola) max{ CS(Pepsi) , CS(Coca−Cola) } = 0.5 Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 10. Create Substitutive Sets RapidMiner operator Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 11. Use Case: DBpedia DBpedia knowledge base version 2014 sets of 3–item transactions {c1,p,c2}, where c1 and c2 classes of subject and object of RDF triple, and p property connecting s and o SELECT ?c1 ?p ?c2 WHERE { ?s rdf:type dbpedia-owl:Organization . ?s ?p ?o . ?s rdf:type ?c1 . ?o rdf:type ?c2 . FILTER(?p != dbpedia-owl:wikiPageWikiLink) . FILTER(?p != rdf:type) . FILTER(?p != dbpedia-owl:wikiPageExternalLink) . FILTER(?p != dbpedia-owl:wikiPageID) . FILTER(?p != dbpedia-owl:wikiPageInterLanguageLink) . FILTER(?p != dbpedia-owl:wikiPageLength) . FILTER(?p != dbpedia-owl:wikiPageOutDegree) . FILTER(?p != dbpedia-owl:wikiPageRedirects) . FILTER(?p != dbpedia-owl:wikiPageRevisionID) } Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 12. Transaction generation dbpedia'owl:MusicalAr2st44 dbpedia'owl:PopulatedPlace44 Norah_Jones4 4 Denton,_Texas4 dbpedia'prop:origin4 Mark_Knopfler4 Gosforth4 dbpedia'owl:hometown4 dbpedia'owl:MusicalAr2st44 dbpedia'owl:PopulatedPlace44 s4 s4 p4 p4 o4 o4 c14 c14 c24 c24 Transactions {c1 dbpedia-owl:MusicalArtist, dbpedia-owl:hometown, c2 dbpedia-owl:PopulatedPlace } {c1 dbpedia-owl:MusicalArtist, dbpedia-prop:origin , c2 dbpedia-owl:PopulatedPlace } ... Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 13. Experimental Setup FP-Growth: min number of itemsets = 500, max number of retries = 15, min support: 1.0E-4), Create Substitutive Sets: min support = 1.0E-4, min common = 0.7, epsilon =1.0E-5, a sample of 100k results per each query, desktop computer with 12GB RAM and CPU Intel(R) Core(TM) i5-4570 3.20GHz, a single run of mining substitutive sets (for a single class and 100k transactions) took several seconds on average (ranging from 2s to 12s) Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 14. Sample substitutive properties for the class Organisation Item X Item Y Common dbpprop:parentOrganization dbo:parentOrganisation 1.000 dbpprop:owner dbo:owner 1.000 dbpprop:origin dbo:hometown 1.000 dbpprop:headquarters dbpprop:parentOrganization 1.000 dbpprop:formerAffiliations dbo:formerBroadcastNetwork 1.000 dbo:product dbpprop:products 1.000 dbpprop:keyPeople dbo:keyPerson 0.910 dbpprop:commandStructure dbpprop:branch 0.857 dbo:schoolPatron dbo:foundedBy 0.835 dbpprop:notableCommanders dbo:notableCommander 0.824 dbo:recordLabel dbpprop:label 0.803 dbo:headquarter dbo:locationCountry 0.803 dbpprop:country dbo:state 0.753 Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 15. Summary Introduced a model for substitutive itemsets mining Preliminary tests of this model within the task of deduplication of object properties in an RDF dataset (DBpedia) Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15
  • 16. Acknowledgements Foundation for Polish Science under the POMOST programme, cofinanced from European Union, Regional Development Fund (No POMOST/2013-7/8) (2013-2015) EU FP7 ICT-2007.4.4 (No 231519) ”e-LICO: An e-Laboratory for Interdisciplinary Collaborative Research in Data Mining and Data-Intensive Science” (2009-2012) Thanks to Ewa Kowalczuk for debugging the RapidMiner plugin Mikolaj Morzy, Agnieszka Lawrynowicz, Mateusz Zozulinski ( Poznan University of Technology, Poland )Using Substitutive Itemset Mining Framework for Finding Synonymous Properties in Linked August 3rd, 2015 RuleML 20 / 15