How much Semantic Data on Small Devices?

•Als PPTX, PDF herunterladen•

0 gefällt mir•627 views

Mathieu d'Aquin

Short paper presentation at the EKAW 2010 conference on benchmarking RDF triple stores on small devices.

Technologie

How much semantic data on small devices? Mathieu d’Aquin, AndriyNikolov and Enrico Motta Knowledge Media Institute, The Open Univeristy, UK m.daquin@open.ac.uk @mdaquin

Benchmarking Semantic Data Tools LUBM(1,0) 103,397 triples Large Scale Benchmarks

Extracting sets of small-scale ontologies Clusters of ontologies having similar characteristics, except for size

Extracting sets of small-scale Ontologies Characteristics of ontologies Size (tiples): varies from very small scale to medium scale Ratio class/prop: allowing 50% variance Ratio class/inst.: allowing 50% variance DL expressivity: Complexity of the language 99 automatically created clusters Manual selection of 10

Queries Using real life ontologies need domain independent Queries A set of 8 generic queries of varying complexity, and which results might depend on inference Select all instances of all classes Select all comments Select all labels and comments Select all labels Select all classes (RDFS/OWL/DAML) Select all properties by their domain Select all RDFS classes Select all properties applied to instances of all classes

Running the benchmarks – Triple Stores Jena with TDB persistent storage R As above + RDFS reasoning Sesame with persistent storage R As above + RDFS reasoning Mulgara with default configuration

Running the benchmarks – Device Asus EEE PC 700 (2G)

Running the benchmarks - Measures Loading time: for each ontologies in an empty, re-initialized store. Disk Space: of the persistent store right after loading. Memory consumption: of the triple store process right after loading the ontology. Query time: for each ontology, averaged over the 8 queries.

Conclusion – on tests Sesame performs best in almost all aspects, even when including reasoning Reasoning has big impact on Jena TDB at query time Mulgara is clearly not adequate in a small-scale scenario

Conclusion – on small-scale benchmarking Validates our assumption that small-scale benchmarks give different results than large-scale benchmarks Points out the need for more work to tackle the small-scale scenarios Results are not always clear cut in every aspects: benchmarks as support to decide which tool to use, depending on the application constraints

Weitere ähnliche Inhalte

Ähnlich wie How much Semantic Data on Small Devices?

Writting Better Softwaresvilen.ivanov

MEME – An Integrated Tool For Advanced Computational ExperimentsGIScRG

Google, quality and younelinger

Machine learning session6(decision trees random forrest)Abhimanyu Dwivedi

Performance Issue? Machine Learning to the rescue!Maarten Smeets

Using the Machine to predict TestabilityMiguel Lopez

IRJET- Factoid Question and Answering SystemIRJET Journal

M018147883IOSR Journals

Electi Deep Learning OptimizationNikolas Markou

Challenges in Large Scale Machine LearningSudarsun Santhiappan

Testingnazeer pasha

Evaluationmissstevenson01

H evaluationmissstevenson01

AWS Customer Presentation- Pathwork DiagnosticsAmazon Web Services

team10.ppt.pptxREMEGIUSPRAVEENSAHAY

Issues in AI product development and practices in audio applicationsTaesu Kim

Michigan Information Retrieval Enthusiasts Group Meetup - August 19, 2010ivan provalov

Semantics in Sensor NetworksOscar Corcho

Sentiment Analysis: A comparative study of Deep Learning and Machine LearningIRJET Journal

Aco based solution for tsp model for evaluation of software test suiteIAEME Publication

Ähnlich wie How much Semantic Data on Small Devices? (20)

Writting Better Software

MEME – An Integrated Tool For Advanced Computational Experiments

Google, quality and you

Machine learning session6(decision trees random forrest)

Performance Issue? Machine Learning to the rescue!

Using the Machine to predict Testability

IRJET- Factoid Question and Answering System

M018147883

Electi Deep Learning Optimization

Challenges in Large Scale Machine Learning

Testing

Evaluation

H evaluation

AWS Customer Presentation- Pathwork Diagnostics

team10.ppt.pptx

Issues in AI product development and practices in audio applications

Michigan Information Retrieval Enthusiasts Group Meetup - August 19, 2010

Semantics in Sensor Networks

Sentiment Analysis: A comparative study of Deep Learning and Machine Learning

Aco based solution for tsp model for evaluation of software test suite

Mehr von Mathieu d'Aquin

A factorial study of neural network learning from differences for regressionMathieu d'Aquin

Recentrer l'intelligence artificielle sur les connaissancesMathieu d'Aquin

Data and Knowledge as CommoditiesMathieu d'Aquin

Unsupervised learning approach for identifying sub-genres in music scoresMathieu d'Aquin

Is knowledge engineering still relevant?Mathieu d'Aquin

A data view of the data science processMathieu d'Aquin

Dealing with Open Domain DataMathieu d'Aquin

Web Analytics for Everyday LearningMathieu d'Aquin

Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)Mathieu d'Aquin

Learning Analytics: understand learning and support the learnerMathieu d'Aquin

The AFEL ProjectMathieu d'Aquin

Assessing the Readability of Policy Documents: The Case of Terms of Use of On...Mathieu d'Aquin

Data ethicsMathieu d'Aquin

Data for Learning and Learning with DataMathieu d'Aquin

Towards an “Ethics in Design” methodology for AI research projects Mathieu d'Aquin

AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...Mathieu d'Aquin

Profiling information sources and services for discoveryMathieu d'Aquin

Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...Mathieu d'Aquin

From Knowledge Bases to Knowledge Infrastructures for Intelligent SystemsMathieu d'Aquin

Data analytics beyond data processing and how it affects Industry 4.0Mathieu d'Aquin

Mehr von Mathieu d'Aquin (20)

A factorial study of neural network learning from differences for regression

Recentrer l'intelligence artificielle sur les connaissances

Data and Knowledge as Commodities

Unsupervised learning approach for identifying sub-genres in music scores

Is knowledge engineering still relevant?

A data view of the data science process

Dealing with Open Domain Data

Web Analytics for Everyday Learning

Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)

Learning Analytics: understand learning and support the learner

The AFEL Project

Assessing the Readability of Policy Documents: The Case of Terms of Use of On...

Data ethics

Data for Learning and Learning with Data

Towards an “Ethics in Design” methodology for AI research projects

AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...

Profiling information sources and services for discovery

Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...

From Knowledge Bases to Knowledge Infrastructures for Intelligent Systems

Data analytics beyond data processing and how it affects Industry 4.0

Kürzlich hochgeladen

A Journey Into the Emotions of Software DevelopersNicole Novielli

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

Rise of the Machines: Known As Drones...Rick Flair

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Scale your database traffic with Read & Write split using MySQL RouterMydbops

2024 April Patch TuesdayIvanti

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes

Manual 508 Accessibility Compliance AuditSkynet Technologies

Connecting the Dots for Information Discovery.pdfNeo4j

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

How to write a Business Continuity PlanDatabarracks

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Sample pptx for embedding into website for demoHarshalMandlekar2

Kürzlich hochgeladen (20)

A Journey Into the Emotions of Software Developers

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

Emixa Mendix Meetup 11 April 2024 about Mendix Native development

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Rise of the Machines: Known As Drones...

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Scale your database traffic with Read & Write split using MySQL Router

2024 April Patch Tuesday

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

Assure Ecommerce and Retail Operations Uptime with ThousandEyes

Manual 508 Accessibility Compliance Audit

Connecting the Dots for Information Discovery.pdf

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...

TeamStation AI System Report LATAM IT Salaries 2024

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

How to write a Business Continuity Plan

The Ultimate Guide to Choosing WordPress Pros and Cons

The State of Passkeys with FIDO Alliance.pptx

Sample pptx for embedding into website for demo

How much Semantic Data on Small Devices?

1. How much semantic data on small devices? Mathieu d’Aquin, AndriyNikolov and Enrico Motta Knowledge Media Institute, The Open Univeristy, UK m.daquin@open.ac.uk @mdaquin

2. Semantic Data on Small Devices?

3. Benchmarking Semantic Data Tools LUBM(1,0) 103,397 triples Large Scale Benchmarks

4. Extracting sets of small-scale ontologies Clusters of ontologies having similar characteristics, except for size

5. Extracting sets of small-scale Ontologies Characteristics of ontologies Size (tiples): varies from very small scale to medium scale Ratio class/prop: allowing 50% variance Ratio class/inst.: allowing 50% variance DL expressivity: Complexity of the language 99 automatically created clusters Manual selection of 10

6. Results

7. Queries Using real life ontologies need domain independent Queries A set of 8 generic queries of varying complexity, and which results might depend on inference Select all instances of all classes Select all comments Select all labels and comments Select all labels Select all classes (RDFS/OWL/DAML) Select all properties by their domain Select all RDFS classes Select all properties applied to instances of all classes

8. Running the benchmarks – Triple Stores Jena with TDB persistent storage R As above + RDFS reasoning Sesame with persistent storage R As above + RDFS reasoning Mulgara with default configuration

9. Running the benchmarks – Device Asus EEE PC 700 (2G)

10. Running the benchmarks - Measures Loading time: for each ontologies in an empty, re-initialized store. Disk Space: of the persistent store right after loading. Memory consumption: of the triple store process right after loading the ontology. Query time: for each ontology, averaged over the 8 queries.

11. Results – Loading time

12. Results – Loading time R = R

13. Results – Disk Space

14. Results – Disk Space = < < R R

15. Results – Memory consumption

16. Results – Memory consumptions R R =

17. Result – Query time

18. Result – Query time = < R R

19. Conclusion – on tests Sesame performs best in almost all aspects, even when including reasoning Reasoning has big impact on Jena TDB at query time Mulgara is clearly not adequate in a small-scale scenario

20. Conclusion – on small-scale benchmarking Validates our assumption that small-scale benchmarks give different results than large-scale benchmarks Points out the need for more work to tackle the small-scale scenarios Results are not always clear cut in every aspects: benchmarks as support to decide which tool to use, depending on the application constraints

How much Semantic Data on Small Devices?

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie How much Semantic Data on Small Devices?

Ähnlich wie How much Semantic Data on Small Devices? (20)

Mehr von Mathieu d'Aquin

Mehr von Mathieu d'Aquin (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

How much Semantic Data on Small Devices?