Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

•Als PPTX, PDF herunterladen•

1 gefällt mir•356 views

Path-based storytelling with Linked Data on the Web provides users the ability to discover concepts in an entertaining and educational way. Given a query context, many state-of-the-art pathfinding approaches aim at telling a story that coincides with the user’s expectations by investigating paths over Linked Data on the Web. By taking into account serendipity in storytelling, we aim at improving and tailoring existing approaches towards better fitting user expectations so that users are able to discover interesting knowledge without feeling unsure or even lost in the story facts. To this end, we propose to optimize the link estimation between - and the selection of facts in a story by increasing the consistency and relevancy of links between facts through additional domain delineation and refinement steps. In order to address multiple aspects of serendipity, we propose and investigate combinations of weights and heuristics in paths forming the essential building blocks for each story. Our experimental findings with stories based on DBpedia indicate the improvements when applying the optimized algorithm.

Technologie

Effect of Heuristics on Serendipity in Path-Based
Storytelling with Linked Data
Laurens De Vocht
Christian Beecks, Ruben Verborgh, Erik Mannens, Thomas Seidl, Rik Van de Walle

BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps

How to consistently improve and tailor existing
pathfinding approaches? [pathfinding]
How well do heuristics effect user expectations so
users are able to discover feeling confident about the
story facts relevance? [serendipity]
Is semantic distance between facts a good criterion
for optimizing the paths forming a story? [user
judgments]

6
- trivial
randomness -
familiarity
surprise
sense
makin
g
+ discovery
Serendipity

8
Original Core Algorithm A* based
A*
h =
Jaccard
Distance
w = Common
Node Degree
Optimizations

9
Improved Algorithm Wraps Core Algorithm
h
w
Domain Delineation
Iterative Refinement
to increase semantic relatedness

10
Heuristics [h]
Jaccard
Normalized
DBpedia Distance
Confidence

11
Weights [w]
Jaccard
Jiang-Conrath
Distance (JCW)
Common Node
Degree (CND)

13
Semantic
Distance
0.62 via Physics
0.45 via Hume
Einstein
Newton
Physics
Hume
:influences
:discipline
:birthPlace :deathPlace
Semantic Distances

14
Normalized Web Search Distance
e.g. Google Distance, Bing Distance…

22
Serendipity – User Judgments
Least agreement (high standard deviation):
Carl Linnaeus and Albert Einstein [JCWJaccard]
Carl Linnaeus and Baruch Spinoza are Expert, Intellectual and Scholar Baruch
Spinoza’s and Albert Einstein’s are both Pantheists Intellectuals and Jewish
Philosophers
Most relevant and consistent:
Charles Darwin and Carl Linnaeus [CNDJaccard]
Copley Medal’s the award of Alfred Russel Wallace and Charles Darwin
Alfred Russel Wallace’s and Charles Darwin’s awards are Royal Medal and
Copley Medal Alfred Russel Wallace and Charles Darwin are known for their
Natural selection Carl Linnaeus and Alfred Russel Wallace have as subject
‘Fellows of the Royal Society’ Carl Linnaeus and Alfred Russel Wallace are
Biologists and Colleagues

24
Conclusions
 Reducing the number of arbitrary resources/facts revealed for a story.
 Dbpedia example: telling a story with better link estimation, in cases
where the original algorithm did not make optimal choices of links.
 The most consistent output was generated with the Jaccard distance
used both as weight and heuristic; or as heuristic in combination with
the Jiang-Conrath distance as weight.
 The most arbitrary facts occur in a story when using the combined
node degree as weight with the Jaccard distance as heuristic, both in
the optimized and the original algorithm.
 User judgments conﬁrm the ﬁndings for the Jiang-Conrath weight,
original algorithm and for the Jaccard distance used as weight and
heuristic in terms of discovery.

25
 Validate the correlation between the effect of the link estimation
on the arbitrariness as perceived by users and computational
semantic relatedness measures such as SemRank.
 Measure the scalability of the approach by implementing the
algorithms:
(i) solely on the client,
(ii) completely on the sever, and
(iii) in a distributed client/server architecture.
Next Steps

Additional questions?
@laurens_d_v
laurens.devocht@ugent.be
http://slideshare.net/laurensdv

Weitere ähnliche Inhalte

Andere mochten auch

It’s 2025. Internet of things has taken of, and both machines have joined humans on the Web and in society. What do they do, how do we cope, and more importantly, how did we end up here? User experience has a major peak in importance: digital devices are everywhere, users are utterly diverse, and technology is getting highly advanced. Inventing better ways to let people interact with such technology has been the challenge of the 21st century. UX expectations are through the roof: children grow up as digital natives and think any thing is science. However, that is not all. The user spectrum has some new participants: machines. The rise of digital assistants introduce new challenges about how to deal with these newcomers. But how are we handling the next-gen AI-driven machine players, as they are the real digital natives. In this talk, we go over interesting use cases, positions and developments on AI. Furthermore, we touch on some technical and ethical fallacies in existing systems such as IBM Watson or self-driving cars. Finally, we illustrate future prospects and raise some challenging ethical questions.

Machines are the new Digital Natives

Miel Vander Sande

Between uri dereferencing and the sparql protocol lies a largely unexplored axis of possible interfaces to Linked Data, eachwith its own combination of trade-offs. One of these interfaces is Triple Pattern Fragments, which allows clients to execute sparql queries against low-cost servers, at the cost of higher bandwidth. Increasing a client’s efficiency means lowering the number of requests, which can among others be achieved through additional metadata in responses. We noted that typical sparql query evaluations against Triple Pattern Fragments require a significant portion of membership subqueries, which check the presence of a specific triple, rather than a variable pattern. This paper studies the impact of providing approximate membership functions, i.e., Bloom filters and Golombcoded sets, as extra metadata. In addition to reducing http requests, such functions allow to achieve full result recall earlier when temporarily allowing lower precision. Half of the tested queries from aWatDiv benchmark test set could be executed with up to a third fewer http requests with only marginally higher server cost. Query times, however, did not improve, likely due to slower metadata generation and transfer. This indicates that approximate membership functions can partly improve the client-side query process with minimal impact on the server and its interface.

Opportunistic Linked Data Querying through Approximate Membership Metadata

Miel Vander Sande

iRail: History & current issues

Pieter Colpaert

Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...

Pieter Heyvaert

Querying federations  of Triple Pattern Fragments

Ruben Verborgh

Situation of open data in Flanders

Pieter Colpaert

Querying Heterogeneous Linked Date Interfaces through Reasoning

Joachim Van Herwegen

Linked Data generation and publication remain challenging and complicated, in particular for data owners who are not Semantic Web experts or tech-savvy. The situation deteriorates when data from multiple heterogeneous sources, accessed via different interfaces, is integrated, and the Linked Data generation is a long-lasting activity repeated periodically, often adjusted and incrementally enriched with new data. Therefore, we propose the rmlworkbench, a graphical user interface to support data owners administrating their Linked Data generation and publication workflow. The rmlworkbench’s underlying language is rml, since it allows to declaratively describe the complete Linked Data generation workflow. Thus, any Linked Data generation workflow specified by a user can be exported and reused by other tools interpreting RML.

Towards an Interface for User-Friendly Linked Data Generation Administration

andimou

LDOW2013 r&wbase: git for triples

Miel Vander Sande

Data on the World Wide Web changes at the speed of light—today’s facts are tomorrow’s history. This makes the ability to look back important: how do facts grow and change over time? It gets even more interesting when we zoom out beyond individual facts: how do answers to questions evolve when data ages? With Linked Data, we are used to query the latest version of information, because updating a sparql endpoint is easier than maintaining every historical version. With the lightweight Triple Pattern Fragments interface, it becomes very easy for a server to host multiple versions. Using the Memento framework to switch between versions based on a timestamp, your browser can evaluate sparql queries over any point in time. We tried this with dbpedia—and so can you!

Time travelling through DBpedia

Miel Vander Sande

ESWC2015 - Query Optimization for Clients of Linked Data Fragments

Joachim Van Herwegen

Towards a Uniform User Interface for Editing Mapping Definitions

Pieter Heyvaert

Presentation Data Science Challenge

Dieter De Witte

The root of schema violations for rdf data generated from (semi-)structured data, often derives from mappings, which are repeatedly applied and specify how an RDF dataset is generated. The DBpedia dataset, which derives from Wikipedia infoboxes, is no exception. To mitigate the violations, we proposed in previous work to validate the mappings which generate the data, instead of validating the generated data afterwards. In this work, we demonstrate how mappings validation is applied to DBpedia. dbpedia mappings are automatically translated to RML and validated by RDFUnit. The DBpedia mappings assessment can be frequently executed, because it requires significantly less time compared to validating the dataset. The validation results become available via a user-friendly interface. The DBpedia community takes them into consideration to refine the DBpedia mappings or ontology and thus, increase the dataset quality.

DBpedia Mappings Quality Assessment

andimou

Scaling out federated queries for Life Sciences Data In Production

Dieter De Witte

ComparativeMotifFinding

Dieter De Witte

Although several tools have been implemented to generate Linked Data from raw data, users still need to be aware of the underlying technologies and Linked Data principles to use them. Mapping languages enable to detach the mapping definitions from the implementation that executes them. However, no thorough research has been conducted on how to facilitate the editing of mappings. We propose the RMLEditor, a visual graph-based user interface, which allows users to easily define the mappings that deliver the RDF representation of the corresponding raw data. Neither knowledge of the underlying mapping language nor the used technologies is required. The RMLEditor aims to facilitate the editing of mappings, and thereby lowers the barriers to create Linked Data. The RMLEditor is developed for use by data specialists who are partners of (i) a companies-driven pilot and (ii) a community group. The current version of the RMLEditor was validated: participants indicate that it is adequate for its purpose and the graph-based approach enables users to conceive the linked nature of the data.

RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings

Pieter Heyvaert

Semantically annotating and interlinking Open Data results in Linked Open Data which concisely and unambiguously describes a knowledge domain.However, the uptake of the Linked Data depends on its usefulness to non-Semantic Web experts. Failing to support data consumers understanding the added-value of Linked Data and possible exploitation opportunities could inhibit its diffusion. In this paper, we propose an interactive visual workflow for discovering and exploring Linked Open Data. We implemented the workflow considering academic library metadata and carried out a qualitative evaluation. We assessed the workflow’s potential impact on data consumers which bridges the offer as published Linked Open Data, and the demand as requests for: (i) higher quality data; and (ii) more applications that re-use data. More than 70% of the 34 test users agreed that the workflow fulfills its goal: it facilitates non-Semantic Web experts to understand the potential of Linked Open Data

A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...

Laurens De Vocht

We propose a framework to address an important challenge in the context of the ongoing adoption of the “Web 2.0” in science and research, often referred to as “Research 2.0”. Microblogging is one of the trends with increasing leverage. The challenge in this thesis is to connect users of microblogging services such as Twitter based on specific common entities that are representative and truly matter to them. We investigated the possibilities of using social data for locating an expert who shares a very specific research topic. To enrich and verify this social data we link such content to existing open data provided by the online community. We are using semantic technologies (RDF ,SPARQL), common ontologies (SIOC, FOAF, DublinCore, SWRC) and Linked Data (DBpedia, GeoNames, CoLinDa) to extract and mine the data about scientific conferences out of context of microblogs. We are identifying users related to each other based on entities such as topics (tags), events, time, locations and persons (mentions). As a proof-of-concept we explain, implement and evaluate such a researcher profiling use case. It involves the development of a framework that focuses on the proposition of researches based on topics and conferences they have in common. This framework provides an API that allows quick access to the analyzed information. A demonstration application: “Researcher Affinity Browser” shows how the API supports developers to build rich internet applications for Research 2.0. This application also intro- duces the concept “affinity” that exposes the implicit proximity between entities and users based on the content users produced. The usability of a demonstration application and the usefulness of the framework itself are investigated with an explicit evaluation question- naire. This user feedback lead to important conclusions about successful achievements and opportunities to further improve this effort.

Researcher Profiling based on Semantic Analysis in Social Networks

Laurens De Vocht

Andere mochten auch (19)

Machines are the new Digital Natives

Opportunistic Linked Data Querying through Approximate Membership Metadata

iRail: History & current issues

Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...

Querying federations  of Triple Pattern Fragments

Situation of open data in Flanders

Querying Heterogeneous Linked Date Interfaces through Reasoning

Towards an Interface for User-Friendly Linked Data Generation Administration

LDOW2013 r&wbase: git for triples

Time travelling through DBpedia

ESWC2015 - Query Optimization for Clients of Linked Data Fragments

Towards a Uniform User Interface for Editing Mapping Definitions

Presentation Data Science Challenge

DBpedia Mappings Quality Assessment

Scaling out federated queries for Life Sciences Data In Production

ComparativeMotifFinding

RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings

A Visual Exploration Workflow as Enabler for the Exploitation of Linked Open ...

Researcher Profiling based on Semantic Analysis in Social Networks

Ähnlich wie Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

240408_JW_labseminar[Asymmetric Transitivity Preserving Graph Embedding].pptx

thanhdowork

Tasks such as question answering and semantic search are dependent on the ability of querying & reasoning over large-scale commonsense knowledge bases (KBs). However, dealing with commonsense data demands coping with problems such as the increase in schema complexity, semantic inconsistency, incompleteness and scalability. This paper proposes a selective graph navigation mechanism based on a distributional relational semantic model which can be applied to querying & reasoning over heterogeneous knowledge bases (KBs). The approach can be used for approximative reasoning, querying and associational knowledge discovery. In this paper we focus on commonsense reasoning as the main motivational scenario for the approach. The approach focuses on addressing the following problems: (i) providing a semantic selection mechanism for facts which are relevant and meaningful in a specific reasoning & querying context and (ii) allowing coping with information incompleteness in large KBs. The approach is evaluated using ConceptNet as a commonsense KB, and achieved high selectivity, high scalability and high accuracy in the selection of meaningful nav- igational paths. Distributional semantics is also used as a principled mechanism to cope with information incompleteness.

A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...

Andre Freitas

Adaptive cluster distance bounding

Ocular Systems

Advanced strategies for Metabolomics Data Analysis

Dmitry Grapov

Cannonical correlation

domsr

Cannonical Correlation

domsr

Weighting NN

yalda akbarzadeh

Statistical Clustering

tim_hare

671_JeevanRavula_CEE

Jeevan Reddy R

Dsto tr-1436

fitriyutarihidayah

Adaptive image search with hash codes

Ecway Technologies

Matlab adaptive image search with hash codes

Ecway Technologies

cannonicalpresentation-110505114327-phpapp01.pdf

JermaeDizon2

UROP Symposium Poster

Brad Schwartz

SVM - Functional Verification

Sai Kiran Kadam

Advanced Strategies for Analysis of Metabolomic Data

Dmitry Grapov

Graphical Models 4dummies

xamdam

421_PrakashMudholkar

Prakash Mudholkar

Data Science and Machine learning-Lect01.pdf

RAJVEERKUMAR41

Download

butest

Ähnlich wie Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data (20)

240408_JW_labseminar[Asymmetric Transitivity Preserving Graph Embedding].pptx

A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...

Adaptive cluster distance bounding

Advanced strategies for Metabolomics Data Analysis

Cannonical correlation

Cannonical Correlation

Weighting NN

Statistical Clustering

671_JeevanRavula_CEE

Dsto tr-1436

Adaptive image search with hash codes

Matlab adaptive image search with hash codes

cannonicalpresentation-110505114327-phpapp01.pdf

UROP Symposium Poster

SVM - Functional Verification

Advanced Strategies for Analysis of Metabolomic Data

Graphical Models 4dummies

421_PrakashMudholkar

Data Science and Machine learning-Lect01.pdf

Download

Kürzlich hochgeladen

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Rafal Los

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Real Time Object Detection Using Open CV

Khem

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Advantages of Hiring UIUX Design Service Providers for Your Business

Pixlogix Infotech

Tech Trends Report 2024 Future Today Institute.pdf

hans926745

This presentation explores the impact of HTML injection attacks on web applications, detailing how attackers exploit vulnerabilities to inject malicious code into web pages. Learn about the potential consequences of such attacks and discover effective mitigation strategies to protect your web applications from HTML injection vulnerabilities. for more information visit https://bostoninstituteofanalytics.org/category/cyber-security-ethical-hacking/

HTML Injection Attacks: Impact and Mitigation Strategies

Boston Institute of Analytics

Developing An App To Navigate The Roads of Brazil

V3cube

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Finology Group – Insurtech Innovation Award 2024

The Digital Insurer

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

With more memory available, system performance of three Dell devices increased, which can translate to a better user experience Conclusion When your system has plenty of RAM to meet your needs, you can efficiently access the applications and data you need to finish projects and to-do lists without sacrificing time and focus. Our test results show that with more memory available, three Dell PCs delivered better performance and took less time to complete the Procyon Office Productivity benchmark. These advantages translate to users being able to complete workflows more quickly and multitask more easily. Whether you need the mobility of the Latitude 5440, the creative capabilities of the Precision 3470, or the high performance of the OptiPlex Tower Plus 7010, configuring your system with more RAM can help keep processes running smoothly, enabling you to do more without compromising performance.

Boost PC performance: How more available memory can improve productivity

Principled Technologies

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

Created by Mozilla Research in 2012 and now part of Linux Foundation Europe, the Servo project is an experimental rendering engine written in Rust. It combines memory safety and concurrency to create an independent, modular, and embeddable rendering engine that adheres to web standards. Stewardship of Servo moved from Mozilla Research to the Linux Foundation in 2020, where its mission remains unchanged. After some slow years, in 2023 there has been renewed activity on the project, with a roadmap now focused on improving the engine’s CSS 2 conformance, exploring Android support, and making Servo a practical embeddable rendering engine. In this presentation, Rakhi Sharma reviews the status of the project, our recent developments in 2023, our collaboration with Tauri to make Servo an easy-to-use embeddable rendering engine, and our plans for the future to make Servo an alternative web rendering engine for the embedded devices industry. (c) Embedded Open Source Summit 2024 April 16-18, 2024 Seattle, Washington (US) https://events.linuxfoundation.org/embedded-open-source-summit/ https://ossna2024.sched.com/event/1aBNF/a-year-of-servo-reboot-where-are-we-now-rakhi-sharma-igalia

A Year of the Servo Reboot: Where Are We Now?

Igalia

Tata AIG General Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

Kürzlich hochgeladen (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Apidays New York 2024 - The value of a flexible API Management solution for O...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Real Time Object Detection Using Open CV

Data Cloud, More than a CDP by Matt Robison

Advantages of Hiring UIUX Design Service Providers for Your Business

Tech Trends Report 2024 Future Today Institute.pdf

HTML Injection Attacks: Impact and Mitigation Strategies

Developing An App To Navigate The Roads of Brazil

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

A Domino Admins Adventures (Engage 2024)

Exploring the Future Potential of AI-Enabled Smartphone Processors

Finology Group – Insurtech Innovation Award 2024

Axa Assurance Maroc - Insurer Innovation Award 2024

Boost PC performance: How more available memory can improve productivity

Driving Behavioral Change for Information Management through Data-Driven Gree...

A Year of the Servo Reboot: Where Are We Now?

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Artificial Intelligence: Facts and Myths

🐬 The future of MySQL is Postgres 🐘

Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

1. Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data Laurens De Vocht Christian Beecks, Ruben Verborgh, Erik Mannens, Thomas Seidl, Rik Van de Walle

2. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

3. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

4. BA ?

5. How to consistently improve and tailor existing pathfinding approaches? [pathfinding] How well do heuristics effect user expectations so users are able to discover feeling confident about the story facts relevance? [serendipity] Is semantic distance between facts a good criterion for optimizing the paths forming a story? [user judgments]

6. 6 - trivial randomness - familiarity surprise sense makin g + discovery Serendipity

7. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

8. 8 Original Core Algorithm A* based A* h = Jaccard Distance w = Common Node Degree Optimizations

9. 9 Improved Algorithm Wraps Core Algorithm h w Domain Delineation Iterative Refinement to increase semantic relatedness

10. 10 Heuristics [h] Jaccard Normalized DBpedia Distance Confidence

11. 11 Weights [w] Jaccard Jiang-Conrath Distance (JCW) Common Node Degree (CND)

12. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

13. 13 Semantic Distance 0.62 via Physics 0.45 via Hume Einstein Newton Physics Hume :influences :discipline :birthPlace :deathPlace Semantic Distances

14. 14 Normalized Web Search Distance e.g. Google Distance, Bing Distance…

15. 15 Motivating Example

16. 16

17. 17

18. 18 Semantic Distances (continued)

19. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

20. 20 Serendipity – Semantic Distance

21. 21 Serendipity – User Judgments

22. 22 Serendipity – User Judgments Least agreement (high standard deviation): Carl Linnaeus and Albert Einstein [JCWJaccard] Carl Linnaeus and Baruch Spinoza are Expert, Intellectual and Scholar Baruch Spinoza’s and Albert Einstein’s are both Pantheists Intellectuals and Jewish Philosophers Most relevant and consistent: Charles Darwin and Carl Linnaeus [CNDJaccard] Copley Medal’s the award of Alfred Russel Wallace and Charles Darwin Alfred Russel Wallace’s and Charles Darwin’s awards are Royal Medal and Copley Medal Alfred Russel Wallace and Charles Darwin are known for their Natural selection Carl Linnaeus and Alfred Russel Wallace have as subject ‘Fellows of the Royal Society’ Carl Linnaeus and Alfred Russel Wallace are Biologists and Colleagues

23. BA Introduction Pathfinding Semantic Distance Evaluation Conclusions & Next Steps

24. 24 Conclusions  Reducing the number of arbitrary resources/facts revealed for a story.  Dbpedia example: telling a story with better link estimation, in cases where the original algorithm did not make optimal choices of links.  The most consistent output was generated with the Jaccard distance used both as weight and heuristic; or as heuristic in combination with the Jiang-Conrath distance as weight.  The most arbitrary facts occur in a story when using the combined node degree as weight with the Jaccard distance as heuristic, both in the optimized and the original algorithm.  User judgments conﬁrm the ﬁndings for the Jiang-Conrath weight, original algorithm and for the Jaccard distance used as weight and heuristic in terms of discovery.

25. 25  Validate the correlation between the effect of the link estimation on the arbitrariness as perceived by users and computational semantic relatedness measures such as SemRank.  Measure the scalability of the approach by implementing the algorithms: (i) solely on the client, (ii) completely on the sever, and (iii) in a distributed client/server architecture. Next Steps

26. Additional questions? @laurens_d_v laurens.devocht@ugent.be http://slideshare.net/laurensdv

Hinweis der Redaktion

Algorithmic storytelling can be seen as a particular kind of querying data. Given a set of keywords or entities, which are typically, but not necessarily dissimilar, it aims at generating a story by explicitly relating the query context with a path that includes semantically related resources. Storytelling is utilized for example in entertaining applications and visualizations in order to enrich related Linked Data resources with data from multimedia archives and social media as well as in scientiﬁc research ﬁelds such as bio-informatics where biologists try to relate sets of genes arising from different experiments by investigating the implicated pathways or discovering stories through linked books.
Algorithmic storytelling can be seen as a particular kind of querying data. Given a set of keywords or entities, which are typically, but not necessarily dissimilar, it aims at generating a story by explicitly relating the query context with a path that includes semantically related resources. Storytelling is utilized for example in entertaining applications and visualizations in order to enrich related Linked Data resources with data from multimedia archives and social media as well as in scientiﬁc research ﬁelds such as bio-informatics where biologists try to relate sets of genes arising from different experiments by investigating the implicated pathways or discovering stories through linked books.
Research questions: Relate to the title of the paper, we want to get an indication of the effect of using different heuristics and find suitable measure to gain insight in them: 3 components: [pathfinding] [serendipity] and [user judgments]
The aspects that make a story a good story are captured in the term serendipity. The term depicts a mixture between casual, lucky, helpful and unforeseen facts, also in an information context. In fact, users want to be surprised and they want to discover, conﬁrm, and extend knowledge - but not feel unsure while doing so. This means that users can always relate presented story facts to their background knowledge.
The most frequently encountered algorithm to determine a path between multiple resources is the A* algorithm. This algorithm, which is based on a graph representation of the underlying data (i.e., resources and links between them deﬁne nodes and edges, respectively) determines an optimal solution in form of a lowest-cost traversable path between two resources (determined using H and W). The optimality of a path, which is guaranteed by the A* algorithm, does not necessarily comply with the users’ expectations
Our algorithm reduces the arbitrariness of a path between these resources by increasing the relevance of the links between the nodes using a domain-delineation step. The path is reﬁned by iteratively applying the A* algorithm and with each iteration attempting to improve the overall semantic relatedness between the resources until a ﬁxed number of iterations or a certain similarity threshold is reached. H and W are not fixed as in the original core algorithm.
Jaccard statistical approach taking into account the relative number of common predicates of two nodes. The higher the number of common predicates, the more likely similar properties of the nodes and thus the semantically closer in terms of distance the corresponding nodes. NDD See normalized web distance but with f being a different function: f (n)∈N denotes thenumber of DBpedia nodes linking to node n∈G , f (n,m)∈N denotes the number of DBpedia nodes linking to both nodes n and m. Confidence asymmetrical statistical measure that can be thought of as the probability that node a occurs provided that node b has already occurred.
Jaccard (see prev) JCW looking at the classes of each of the nodes and determining the most common denominator of those classes in the ontology. Once this type is determined, the number of subjects that exist with this type is divided by the total number of subjects. The higher this number, the more generic the class, thus the more different two nodes. CND can be used to compute a weight that encourages rarity of items in a path. It ranks more rare resources higher, thereby guaranteeing that paths between resources prefer speciﬁc relations. The main idea is to avoid that paths go via generic nodes. It makes use of the node degree, the number of in and outgoing links.
f(x) and f(y) are the number of hits for search terms x and y, respectively; andf(x, y) is the number of web pages on which both x and y occur.
Semantic relatedness is improved (distance is decreased) in all cases except when entities were already closely related. NGD: normalized web search distance implemented on top of Bing Search API.
A set of paths is not a presentable story yet. We note that even if a path comprise just the start and destination (indicating they are linked via common hops or directly to each other), the story will contain interesting facts. This is because each step in the path is separated with at least one hop from the next node. For example, to present a story about Carl Linnaeus and Charles Darwin, the story could start from a path that goes via J.W. von Goethe. The resulting statements serve as basic facts, which are relation-object statements, that make up the story. It is up to the application or visualization engine to present it to end-users and enrich it with descriptions, media or further facts.
We applied SemRank to evaluate the paths, in particular to capture the serendipity of each path. The serendipity is measured by using a factor µ to indicate the so called ‘refraction’ how different each new step in a path is compared to the previous averaged over the entire path. Furthermore the information gain is modulated using the same factor µ . The information gain is computed from the weakest point along the path and an average of the rest. * The information gain indicates how much new information is added over all the possibilities that were in fact available. It is averaged over all the steps (facts). If a fact is more rare it will contribute relatively more than others at a certain point. This involves looking at the frequency of the predicates and the resources in the path and how likely it was that a certain resource was included a certain point in a path. In information theory, the amount of information contained in an event is measured by the negative logarithm of the probability of occurrence of the event. More details -> refer to SemRank paper: Kemafor Anyanwu, Angela Maduko, and Amit Sheth. 2005. SemRank: ranking complex relationship search results on the semantic web. In Proceedings of the 14th international conference on World Wide Web (WWW '05). ACM, New York, NY, USA, 117-127. DOI=http://dx.doi.org/10.1145/1060745.1060766 * Average re- fraction R(p): path -> how different is the item from the previous, i.e. thus how much does it contribute to the path so far. There are three special cases [4]: conventional with µ = 0 leading to 1 I(p) SemRank(0, p) = , serendipity plays no role and so no emphasis is put one newly gained or unexpected information; mixed with µ = 0.5 leading toSemRank(0.5, p)= R(p) I(p) 1 2I(p) + ]×−[1+ [ ], a balance between unexpected and newly gained information; 2 2 and discovery with µ = 1 leading to SemRank(1, p) = I(p)×−[1+R(p)], emphasizing unexpected and newly gained information.
On the one hand the conventional and mixed mode for SemRank put less emphasis on novelty and focuses mainly on semantic association and information content. The jaccard distance combination used as weight and heuristic is not entirely surprisingly the best choice for this scenario. On the other hand the results of the original algorithm making use of the common node degree as weight together with the jaccard distance is conﬁrmed by the results of the improved algorithm with the common node degree however with a slightly lower rank in the new algorithm. Using the JCW however leads to even higher ranks. In terms of discovery, the original algorithm outperforms the JaccardJaccard combination. The CNDJaccard improved algorithm is able to slightly outperform all the other combinations.
The effect of the heuristics according to user judgments compared to the overall median. The JCWJaccard conﬁrms already good results with SemRank. The CNDJaccard scores relatively well too. The scores for relevancy, consistency and discovery as unexpected - but relevant - facts are highly dependent on the user who judges. Some users might be interested in the more trivial path as well in some cases. Nevertheless, we used the overall judgment as a baseline to compare the judgments with the same combinations of heuristics and weights as before.
The suggested stories that center around a certain via-fact are not always considered relevant by some users even though the algorithms might consider them so.
Algorithmic storytelling can be seen as a particular kind of querying data. Given a set of keywords or entities, which are typically, but not necessarily dissimilar, it aims at generating a story by explicitly relating the query context with a path that includes semantically related resources. Storytelling is utilized for example in entertaining applications and visualizations in order to enrich related Linked Data resources with data from multimedia archives and social media as well as in scientiﬁc research ﬁelds such as bio-informatics where biologists try to relate sets of genes arising from different experiments by investigating the implicated pathways or discovering stories through linked books.
We proposed an optimized pathfinding algorithm for storytelling that reduces the number of arbitrary resources revealed in paths contained in the story. Preliminary evaluation results using the DBpedia dataset indicate that our proposal succeeds in telling a story featuring better link estimation, especially in cases where the previous algorithm did not make seemingly optimal choices of links. By defining stories as chains of links in Linked Data, we optimized the storytelling algorithm and tested with several heuristics and weights. The most consistent output was generated with the Jaccard distance used both as weight and heuristic; or as heuristic in combination with the Jiang-Conrath distance as weight. The most arbitrary facts occur in a story when using the combined node degree as weight with the Jaccard distance as heuristic, both in the optimized and the original algorithm. User judgments confirm the findings for the Jiang-Conrath weight and the original algorithm and for the Jaccard distance used as weight and heuristic in terms of discovery. There is no clear positive effect however according the users in terms of consistency and relevancy there.
Future work will focus on validating the correlation between the effect of the link estimation on the arbitrariness as perceived by users and computational semantic relatedness measures such as SemRank. Additionally, we will measure the scalability of our approach by implementing the algorithms (i) solely on the client, (ii) completely on the sever, and (iii) in a distributed client/server architecture.

Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (19)

Ähnlich wie Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Ähnlich wie Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data

Hinweis der Redaktion