This document discusses Lora Aroyo's work on using events and narratives to enhance access to cultural heritage collections. It describes early projects that linked cultural objects to events and entities to provide more context and engagement for online users. This led to work modeling historical events and extracting event properties and relationships to generate "proto-narratives". Later projects like DIVE and DIVE+ developed event-centric exploratory search tools and media suites. More recent efforts focus on crowdsourcing event tagging and curating to further engage audiences and remix archival stories. A key challenge discussed is the lack of standardized event vocabularies across cultural heritage communities.
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+Lora Aroyo
The document discusses the DIVE+ project which aims to create new cultural commons by developing tools to help users navigate and explore vast amounts of digital cultural heritage content in an engaging way. It presents the DIVE browser which links objects through events and collects crowd-sourced perspectives. This approach moves cultural institutions from being inventories to places that support engagement and multiple viewpoints. The goal is to make cultural content more accessible and bring users, collections, and distributed content together in a smart, open, and connected way.
Semantic Digital Humanities Workshop 2015 @OxfordLora Aroyo
Lora Aroyo presents on open, connected, and smart heritage and new cultural commons. She discusses how crowdsourcing can be used to gather diverse perspectives from users to expand expert vocabularies and gather new types of metadata. Three case studies are presented: crowdsourcing video tags at Sound and Vision, where 340,551 tags were added by 555 registered users; tagging 1,782 works of art across 11 museums, gathering 36,981 tags from 2,017 users; and the Waisda project, where user tags improved search accuracy by 53% compared to consensus tags alone.
Towards Better Media Understanding and Searchabilityoanainel
This document discusses improving media understanding and searchability. It mentions CrowdTruth.org, DIVE+, and CLARIAH, which are projects aimed at using crowdsourcing and linked open data to better analyze and organize media like images, video, and audio. DIVE+ is part of both CLARIAH and Beeldengeluid, and provides a platform for analyzing multimedia content.
Keynote at International Conference of Art Libraries 2018 @RijksmuseumLora Aroyo
Lora Aroyo presents on data science for smart cultural heritage. Some key points:
- Cultural heritage organizations are traditionally seen as inventories but aim to engage people.
- Bringing collections online increased access but interpretation was still needed for engagement.
- Data should be at the center of processes to evolve with users. There is a spectrum of truth, not just one view.
Historically Speaking, Digital Humanities, EWallis July 2012Elycia Wallis
Digital humanities combines traditional humanistic study with digital tools and methods. It values collaboration and sharing through open data. Museums and other cultural institutions are digitizing their collections, making vast amounts of data and resources available online. This allows new types of research, projects, and tools to develop. Digital humanities practitioners encourage opening data with permissive licenses to maximize reuse and partnerships.
The document discusses DIVE+, a tool that aims to address the issues of audiences feeling disconnected from the massive amounts of digital cultural content available by bringing them back into the driving seat of exploration. It does this through an event-centric browser called DIVE that allows linking objects through events and collecting diverse perspectives from crowdsourcing to support different points of view. The goal is to shift museums and archives from being mere inventories to places that engage users through event narratives and an infinity of exploration options in linked open data.
This document summarizes a study on using social and mobile technologies during a school trip to a museum. The study investigated how technologies can shape young people's learning processes and facilitate meaning making. A class visited the Museum of London and used iPhones and Twitter to communicate, share content, and reflect during their visit. Interviews after found that students engaged more and interaction was facilitated by seeing others' tweets, photos and opinions. The use of technologies made the museum experience more interactive and enjoyable compared to traditional trips.
DIVE+ @ NLeSymposium 2015: Towards New Cultural Commons with DIVE+Lora Aroyo
The document discusses the DIVE+ project which aims to create new cultural commons by developing tools to help users navigate and explore vast amounts of digital cultural heritage content in an engaging way. It presents the DIVE browser which links objects through events and collects crowd-sourced perspectives. This approach moves cultural institutions from being inventories to places that support engagement and multiple viewpoints. The goal is to make cultural content more accessible and bring users, collections, and distributed content together in a smart, open, and connected way.
Semantic Digital Humanities Workshop 2015 @OxfordLora Aroyo
Lora Aroyo presents on open, connected, and smart heritage and new cultural commons. She discusses how crowdsourcing can be used to gather diverse perspectives from users to expand expert vocabularies and gather new types of metadata. Three case studies are presented: crowdsourcing video tags at Sound and Vision, where 340,551 tags were added by 555 registered users; tagging 1,782 works of art across 11 museums, gathering 36,981 tags from 2,017 users; and the Waisda project, where user tags improved search accuracy by 53% compared to consensus tags alone.
Towards Better Media Understanding and Searchabilityoanainel
This document discusses improving media understanding and searchability. It mentions CrowdTruth.org, DIVE+, and CLARIAH, which are projects aimed at using crowdsourcing and linked open data to better analyze and organize media like images, video, and audio. DIVE+ is part of both CLARIAH and Beeldengeluid, and provides a platform for analyzing multimedia content.
Keynote at International Conference of Art Libraries 2018 @RijksmuseumLora Aroyo
Lora Aroyo presents on data science for smart cultural heritage. Some key points:
- Cultural heritage organizations are traditionally seen as inventories but aim to engage people.
- Bringing collections online increased access but interpretation was still needed for engagement.
- Data should be at the center of processes to evolve with users. There is a spectrum of truth, not just one view.
Historically Speaking, Digital Humanities, EWallis July 2012Elycia Wallis
Digital humanities combines traditional humanistic study with digital tools and methods. It values collaboration and sharing through open data. Museums and other cultural institutions are digitizing their collections, making vast amounts of data and resources available online. This allows new types of research, projects, and tools to develop. Digital humanities practitioners encourage opening data with permissive licenses to maximize reuse and partnerships.
The document discusses DIVE+, a tool that aims to address the issues of audiences feeling disconnected from the massive amounts of digital cultural content available by bringing them back into the driving seat of exploration. It does this through an event-centric browser called DIVE that allows linking objects through events and collecting diverse perspectives from crowdsourcing to support different points of view. The goal is to shift museums and archives from being mere inventories to places that engage users through event narratives and an infinity of exploration options in linked open data.
This document summarizes a study on using social and mobile technologies during a school trip to a museum. The study investigated how technologies can shape young people's learning processes and facilitate meaning making. A class visited the Museum of London and used iPhones and Twitter to communicate, share content, and reflect during their visit. Interviews after found that students engaged more and interaction was facilitated by seeing others' tweets, photos and opinions. The use of technologies made the museum experience more interactive and enjoyable compared to traditional trips.
DIVE+: Explorative Search for Digital HumanitiesJohan Oomen
DIVE+ is an event-centric linked data digital collection browser aimed to provide an integrated and interactive access to multimedia objects from various heterogeneous online collections. It enriches the structured metadata of online collections with linked open data vocabularies with focus on events, people, locations and concepts that are depicted or associated with particular collection objects. DIVE+ is result of a true inter-disciplinary collaboration between computer scientists, humanities scholars, cultural heritage professionals and interaction designers. The tool allows humanities scholars to explore unexpected relations between entities and media objects and to construct and share navigation paths to develop research narratives.
The document discusses how Danish archives are using social media and other digital technologies. It notes that most archives have a Facebook presence but with fewer than 500 likes. Archives are also experimenting with Flickr, Twitter, YouTube and other platforms. Crowdsourcing projects have indexed over 15 million records. Emerging areas include augmented reality, mobile apps, and opening up archives through APIs and web services. Overall archives are still learning how best to engage online audiences and connect digital and physical resources.
Essential Online Tools for Historical Societiesvtrural
Presentation to the 2013 League of Local Historical Societies & Museums Annual Meeting Building on a Strong Foundation - Friday, November 1, 2013 - Barre, Vermont
What online tools are must-haves for your local historical society? You are invited to attend a discussion on how local historical societies are using digital tools to spread history, recruit volunteers, document events, and archive their collections. Specific topics of discussion will include getting your historical society online, using social media (Facebook) for education, recruitment and research, and the importance of archiving your collection on the “cloud.
Presentation by: Rob Fish, Vermont Digital Economy Project and Adriene Katz, Shelburne Museum.
Presentation to the CURSO DE VERANO
Bilbao Arte eta Kultura UPV/EHU: museos, redes sociales y tecnología 2.0 (museums, social networks and 2.0 technology)
6-7 July 2010 at the invitation of the University of the Basque Country.
http://tubilbao.blogspot.com/2010/06/bak-uda-ikastaroa-curso-de-verano.html
This document discusses interactivity in museums through digital engagement and interactive installations. It describes how museums use their online presence to provide pre-visit and post-visit experiences for visitors. Examples are given of interactive installations that use technologies like Arduino, Kinect, and processing to create immersive experiences. The document also outlines three hands-on activities for groups to design an interactive museum prototype or create content for a museum website.
DIVE+ is an exploratory search tool for digital humanities research in CLARIAH Media Suite. It provides an event-centric browser for linked historical data that links objects to events and entities and builds automatic storylines. It integrates access to heterogeneous cultural heritage collections, including over 15 million triples from sources like news broadcasts, scans of radio bulletins, images, and metadata. DIVE+ allows exploring media collections, enriching metadata with historical events, and collecting crowd perspectives to support research.
EUScreen XL 2014 Conference: DIVE In Digital HermeneuticsLora Aroyo
My talk on "The Event-based Browsing Of Linked Historical Media" @EUScreenXL 2014 Conference
http://dive.beeldengeluid.nl
http://blog.euscreen.eu/archives/5607
The document discusses digital hermeneutics, which is a theory of interpreting information by bringing people and technology together. It describes using a simple event model, open annotation, and SKOS to model and link historical media like events, places, times, actors, and concepts. This helps engage users by supporting browsing and exploration of the linked historical data through event narratives. It also discusses using crowdsourcing to extract entities, events and perspectives from cultural heritage collections and linking them to improve discovery for digital humanities researchers.
Expand. Learn. Interact: Enabling Digital HumanitiesLora Aroyo
This document summarizes Lora Aroyo's presentation on enabling digital humanities. The presentation discusses expanding human cognition with technology, teaching machines through crowdsourcing diverse interpretations, and engaging users through novel interfaces like event-based browsing of linked historical media. The goals are to support interpretation, gather human semantics, and allow natural interaction. Challenges and the road ahead are also outlined.
This document discusses open, connected, and smart cultural heritage. It proposes using crowdsourcing and machine-human computation to support multiple perspectives in interpreting digital cultural content. Disagreement between human annotators is seen as essential for helping machines with semantic interpretation. The CrowdTruth platform and software is presented as a way to harness perspectives through crowdsourced annotation and analytics. Specific use cases are described for exploring audiovisual archives about historical events in the Netherlands.
How do you know what you are looking for?Shawn Day
This document discusses finding and capturing digital cultural heritage from various sources on the internet. It provides an overview of Europeana, a digital library that aims to give access to Europe's digitized cultural heritage by 2025. The document notes barriers to digitization like intellectual property and issues around metadata. It also lists some sources of data like Europeana, Project Gutenberg, and NINES as well as tools used for digitization like spiders and Getty vocabularies. Next steps discussed are analyzing digitized collections and upcoming readings on analyzing ship logs, images from the web, and comparing web archives.
The document discusses using crowdsourcing to understand events in cultural heritage collections by harnessing disagreement among annotators. It proposes a CrowdTruth framework where multiple annotators label examples to capture varying perspectives, and the disagreements are represented and can be learned from by machines. This helps address issues with traditional gold standards which assume a single correct answer and aims to scale understanding of vague concepts like events where human interpretation varies.
The document discusses the Powerhouse Museum's journey towards open access and digital engagement. It outlines the museum's revenue model, key audiences, and guiding digital principles of being findable, meaningful, responsive, usable, and available everywhere. The museum launched an open access image repository in 2005 which saw rapid growth in views and engagement from the Flickr community. This led to positive outcomes like more engagement with collections, effective delivery of education, leveraging community interest in research, and an institutional shift towards default creative commons rights.
The many unexptected joys if being "out there": examples of user participatio...Johan Oomen
Contribution as part of the SXSW 2014 panel "100 Years of Oversharing: Tools for Time Travel" - http://schedule.sxsw.com/2014/events/event_IAP21645 @johanoomen
A typed journal from WWI passed on through generations fuels a young man's dreams of time travel and allows us to explore the power of personal stories and photos. Together with archival collections, these items take us through space and time, and the magical ability of cultural memory institutions to help individuals bring these incredibly compelling dreams to life. The World Wide Web provides the cultural, technological, and legal frameworks to open the doors to innovation and imagination, and also enables libraries, archives and museums the world over to play a critical role. We explore some of the diverse efforts to bring stories and memory to life in new ways, while also fostering open data and preservation, and the pros and cons at the intersection of public domain and private enterprise.
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Lora Aroyo
The document discusses using both experts and crowdsourcing to enrich cultural heritage knowledge. It describes four research projects: 1) extracting knowledge from social tagging, 2) harnessing non-expert medical knowledge, 3) obtaining expert knowledge through nichesourcing, and 4) capturing event knowledge. The goal is to develop scalable and reliable methods of capturing human knowledge to improve machine systems.
Open, Connected & Smart Heritage: Towards New Cultural CommonsLora Aroyo
The document discusses open, connected, and smart approaches to cultural heritage. It argues that new technologies can help provide better access to vast amounts of digital cultural content. Specifically, it presents examples of projects that use crowdsourcing to engage users in indexing, tagging, and interpreting cultural works. This allows cultural institutions to enrich their collections by tapping into the knowledge and perspectives of online audiences. The goal is to make cultural heritage more accessible and to shape new forms of online engagement with the cultural record.
Citizen Science & Geographical Technologies: creativity, learning, and engage...Muki Haklay
These slides are from a keynote talk at the Esri Education User Conference in 2016, about citizen science and extreme citizen science, and their link to geographical technologies
The document discusses the partnership between The Children's Museum of Indianapolis and Wikimedia to improve Wikipedia articles related to the museum. It provides examples of other museums that have participated in Wikipedian in Residence programs, including the British Museum, Smithsonian Institution, and National Archives. It outlines the benefits of collaborating with Wikipedia, such as reaching wider audiences, engaging volunteers, and improving online information. Strategies discussed include hosting edit-a-thons, backstage tours, student programs, multilingual projects using QR codes, and connecting with educators. Next steps mentioned are supporting the GLAM-Wiki community and diversifying engagement.
DIVE+: Explorative Search for Digital HumanitiesJohan Oomen
DIVE+ is an event-centric linked data digital collection browser aimed to provide an integrated and interactive access to multimedia objects from various heterogeneous online collections. It enriches the structured metadata of online collections with linked open data vocabularies with focus on events, people, locations and concepts that are depicted or associated with particular collection objects. DIVE+ is result of a true inter-disciplinary collaboration between computer scientists, humanities scholars, cultural heritage professionals and interaction designers. The tool allows humanities scholars to explore unexpected relations between entities and media objects and to construct and share navigation paths to develop research narratives.
The document discusses how Danish archives are using social media and other digital technologies. It notes that most archives have a Facebook presence but with fewer than 500 likes. Archives are also experimenting with Flickr, Twitter, YouTube and other platforms. Crowdsourcing projects have indexed over 15 million records. Emerging areas include augmented reality, mobile apps, and opening up archives through APIs and web services. Overall archives are still learning how best to engage online audiences and connect digital and physical resources.
Essential Online Tools for Historical Societiesvtrural
Presentation to the 2013 League of Local Historical Societies & Museums Annual Meeting Building on a Strong Foundation - Friday, November 1, 2013 - Barre, Vermont
What online tools are must-haves for your local historical society? You are invited to attend a discussion on how local historical societies are using digital tools to spread history, recruit volunteers, document events, and archive their collections. Specific topics of discussion will include getting your historical society online, using social media (Facebook) for education, recruitment and research, and the importance of archiving your collection on the “cloud.
Presentation by: Rob Fish, Vermont Digital Economy Project and Adriene Katz, Shelburne Museum.
Presentation to the CURSO DE VERANO
Bilbao Arte eta Kultura UPV/EHU: museos, redes sociales y tecnología 2.0 (museums, social networks and 2.0 technology)
6-7 July 2010 at the invitation of the University of the Basque Country.
http://tubilbao.blogspot.com/2010/06/bak-uda-ikastaroa-curso-de-verano.html
This document discusses interactivity in museums through digital engagement and interactive installations. It describes how museums use their online presence to provide pre-visit and post-visit experiences for visitors. Examples are given of interactive installations that use technologies like Arduino, Kinect, and processing to create immersive experiences. The document also outlines three hands-on activities for groups to design an interactive museum prototype or create content for a museum website.
DIVE+ is an exploratory search tool for digital humanities research in CLARIAH Media Suite. It provides an event-centric browser for linked historical data that links objects to events and entities and builds automatic storylines. It integrates access to heterogeneous cultural heritage collections, including over 15 million triples from sources like news broadcasts, scans of radio bulletins, images, and metadata. DIVE+ allows exploring media collections, enriching metadata with historical events, and collecting crowd perspectives to support research.
EUScreen XL 2014 Conference: DIVE In Digital HermeneuticsLora Aroyo
My talk on "The Event-based Browsing Of Linked Historical Media" @EUScreenXL 2014 Conference
http://dive.beeldengeluid.nl
http://blog.euscreen.eu/archives/5607
The document discusses digital hermeneutics, which is a theory of interpreting information by bringing people and technology together. It describes using a simple event model, open annotation, and SKOS to model and link historical media like events, places, times, actors, and concepts. This helps engage users by supporting browsing and exploration of the linked historical data through event narratives. It also discusses using crowdsourcing to extract entities, events and perspectives from cultural heritage collections and linking them to improve discovery for digital humanities researchers.
Expand. Learn. Interact: Enabling Digital HumanitiesLora Aroyo
This document summarizes Lora Aroyo's presentation on enabling digital humanities. The presentation discusses expanding human cognition with technology, teaching machines through crowdsourcing diverse interpretations, and engaging users through novel interfaces like event-based browsing of linked historical media. The goals are to support interpretation, gather human semantics, and allow natural interaction. Challenges and the road ahead are also outlined.
This document discusses open, connected, and smart cultural heritage. It proposes using crowdsourcing and machine-human computation to support multiple perspectives in interpreting digital cultural content. Disagreement between human annotators is seen as essential for helping machines with semantic interpretation. The CrowdTruth platform and software is presented as a way to harness perspectives through crowdsourced annotation and analytics. Specific use cases are described for exploring audiovisual archives about historical events in the Netherlands.
How do you know what you are looking for?Shawn Day
This document discusses finding and capturing digital cultural heritage from various sources on the internet. It provides an overview of Europeana, a digital library that aims to give access to Europe's digitized cultural heritage by 2025. The document notes barriers to digitization like intellectual property and issues around metadata. It also lists some sources of data like Europeana, Project Gutenberg, and NINES as well as tools used for digitization like spiders and Getty vocabularies. Next steps discussed are analyzing digitized collections and upcoming readings on analyzing ship logs, images from the web, and comparing web archives.
The document discusses using crowdsourcing to understand events in cultural heritage collections by harnessing disagreement among annotators. It proposes a CrowdTruth framework where multiple annotators label examples to capture varying perspectives, and the disagreements are represented and can be learned from by machines. This helps address issues with traditional gold standards which assume a single correct answer and aims to scale understanding of vague concepts like events where human interpretation varies.
The document discusses the Powerhouse Museum's journey towards open access and digital engagement. It outlines the museum's revenue model, key audiences, and guiding digital principles of being findable, meaningful, responsive, usable, and available everywhere. The museum launched an open access image repository in 2005 which saw rapid growth in views and engagement from the Flickr community. This led to positive outcomes like more engagement with collections, effective delivery of education, leveraging community interest in research, and an institutional shift towards default creative commons rights.
The many unexptected joys if being "out there": examples of user participatio...Johan Oomen
Contribution as part of the SXSW 2014 panel "100 Years of Oversharing: Tools for Time Travel" - http://schedule.sxsw.com/2014/events/event_IAP21645 @johanoomen
A typed journal from WWI passed on through generations fuels a young man's dreams of time travel and allows us to explore the power of personal stories and photos. Together with archival collections, these items take us through space and time, and the magical ability of cultural memory institutions to help individuals bring these incredibly compelling dreams to life. The World Wide Web provides the cultural, technological, and legal frameworks to open the doors to innovation and imagination, and also enables libraries, archives and museums the world over to play a critical role. We explore some of the diverse efforts to bring stories and memory to life in new ways, while also fostering open data and preservation, and the pros and cons at the intersection of public domain and private enterprise.
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Lora Aroyo
The document discusses using both experts and crowdsourcing to enrich cultural heritage knowledge. It describes four research projects: 1) extracting knowledge from social tagging, 2) harnessing non-expert medical knowledge, 3) obtaining expert knowledge through nichesourcing, and 4) capturing event knowledge. The goal is to develop scalable and reliable methods of capturing human knowledge to improve machine systems.
Open, Connected & Smart Heritage: Towards New Cultural CommonsLora Aroyo
The document discusses open, connected, and smart approaches to cultural heritage. It argues that new technologies can help provide better access to vast amounts of digital cultural content. Specifically, it presents examples of projects that use crowdsourcing to engage users in indexing, tagging, and interpreting cultural works. This allows cultural institutions to enrich their collections by tapping into the knowledge and perspectives of online audiences. The goal is to make cultural heritage more accessible and to shape new forms of online engagement with the cultural record.
Citizen Science & Geographical Technologies: creativity, learning, and engage...Muki Haklay
These slides are from a keynote talk at the Esri Education User Conference in 2016, about citizen science and extreme citizen science, and their link to geographical technologies
The document discusses the partnership between The Children's Museum of Indianapolis and Wikimedia to improve Wikipedia articles related to the museum. It provides examples of other museums that have participated in Wikipedian in Residence programs, including the British Museum, Smithsonian Institution, and National Archives. It outlines the benefits of collaborating with Wikipedia, such as reaching wider audiences, engaging volunteers, and improving online information. Strategies discussed include hosting edit-a-thons, backstage tours, student programs, multilingual projects using QR codes, and connecting with educators. Next steps mentioned are supporting the GLAM-Wiki community and diversifying engagement.
Forever In Between : similarities and differences, opportunities and responsa...Saskia Scheltjens
Saskia Scheltjens discussed the evolving landscape of libraries, archives, and museums (LAMs) in an era of digitization and open data. She outlined both the similarities and differences between these communities, as well as opportunities for greater collaboration. While LAMs historically developed separately, there is now a push for convergence driven by the increasing availability of digital information. However, fully integrating these communities remains an ongoing challenge. Scheltjens highlighted several promising projects and principles that could help facilitate more cooperation between LAMs in the future.
This document outlines six steps for making risky participatory projects possible in museums: 1) Connect the idea to the museum's mission and goals. 2) Find the right tool for the idea. 3) Align the strategy with institutional culture. 4) Plan for evaluation of outcomes. 5) Reserve necessary resources for operation. 6) Develop a support network to help tend to the project. Examples are provided for each step to illustrate how others have successfully implemented participatory initiatives.
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningLora Aroyo
The document introduces CATS4ML, a crowdsourcing challenge to discover blindspots in machine learning models by having participants label images in the Open Images Dataset that are incorrectly labeled by AI. The goal is to crowdsource adverse test sets that can capture biases and improve evaluation of AI. The challenge runs through April 2021 and invites individuals and teams to discover interesting mislabeled images and contribute them for review and inclusion in the test sets. Winning contributions will be promoted at the next CrowdCamp conference.
Harnessing Human Semantics at Scale (updated)Lora Aroyo
The document appears to be a series of tweets and posts by Lora Aroyo discussing data science and crowdsourcing techniques. Some key points discussed include harnessing human semantics at scale through crowdsourcing and nichesourcing, measuring quality and reproducibility of crowdsourced results, and experimenting with different task designs and payment models to assess their impact. Specific examples mentioned include using crowdsourcing to add detailed annotations to museum collections and to find "blindspots" in AI models through a data challenge.
Data excellence: Better data for better AILora Aroyo
The document discusses the importance of data quality and a data lifecycle approach for artificial intelligence. Some key points made include:
- A data lifecycle is needed to guide best practices for data research and development, similar to how a software lifecycle guides software engineering.
- Data quality must be addressed through practices and standards to help avoid unintended AI behaviors that can result from low quality data.
- Disagreement in annotation tasks can provide valuable signals about ambiguity and diversity rather than just being considered noise.
- Achieving high quality, reliable data requires consideration of aspects like validity, fidelity, reproducibility and maintaining data over time - an approach toward "data excellence".
This document summarizes the CHIP project, which aims to use semantic metadata about cultural heritage objects to improve personalized access and recommendations for museum visitors. The CHIP approach involves making metadata and vocabularies available as RDF/OWL, aligning and enriching the data, and using it to build a combined user model for generating virtual and physical museum tours. Experiments show semantic relations can enhance content-based recommendations for novices and experts. Follow-up projects include Agora, deploying the techniques at the Rijksmuseum in Amsterdam.
The Rijksmuseum Collection as Linked DataLora Aroyo
Presentation at ISWC2018: http://iswc2018.semanticweb.org/sessions/the-rijksmuseum-collection-as-linked-data/ of our paper published originally in the Semantic Web Journal: http://www.semantic-web-journal.net/content/rijksmuseum-collection-linked-data-2
Many museums are currently providing online access to their collections. The state of the art research in the last decade shows that it is beneficial for institutions to provide their datasets as Linked Data in order to achieve easy cross-referencing, interlinking and integration. In this paper, we present the Rijksmuseum linked dataset (accessible at http://datahub.io/dataset/rijksmuseum), along with collection and vocabulary statistics, as well as lessons learned from the process of converting the collection to Linked Data. The version of March 2016 contains over 350,000 objects, including detailed descriptions and high-quality images released under a public domain license.
FAIRview: Responsible Video Summarization @NYCML'18Lora Aroyo
Presentation at the NYC Media Lab (NYCML2018). There is a growing demand for news videos online, with more consumers preferring to watch the news than read or listen to it. On the publisher side, there is a growing effort to use video summarization technology in order to create easy-to-consume previews (trailers) for different types of broadcast programs. How can we measure the quality of video summaries and their potential to misinform? This workshop will inform participants about automatic video summarization algorithms and how to produce more “representative” video summaries. The research presented is from the FAIRview project and is supported by the Digital News Innovation Fund (DNI Fund), which is part of the Google News Initiative.
Digital Humanities Benelux 2017: Keynote Lora AroyoLora Aroyo
This document discusses harnessing human semantics at scale through crowdsourcing and nichesourcing. It addresses making crowdsourcing efforts measurable, reproducible, engaging and sustainable. Some key points discussed are identifying crowdsourcing goals, assessing the impact of task and result designs, measuring quality and progress over time, and running continuous campaigns to reproduce and sustain results at scale.
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
Lora Aroyo, Chiel van den Akker, Marnix van Berchum, Lodewijk
Petram, Gerard Kuys, Tommaso Caselli, Jacco van Ossenbruggen, Victor de Boer, Sabrina Sauer, Berber Hagedoorn
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Lora Aroyo
The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to the volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, this assumption often creates issues in practice. Previous experiments we performed found that inter-annotator disagreement is usually never captured, either because the number of annotators is too small to capture the full diversity of opinion, or because the crowd data is aggregated with metrics that enforce consensus, such as majority vote. These practices create artificial data that is neither general nor reflects the ambiguity inherent in the data.
To address these issues, we proposed the method for crowdsourcing ground truth by harnessing inter-annotator disagreement. We present an alternative approach for crowdsourcing ground truth data that, instead of enforcing an agreement between annotators, captures the ambiguity inherent in semantic annotation through the use of disagreement-aware metrics for aggregating crowdsourcing responses. Based on this principle, we have implemented the CrowdTruth framework for machine-human computation, that first introduced the disagreement-aware metrics and built a pipeline to process crowdsourcing data with these metrics.
In this paper, we apply the CrowdTruth methodology to collect data over a set of diverse tasks: medical relation extraction, Twitter event identification, news event extraction and sound interpretation. We prove that capturing disagreement is essential for acquiring a high-quality ground truth. We achieve this by comparing the quality of the data aggregated with CrowdTruth metrics with a majority vote, a method which enforces consensus among annotators. By applying our analysis over a set of diverse tasks we show that, even though ambiguity manifests differently depending on the task, our theory of inter-annotator disagreement as a property of ambiguity is generalizable.
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneLora Aroyo
Ambiguity in interpreting signs is not a new idea, yet the vast majority of research in machine interpretation of signals such as speech, language, images, video, audio, etc., tend to ignore ambiguity. This is evidenced by the fact that metrics for quality of machine understanding rely on a ground truth, in which each instance (a sentence, a photo, a sound clip, etc) is assigned a discrete label, or set of labels, and the machine’s prediction for that instance is compared to the label to determine if it is correct. This determination yields the familiar precision, recall, accuracy, and f-measure metrics, but clearly presupposes that this determination can be made. CrowdTruth is a form of collective intelligence based on a vector representation that accommodates diverse interpretation perspectives and encourages human annotators to disagree with each other, in order to expose latent elements such as ambiguity and worker quality. In other words, CrowdTruth assumes that when annotators disagree on how to label an example, it is because the example is ambiguous, the worker isn’t doing the right thing, or the task itself is not clear. In previous work on CrowdTruth, the focus was on how the disagreement signals from low quality workers and from unclear tasks can be isolated. Recently, we observed that disagreement can also signal ambiguity. The basic hypothesis is that, if workers disagree on the correct label for an example, then it will be more difficult for a machine to classify that example. The elaborate data analysis to determine if the source of the disagreement is ambiguity supports our intuition that low clarity signals ambiguity, while high clarity sentences quite obviously express one or more of the target relations. In this talk I will share the experiences and lessons learned on the path to understanding diversity in human interpretation and the ways to capture it as ground truth to enable machines to deal with such diversity.
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
Software systems are becoming ever more intelligent and more useful, but the way we interact with these machines too often reveals that they don’t actually understand people. Knowledge Representation and Semantic Web focus on the scientific challenges involved in providing human knowledge in machine-readable form. However, we observe that various types of human knowledge cannot yet be captured by machines, especially when dealing with wide ranges of real-world tasks and contexts. The key scientific challenge is to provide an approach to capturing human knowledge in a way that is scalable and adequate to real-world needs. Human Computation has begun to scientifically study how human intelligence at scale can be used to methodologically improve machine-based knowledge and data management. My research is focusing on understanding human computation for improving how machine-based systems can acquire, capture and harness human knowledge and thus become even more intelligent. In this talk I will show how the CrowdTruth framework (http://crowdtruth.org) facilitates data collection, processing and analytics of human computation knowledge.
Some project links:
- http://controcurator.org/
- http://crowdtruth.org/
- http://diveproject.beeldengeluid.nl/
- http://vu-amsterdam-web-media-group.github.io/linkflows/
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital AgeLora Aroyo
The document discusses harnessing crowds, niches, and professionals in the digital age. The key points are:
- Software is becoming less important as data takes center stage; cultural institutions must know their data and crowds.
- Different crowds have different expertise and abilities; nichesourcing can access specialized knowledge.
- Crowdsourcing initiatives should be part of an overall strategy and integrated into existing systems.
- Novel interactions and user-driven augmentations can empower users and align the digital and physical.
"Video Killed the Radio Star": From MTV to SnapchatLora Aroyo
The document discusses bridging the gap between people and the massive amount of online multimedia content. It proposes decomposing videos and images into smaller fragments and building a media graph to link these fragments based on semantic relationships. Both machine learning and crowdsourcing are used to analyze and enrich media with metadata at scale. The goal is to turn "mute" images and context-free videos into relationship-aware media that allows nonlinear exploration. This would provide a more engaging experience for online audiences.
This document summarizes the key details of UMAP 2022, including:
- UMAP is moving to ACM as its new sponsor and publisher after many years with other organizations.
- It features a new presentation model of short talks and posters.
- It received 123 submissions from various countries, with acceptance rates ranging from 23.9% to 27.6% across categories.
- The program committee included 132 members who conducted a rigorous review process with at least 3 reviews per submission.
- 128 people from Europe, North America, Asia, Australia, and South America registered to attend the conference.
Stitch by Stitch: Annotating Fashion at the RijksmuseumLora Aroyo
https://www.rijksmuseum.nl/en/stitch-by-stitch
http://annotate.accurator.nl/
Fashion can be found everywhere in museums. Fashion heritage collected over centuries: costumes, accessories, paintings, prints and photographs. But while some clothes and accessories are easily found and identified, others are obscure and require a trained eye to describe. What are we looking at? What kind of sleeve is this? Which materials and techniques have been used? More specific descriptions of the images facilitate better use of digital collections and enable users to wander through them in detail.
Skybuffer SAM4U tool for SAP license adoptionTatiana Kojar
Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool.
SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on integration of Salesforce with Bonterra Impact Management.
Interested in deploying an integration with Salesforce for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfflufftailshop
When it comes to unit testing in the .NET ecosystem, developers have a wide range of options available. Among the most popular choices are NUnit, XUnit, and MSTest. These unit testing frameworks provide essential tools and features to help ensure the quality and reliability of code. However, understanding the differences between these frameworks is crucial for selecting the most suitable one for your projects.
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
Trusted Execution Environment for Decentralized Process MiningLucaBarbaro3
Presentation of the paper "Trusted Execution Environment for Decentralized Process Mining" given during the CAiSE 2024 Conference in Cyprus on June 7, 2024.
4. http://lora-aroyo.org @laroyo
CULTURAL HERITAGE
4
Before the Digital Age
Lots of manual effort
Focus on internal collection
management
Focus on art historical
significance
Access targeted to
researchers & professionals
Small curated selection
online for general audiences
onsite
5. http://lora-aroyo.org @laroyo
DIGITAL HERITAGE
5
Bringing collections online
Focus on massive
digitization of heritage
collections
Getting large collections
online
Still need significant art
historical understanding to
get access
Metadata not sufficient for
the online presence
7. http://lora-aroyo.org @laroyo
Linked Data, Semantic Web, Interoperability, Standards
METADATA ENRICHMENT
Shift from metadata for internal use to metadata for online access
8. http://lora-aroyo.org @laroyo
Linked Data, Semantic Web, Interoperability, Standards
METADATA ENRICHMENT
Building community for shared knowledge creation, use & maintenance
http://www.getty.edu/research/tools/vocabularies/lod/index.html
19. http://lora-aroyo.org @laroyo
Europeana.eu
sharing cultural heritage for
enjoyment, education and research
In 2008 launched with 4.5 mil
digitised items & 1,000
contributing organisations
In 2018 it collaborates with
thousands of European
archives, libraries & museums
> 50 mil digitised items:
● Books
● Music
● Artworks
Thematic collections on:
● Art
● Fashion
● Music
● Photography
● World War I
https://www.europeana.eu/portal/en
20. http://lora-aroyo.org @laroyo
ADDRESSED THE WEB ACCESS & SCALE ISSUES ...
through using automated methods to enrich & curate metadata
André Malraux, The Imaginary Museum of World Sculpture, 1953
21. http://lora-aroyo.org @laroyo
André Malraux, The Imaginary Museum of World Sculpture, 1953
BUT THAT WASN’T ENOUGH FOR TRUE ENGAGEMENT
Still there is much more focus on information support
rather than interpretation support for online collections
22. http://lora-aroyo.org @laroyo Gravity (2013)
LOST IN CULTURAL SPACE MORE THAN EVER
The sense of disconnect was now bigger as there has never
been so much online information and so difficult to find ...
23. http://lora-aroyo.org @laroyo 23
… BECAUSE THERE WAS NO CONTEXT
Entities were not sufficient to endure engagement with online collections
31. http://lora-aroyo.org @laroyo 31
theory of interpretation of
information bringing people
and technology together to:
● model information
● offer engaging interaction
● support interpretation
DIGITAL HERMENEUTICS
Chiel van den Akker, Susan Legêne, Marieke van Erp, Lora Aroyo, Roxane Segers, Lourens van der Meij, Jacco van
Ossenbruggen, Guus Schreiber, Bob Wielinga, Johan Oomen, and Geertje Jacobs (2011).
Digital hermeneutics: Agora and the online understanding of cultural heritage. In Proceedings of the 3rd International Web
Science Conference (WebSci '11). ACM, New York, NY, USA
32. http://lora-aroyo.org @laroyo
LINKING OBJECTS THROUGH EVENTS & ENTITIES
Erp, M. van; Oomen, J.; Segers, R.; Akker, C. van de; Aroyo, L.; Jacobs, G.; Legêne, S; Meij, L. van der; O ssenbruggen, J.R. van;
Schreiber, G. Automatic Heritage Metadata Enrichment with Historic Events Museums and the Web 2011
http://diveproject.beeldengeluid.nl/
33. http://lora-aroyo.org @laroyo
Erp, M. van; Oomen, J.; Segers, R.; Akker, C. van de; Aroyo, L.; Jacobs, G.; Legêne, S; Meij, L. van der; O ssenbruggen, J.R. van;
Schreiber, G. Automatic Heritage Metadata Enrichment with Historic Events Museums and the Web 2011
http://diveproject.beeldengeluid.nl/
ENGAGING USERS THROUGH EVENT NARRATIVES
34. http://lora-aroyo.org @laroyo
AGORA PROJECT
Modeling Historical Events
Segers, R., Erp, M.V., Meij, L.V., Aroyo, L., Schreiber, G., Wielinga, B.F., Ossenbruggen, J.V., Oomen, J., & Jacobs, G. (2011).
Hacking History : Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In Proc. of the
6th International Conference on Knowledge Capture (K-CAP’11)
35. http://lora-aroyo.org @laroyo
AGORA PROJECT
Modeling Historical Events
Segers, R., Erp, M.V., Meij, L.V., Aroyo, L., Schreiber, G., Wielinga, B.F., Ossenbruggen, J.V., Oomen, J., & Jacobs, G. (2011).
Hacking History : Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In Proc. of the
6th International Conference on Knowledge Capture (K-CAP’11)
36. http://lora-aroyo.org @laroyo
AGORA PROJECT
Event Properties & Relations
Segers, R., Erp, M.V., Meij, L.V., Aroyo, L., Schreiber, G., Wielinga, B.F., Ossenbruggen, J.V., Oomen, J., & Jacobs, G. (2011).
Hacking History : Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In Proc. of the
6th International Conference on Knowledge Capture (K-CAP’11)
37. http://lora-aroyo.org @laroyo
AGORA PROJECT
Proto-narratives with Events
Segers, R., Erp, M.V., Meij, L.V., Aroyo, L., Schreiber, G., Wielinga, B.F., Ossenbruggen, J.V., Oomen, J., & Jacobs, G. (2011).
Hacking History : Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In Proc. of the
6th International Conference on Knowledge Capture (K-CAP’11)
38. http://lora-aroyo.org @laroyo
DIVE+
Event-centric Explorative Search
DIVE into the event-based browsing of linked historical media (2015)
V De Boer, J Oomen, O Inel, L Aroyo, E Van Staveren, in Journal of Web Semantics
http://diveproject.beeldengeluid.nl/
39. http://lora-aroyo.org @laroyo
DIVE+
Explorative Search
DIVE into the event-based browsing of linked historical media (2015)
V De Boer, J Oomen, O Inel, L Aroyo, E Van Staveren, in Journal of Web Semantics:
http://diveproject.beeldengeluid.nl/
40. http://lora-aroyo.org @laroyo
DIVE+
Filters for Events
DIVE into the event-based browsing of linked historical media (2015)
V De Boer, J Oomen, O Inel, L Aroyo, E Van Staveren, in Journal of Web Semantics:
http://diveproject.beeldengeluid.nl/
filter on events
41. http://lora-aroyo.org @laroyo
DIVE+
Building Exploration Narratives
DIVE into the event-based browsing of linked historical media (2015)
V De Boer, J Oomen, O Inel, L Aroyo, E Van Staveren, in Journal of Web Semantics:
http://diveproject.beeldengeluid.nl/
narrative
42. http://lora-aroyo.org @laroyo
DIVE+ MEDIA SUITE
Explorative Search for Media Collections
de Boer V., Melgar L., Inel O., Ortiz C.M., Aroyo L., Oomen J. (2017)
Enriching Media Collections for Event-Based Exploration. In Proceedings of Metadata and Semantic Research (MTSR 2017),
Communications in Computer and Information Science, vol 755. Springer
http://mediasuite.clariah.nl/
43. http://lora-aroyo.org @laroyo
DIVE+ MEDIA SUITE
Explorative Search for Media Collections
de Boer V., Melgar L., Inel O., Ortiz C.M., Aroyo L., Oomen J. (2017)
Enriching Media Collections for Event-Based Exploration. In Proceedings of Metadata and Semantic Research (MTSR 2017),
Communications in Computer and Information Science, vol 755. Springer
http://mediasuite.clariah.nl/
44. http://lora-aroyo.org @laroyo
Narratives in animated GIFs
Remixing Archival Stories with millenials
Inel O., Sauer, S., Aroyo L. (2018)
A Study of Narrative Creation by Means of Crowds and Niches
http://diveproject.beeldengeluid.nl/
47. http://lora-aroyo.org @laroyo
M.C. Escher, Day and Night , 1938
BUT THIS ONLY WORKS IF THERE ARE EVENTS ...
Event vocabularies are difficult: too many, not structured, not shared, not
standardized, lots of variations, perspectives, no agreement across communities
49. http://lora-aroyo.org @laroyo
CROWDTRUTH.ORG
a spatial representation of meaning
that harnesses disagreement
http://crowdtruth.orghttp://data.crowdtruth.org
a human computation
(crowdsourcing) approach to:
● gather diversity of
perspectives & opinions
from crowds & niches
● expand expert
vocabularies with these
● gather new type of gold
standard for machines
51. http://lora-aroyo.org @laroyo 51
One truth: knowledge acquisition and
curation assume one correct interpretation
for every object
All cases are created equal: they are all
either true or false
Disagreement bad: when people disagree,
they don’t understand the problem
Experts rule: knowledge is always
captured from domain experts
One is enough: knowledge by a single
expert is sufficient
Detailed explanations help: if cases cause
disagreement - add instructions
Once done, forever valid: knowledge is
not updated; new data not aligned with old
a set of assumptions and rules
that we rarely question
“Truth is a Lie: 7 Myths about Human Annotation”, AI Magazine 2014, L. Aroyo, C. Welty
http://crowdtruth.org
COMFORT ZONE DISRUPTED
53. http://lora-aroyo.org @laroyo
On the role of user-generated metadata in audio visual collections (2011).
R. Gligorov, M. Hildebrand, J. van Ossenbruggen, G. Schreiber, L. Aroyo K-CAP2011
VIDEO METADATA ENRICHMENT
The Netherlands Institute for Sound and Vision
http://waisda.nl
54. http://lora-aroyo.org @laroyo
On the role of user-generated metadata in audio visual collections (2011).
R. Gligorov, M. Hildebrand, J. van Ossenbruggen, G. Schreiber, L. Aroyo K-CAP2011
VIDEO METADATA ENRICHMENT
The Netherlands Institute for Sound and Vision
http://spotvogel.vroegevogels.vara.nl/
55. http://lora-aroyo.org @laroyo
L. Aroyo, C. Welty: CrowdTruth: Harnessing disagreement in crowdsourcing relex gold standard. ACM WebSci 2013.
L. Aroyo, C. Welty. The Three Sides of CrowdTruth, Journal of Human Computation, 2014
VIDEO ENRICHMENT
CrowdTruth with Amazon Mechanical Turk & Figure Eight
http://crowdtruth.org
56. http://lora-aroyo.org @laroyo
L. Aroyo, C. Welty: CrowdTruth: Harnessing disagreement in crowdsourcing relex gold standard. ACM WebSci 2013.
L. Aroyo, C. Welty. The Three Sides of CrowdTruth, Journal of Human Computation, 2014
IMAGE ENRICHMENT
CrowdTruth with Amazon Mechanical Turk & Figure Eight
http://crowdtruth.org
57. http://lora-aroyo.org @laroyo
L. Aroyo, C. Welty: CrowdTruth: Harnessing disagreement in crowdsourcing relex gold standard. ACM WebSci 2013.
L. Aroyo, C. Welty. The Three Sides of CrowdTruth, Journal of Human Computation, 2014
IMAGE ENRICHMENT
CrowdTruth with Amazon Mechanical Turk & Figure Eight
http://crowdtruth.org
58. http://lora-aroyo.org @laroyo
Nikita Galinkin, Zoltán Szlávik, Lora Aroyo and Benjamin Timmermans (2017).
Catch Them If You Can: A Simulation Study on Malicious Behavior in a Cultural Heritage
Question Answering System. The 29th Benelux Conference on Artificial Intelligence (BNAIC 2017).
IMAGE ENRICHMENT
CrowdTruth with Mauritshuis
http://crowdtruth.org
59. http://lora-aroyo.org @laroyo
Chris Dijkshoorn, Victor De Boer, Lora Aroyo, Guus Schreiber (2014).
Accurator: Nichesourcing for Cultural Heritage
NICHESOURCING: FINDING NICHES IN THE CROWD
Accurator tool: SealincMedia Project
http://sealincmedia.wordpress.com
60. http://lora-aroyo.org @laroyo
Chris Dijkshoorn, Victor De Boer, Lora Aroyo, Guus Schreiber (2014).
Accurator: Nichesourcing for Cultural Heritage
NICHESOURCING IN THE CULTURAL HERITAGE
Accurator tool
http://annotate.accurator.nl
61. http://lora-aroyo.org @laroyo
Chris Dijkshoorn, Victor De Boer, Lora Aroyo, Guus Schreiber (2014).
Accurator: Nichesourcing for Cultural Heritage
NICHESOURCING IN THE CULTURAL HERITAGE
Accurator tool
http://annotate.accurator.nl
62. http://lora-aroyo.org @laroyo
Chris Dijkshoorn, Victor De Boer, Lora Aroyo, Guus Schreiber (2014).
Accurator: Nichesourcing for Cultural Heritage
NICHESOURCING IN THE CULTURAL HERITAGE
Accurator tool
http://annotate.accurator.nl
63. http://lora-aroyo.org @laroyo
Chris Dijkshoorn, Victor De Boer, Lora Aroyo, Guus Schreiber (2014).
Accurator: Nichesourcing for Cultural Heritage
CREATING EXPERTS WITH GAMES
Accurator tool
http://annotate.accurator.nl
64. http://lora-aroyo.org @laroyo
DigiBird: on the fly collection integration supported by the crowd (2017)
Chris Dijkshoorn, Christina-Lulia Bucur, Maarten Brinkerink, Sander Pieterse and Lora Aroyo
NICHESOURCING EVENTS
Part of the SealincMedia Project
http://annotate.accurator.nl
65. http://lora-aroyo.org @laroyo
DigiBird: on the fly collection integration supported by the crowd (2017)
Chris Dijkshoorn, Christina-Lulia Bucur, Maarten Brinkerink, Sander Pieterse and Lora Aroyo
NICHESOURCING EVENTS
DigiBird Project
http://annotate.accurator.nl
66. http://lora-aroyo.org @laroyo
DigiBird: on the fly collection integration supported by the crowd (2017)
Chris Dijkshoorn, Christina-Lulia Bucur, Maarten Brinkerink, Sander Pieterse and Lora Aroyo
NICHESOURCING EVENTS
DigiBird Project
http://annotate.accurator.nl
67. http://lora-aroyo.org @laroyo
DigiBird: on the fly collection integration supported by the crowd (2017)
Chris Dijkshoorn, Christina-Lulia Bucur, Maarten Brinkerink, Sander Pieterse and Lora Aroyo
NICHESOURCING EVENTS
DigiBird Project
http://annotate.accurator.nl
80. http://lora-aroyo.org @laroyo
LESSONS LEARNED ...
Crowds are large and contribute at scale
Crowds bring natural diversity
Crowds help gathering real human semantics
There are niches of experts in the crowds
Experts and crowds are complimentary
together they encompass a multitude
of opinions and perspectives
Experts and crowds have different semantics
Experts and crowds are
interested in different stories
Experts and crowds use different vocabularies
Crowds are enthusiasts, motivated,
driven by altruism
81. http://lora-aroyo.org @laroyo
The world is full
of shades of grey
Capturing and understanding opinions,
perspectives & contexts is in the center
of understanding people
LESSONS LEARNED ...
CrowdTruth defines multi-dimensional
space to measure quality
CrowdTruth defines hyper-dimensional
space to represent ambiguity
Nichesourcing helps expanding expertise
beyond the walls of organizations
Nichesourcing needs active engagement
online and with onsite campaigns