Data sharing: solutions

•Download as PPTX, PDF•

1 like•197 views

Hughes, V. and Wormald, J. (2018) Data sharing: solutions. Paper presented at WYRED Project: Data Sharing Satellite Event, University of Huddersfield. 2 August 2018.

Education

WYRED Project: Data Sharing Satellite Event
2nd August 2018
Data Sharing: Solutions
Vincent Hughes and Jessica Wormald

Re-cap
• Increasing focus on data sharing across sciences
(‘open science’)
• Especially useful for sociolinguistics…
oexpands scale of analyses (reveal subtle effects)
oreplication
• …and forensics
ofor empirical estimation of typicality/ validation
omove away from experience-based approaches
• Importance of collaboration
2

What is data?
• Anything is useful (and better than nothing):
oknowledge
oliterature
orecordings
oraw data
oplatforms/ scripts for data extraction
ocode (statistical modelling)
3

Solutions: Quantitative
General
• Recordings / raw data sharing
• Online platforms – transcriptions + recordings
Forensics
• Different types of features
• Different types of analysis
• Forensic-friendly data collection
4

Solutions: Qualitative
General + Forensic
• Specialist knowledge
• (Published) literature
• Speed of access
5

WikiDialects
What is it?
Wiki = repository for information which is
developed collaboratively by members of a
community
6

WikiDialects
• How would it be used?
• Casework in forensics – assessing typicality
• Research by academics – useful resource for finding out
what is out there / sharing research
• But other beneficiaries / users too
• Speech and language therapy – assessing typicality
• L2 English studies – understanding variation
• Students – resource for studies
• Lay audience – general interest
9

What does this need to work?
Collaboration
10

What does this need to work?
Better communication
11

What's hot

Multiple perspectives on bibliometric dataNees Jan van Eck

Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentHendrik Drachsler

20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...OpenAIRE

Managing sensitive data at the University of BristolJisc RDM

European Open Science CloudJisc RDM

Data sharing in the NetherlandsJisc RDM

Data management: The new frontier for librariesLEARN Project

20190527_David Osimo_The Open Science MonitorOpenAIRE

Presenting RISEJisc RDM

Frances Burton on sensitive dataJisc RDM

Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike ConlonAllen Press

A discovery service for UK research dataJisc RDM

Evidence of OER ImpactRobert Farrow

How to improve the acceptance of AltMetricsuherb

Open by default: the challenges of research data in EuropeLEARN Project

The Future of Open SciencePhilip Bourne

Research Data Management and the brave new world, By Paul AyrisLEARN Project

LACE Project Overview and ExploitationHendrik Drachsler

LACE Flyer 2016 Hendrik Drachsler

Managing Arts and Humanities DataJisc RDM

What's hot (20)

Multiple perspectives on bibliometric data

Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent

20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...

Managing sensitive data at the University of Bristol

European Open Science Cloud

Data sharing in the Netherlands

Data management: The new frontier for libraries

20190527_David Osimo_The Open Science Monitor

Presenting RISE

Frances Burton on sensitive data

Symbiosis—Is Collaboration the New Innovation? (Part 3 of 3), Mike Conlon

A discovery service for UK research data

Evidence of OER Impact

How to improve the acceptance of AltMetrics

Open by default: the challenges of research data in Europe

The Future of Open Science

Research Data Management and the brave new world, By Paul Ayris

LACE Project Overview and Exploitation

LACE Flyer 2016

Managing Arts and Humanities Data

Similar to Data sharing: solutions

CODATA International Training Workshop in Big Data for Science for Researcher...Johann van Wyk

Crowdsourcing ScienceAndrea Wiggins

Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith

Being an Open Scholar in a Connected WorldStian Håklev

Delivering biodiversity knowledge in the information ageVince Smith

Data Science and Urban Science @ UWUniversity of Washington

Open Source, Open Science, & Citizen ScienceAndrea Wiggins

Research Data Lifecycle: Role of Data ServicesArhiv družboslovnih podatkov

The role of learning in citizen scienceMuki Haklay

Dataverse in the Universe of Data by Christine L. Borgmandatascienceiqss

Oess NCRM FestivalOxford Martin Centre, OII, and Computer Science at the University of Oxford

Haw GIScience lost its interdisciplinary mojo?Muki Haklay

Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Carole Goble

What is Open Science and what role does it play in Development?Leslie Chan

Open Data in a Big Data World: easy to say, but hard to do?LEARN Project

Digital Humanities and “Digital” Social SciencesChantal van Son

Getting Started with Institutional Repositories and Open AccessAbby Clobridge

Directions in Open ScienceMike Travers

Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe

FAIR for the future: embracing all things dataARDC

Similar to Data sharing: solutions (20)

CODATA International Training Workshop in Big Data for Science for Researcher...

Crowdsourcing Science

Vince smith-delivering biodiversity knowledge in the information age-notext

Being an Open Scholar in a Connected World

Delivering biodiversity knowledge in the information age

Data Science and Urban Science @ UW

Open Source, Open Science, & Citizen Science

Research Data Lifecycle: Role of Data Services

The role of learning in citizen science

Dataverse in the Universe of Data by Christine L. Borgman

Oess NCRM Festival

Haw GIScience lost its interdisciplinary mojo?

Trust and Accountability: experiences from the FAIRDOM Commons Initiative.

What is Open Science and what role does it play in Development?

Open Data in a Big Data World: easy to say, but hard to do?

Digital Humanities and “Digital” Social Sciences

Getting Started with Institutional Repositories and Open Access

Directions in Open Science

Data Science: History repeated? – The heritage of the Free and Open Source GI...

FAIR for the future: embracing all things data

Recently uploaded

Concurrency Control in Database Management systemChristalin Nelson

INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxExcellence Foundation for South Sudan

Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringSri Sairam College Of Engineering Bengaluru

Transaction Management in Database Management SystemChristalin Nelson

MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir

Mattingly "AI & Prompt Design: Large Language Models"National Information Standards Organization (NISO)

Oppenheimer Film Discussion for Philosophy and FilmStan Meyer

Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1

Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW

ClimART Action | eTwinning Projectjordimapav

Sulphonamides, mechanisms and their usesVijayaLaxmi84

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar

4.9.24 School Desegregation in Boston.pptxmary850239

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection

ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri

Textual Evidence in Reading and Writing of SHSMae Pangan

Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO

Scientific Writing :Research DiscourseAnita GoswamiGiri

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW

Recently uploaded (20)

Concurrency Control in Database Management system

INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx

Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering

Transaction Management in Database Management System

MS4 level being good citizen -imperative- (1) (1).pdf

Mattingly "AI & Prompt Design: Large Language Models"

Oppenheimer Film Discussion for Philosophy and Film

Reading and Writing Skills 11 quarter 4 melc 1

Q-Factor General Quiz-7th April 2024, Quiz Club NITW

ClimART Action | eTwinning Project

Sulphonamides, mechanisms and their uses

Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx

4.9.24 School Desegregation in Boston.pptx

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...

ICS 2208 Lecture Slide Notes for Topic 6

Textual Evidence in Reading and Writing of SHS

Daily Lesson Plan in Mathematics Quarter 4

Scientific Writing :Research Discourse

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW

Data sharing: solutions

1. WYRED Project: Data Sharing Satellite Event 2nd August 2018 Data Sharing: Solutions Vincent Hughes and Jessica Wormald

2. Re-cap • Increasing focus on data sharing across sciences (‘open science’) • Especially useful for sociolinguistics… oexpands scale of analyses (reveal subtle effects) oreplication • …and forensics ofor empirical estimation of typicality/ validation omove away from experience-based approaches • Importance of collaboration 2

3. What is data? • Anything is useful (and better than nothing): oknowledge oliterature orecordings oraw data oplatforms/ scripts for data extraction ocode (statistical modelling) 3

4. Solutions: Quantitative General • Recordings / raw data sharing • Online platforms – transcriptions + recordings Forensics • Different types of features • Different types of analysis • Forensic-friendly data collection 4

5. Solutions: Qualitative General + Forensic • Specialist knowledge • (Published) literature • Speed of access 5

6. WikiDialects What is it? Wiki = repository for information which is developed collaboratively by members of a community 6

7. WikiDialects How would it work? 7

9. WikiDialects • How would it be used? • Casework in forensics – assessing typicality • Research by academics – useful resource for finding out what is out there / sharing research • But other beneficiaries / users too • Speech and language therapy – assessing typicality • L2 English studies – understanding variation • Students – resource for studies • Lay audience – general interest 9

10. What does this need to work? Collaboration 10

11. What does this need to work? Better communication 11

12. What does this need to work? Ethics 12

13. What does this need to work? Money 13

Editor's Notes

Platforms: SLAAP, ONZE, FAVE suite, SPADE Forced alignment Searching for internal and external sources of variation Easy extraction of large amounts of data Continually updated (longitudinal resource) Forensic-specific features: e.g. MFCCs/ LTFDs from across an entire recording means of summarising data easily incorporating large-scale analyses into case reports Forensic-specific collection methods capture more real world variation forensically realistic conditions e.g. multiple recordings per speaker, technical factors…
Platforms: SLAAP, ONZE, FAVE suite, SPADE Forced alignment Searching for internal and external sources of variation Easy extraction of large amounts of data Continually updated (longitudinal resource) Forensic-specific features: e.g. MFCCs/ LTFDs from across an entire recording means of summarising data easily incorporating large-scale analyses into case reports Forensic-specific collection methods capture more real world variation forensically realistic conditions e.g. multiple recordings per speaker, technical factors…
Combined qualitative and quantitative resource – qualitative collation of research as a ‘first port of call’ Community in this case = researchers interested in accents (sociolinguists, forensic speech scientists, speech and language therapists) Not – lay individuals with a view on language patterns (‘the youth of today can’t say th anymore – they all think it’s f !)
Academics from across disciplines (e.g. sociolinguistics, phonetics) could contribute descriptions of linguistic features for different regional and social groups Could use the repository to search about features in a given region to find out about what’s been done about that feature – e.g….
Select feature and region Pulls up resources (and summaries) on that feature – resources could be project platforms, larger platforms (e.g. SPADE), individual publications, academic website / blog Would include metadata (how many speakers, how old were they?) Provides a quick starting point to find out info available (useful for all – caseworkers / researchers / students)
Investment in the enterprise requires us to explain the importance both theoretical and practical benefits Becoming part of ‘standard practice’
General Forensics
Understandable concerns about how data is used (especially where the term forensic is used)
Difficult to get funding for tools (unless some specific call from RCs) Especially if no direct research question associated Impact funding is one potential avenue Also need funding for long-term maintenance of resources certainly for data platforms maybe less so for a wiki (?)

Data sharing: solutions

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Data sharing: solutions

Similar to Data sharing: solutions (20)

Recently uploaded

Recently uploaded (20)

Data sharing: solutions

Editor's Notes