Evolution of e-Research

•Als PPT, PDF herunterladen•

1 gefällt mir•1,558 views

David De Roure

Three generations of e-Research - a 20min presentation at the launch of e-XPO 2010 at Monash in August 2010

Technologie

The Evolution of e-Research David De Roure

Overview ,[object Object],[object Object],[object Object],[object Object],[object Object]

e-Science ,[object Object],[object Object],[object Object],[object Object],[object Object]

26/2/2007 | myExperiment | Slide Jeremy Frey

[object Object],[object Object],[object Object],[object Object],Carole Goble E. Science laboris

Taverna Trident Kepler Triana BPEL Meandre Galaxy

1 st Generation Summary Current practices of early adoptors of tools. Characterised by researchers using tools within their particular problem area, with some re-use of tools, data and methods within the discipline. Traditional publishing is supplemented by publication of some digital artefacts like workflows and links to data. Science is accelerated and practice beginning to shift to emphasise in silico work.

[object Object],[object Object],[object Object],[object Object],[object Object],Reuse, Recycling, Repurposing

[object Object],Mike Ashburner and others Professor in Dept of Genetics, University of Cambridge, UK

“ Data mining: my data’s mine and your data’s mine”

mySpace for scientists! Facebook for scientists! Not Facebook for scientists!

Web 2 Open Repositories Researchers Social Network Developers Social Scientists The experiment that is

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],myExperiment currently has 4989 members, 234 groups, 1260 workflows, 345 files and 129 packs

Results Logs Results Metadata Paper Slides Feeds into produces Included in produces Published in produces Included in Included in Included in Published in Workflow 16 Workflow 13 Common pathways QTL Paul’s Pack Paul’s Research Object

Biomedical Task Effect of antibody treatment on tumour blood vessels and stroma? ,[object Object],[object Object],[object Object],David Abramson Antibody-treated untreated Nuclei Blood vessels stroma merged

2 nd Generation Summary Projects delivering now. Some institutional embedding. Key characteristic is re-use - of the increasing pool of tools, data and methods across areas/disciplines. Contain some freestanding, recombinant, reproducible research objects. New scientific practices are established and opportunities arise for completely new scientific investigations. Some expert curation.

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],A Bioinformatics Experiment Scott Marshall Marco Roos Taverna

Digital Music Collections Crowdsourced ground truth Community Software Linked Data Repository Supercomputer Structural Analysis of Large Amounts of Music Information

[object Object],[object Object],Digital Social Research Harnessing advances in digital technology and practice to achieve world-class social research with maximum impact ,[object Object],[object Object]

3 rd Generation Summary The solutions we'll be delivering in 5 years Characterised by global reuse of tools, data and methods across any discipline, and surfacing the right levels of complexity for the researcher. Routine use. Key characteristic is radical sharing . Research is significantly data driven - plundering the backlog of data, results and methods. Increasing automation and decision-support for the researcher - the VRE becomes assistive. Curation is autonomic and social.

Reflections ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...GigaScience, BGI Hong Kong

Pharmacy Capstone 6 2 2016Linda Galloway

5 steps to using open access in the classroom 11 9 2011 Elizabeth Brown

LOA2020 Asking questions to solve a problem Kungliga biblioteket National Library of Sweden

e- Research As Intervention (5 April 2010) J UnitWebometrics Class

How to Execute A Research PaperAnita de Waard

Executing the Research PaperAnita de Waard

OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederEric Meyer

Sci 2011 big_data(30_may13)2nd revised _ loetHan Woo PARK

Scott Edmunds: GigaScience Datacite meeting Rapid Fire TalkGigaScience, BGI Hong Kong

Biomedical Engineering in a Changing Scholarly LandscapePhilip Bourne

2014 CrossRef Annual Meeting Keynote: Ways and Needs to Promote Rapid Data Sh...Crossref

Nicole Nogoy at the Auckland BMC RoadShowGigaScience, BGI Hong Kong

Fsci 2018 monday30_july_am6ARDC

Garcia Ethics 2016evadew1

2015 12 ebi_ganley_finalEmma Ganley

Futures for scholarly journals: a researchers' perspectiveResearch Information Network

Biology spring 2012_neurobiologyBruce Slutsky

Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchHan Woo PARK

Was ist angesagt? (19)

Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...

Pharmacy Capstone 6 2 2016

5 steps to using open access in the classroom 11 9 2011

LOA2020 Asking questions to solve a problem

e- Research As Intervention (5 April 2010) J Unit

How to Execute A Research Paper

Executing the Research Paper

OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder

Sci 2011 big_data(30_may13)2nd revised _ loet

Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk

Biomedical Engineering in a Changing Scholarly Landscape

2014 CrossRef Annual Meeting Keynote: Ways and Needs to Promote Rapid Data Sh...

Nicole Nogoy at the Auckland BMC RoadShow

Fsci 2018 monday30_july_am6

Garcia Ethics 2016

2015 12 ebi_ganley_final

Futures for scholarly journals: a researchers' perspective

Biology spring 2012_neurobiology

Decomposing Social and Semantic Networks in Emerging “Big Data” Research

Andere mochten auch

The Evolution of e-Research: Machines, Methods and MusicDavid De Roure

E learning a versatile tool for knowledge managementKishor Satpathy

Sociedad del conocimiento y educaciónCentro de Estudios Joan Bardina - Capítulo Uruguay

ToothbrushEmily Aguilar Camacho

Toothbrush bDrIbrahim Shaikh

Delivering online learning - are you ready? - Jisc Digifest 2016Jisc

Showcasing research data tools - Jisc Digifest 2016Jisc

Toothbrush aDrIbrahim Shaikh

Brushing techniquesJigyasha Timsina

Colgate palmolive the precision toothbrushRajendra Inani

Tooth brushing techniquesChinthamani Laser

Mechanical plaque control Green Sleeves

Dental hygiene and oral careManisha Saxena

26 Disruptive & Technology Trends 2016 - 2018Brian Solis

Andere mochten auch (14)

The Evolution of e-Research: Machines, Methods and Music

E learning a versatile tool for knowledge management

Sociedad del conocimiento y educación

Toothbrush

Toothbrush b

Delivering online learning - are you ready? - Jisc Digifest 2016

Showcasing research data tools - Jisc Digifest 2016

Toothbrush a

Brushing techniques

Colgate palmolive the precision toothbrush

Tooth brushing techniques

Mechanical plaque control

Dental hygiene and oral care

26 Disruptive & Technology Trends 2016 - 2018

Ähnlich wie Evolution of e-Research

Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble

The Future of Research (Science and Technology)Duncan Hull

ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust

ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble

ContentMine: Mining the Scientific Literaturepetermurrayrust

New e-Science Edinburgh Late EditionDavid De Roure

Biodiversity Informatics: An Interdisciplinary ChallengeBryan Heidorn

Open Data in a Big Data World: easy to say, but hard to do?LEARN Project

Minimal viable data reusevoginip

eScience-School-Oct2012-Campinas-BrazilSusanna-Assunta Sansone

Science and TechnologyDivyanshuTyagi8

Moving From Small Science To Big ScienceWebometrics Class

Social Machines of Science and ScholarshipDavid De Roure

The new alchemy: Online networking, data sharing and research activity distri...US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

E research overview gahegan bioinformatics workshop 2010BeSTGRID - NZ Research Computing (eResearch, Grid, eScience, Cyberinfrastructure)

The culture of researchDatapetermurrayrust

Looking for Data: Finding New ScienceAnita de Waard

Peer Review and Science2.0Jean-Claude Bradley

Open access impactIryna Kuchma

ContentMining in Neurosciencepetermurrayrust

Ähnlich wie Evolution of e-Research (20)

Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks

The Future of Research (Science and Technology)

ContentMining for France and Europe; Lessons from 2 years in UK

ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...

ContentMine: Mining the Scientific Literature

New e-Science Edinburgh Late Edition

Biodiversity Informatics: An Interdisciplinary Challenge

Open Data in a Big Data World: easy to say, but hard to do?

Minimal viable data reuse

eScience-School-Oct2012-Campinas-Brazil

Science and Technology

Moving From Small Science To Big Science

Social Machines of Science and Scholarship

The new alchemy: Online networking, data sharing and research activity distri...

E research overview gahegan bioinformatics workshop 2010

The culture of researchData

Looking for Data: Finding New Science

Peer Review and Science2.0

Open access impact

ContentMining in Neuroscience

Mehr von David De Roure

Emerging Scholarly Practice and Scholarly Primitives: a Case Study in Music a...David De Roure

Digital Humanities RSE LandscapeDavid De Roure

Music in the ArchivesDavid De Roure

Digital Research InfrastructureDavid De Roure

Alter: an ensemble work composed with and about AIDavid De Roure

Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDavid De Roure

Lovelace’s Legacy: Creative Algorithmic Interventions for Live PerformanceDavid De Roure

Experimental Humanities: An Adventure with Lovelace and BabbageDavid De Roure

Creativity in Digital ScholarshipDavid De Roure

The Imagination of Ada LovelaceDavid De Roure

Scholarly Social Machines EssayDavid De Roure

Social Machines and how to study themDavid De Roure

New and Emerging Forms of DataDavid De Roure

Plans and PerformancesDavid De Roure

Description of ProcessDavid De Roure

The Short and the Long of Web ScienceDavid De Roure

Short and Long of Data Driven InnovationDavid De Roure

New Data `New ComputationDavid De Roure

Ethics of AutomationDavid De Roure

Emerging Forms of Data and AnalyticsDavid De Roure

Mehr von David De Roure (20)

Emerging Scholarly Practice and Scholarly Primitives: a Case Study in Music a...

Digital Humanities RSE Landscape

Music in the Archives

Digital Research Infrastructure

Alter: an ensemble work composed with and about AI

Digital Scholarship: Intersection, Automation, and Scholarly Social Machines

Lovelace’s Legacy: Creative Algorithmic Interventions for Live Performance

Experimental Humanities: An Adventure with Lovelace and Babbage

Creativity in Digital Scholarship

The Imagination of Ada Lovelace

Scholarly Social Machines Essay

Social Machines and how to study them

New and Emerging Forms of Data

Plans and Performances

Description of Process

The Short and the Long of Web Science

Short and Long of Data Driven Innovation

New Data `New Computation

Ethics of Automation

Emerging Forms of Data and Analytics

Kürzlich hochgeladen

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Scaling API-first – The story of a global engineering organizationRadu Cotescu

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Histor y of HAM Radio presentation slidevu2urc

Evaluating the top large language models.pdfChristopherTHyatt

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Tech Trends Report 2024 Future Today Institute.pdfhans926745

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

How to convert PDF to text with Nanonetsnaman860154

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Kürzlich hochgeladen (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Boost Fertility New Invention Ups Success Rates.pdf

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Automating Google Workspace (GWS) & more with Apps Script

CNv6 Instructor Chapter 6 Quality of Service

Presentation on how to chat with PDF using ChatGPT code interpreter

2024: Domino Containers - The Next Step. News from the Domino Container commu...

How to Troubleshoot Apps for the Modern Connected Worker

Scaling API-first – The story of a global engineering organization

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Histor y of HAM Radio presentation slide

Evaluating the top large language models.pdf

Finology Group – Insurtech Innovation Award 2024

Tech Trends Report 2024 Future Today Institute.pdf

[2024]Digital Global Overview Report 2024 Meltwater.pdf

presentation ICT roal in 21st century education

How to convert PDF to text with Nanonets

GenCyber Cyber Security Day Presentation

Evolution of e-Research

1. The Evolution of e-Research David De Roure

4. 26/2/2007 | myExperiment | Slide Jeremy Frey

6. Taverna Trident Kepler Triana BPEL Meandre Galaxy

8. 1 st Generation Summary Current practices of early adoptors of tools. Characterised by researchers using tools within their particular problem area, with some re-use of tools, data and methods within the discipline. Traditional publishing is supplemented by publication of some digital artefacts like workflows and links to data. Science is accelerated and practice beginning to shift to emphasise in silico work.

10.

11. “ Data mining: my data’s mine and your data’s mine”

12. mySpace for scientists! Facebook for scientists! Not Facebook for scientists!

13. Web 2 Open Repositories Researchers Social Network Developers Social Scientists The experiment that is

14.

15.

16. data method

17. Results Logs Results Metadata Paper Slides Feeds into produces Included in produces Published in produces Included in Included in Included in Published in Workflow 16 Workflow 13 Common pathways QTL Paul’s Pack Paul’s Research Object

18.

19. 2 nd Generation Summary Projects delivering now. Some institutional embedding. Key characteristic is re-use - of the increasing pool of tools, data and methods across areas/disciplines. Contain some freestanding, recombinant, reproducible research objects. New scientific practices are established and opportunities arise for completely new scientific investigations. Some expert curation.

20. Francois Belleau

21.

22. How country is my country?

23. Digital Music Collections Crowdsourced ground truth Community Software Linked Data Repository Supercomputer Structural Analysis of Large Amounts of Music Information

24.

25. 3 rd Generation Summary The solutions we'll be delivering in 5 years Characterised by global reuse of tools, data and methods across any discipline, and surfacing the right levels of complexity for the researcher. Routine use. Key characteristic is radical sharing . Research is significantly data driven - plundering the backlog of data, results and methods. Increasing automation and decision-support for the researcher - the VRE becomes assistive. Curation is autonomic and social.

26.

27.

Hinweis der Redaktion

Today I’m going to talk about the trajectory of e-Science – from its conception through examples of 3 generations, and I’ll reflect on how we are moving from generation 2 to generation 3. Different disciplines and especially communities may be in different stages of evolution.
First something about words. This definition of e-Science is important – it reminds us that it isn’t just about technology but about people working together and being empowered by technology – and the emphasis on “science” reminds us that ultimately success is measured by new scientific outcome. At the turn of the decade this was a vision of the future. A programme was created called e-Science. The projects doing the innovation were labelled as “e-Science”. By the time we arrive, it’s just “science”. So “e-Science” has become the name of the journey rather than the destination. Note that the innovation that takes us to the destination isn’t solely in the custody of e-Science projects – there’s a lot of relevant work going on that doesn’t carry that label. Note also that when we say “e-Science” we actually mean “e-Research”! We sometimes forget to say that.
e-Science is often characterised as dealing with the data deluge – especially from new experimental techniques such as combinatorial chemistry, DNA microarrays, instruments, sensor networks, earth observation – even facebook (which I see as a kind of large hadron collider – or large people collider - of social science) as well as digitisation programmes and release of existing data (e.g. open government data) or new modes of access to secure data. Researchers are working digitally. The data deluge is caused by, and needs to be handled by, automation. The trick with automation is getting the right balance of “human in the loop” so that researcher can do what they’re very good at while machines do what machines are very good at. BTW Note the cocktail on the form in this slide!
Scientific workflow systems are a key automation technique for systematically handling the data deluge and giving us the “workflow” as a new sharable artefact of digital science – to record, repeat, reproduce and repurpose an experiment. This is an iconic slide by Carole Goble which is much repeated, reproduced and repurposed!
As keen observers of the e-Research ecosystem (as I’m sure we all are!) it’s interesting to note just how many workflow systems there are. This isn’t bad – each one comes prepackaged to solve particular problems for particular research communities. This is a good thing – it’s about adoption, about doing the specific before the generic. It shows co-evolution in action – successful e-Research isn’t about technology impacting research, it’s about technology being harnessed by researchers. Note Computer scientists in the audience may feel an urge to build a generic workflow language so that these systems can inter-operate. As it happens, workflows by their very nature plug together pretty well anyway – calling each other as services, or piping data from one to another.
Some co-evolution in action. In CombeChem I didn’t get requirements and go away and design a system that nobody wanted. We empowered some chemists to harness the technology – in this case Semantic Web. We “went on the journey” with them. They have done cool stuff! Semantic lab books, publication at source (e-crystals then blogging the lab), semantically enhanced publications. And a neat units ontology.
This is a summary of the phase we have been describing. The text on my summary slides has evolved but was originally based on the work of the e-Laboratories group at Manchester University (cf collaboratory or Virtual Research Environment) – I believe this framework to be more generally applicable, as you’ll see in this talk.
What we didn’t see much in phase 1 was sharing and reuse, but this is essential to harnessing of the new technology. The story on this slide involves sharing in a corridor and we will go on to see how we do it digitally! But it’s an important motivation. It led to new science.
The problem with sharing is that scientists are selfish – not so much e-Science as “me-Science”!
Heard this one? :-)
So we created “myExperiment” to find out whether scientists do indeed share enough to enjoy the benefits. New Scientist called it “mySpace for Scientists” (and my daughter called it mySpace for Science homework) but alas mySpace was soon passé, so it rapidly became Facebook for scientists. But that was a deterrent to uptake, because it was perceived to imply no privacy. So it’s not facebook! Incidentally our astronomy colleagues picked up the idea to create “Spacebook” :-)
How we actually describe myExperiment of course depends on our audience, and there are things of interest to many people. It’s like the blind monks and the elephant. Apologies to repositories colleagues in the audience for putting them at the tail end!
myExperiment in one slide! It’s a “boutique” Web site with the largest public collection of scientific workflows. For lots more information see the myExperiment wiki http://wiki.myexperiment.org/ BioCatalogue is a registry of Web Service in the life sciences and is directly based on the myExperiment experience. Sysmo and Methodbox grew from the myExperiment codebase – methodbox is an e-Social Science e-Laboratory for sharing and analysing data, and sysmo is customised to the systems biology domain. See http://www.biocatalogue.org/ http://www.methodbox.org/ http://www.sysmo-db.org/
My example screenshot page today isn’t a Taverna workflow but is another example of co-evolution. This is a nimrod workflow, and it’s on the Australian instance of myExperiment. We don’t mandate how people use myExperiment, we empower and watch and learn! One of the distinctives is the yellow strip – the “social metadata”... Licenses, credits, attribution. Without this scientists wouldn’t use it.
Lots of people focus on data (after all, there is a deluge!). Another important distinctive of myExperiment is that we have focused on sharing workflows (specific first – we focus on workflows like movies on youtube or photos on flickr) – or more generally on methods (sharing “know-how” ). If there is a data deluge then surely methods for handling and analysing it are just as important as the data?
This is reflected in a third distinctive – the pack. This is Paul Fishers pack from the Tryps example. Some packs contain example input and output data so workflows can be checked for “decay” (they don’t actually rot, but the world changes round them). While others are looking at semantically enhanced publication, we are asking “what is the shared artefact of future research?” We come at the same problem from the other side. We have it surrounded! Our approach relieves us of the paper mindest – so, for example, a Research Object could contain information for many audiences and purposes, with a commonly interpreted core (social scientists will recognise the idea of a “boundary object”).
None of this would be relevant if we weren’t seeing new science coming out – and we are. This example involves a microscope – back to our earlier instruments and automation theme – and a Kepler workflow which is shared on myexperiment.org.au and is in routine use.
This is pretty much where we are now!
Now we look at myExperiment as a probe into the future behaviour of researchers. For example, these workflows by Francois Belleau show what could be described as another level of working – building on the new tooling.
Here we see bioinformaticians assembling the resources they need to answer a research question – and also demonstrating what the methods section of the future paper needs to look like. They are using Linked Data. We see the power – ease of assembly. This could be where the new computer science challenges lie in e-Research.
To show it isn’t just bioinformaticians, here are Computational Musicologists doing a similar thing. Here the “signal” is digital music recordings, and the research question relates to country music!
That example comes from a Digging into Data project with the best project acronym ever. The projects is conducting a massive structural analysis of music in the internet archibe, to support musicologists. It illustrates many of the things we are now seeing in e-Research – crowdsourcing, annotation, community software development, high performance computation, data publication. This project involves UIUC, McGill and Oxford – and the supercomputer time is donated by NCSA.
We’ve seen digital humanities, let’s look briefly at e-social science – or rather, “Digital Social Research” (the name of the destination not the journey!) In social science we have more data than ever before but not collected for social science research per se – it’s fit for a different purpose. This brings a set of challenges, from statistics to ethics. We also have more capability than ever before, as illustrated in this talk. We believe the trick (again) is to focus on “methods” – the training and capacity building in the next generation of researchers. Social Science has another important angle – the social science study of e-Research itself. Many useful studies are now emerging.
Once the technologies are established and adopted we can realise the benefits of sharing – not just in “big science” but in everyday research. Collections like myExperiment enable new forms of analysis – of patterns of methods for example.
What we have seen throughout this talk is co-evolution or co-design in action. Or – more words – co-constitution. For computer scientists let’s just say co-* :-) A year ago I did a tour of the US with Malcolm Atkinson and we introduced two metaphors which have become “memes”: Intellectual access ramps, of which workflow systems and myExperiment are examples, enable incremental engagement – rather thank jumping straight into the fast lane! They are for scientists but also developers and research technologists. - Datascopes. These are the assemblies of tools that take us from signal to understanding. They are scientific instruments which equally support humanists. We hope they will change our understanding of our place in the universe.

Evolution of e-Research

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (19)

Andere mochten auch

Andere mochten auch (14)

Ähnlich wie Evolution of e-Research

Ähnlich wie Evolution of e-Research (20)

Mehr von David De Roure

Mehr von David De Roure (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Evolution of e-Research

Hinweis der Redaktion