SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Lurking in the lab: Analysis of data from molecular biology laboratory instruments
                                                              Jen Ferguson • Northeastern University • Boston MA



Introduction                                                  Results                                                                                                                  Conclusions
Science labs are filled with laboratory instruments that                                                                                                                               Experimental data files in both proprietary and open
gather and store data. These data sets are neither                Quantitative analysis of data files                                                                                  formats were found in analysis of lab equipment hard
glamorous nor polished, and very little of this data will                                                                                                                              drives.
ultimately be published. However, they represent the
reality that science faculty, staff, and students are              Eight most prevalent file formats                         62% of files were in proprietary formats                  Image files made up the bulk of the experimental data and
attempting to manage daily.
                                                                                                                                                                                       were usually in open formats.

                                                                                                                                                                                       Other experimental data files tended to be in proprietary
                                                                                                                                                                                       formats.

                                                                                                                                                                                       Several factors pose potential challenges to data use,
                                                                                                                                                                                       sharing, management, and preservation:

                                                                                                                                                                                            o Automatic overwriting and/or cloud storage of data
                                                                                                                                                                                              sets
                                                                                                                                                                                            o Data consisting of bundles of different file types
A faculty member granted access to instruments used in
                                                                                                                                                                                            o Proprietary file formats
her molecular biology teaching lab.

Experimental data files generated by these instruments
were examined in order to:

       o Gain insight into lab data management practices         Additional findings
       o Evaluate possible curation/preservation challenges                                                                                                                            Acknowledgments
         posed by this data                                      Image files in various formats were the most prevalent type of experimental data found, accounting for 65% of the
                                                                 data files on the lab instrument hard drives.                                                                         Thanks to M. Kosinski-Collins & D. Bordne of Brandeis
                                                                                                                                                                                       University for generously providing insights and access to
                                                                 Data from these instruments can be stored as packages comprised of several different file types. For
                                                                                                                                                                                       their laboratory equipment.
                                                                 example, DNA sequence data can consist of sequence text file (.txt) + curve file (.scf) + gel image file (.samp).

                                                                 One instrument (the LI-COR DNA analyzer) had very few data files stored on its hard drive. This instrument uses
                                                                 drive space as temporary storage only; the oldest data sets are continuously overwritten. Some laboratory
Materials and methods                                            instruments, including the LI-COR, can be configured to save data to cloud storage supplied by the instrument’s
                                                                 vendor.                                                                                                               Sources
Xplorer2 file management software was used to flatten
directory structures and capture files and metadata on hard
                                                                                                                                                                                       Cornell University Library/Research Department. (2003).
drives dedicated to these laboratory instruments:
                                                                                                                                                                                         Common image file formats. In M oving theory into practice:
                                                                                                                                                                                         D igital imaging tutorial. Retrieved from
   o    Agilent Technologies atomic force microscope             Faculty & staff perspectives                                                                                            http://www.library.cornell.edu/preservation/tutorial/present
   o    LI-COR NEN DNA analyzer                                                                                                                                                          ation/table7-1.html
   o    Bio-Rad Gel Doc XR
   o    Hitachi fluorescence spectrophotometer                   Teaching lab faculty and staff noted challenges in organizing files so students could find them, due to:              File-extensions.org – File extension library. (2012).
                                                                                                                                                                                          Retrieved from www.file-extensions.org
                                                                   o   Inconsistent file storage locations & naming conventions
Xplorer2 and Microsoft Excel were used to sort and analyze                                                                                                                             LI-COR, Inc. (2006). Operator’s manual: N E N model 4300
                                                                   o   Inconsistent data management practices from one lab instrument to the next
experimental data files. Formats were categorized as                                                                                                                                      D N A analyzer. Retrieved from http://biosupport.licor.com/
proprietary or open/standard.                                                                                                                                                             docs/4300_OpMan_08591.pdf
                                                                 They observed that students found working with experimental data frustrating, largely as a result of barriers
                                                                 imposed by proprietary file formats. Challenges also arose due to issues such as difficulty manipulating 32-bit vs.   Zabkat Software. (2011). Xplorer2 portable edition (Version
Informal discussions with the faculty member & lab staff         16-bit vs. 8-bit images.                                                                                                1.8.1) [Software]. Available from
during the course of this work also informed the findings.                                                                                                                               http://zabkat.com/x2port.htm

Weitere ähnliche Inhalte

Andere mochten auch

Advanced Process Control for ARC
Advanced Process Control for ARCAdvanced Process Control for ARC
Advanced Process Control for ARC
anilkc12
 
Practical Advanced Process Control for Engineers and Technicians
Practical Advanced Process Control for Engineers and TechniciansPractical Advanced Process Control for Engineers and Technicians
Practical Advanced Process Control for Engineers and Technicians
Living Online
 

Andere mochten auch (15)

IMR 7500_eng(1)
IMR 7500_eng(1)IMR 7500_eng(1)
IMR 7500_eng(1)
 
Process Gas Chromatograph for Industrial Use
Process Gas Chromatograph for Industrial UseProcess Gas Chromatograph for Industrial Use
Process Gas Chromatograph for Industrial Use
 
Suggested QC Practices for On-Line Analysis
Suggested QC Practices for On-Line AnalysisSuggested QC Practices for On-Line Analysis
Suggested QC Practices for On-Line Analysis
 
Process Solutions from Wet Chemistry to Near-Infrared Spectroscopy
Process Solutions from Wet Chemistry to Near-Infrared SpectroscopyProcess Solutions from Wet Chemistry to Near-Infrared Spectroscopy
Process Solutions from Wet Chemistry to Near-Infrared Spectroscopy
 
DELTA XRF Analyzer Labeling Feature
DELTA XRF Analyzer Labeling FeatureDELTA XRF Analyzer Labeling Feature
DELTA XRF Analyzer Labeling Feature
 
Presentation on stack monitoring for industries
Presentation on stack monitoring for industriesPresentation on stack monitoring for industries
Presentation on stack monitoring for industries
 
Linked In Presentation
Linked In PresentationLinked In Presentation
Linked In Presentation
 
Sap tao 2.0 Material
Sap tao 2.0 MaterialSap tao 2.0 Material
Sap tao 2.0 Material
 
Advanced Process Control for ARC
Advanced Process Control for ARCAdvanced Process Control for ARC
Advanced Process Control for ARC
 
The Diablo Real-Time Gas Analyzer Webinar
The Diablo Real-Time Gas Analyzer WebinarThe Diablo Real-Time Gas Analyzer Webinar
The Diablo Real-Time Gas Analyzer Webinar
 
SSAS - Other Cube Browsers
SSAS - Other Cube BrowsersSSAS - Other Cube Browsers
SSAS - Other Cube Browsers
 
Continuous Emissions Monitoring presentation by ET
Continuous Emissions Monitoring presentation by ETContinuous Emissions Monitoring presentation by ET
Continuous Emissions Monitoring presentation by ET
 
Practical Advanced Process Control for Engineers and Technicians
Practical Advanced Process Control for Engineers and TechniciansPractical Advanced Process Control for Engineers and Technicians
Practical Advanced Process Control for Engineers and Technicians
 
Flue gas analisys in industry-Practical guide for Emission and Process Measur...
Flue gas analisys in industry-Practical guide for Emission and Process Measur...Flue gas analisys in industry-Practical guide for Emission and Process Measur...
Flue gas analisys in industry-Practical guide for Emission and Process Measur...
 
Spectrum Analyzer Fundamentals/Advanced Spectrum Analysis
Spectrum Analyzer Fundamentals/Advanced Spectrum AnalysisSpectrum Analyzer Fundamentals/Advanced Spectrum Analysis
Spectrum Analyzer Fundamentals/Advanced Spectrum Analysis
 

Ähnlich wie Lurking in the lab: analysis of data from molecular biology laboratory instruments

2013 03-15- Institut Jacques Monod - bioinfoclub
2013 03-15- Institut Jacques Monod - bioinfoclub2013 03-15- Institut Jacques Monod - bioinfoclub
2013 03-15- Institut Jacques Monod - bioinfoclub
Yannick Wurm
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
Sherry Lake
 

Ähnlich wie Lurking in the lab: analysis of data from molecular biology laboratory instruments (20)

Description & annotation of biomedical experimental data sets: work in p...
Description & annotation of biomedical experimental data sets:  work in p...Description & annotation of biomedical experimental data sets:  work in p...
Description & annotation of biomedical experimental data sets: work in p...
 
Brain Imaging Data Structure
Brain Imaging Data StructureBrain Imaging Data Structure
Brain Imaging Data Structure
 
Research data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsResearch data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research Methods
 
Good (enough) research data management practices
Good (enough) research data management practicesGood (enough) research data management practices
Good (enough) research data management practices
 
2013 03-15- Institut Jacques Monod - bioinfoclub
2013 03-15- Institut Jacques Monod - bioinfoclub2013 03-15- Institut Jacques Monod - bioinfoclub
2013 03-15- Institut Jacques Monod - bioinfoclub
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 
Eamonn Maguire: The Open Source ISA Metadata Tracking Framework: From Data Cu...
Eamonn Maguire: The Open Source ISA Metadata Tracking Framework: From Data Cu...Eamonn Maguire: The Open Source ISA Metadata Tracking Framework: From Data Cu...
Eamonn Maguire: The Open Source ISA Metadata Tracking Framework: From Data Cu...
 
Introduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate studentsIntroduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate students
 
Databases
DatabasesDatabases
Databases
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides
 
Archive and Data Management Training Center
Archive and Data Management Training CenterArchive and Data Management Training Center
Archive and Data Management Training Center
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
RDAP 15 Lost in the Data Jungle: A Case Study for Organizing, Publishing, and...
RDAP 15 Lost in the Data Jungle: A Case Study for Organizing, Publishing, and...RDAP 15 Lost in the Data Jungle: A Case Study for Organizing, Publishing, and...
RDAP 15 Lost in the Data Jungle: A Case Study for Organizing, Publishing, and...
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcher
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
Whither Small Data?
Whither Small Data?Whither Small Data?
Whither Small Data?
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05
 

Lurking in the lab: analysis of data from molecular biology laboratory instruments

  • 1. Lurking in the lab: Analysis of data from molecular biology laboratory instruments Jen Ferguson • Northeastern University • Boston MA Introduction Results Conclusions Science labs are filled with laboratory instruments that Experimental data files in both proprietary and open gather and store data. These data sets are neither Quantitative analysis of data files formats were found in analysis of lab equipment hard glamorous nor polished, and very little of this data will drives. ultimately be published. However, they represent the reality that science faculty, staff, and students are Eight most prevalent file formats 62% of files were in proprietary formats Image files made up the bulk of the experimental data and attempting to manage daily.   were usually in open formats. Other experimental data files tended to be in proprietary formats. Several factors pose potential challenges to data use, sharing, management, and preservation: o Automatic overwriting and/or cloud storage of data sets o Data consisting of bundles of different file types A faculty member granted access to instruments used in o Proprietary file formats her molecular biology teaching lab. Experimental data files generated by these instruments were examined in order to: o Gain insight into lab data management practices Additional findings o Evaluate possible curation/preservation challenges Acknowledgments posed by this data Image files in various formats were the most prevalent type of experimental data found, accounting for 65% of the data files on the lab instrument hard drives. Thanks to M. Kosinski-Collins & D. Bordne of Brandeis University for generously providing insights and access to Data from these instruments can be stored as packages comprised of several different file types. For their laboratory equipment. example, DNA sequence data can consist of sequence text file (.txt) + curve file (.scf) + gel image file (.samp). One instrument (the LI-COR DNA analyzer) had very few data files stored on its hard drive. This instrument uses drive space as temporary storage only; the oldest data sets are continuously overwritten. Some laboratory Materials and methods instruments, including the LI-COR, can be configured to save data to cloud storage supplied by the instrument’s vendor. Sources Xplorer2 file management software was used to flatten directory structures and capture files and metadata on hard Cornell University Library/Research Department. (2003). drives dedicated to these laboratory instruments: Common image file formats. In M oving theory into practice: D igital imaging tutorial. Retrieved from o Agilent Technologies atomic force microscope Faculty & staff perspectives http://www.library.cornell.edu/preservation/tutorial/present o LI-COR NEN DNA analyzer ation/table7-1.html o Bio-Rad Gel Doc XR o Hitachi fluorescence spectrophotometer Teaching lab faculty and staff noted challenges in organizing files so students could find them, due to: File-extensions.org – File extension library. (2012). Retrieved from www.file-extensions.org o Inconsistent file storage locations & naming conventions Xplorer2 and Microsoft Excel were used to sort and analyze LI-COR, Inc. (2006). Operator’s manual: N E N model 4300 o Inconsistent data management practices from one lab instrument to the next experimental data files. Formats were categorized as D N A analyzer. Retrieved from http://biosupport.licor.com/ proprietary or open/standard. docs/4300_OpMan_08591.pdf They observed that students found working with experimental data frustrating, largely as a result of barriers imposed by proprietary file formats. Challenges also arose due to issues such as difficulty manipulating 32-bit vs. Zabkat Software. (2011). Xplorer2 portable edition (Version Informal discussions with the faculty member & lab staff 16-bit vs. 8-bit images. 1.8.1) [Software]. Available from during the course of this work also informed the findings. http://zabkat.com/x2port.htm

Hinweis der Redaktion

  1. PRINTING NOTES This PowerPoint template complements of MakeSigns.com If you opened this file directly from a web browser, you’ll want to save it to your computer before adding your poster information. This template has a page size of 36”x 48” . When uploaded at MakeSigns.com, this template can be used to order posters in the following sizes: 36”x 48”, 42”x 56”, 48”x 64”, 31.5” x 42” and 27”x 36”. We recommend that you avoid changing the page size of the template. Please keep in mind, if you do change the page size it will alter the available print sizes listed above. Any changes to the template size should be done before entering your information. If you have any questions about creating a scientific poster , visit MakeSigns.com or email us at [email_address] We offer these scientific poster examples free of charge to help you create and design a poster presentation with ease.