This document summarizes an analysis of experimental data files from molecular biology lab instruments. The analysis found that 62% of files were in proprietary formats, with image files making up 65% of data but usually in open formats. Other experimental data tended to be in proprietary formats. Managing, sharing, and preserving this data poses challenges due to automatic overwriting, multiple file types per data set, and proprietary formats. Faculty noted students found working with experimental data frustrating due to barriers from proprietary formats and issues with file formats.
Lurking in the lab: analysis of data from molecular biology laboratory instruments
1. Lurking in the lab: Analysis of data from molecular biology laboratory instruments
Jen Ferguson • Northeastern University • Boston MA
Introduction Results Conclusions
Science labs are filled with laboratory instruments that Experimental data files in both proprietary and open
gather and store data. These data sets are neither Quantitative analysis of data files formats were found in analysis of lab equipment hard
glamorous nor polished, and very little of this data will drives.
ultimately be published. However, they represent the
reality that science faculty, staff, and students are Eight most prevalent file formats 62% of files were in proprietary formats Image files made up the bulk of the experimental data and
attempting to manage daily.
were usually in open formats.
Other experimental data files tended to be in proprietary
formats.
Several factors pose potential challenges to data use,
sharing, management, and preservation:
o Automatic overwriting and/or cloud storage of data
sets
o Data consisting of bundles of different file types
A faculty member granted access to instruments used in
o Proprietary file formats
her molecular biology teaching lab.
Experimental data files generated by these instruments
were examined in order to:
o Gain insight into lab data management practices Additional findings
o Evaluate possible curation/preservation challenges Acknowledgments
posed by this data Image files in various formats were the most prevalent type of experimental data found, accounting for 65% of the
data files on the lab instrument hard drives. Thanks to M. Kosinski-Collins & D. Bordne of Brandeis
University for generously providing insights and access to
Data from these instruments can be stored as packages comprised of several different file types. For
their laboratory equipment.
example, DNA sequence data can consist of sequence text file (.txt) + curve file (.scf) + gel image file (.samp).
One instrument (the LI-COR DNA analyzer) had very few data files stored on its hard drive. This instrument uses
drive space as temporary storage only; the oldest data sets are continuously overwritten. Some laboratory
Materials and methods instruments, including the LI-COR, can be configured to save data to cloud storage supplied by the instrument’s
vendor. Sources
Xplorer2 file management software was used to flatten
directory structures and capture files and metadata on hard
Cornell University Library/Research Department. (2003).
drives dedicated to these laboratory instruments:
Common image file formats. In M oving theory into practice:
D igital imaging tutorial. Retrieved from
o Agilent Technologies atomic force microscope Faculty & staff perspectives http://www.library.cornell.edu/preservation/tutorial/present
o LI-COR NEN DNA analyzer ation/table7-1.html
o Bio-Rad Gel Doc XR
o Hitachi fluorescence spectrophotometer Teaching lab faculty and staff noted challenges in organizing files so students could find them, due to: File-extensions.org – File extension library. (2012).
Retrieved from www.file-extensions.org
o Inconsistent file storage locations & naming conventions
Xplorer2 and Microsoft Excel were used to sort and analyze LI-COR, Inc. (2006). Operator’s manual: N E N model 4300
o Inconsistent data management practices from one lab instrument to the next
experimental data files. Formats were categorized as D N A analyzer. Retrieved from http://biosupport.licor.com/
proprietary or open/standard. docs/4300_OpMan_08591.pdf
They observed that students found working with experimental data frustrating, largely as a result of barriers
imposed by proprietary file formats. Challenges also arose due to issues such as difficulty manipulating 32-bit vs. Zabkat Software. (2011). Xplorer2 portable edition (Version
Informal discussions with the faculty member & lab staff 16-bit vs. 8-bit images. 1.8.1) [Software]. Available from
during the course of this work also informed the findings. http://zabkat.com/x2port.htm
Hinweis der Redaktion
PRINTING NOTES This PowerPoint template complements of MakeSigns.com If you opened this file directly from a web browser, you’ll want to save it to your computer before adding your poster information. This template has a page size of 36”x 48” . When uploaded at MakeSigns.com, this template can be used to order posters in the following sizes: 36”x 48”, 42”x 56”, 48”x 64”, 31.5” x 42” and 27”x 36”. We recommend that you avoid changing the page size of the template. Please keep in mind, if you do change the page size it will alter the available print sizes listed above. Any changes to the template size should be done before entering your information. If you have any questions about creating a scientific poster , visit MakeSigns.com or email us at [email_address] We offer these scientific poster examples free of charge to help you create and design a poster presentation with ease.