Unraveling Multimodality with Large Language Models.pdf
Large-scale digitisation options at the Natural History Museum, London.
1. “ We are drowning in information, while starving for wisdom. The world henceforth will be run by synthesizers, people able to put together the right information at the right time , think critically about it, and make important choices ” E. O. Wilson | Harvard University
8. 1mm Fossil zoomed in from imaged 500x500mm samples draw Single moth from a 500x500mm display case imaged in full at 1000 DPI
9.
10. Indexing Conveyor Inspection Module Museum Trays Camera Loading / Unloading Area Loading / Unloading Area 1.2 Mtr 1.2 Mtr 1 Mtr 700 mm All dimensions are approximate – do not scale SmartDrive TrayScan System with Infeed / Outfeed Conveyor Concept Illustration Positioning Guides (Flites) Conveyor approach for draws Digitise whole collections quickly
11. SmartDrive MicroScan System with Microscope Slide Tray Loader on Gliding Framework Concept Illustration Slide Tray Glide Mechanism Inspection Module SlideTrays with 'Locations' Fibre Optic Illuminator 500mm 1200mm All dimensions are approximate – do not scale Conveyor approach 1 for slides
17. “ We are drowning in information, while starving for wisdom. The world henceforth will be run by synthesizers, people able to put together the right information at the right time , think critically about it, and make important choices ” E. O. Wilson | Harvard University
Hinweis der Redaktion
This sums up what we (the NHM) should be about for biodiversity. Yet when it come to our thinking on digitisation of the “collection” we are way off base…
Using current approaches we cannot deliver it in a timely way. Most grants and benefactors expect a return in a much shorter period. 3,5, perhaps 10 years (e.g. EOL expects a returns in 10). 500 years is 2 orders of magnitude out, and that is just for creating the digital surrogate (incidentally the name digital surrogate isn’t marketable- we need a better name - no one understands what a digital surrogate is). We need a fresh approach…
Emphasis has been on prioritizing individual specimens. Traditionally as taxonomists we select types. But types are not that important to all but taxonomists. More importantly the act of specimen selection becomes a rate limiting step. Transaction costs are way too high. We have the dilemma of what to specimens to select, and for whom, for an audience we cannot be certain of either now, or into the future.
As part of the fresh approach we need some standards and we need to simplify. What are the standard objects, since it is around these we can simplify and standardise the process of handling. These objects are going to differ between departments, but these are what we should be digitized because they minimize the handling, and handling is the rate limiting step to creating digital surrogates. With well thought out workflows and processing digital surrogates of these objects can be created quickly. One one they are done, we don’t have to go back to a collection and pick up the parts we haven’t digitized.
As above
Barcodes can be added as required. They don’t need to be recorded. Could be picked up on the pins after the draw has been digitised.
Addresses parallax issues and squared distortion issues.
Addresses parallax issues and squared distortion issues.
Collection draws for digitising an entire collection very quickly. Only the process of putting the barcodes on specimens and putting the draws on the conveyer is manual. Everything else is automated.
Slides 1
Preferred slides option. This is quicker and gives better quality images. The system is based on that used for automatically scanning and screening histology slides. Some research is needed on adapting it to whole specimens.
Captcha is a mechanical turk process. [Explain mechanic turk]. Great way of getting help from a wider audience doing things we cannot automate. However, scientists and other interested parties can be involved too. We can design the system to work for us. A practical example is the Herbaria@home project.