TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Kirchhoff, Agnes A1 - Bügel, Ulrich A1 - Santamaria, Eduard A1 - Reimeier, Fabian A1 - Röpert, Dominik A1 - Tebbje, Alexander A1 - Güntsch, Anton A1 - Chaves, Fernando A1 - Steinke, Karl-Heinz A1 - Berendsohn, Walter T1 - Toward a service-based workflow for automated information extraction from herbarium specimens JF - Database N2 - Over the past years, herbarium collections worldwide have started to digitize millions of specimens on an industrial scale. Although the imaging costs are steadily falling, capturing the accompanying label information is still predominantly done manually and develops into the principal cost factor. In order to streamline the process of capturing herbarium specimen metadata, we specified a formal extensible workflow integrating a wide range of automated specimen image analysis services. We implemented the workflow on the basis of OpenRefine together with a plugin for handling service calls and responses. The evolving system presently covers the generation of optical character recognition (OCR) from specimen images, the identification of regions of interest in images and the extraction of meaningful information items from OCR. These implementations were developed as part of the Deutsche Forschungsgemeinschaft funded a standardised and optimised process for data acquisition from digital images of herbarium specimens (StanDAP-Herb) Project. KW - Bildanalyse KW - Optische Zeichenerkennung KW - Metadaten KW - Herbarium Y1 - 2018 UN - https://nbn-resolving.org/urn:nbn:de:bsz:960-opus4-12860 SN - 1758-0463 SS - 1758-0463 U6 - https://doi.org/10.25968/opus-1286 DO - https://doi.org/10.25968/opus-1286 VL - 2018 SP - 1 EP - 11 ER -