skip to main content
10.1145/2467696.2467729acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

A scalable, distributed and dynamic workflow system for digitization processes

Published:22 July 2013Publication History

ABSTRACT

Creating digital representations of ancient manuscripts, prints and maps is a challenging task due to the sources' fragile and heterogeneous natures. Digitization requires a very specialized set of scanning hardware in order to cover the sources' diversity. The central task is obtaining the maximum reproduction quality while minimizing the error rate, which is difficult to achieve due to the large amounts of image data resulting from digitization, putting huge computational loads on image processing modules, error-detection and information retrieval heuristics. As digital copies initially do not contain any information about their sources' semantics, additional efforts have to be made to extract semantic metadata. This is an error-prone, time-consuming manual process, which calls for automated mechanisms to support the user. This paper introduces a decentralized, event-driven workflow system designed to overcome the above mentioned challenges. It leverages dynamic routing between workflow components, thus being able to quickly adapt to the sources' unique requirements. It provides a scalable approach to soften out high computational loads on single units by using distributed computing and provides modules for automated image pre-/post-processing, error-detection heuristics, data mining, semantic analysis, metadata augmentation, quality assurance and an export functionality to established publishing platforms or long-term storage facilites.

References

  1. J. Armstrong and T. Helen. Making reliable distributed systems in the presence of software errors, 2003.Google ScholarGoogle Scholar
  2. H. Schöneberg. Context vector classification - term classification with context evaluation. In A. L. N. Fred and J. Filipe, editors, KDIR, pages 387--391. SciTePress, 2010.Google ScholarGoogle Scholar
  3. H. Schöneberg and F. Müller. Contextual approaches for identification of toponyms in ancient documents. In A. L. N. Fred, J. Filipe, A. L. N. Fred, and J. Filipe, editors, KDIR, pages 163--168. SciTePress, 2012.Google ScholarGoogle Scholar
  4. E. D. Tiona. Efficency and Fault-Tolerance in Workflows for Early Documents (in German). Master's thesis, University of Würzburg, to appear 2013.Google ScholarGoogle Scholar

Index Terms

  1. A scalable, distributed and dynamic workflow system for digitization processes

                      Recommendations

                      Comments

                      Login options

                      Check if you have access through your login credentials or your institution to get full access on this article.

                      Sign in
                      • Published in

                        cover image ACM Conferences
                        JCDL '13: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
                        July 2013
                        480 pages
                        ISBN:9781450320771
                        DOI:10.1145/2467696

                        Copyright © 2013 ACM

                        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                        Publisher

                        Association for Computing Machinery

                        New York, NY, United States

                        Publication History

                        • Published: 22 July 2013

                        Permissions

                        Request permissions about this article.

                        Request Permissions

                        Check for updates

                        Qualifiers

                        • research-article

                        Acceptance Rates

                        JCDL '13 Paper Acceptance Rate28of95submissions,29%Overall Acceptance Rate415of1,482submissions,28%

                      PDF Format

                      View or Download as a PDF file.

                      PDF

                      eReader

                      View online with eReader.

                      eReader