skip to main content
10.1145/379437.379439acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Integrating automatic genre analysis into digital libraries

Authors Info & Claims
Published:01 January 2001Publication History

ABSTRACT

With the number and types of documents in digital library systems incr easing, tools for automatically organizing and presenting the content have to be found. While many approaches focus on topic-based organization and structuring, hardly any system incorporates automatic structural analysis and representation. Yet, genre information (unconsciously) forms one of the most distinguishing features in conventional libraries and in information searches. In this paper we present an approach to automatically analyze the structure of documents and to integrate this information into an automatically created content-based organization. In the resulting visualization, documents on similar topics, yet representing different genres, are depicted as books in differing colors. This representation supports users intuitively in locating relevant information presented in a relevant form.

References

  1. 1.D. Biber. Variations across Speech and Writing. Cambridge University Press, UK, 1988.]]Google ScholarGoogle Scholar
  2. 2.D. Biber. A typology of english texts. Linguistics, 27:3 - 43, 1989.]]Google ScholarGoogle Scholar
  3. 3.I. Bretan, J. Dewe, A. Hallberg, N. Wolkert, and J. Karlgren. Web-specific genre visualization. In Proc of WebNet '98, Orlando, FL, November 1998. http://www.stacken.kth.se/~dewe/.]]Google ScholarGoogle Scholar
  4. 4.H. Chen, C. Schuels, and R. Orwig. Internet categorization and search: A self-organizing approach. Journal of Visual Communication and Image Representation, 7(1):88-102, 1996. http://ai.BPA.arizona.edu/papers/.]]Google ScholarGoogle ScholarCross RefCross Ref
  5. 5.H. Chernoff. The use of faces to represent points in k-dimensional space graphically. Journal American Statistical Association, 68:361-368, 1973.]]Google ScholarGoogle ScholarCross RefCross Ref
  6. 6.L. Cherra and W. Vesterman. Writing tools: The STYLE and DICTION programs. Technical Report 91, Bell Laboratories, Murray Hill, NJ, 1981. Republished as part 4.4BSD User's Supplementary Documents by O'Reilly.]]Google ScholarGoogle Scholar
  7. 7.J. Himberg. A SOM based cluster visualization and its application for false coloring. In Proc Int'l Joint Conf on Neural Networks (IJCNN 2000), Como, Italy, July 24. - 27. 2000. IEEE Computer Society.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.J. Karlgren. Stylistic experiments in information retrieval. In T. Strzalkowski, editor, Natural Language Information Retrieval. Kluwer, 1999. http://www.sics.se/~jussi/Artiklar/.]]Google ScholarGoogle ScholarCross RefCross Ref
  9. 9.J. Karlgren, I. Bretan, J. Dewe, A. Hallberg, and N. Wolkert. Iterative information retrieval using fast clustering and usage-specific genres. In Proc Eighth DELOS Workshop on User Interfaces in Digital Libraries, pages 85-92, Stockholm, Sweden, October 1998. http://www.stacken.kth.se/~dewe/.]]Google ScholarGoogle Scholar
  10. 10.J. Karlgren and D. Cutting. Recognizing text genres with simple metrics using discriminant analysis. In Proc 15. Int'l Conf on Computational Linguistics (COLING '94), Kyoto, Japan, 1994. http://www.sics.se/~jussi/Artiklar/.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.B. Kessler, G. Nunberg, and H. Schutze. Automatic detection of text genre. In Proc 8. Conf Europ. Chapter of the Association for Computational Linguistics (ACL/EACL97), pages 32-38, Madrid, Spain, 1997. http://spell.psychology.wayne.edu/~bkessler/.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.T. Kohonen. Self-organized formation of topologically correct feature maps. Biological Cybernetics, 43, 1982.]]Google ScholarGoogle Scholar
  13. 13.T. Kohonen. Self-organizing maps. Springer-Verlag, Berlin, 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.T. Kohonen, S. Kaski, K. Lagus, J. Salojarvi, J. Honkela, V. Paatero, and A. Saarela. Self-organization of a massive document collection. IEEE Transactions on Neural Networks, 11(3):574-585, May 2000. http://ieeexplore.ieee.org/.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.D. Merkl and A. Rauber. Document classification with unsupervised neural networks. In F. Crestani and G. Pasi, editors, Soft Computing in Information Retrieval, pages 102-121. Physica Verlag, 2000. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]Google ScholarGoogle ScholarCross RefCross Ref
  16. 16.A. Rauber. LabelSOM: On the labeling of self-organizing maps. In Proc Int'l Joint Conf on Neural Networks (IJCNN'99), Washington, DC, July 10 - 16. 1999. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]Google ScholarGoogle Scholar
  17. 17.A. Rauber. SOMLib: A digital library system based on neural networks. In E. Fox and N. Rowe, editors, Proc ACM Conf on Digital Libraries (ACMDL'99), pages 240-241, Berkeley, CA,August 11 - 14. 1999. ACM. http://www.acm.org/dl.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.A. Rauber and H. Bina. Visualizing electronic document repositories: Drawing books and papers in a digital library. In Advances in Visual Database Systems: Proc IFIP TC2 Working Conf on Visual Database Systems, pages95- 114, Fukuoka, Japan, May, 10.- 12. 2000. Kluwer Academic Publishers. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.A. Rauber and D. Merkl. The SOMLib Digital Library System. InProc 3.Europ. Conf on Research and Advanced Technology for Digital Libraries (ECDL99), LNCS 1696, pages 323-342, Paris, France, September 22. - 24. 1999. Springer. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.A. Rauber, M. Dittenbach, and D. Merkl. Automatically detecting and organizing documents into topic hierarchies: A neural-network based approach to bookshelf creation and arrangement. In Proc 4. Europ. Conf on Research and Advanced Technologies for Digital Libraries (ECDL2000), LNCS 1923, pages 348-351, Lisboa, Portugal, September 18. - 20. 2000. Springer. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.K. Ries. Towards the detection and description of textual meaning indicators in spontaneous conversations. In Proc Europ. Conf on Speech Communication and Technology (EUROSPEECH99), Budapest, Hungary, September 5-9 1999.]]Google ScholarGoogle Scholar
  22. 22.G. Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, MA, 1989.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Integrating automatic genre analysis into digital libraries

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        JCDL '01: Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
        January 2001
        481 pages
        ISBN:1581133456
        DOI:10.1145/379437

        Copyright © 2001 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 January 2001

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        JCDL '01 Paper Acceptance Rate76of250submissions,30%Overall Acceptance Rate415of1,482submissions,28%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader