skip to main content
10.1145/1559845.1559999acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

Enabling enterprise mashups over unstructured text feeds with InfoSphere MashupHub and SystemT

Published:29 June 2009Publication History

ABSTRACT

Enterprise mashup scenarios often involve feeds derived from data created primarily for eye consumption, such as email, news, calendars, blogs, and web feeds. These data sources can test the capabilities of current data mashup products, as the attributes needed to perform join, aggregation, and other operations are often buried within unstructured feed text. Information extraction technology is a key enabler in such scenarios, using annotators to convert unstructured text into structured information that can facilitate mashup operations.

Our demo presents the integration of SystemT, an information extraction system from IBM Research, with IBM's InfoSphere MashupHub. We show how to build domain-specific annotators with SystemT's declarative rule language, AQL, and how to use these annotators to combine structured and unstructured information in an enterprise mashup.

References

  1. A. Jhingran, "Enterprise Information Mashups: Integrating Information, Simply", VLDB 2006: 3--4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. IBM Infosphere MashupHub, http://www-01.ibm.com/software/data/info20/how-it-works.htmlGoogle ScholarGoogle Scholar
  3. Simmen, D., Altinel, M., Markl, V., Padmanaban S., Singh, A. Damia: Data Mashups for Intranet Applications. Sigmod 2008 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Calais, http://www.opencalais.comGoogle ScholarGoogle Scholar
  5. Reiss, F., Raghavan, S., Krishnamurthy, R., Zhu, H.,Vaithyanathan, S.: An Algebraic Approach to Rule-Based Information Extraction. ICDE 2008 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. SystemT, http://www.alphaworks.ibm.com/tech/systemt/Google ScholarGoogle Scholar
  7. R. Krishnamurthy et al., "SystemT: A System for Declarative Information Extraction", to appear, SIGMOD Record. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enabling enterprise mashups over unstructured text feeds with InfoSphere MashupHub and SystemT

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '09: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
      June 2009
      1168 pages
      ISBN:9781605585512
      DOI:10.1145/1559845

      Copyright © 2009 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 June 2009

      Check for updates

      Qualifiers

      • demonstration

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader