skip to main content
10.1145/2766462.2767791acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

A Test Collection for Spoken Gujarati Queries

Published:09 August 2015Publication History

ABSTRACT

The development of a new test collection is described in which the task is to search naturally occurring spoken content using naturally occurring spoken queries. To support research on speech retrieval for low-resource settings, the collection includes terms learned by zero-resource term discovery techniques. Use of a new tool designed for exploration of spoken collections provides some additional insight into characteristics of the collection.

References

  1. T. Akiba et al. Overview of the NTCIR-11 spoken query and doc task. In NTCIR-11, 2014.Google ScholarGoogle Scholar
  2. X. Anguera et al. The spoken web search task. In MediaEval, 2013.Google ScholarGoogle Scholar
  3. P. Comas et al. Sibyl, a factoid question-answering system for spoken documents. ACM TOIS, 30 (3): 19, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Dredze et al. NLP on spoken documents without ASR. In EMNLP, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Garofolo et al. The TREC spoken document retrieval track: A success story. In RIAO, 2000.Google ScholarGoogle Scholar
  6. H. Joshi and J. White. Document silmilarity amid automatically detected terms. In FIRE, 2014.Google ScholarGoogle Scholar
  7. D. Oard et al. The FIRE 2013 question answering for the spoken web task. In FIRE, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Patel et al. Avaaj Otalo: A field study of an interactive voice forum for small farmers in rural India. In CHI, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. White et al. Using zero-resource spoken term discovery for ranked retrieval. In NAACL-HLT, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  10. E. Yilmaz et al. A simple and efficient sampling method for estimating AP and NDCG. In SIGIR, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A Test Collection for Spoken Gujarati Queries

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
      August 2015
      1198 pages
      ISBN:9781450336215
      DOI:10.1145/2766462

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 August 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      SIGIR '15 Paper Acceptance Rate70of351submissions,20%Overall Acceptance Rate792of3,983submissions,20%
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader