Combining Word and Phonetic-Code Representations for Spoken Document Retrieval

Reyes-Barragán, Alejandro; Montes-y-Gómez, Manuel; Villaseñor-Pineda, Luis

doi:10.1007/978-3-642-19437-5_38

Alejandro Reyes-Barragán¹⁷,
Manuel Montes-y-Gómez^17,18 &
Luis Villaseñor-Pineda¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6609))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1267 Accesses
2 Citations

Abstract

The traditional approach for spoken document retrieval (SDR) uses an automatic speech recognizer (ASR) in combination with a word-based information retrieval method. This approach has only showed limited accuracy, partially because ASR systems tend to produce transcriptions of spontaneous speech with significant word error rate. In order to overcome such limitation we propose a method which uses word and phonetic-code representations in collaboration. The idea of this combination is to reduce the impact of transcription errors in the processing of some (presumably complex) queries by representing words with similar pronunciations through the same phonetic code. Experimental results on the CLEF-CLSR-2007 corpus are encouraging; the proposed hybrid method improved the mean average precision and the number of retrieved relevant documents from the traditional word-based approach by 3% and 7% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Garofolo, J.S., Auzanne, C.G.P., Voorhees, E.M.: The TREC Spoken Document Retrieval Track: A Success Story. In: NIST, pp. 107–129 (1999), Special publication 500-246
Google Scholar
Comas, P.R., Turmo, J.: Spoken document retrieval based on approximated sequence alignment. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2008. LNCS (LNAI), vol. 5246, pp. 285–292. Springer, Heidelberg (2008)
Chapter Google Scholar
Odell, K.M., Russell, R.C.: Soundex phonetic comparison system. [U.S. Patents 1261167 (1918), and 1435663 (1922)]
Google Scholar
Taft, R.L.: Name Search Techniques, Albany, New York: New York State Identification and Intelligence System. Technical Report, State of New York (1970)
Google Scholar
Gadd, T.: PHONIX: The algorithm. In: Program: Automated Library and Information Systems, pp. 363–366 (1990)
Google Scholar
Philips, L.: The double-metaphone search algorithm. C/C++ User’s Journal 18(6) (2000)
Google Scholar
Mokotoff, G., Sack, S.A.: Where once we walked: a guide to the Jewish communities destroyed in the Holocaust. Avotaynu, Teaneck (1991)
Google Scholar
Whittaker, E.W.D., Van Thong, J.M., Moreno, P.J.: Vocabulary Independent Speech Recognition Using Particles, Trento, Italy (2001)
Google Scholar
Siegler, M.: Integration of continuous speech recognition and information retrieval for mutually optimal performance. Ph.D. dissertation. Carnegie Mellon University, Carnegie Mellon (1999)
Google Scholar
Ng, C., Wilkinson, R., Zobel, J.: Experiments in spoken document retrieval using phoneme N-grams, vol. 32(1-2), pp. 61–77. Elsevier Science Publishers B. V, Amsterdam (September 2000)
Google Scholar
Zhang, L., et al.: Topic indexing of spoken documents based on optimized N-best approach, Shanghai, November 20-22, vol. 4, pp. 302–305 (2009)
Google Scholar
Siegler, M., et al.: Experiments in Spoken Document Retrieval at CMU. National Institute for Standards and Technology, Gaithersburg (1997) NIST-SP 500-240
Google Scholar
Nishizaki, H., Nakagawa, S.: Japanese spoken document retrieval considering OOV keywords using LVCSR system with OOV detection processing, pp. 157–164. Morgan Kaufmann Publishers Inc., San Diego (2002)
Google Scholar
Allan, J.: Robust techniques for organizing and retrieving spoken documents, vol. 2003, pp. 103–114. Hindawi Publishing Corp., New York (January 2003)
Google Scholar
Wang, J., Oard, D.W.: CLEF-2005 CL-SR at maryland: Document and query expansion using side collections and thesauri. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 800–809. Springer, Heidelberg (2006)
Chapter Google Scholar
Alejandro Reyes-Barragán, M., Villaseñor-Pineda, L., Montes-y-Gómez, M.: A soundex-based approach for spoken document retrieval. In: Gelbukh, A., Morales, E.F. (eds.) MICAI 2008. LNCS (LNAI), vol. 5317, pp. 204–211. Springer, Heidelberg (2008)
Chapter Google Scholar
Christen, P.: A Comparison of Personal Name Matching Techniques and Practical Issues. In: Proceedings of the Sixth IEEE International Conference on Data Mining (September 2006)
Google Scholar
Pecina, P., Hoffmannová, P., Jones, G.J.F., Zhang, Y., Oard, D.W.: Overview of the CLEF-2007 cross-language speech retrieval track. In: Peters, C., Jijkoun, V., Mandl, T., Müller, H., Oard, D.W., Peñas, A., Petras, V., Santos, D. (eds.) CLEF 2007. LNCS, vol. 5152, pp. 674–686. Springer, Heidelberg (2008)
Chapter Google Scholar
Ogilvie, P., Callan, J.: Experiments Using the Lemur Toolkit (2002)
Google Scholar
Gálvez, C.: Identificación de Nombres Personales por Medio de Sistemas de Codificación Fonética. Encontros Bibli, Florianópolis, Santa Catarina, Brasil, vol. 11(22), pp. 105–116 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Language Technologies, National Institute of Astrophysics, Optics and Electronics (INAOE), Luis Enrique Erro #1, Sta. María Tonantzintla, Puebla, Mexico
Alejandro Reyes-Barragán, Manuel Montes-y-Gómez & Luis Villaseñor-Pineda
Department of Computer and Information Sciences, The University of Alabama (UAB), 1300 University Boulevard, Birmingham, Alabama, USA
Manuel Montes-y-Gómez

Authors

Alejandro Reyes-Barragán
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Montes-y-Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Luis Villaseñor-Pineda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Reyes-Barragán, A., Montes-y-Gómez, M., Villaseñor-Pineda, L. (2011). Combining Word and Phonetic-Code Representations for Spoken Document Retrieval. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-19437-5_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19436-8
Online ISBN: 978-3-642-19437-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics