Abstract
When using a loudspeaker triplet for virtual sound localization, the traditional conversion method will result in inaccurate localization. In this paper, we constructed a perceptual localization distortion model based on the basic principle of binaural perception sound source localization and relying on the known PKU HRTFs database. On this basic, the perceptual localization errors of virtual sources were calculated by using PKU HRTFs. After analyzing the perceptual localization errors of virtual sources reproduced by loudspeaker triplets, it was found that the main influence factor, i.e., the convergence angle of the loudspeaker triplet, could constrain the perceptual localization distortion. Simulation and subjective evaluation experiments indicate that the proposed selection method outperforms the traditional method, and that the proposed method can be successfully applied to perceptual localization of the moving virtual source.
Supported by National Nature Science Foundation of China (No. 61701194, U1736206, 61762005) and Nature Science Foundation of Hubei Province (No. 2017CFB756).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hamasaki, K., Matsui, K., Sawaya, I., Okubo, H.: The 22.2 multichannel sounds and its reproduction at home and personal enviroment. In: Audio Engineering Society Conference: 43rd International Conference: Audio for Wirelessly Networked Personal Devices. Audio Engineering Society (2011)
Sawaya, I., Oode, S., Ando, A., Hamasaki, K.: Size and shape of listening area reproduced by three-dimensional multichannel sound system with various numbers of loudspeakers. In: Audio Engineering Society Convention 131. Audio Engineering Society (2011)
Lipshitz, S.P.: Stereo microphone techniques: are the purists wrong? J. Audio Eng. Soc. 34(9), 716–744 (1986)
Recommendation ITU-R BS.775-2: Multichannel stereophonic sound system with and without accompanying picture, International Telecommunications Union, Geneva (2010)
Kirkeby, O., Nelson, P.A.: Reproduction of plane wave sound fields. J. Acoust. Soc. Am. 94(5), 2992–3000 (1993)
Comminiello, D., et al.: Intelligent acoustic interfaces with multisensor acquisition for immersive reproduction. IEEE Trans. Multimed. 17(8), 1262–1272 (2015)
Gerzon, M.A.: Ambisonics in multichannel broadcasting and video. J. Audio Eng. Soc. 33(11), 859–871 (1985)
Ward, D.B., Abhayapala, T.D.: Reproduction of a plane-wave sound field using an array of loudspeaker. IEEE Trans. Audio Speech Lang. Process. 9(6), 697–707 (2001)
Blauert, J.: Spatial Hearing. MIT Press, Cambridge (1983)
Recommendation ITU-R BS.1534-2: Method for the subjective assessment of intermediate quality level of coding systems (MUSHRA), International Telecommunications Union, Geneva, Switzerland (2014)
Clark, H.A.M., Dutton, C.F., Vanderlyn, P.B.: The “Stereosonic” recording and production system. IRE Trans. Audio 5(4), 96–111 (1957)
Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)
Ando, A.: Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field. IEEE Trans. Audio Speech Lang. Process. 19(6), 1467–1475 (2011)
Recommendation, ITU-R BS. 1284–2: General Methods for the Subjective Assessment of Sound Quality. International Telecommunications Union (2002)
Ramos, A., Tommasini, F.: Magnitude modelling of HRTF using principal component analysis applied to complex values. Arch. Acoust. 39(4), 477–482 (2014)
Williams, E.G.: Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography. Academic Press, London (1999)
Li, D., Hu, R., Wang, X., Tu, W.: Loudspeaker triplet selection based on low distortion within head for multichannel conversion of smart 3D home theater. Concurrency Computat Pract Exper. (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Guan, D., Li, D., Cai, X., Wang, X., Hu, R. (2020). Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11962. Springer, Cham. https://doi.org/10.1007/978-3-030-37734-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-37734-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37733-5
Online ISBN: 978-3-030-37734-2
eBook Packages: Computer ScienceComputer Science (R0)