Abstract
Recent audio codecs enable high quality signals up to fullband (20 kHz) which is usually associated with the maximal audible bandwidth. Following previous studies on speech coding assessment, we survey in this novel study the music coding ability of two real-time codecs with fullband capability – the IETF standardized Opus codec as well as the 3 GPP specified EVS codec. We tested both codecs with vocal, instrumental and mixed music signals. For evaluation, we predicted human assessments using the instrumental POLQA method which has been primarily designed for speech assessment. Additionally, we performed two listening tests as a reference with a total of 21 young adults. Opus and EVS show a similar music coding performance. The quality assessment mainly depends on the specific music characteristics and on the tested bitrates from 16.4 to 64 kbit/s. The POLQA measure and the listening results are correlating, whereas the absolute ratings of the young listeners achieve much lower MOS values.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Listening to music several hours a day, using different playing techniques – high quality sound system, HD stereo headset etc.
References
3GPP: EVS Codec General Overview. TS 26.441 v12.1.0, 3rd Generation Partnership Project (3GPP), December 2014. http://www.3gpp.org/DynaReport/26441.htm
ETSI: Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Performance characterization. TS 126952 v13.0.0, European Telecommunications Standards Institute (ETSI), January 2016. http://www.etsi.org/deliver/etsi_tr/126900_126999/126952/13.00.00_60/tr_126952v130000p.pdf
European Broadcasting Union: Sound Quality Assessment Material recordings for subjective tests, October 2008. https://tech.ebu.ch/publications/sqamcd
Frauenhofer IIS: The AAC-ELD Family For High Quality Communication Services. Technical paper, Frauenhofer IIS), December 2015
Google Inc.: WebRTC, September 2014. http://www.webrtc.org/
Hoene, C., Valin, J., Vos, K., Skoglund, J.: Summary of OPUS listening test results draft-ietf-codec-results-03. Internet-Draft, January 2014. http://tools.ietf.org/html/draft-ietf-codeco-results-03
ITU-T: Low-complexity, full-band audio coding for high-quality, conversational applications. REC G.719, International Telecommunication Union (Telecommunication Standardization Sector), June 2008. http://www.itu.int/rec/T-REC-G.719-200806-I/en
ITU-T: Methods for objective and subjective assessment of speech quality (POLQA): Perceptual Objective Listening Quality Assessment. REC P.863, International Telecommunication Union (Telecommunication Standardization Sector), September 2014. http://www.itu.int/rec/T-REC-P.863-201409-I/en
ITU-T: P.Imp863: Implementer’s Guide on assessment of EVS coded speech with Recommendation ITU-TP.863, January 2016. http://www.itu.int/rec/T-REC-P.Imp863-201601-I!Oth1/en
Jokisch, O., Maruschke, M.: Audio and speech coding/transcoding in web real-time communication. In: Proceedings of International Symposium of Human Life Design, HLD 2016, Kanazawa, Japan, 26–29 March 2016 (2016). http://www.jaist.ac.jp/hld/IntlSymp2016/paper/HLD2016-COM03.pdf
Jokisch, O., Maruschke, M., Meszaros, M., Iaroshenko, V.: Audio and speech quality survey of the opus codec in web real-time communication. In: Proceedings of ESSV - 27th Conference of Electronic Signal Processing, ESSV 2016, Leipzig, Germany, 2–4 March 2016, pp. 254–262 (2016). http://www1.hft-leipzig.de/ice/essv2016/files/31%20-%20JokischMaruschke-S.254-262.pdf
Maruschke, M., Jokisch, O., Meszaros, M., Iaroshenko, V.: Review of the opus codec in a WebRTC scenario for audio and speech communication. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 348–355. Springer, Heidelberg (2015)
Rämö, A., Toukomaa, H.: Subjective qualitiy evaluation of the 3Gpp EVS codec. In: IEEE International Conference on Acoustics, Speech and Signal Processing, Brisbane, Australia, pp. 5157–5161, April 2015
Valin, J., Vos, K., Terriberry, T.: Definition of the Opus Audio Codec. RFC 6716 (Proposed Standard). http://www.ietf.org/rfc/rfc6716.txt
Acknowledgments
We would like to thank SwissQual, a Rhode & Schwarz company in Zuchwil, Switzerland for supplying the POLQA tool SQuadAnalyzer – in particular Jens Berger for the elaborate discussions. Further acknowledgments go to André Schuster for supporting our experiments in the sound insulating cabinet of HfT Leipzig, Germany and to all volunteers in the listening tests.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Maruschke, M., Jokisch, O., Meszaros, M., Trojahn, F., Hoffmann, M. (2016). Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication. In: Ronzhin, A., Potapova, R., Németh, G. (eds) Speech and Computer. SPECOM 2016. Lecture Notes in Computer Science(), vol 9811. Springer, Cham. https://doi.org/10.1007/978-3-319-43958-7_69
Download citation
DOI: https://doi.org/10.1007/978-3-319-43958-7_69
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43957-0
Online ISBN: 978-3-319-43958-7
eBook Packages: Computer ScienceComputer Science (R0)