Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication

Maruschke, M.; Jokisch, O.; Meszaros, M.; Trojahn, F.; Hoffmann, M.

doi:10.1007/978-3-319-43958-7_69

M. Maruschke¹⁶,
O. Jokisch¹⁶,
M. Meszaros¹⁶,
F. Trojahn¹⁶ &
…
M. Hoffmann¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9811))

Included in the following conference series:

International Conference on Speech and Computer

2305 Accesses
5 Citations

Abstract

Recent audio codecs enable high quality signals up to fullband (20 kHz) which is usually associated with the maximal audible bandwidth. Following previous studies on speech coding assessment, we survey in this novel study the music coding ability of two real-time codecs with fullband capability – the IETF standardized Opus codec as well as the 3 GPP specified EVS codec. We tested both codecs with vocal, instrumental and mixed music signals. For evaluation, we predicted human assessments using the instrumental POLQA method which has been primarily designed for speech assessment. Additionally, we performed two listening tests as a reference with a total of 21 young adults. Opus and EVS show a similar music coding performance. The quality assessment mainly depends on the specific music characteristics and on the tested bitrates from 16.4 to 64 kbit/s. The POLQA measure and the listening results are correlating, whereas the absolute ratings of the young listeners achieve much lower MOS values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Listening to music several hours a day, using different playing techniques – high quality sound system, HD stereo headset etc.

References

3GPP: EVS Codec General Overview. TS 26.441 v12.1.0, 3rd Generation Partnership Project (3GPP), December 2014. http://www.3gpp.org/DynaReport/26441.htm
ETSI: Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Performance characterization. TS 126952 v13.0.0, European Telecommunications Standards Institute (ETSI), January 2016. http://www.etsi.org/deliver/etsi_tr/126900_126999/126952/13.00.00_60/tr_126952v130000p.pdf
European Broadcasting Union: Sound Quality Assessment Material recordings for subjective tests, October 2008. https://tech.ebu.ch/publications/sqamcd
Frauenhofer IIS: The AAC-ELD Family For High Quality Communication Services. Technical paper, Frauenhofer IIS), December 2015
Google Scholar
Google Inc.: WebRTC, September 2014. http://www.webrtc.org/
Hoene, C., Valin, J., Vos, K., Skoglund, J.: Summary of OPUS listening test results draft-ietf-codec-results-03. Internet-Draft, January 2014. http://tools.ietf.org/html/draft-ietf-codeco-results-03
ITU-T: Low-complexity, full-band audio coding for high-quality, conversational applications. REC G.719, International Telecommunication Union (Telecommunication Standardization Sector), June 2008. http://www.itu.int/rec/T-REC-G.719-200806-I/en
ITU-T: Methods for objective and subjective assessment of speech quality (POLQA): Perceptual Objective Listening Quality Assessment. REC P.863, International Telecommunication Union (Telecommunication Standardization Sector), September 2014. http://www.itu.int/rec/T-REC-P.863-201409-I/en
ITU-T: P.Imp863: Implementer’s Guide on assessment of EVS coded speech with Recommendation ITU-TP.863, January 2016. http://www.itu.int/rec/T-REC-P.Imp863-201601-I!Oth1/en
Jokisch, O., Maruschke, M.: Audio and speech coding/transcoding in web real-time communication. In: Proceedings of International Symposium of Human Life Design, HLD 2016, Kanazawa, Japan, 26–29 March 2016 (2016). http://www.jaist.ac.jp/hld/IntlSymp2016/paper/HLD2016-COM03.pdf
Jokisch, O., Maruschke, M., Meszaros, M., Iaroshenko, V.: Audio and speech quality survey of the opus codec in web real-time communication. In: Proceedings of ESSV - 27th Conference of Electronic Signal Processing, ESSV 2016, Leipzig, Germany, 2–4 March 2016, pp. 254–262 (2016). http://www1.hft-leipzig.de/ice/essv2016/files/31%20-%20JokischMaruschke-S.254-262.pdf
Maruschke, M., Jokisch, O., Meszaros, M., Iaroshenko, V.: Review of the opus codec in a WebRTC scenario for audio and speech communication. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 348–355. Springer, Heidelberg (2015)
Chapter Google Scholar
Rämö, A., Toukomaa, H.: Subjective qualitiy evaluation of the 3Gpp EVS codec. In: IEEE International Conference on Acoustics, Speech and Signal Processing, Brisbane, Australia, pp. 5157–5161, April 2015
Google Scholar
Valin, J., Vos, K., Terriberry, T.: Definition of the Opus Audio Codec. RFC 6716 (Proposed Standard). http://www.ietf.org/rfc/rfc6716.txt

Download references

Acknowledgments

We would like to thank SwissQual, a Rhode & Schwarz company in Zuchwil, Switzerland for supplying the POLQA tool SQuadAnalyzer – in particular Jens Berger for the elaborate discussions. Further acknowledgments go to André Schuster for supporting our experiments in the sound insulating cabinet of HfT Leipzig, Germany and to all volunteers in the listening tests.

Author information

Authors and Affiliations

Leipzig University of Telecommunications (HfTL), Leipzig, Germany
M. Maruschke, O. Jokisch, M. Meszaros, F. Trojahn & M. Hoffmann

Authors

M. Maruschke
View author publications
You can also search for this author in PubMed Google Scholar
O. Jokisch
View author publications
You can also search for this author in PubMed Google Scholar
M. Meszaros
View author publications
You can also search for this author in PubMed Google Scholar
F. Trojahn
View author publications
You can also search for this author in PubMed Google Scholar
M. Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Maruschke .

Editor information

Editors and Affiliations

SPIIRAS , Saint-Petersburg, Russia
Andrey Ronzhin
Moscow State Linguistic University , Moscow, Russia
Rodmonga Potapova
Budapest University of Technology and Economics, Budapest, Hungary
Géza Németh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maruschke, M., Jokisch, O., Meszaros, M., Trojahn, F., Hoffmann, M. (2016). Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication. In: Ronzhin, A., Potapova, R., Németh, G. (eds) Speech and Computer. SPECOM 2016. Lecture Notes in Computer Science(), vol 9811. Springer, Cham. https://doi.org/10.1007/978-3-319-43958-7_69

Download citation

DOI: https://doi.org/10.1007/978-3-319-43958-7_69
Published: 13 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43957-0
Online ISBN: 978-3-319-43958-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics