Skip to main content

Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9811))

Included in the following conference series:

Abstract

Recent audio codecs enable high quality signals up to fullband (20 kHz) which is usually associated with the maximal audible bandwidth. Following previous studies on speech coding assessment, we survey in this novel study the music coding ability of two real-time codecs with fullband capability – the IETF standardized Opus codec as well as the 3 GPP specified EVS codec. We tested both codecs with vocal, instrumental and mixed music signals. For evaluation, we predicted human assessments using the instrumental POLQA method which has been primarily designed for speech assessment. Additionally, we performed two listening tests as a reference with a total of 21 young adults. Opus and EVS show a similar music coding performance. The quality assessment mainly depends on the specific music characteristics and on the tested bitrates from 16.4 to 64 kbit/s. The POLQA measure and the listening results are correlating, whereas the absolute ratings of the young listeners achieve much lower MOS values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Listening to music several hours a day, using different playing techniques – high quality sound system, HD stereo headset etc.

References

  1. 3GPP: EVS Codec General Overview. TS 26.441 v12.1.0, 3rd Generation Partnership Project (3GPP), December 2014. http://www.3gpp.org/DynaReport/26441.htm

  2. ETSI: Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Performance characterization. TS 126952 v13.0.0, European Telecommunications Standards Institute (ETSI), January 2016. http://www.etsi.org/deliver/etsi_tr/126900_126999/126952/13.00.00_60/tr_126952v130000p.pdf

  3. European Broadcasting Union: Sound Quality Assessment Material recordings for subjective tests, October 2008. https://tech.ebu.ch/publications/sqamcd

  4. Frauenhofer IIS: The AAC-ELD Family For High Quality Communication Services. Technical paper, Frauenhofer IIS), December 2015

    Google Scholar 

  5. Google Inc.: WebRTC, September 2014. http://www.webrtc.org/

  6. Hoene, C., Valin, J., Vos, K., Skoglund, J.: Summary of OPUS listening test results draft-ietf-codec-results-03. Internet-Draft, January 2014. http://tools.ietf.org/html/draft-ietf-codeco-results-03

  7. ITU-T: Low-complexity, full-band audio coding for high-quality, conversational applications. REC G.719, International Telecommunication Union (Telecommunication Standardization Sector), June 2008. http://www.itu.int/rec/T-REC-G.719-200806-I/en

  8. ITU-T: Methods for objective and subjective assessment of speech quality (POLQA): Perceptual Objective Listening Quality Assessment. REC P.863, International Telecommunication Union (Telecommunication Standardization Sector), September 2014. http://www.itu.int/rec/T-REC-P.863-201409-I/en

  9. ITU-T: P.Imp863: Implementer’s Guide on assessment of EVS coded speech with Recommendation ITU-TP.863, January 2016. http://www.itu.int/rec/T-REC-P.Imp863-201601-I!Oth1/en

  10. Jokisch, O., Maruschke, M.: Audio and speech coding/transcoding in web real-time communication. In: Proceedings of International Symposium of Human Life Design, HLD 2016, Kanazawa, Japan, 26–29 March 2016 (2016). http://www.jaist.ac.jp/hld/IntlSymp2016/paper/HLD2016-COM03.pdf

  11. Jokisch, O., Maruschke, M., Meszaros, M., Iaroshenko, V.: Audio and speech quality survey of the opus codec in web real-time communication. In: Proceedings of ESSV - 27th Conference of Electronic Signal Processing, ESSV 2016, Leipzig, Germany, 2–4 March 2016, pp. 254–262 (2016). http://www1.hft-leipzig.de/ice/essv2016/files/31%20-%20JokischMaruschke-S.254-262.pdf

  12. Maruschke, M., Jokisch, O., Meszaros, M., Iaroshenko, V.: Review of the opus codec in a WebRTC scenario for audio and speech communication. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 348–355. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  13. Rämö, A., Toukomaa, H.: Subjective qualitiy evaluation of the 3Gpp EVS codec. In: IEEE International Conference on Acoustics, Speech and Signal Processing, Brisbane, Australia, pp. 5157–5161, April 2015

    Google Scholar 

  14. Valin, J., Vos, K., Terriberry, T.: Definition of the Opus Audio Codec. RFC 6716 (Proposed Standard). http://www.ietf.org/rfc/rfc6716.txt

Download references

Acknowledgments

We would like to thank SwissQual, a Rhode & Schwarz company in Zuchwil, Switzerland for supplying the POLQA tool SQuadAnalyzer – in particular Jens Berger for the elaborate discussions. Further acknowledgments go to André Schuster for supporting our experiments in the sound insulating cabinet of HfT Leipzig, Germany and to all volunteers in the listening tests.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. Maruschke .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Maruschke, M., Jokisch, O., Meszaros, M., Trojahn, F., Hoffmann, M. (2016). Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication. In: Ronzhin, A., Potapova, R., Németh, G. (eds) Speech and Computer. SPECOM 2016. Lecture Notes in Computer Science(), vol 9811. Springer, Cham. https://doi.org/10.1007/978-3-319-43958-7_69

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43958-7_69

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43957-0

  • Online ISBN: 978-3-319-43958-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics