Abstract
Audio with good quality is the essential fundament for all multi-media services. The transmission of audio signals relies on efficient encoding and decoding algorithms (codecs) that enable the reduction of the required channel capacity, but still provide an excellent audio quality, even when transmission errors occur. The most succesfull audio codecs are mp2, mp3, aac and ac3. The codecs employ sophisticated signal processing algorithms imitating properties of hearing. The processing may cause specific artifacts such as high frequency loss, narrow-band noise and pre-echoes. The final quality needs to be verified with statistically valid listening tests. Detailed procedures for conducting reliable speech and audio tests are defined in ITU Recommendations P.800, BS.1116, and BS.1534. Instrumental measurement methods such as BS.1387 replicate subjective tests allowing the estimation of the perceived quality. The ITU Recommendation P.1201 is a recently standardized method for estimating the audio quality of a transmitted signal without the need to have a reference signal available.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
EBU Document BPN 019 (1998) Report on the EBU subjective listening tests of multichannel audio codecs
EBU Document Tech3296 (2003) EBU subjective listening test on low-bitrate audio codecs
3GPP TR 26.936 (2008) Performance characterization of 3GPP audio codecs. www.3GPP.org
Bech S, Zacharov N (2006) Perceptual audio evaluation. Theory, method and application. Wiley, New York
Bosi M, Brandenburg K, Quackenbush S, Fielder L, Akagiri K, Fuchs H, Dietz M, Herre J, Davidson G, Oikawa (1996) MPEG-2 advanced audio coding. In: Proceedings of the 101st Audio Engineering Society (AES) convention
Brandenburg K, Stoll G, Dehéry YF, Johnston JD, Kerkhof Lvd, Schroeder EF (1992) The ISO/MPEG-audio codec: a generic standard for coding of high quality digital audio. In: Proceedings of the 92th Audio Engineering Society (AES) convention
Broom S (2006) VoIP: quality assessment: taking account of the edge-device. IEEE Trans ASLP 14(6):1977–1983
Campbell D, Jones E, Glavin M (2009) Audio quality assessment—a review, and recent developments. Signal Process 89:1489–1500
Clark A (2001) Modeling the effects of burst packet loss and recency on subjective voice quality internet telephony workshop (IPtel)
Clark A (2003) ITU-T Delayed Contribution COM12-D105: Description of VQMON algorithm
Egi N, Hayashi T, Takahashi A (2010) Parametric packet-layer model for evaluation audio quality in multimedia streaming services. IEICE Trans Commun E93.B:1359–1366
Erne M (2001) Perceptual audio coders: what to listen for. In: Proceedings of the 111th Audio Engineering Society (AES) convention
Feiten B (1997) Measuring the coding margin of perceptual codecs with the difference signal. In: Proceedings of the 102nd Audio Engineering Society (AES) convention
Feiten B, Raake A, Garcia MN, Wüstenhagen U, Kroll J (2009) Subjective quality evaluation of audio streaming applications on absolute and paired rating scales. In: Proceedings of the 126th Audio Engineering Society (AES) convention
Gabrielsson A, Sjogren H (1979) Perceived sound quality of sound-reproduction systems. J Acoust Soc Am 65(4):1019–1033
Garcia MN, Raake A, Feiten B (2013) Parametric audio quality model for IPTV services—ITU-T P.1201.2 audio. In: Proceedings international workshop on Quality of Multimedia Experience (QoMEX)
Graubner M et al (2010) QoE assessment for audio contribution over IP (ACIP). In: Proceedings of the 38th AES international conference on sound quality evaluation
Herre J (2007) Temporal noise shaping, quantization and coding methods in perceptual audio coding: a tutorial introduction. In: Proceedings of the AES 17th international conference on high quality audio coding, pp 1–14
Herre J, Dietz M (2008) Standards in a nutshell: MPEG-4 high-efficiency AAC coding. IEEE Signal Process Mag 25:137–142
Herre J et al (2008) MPEG surround—the ISO/MPEG standard for efficient and compatible multichannel audio coding. J AES 56:932–955
Horbach U, Boone MM (1999) Future transmission and rendering formats for multichannel sound. In: Proceedings of the AES 16th international conference on spatial sound, reproduction, pp 409–418
ISO/IEC 11172–3 (1993) Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s—part 3: audio
ISO/IEC 13818–3 (1995) Generic coding of moving pictures and associated audio: audio
ISO/IEC 13818–7 (2006) Generic coding of moving pictures and associated audio: advanced audio coding
ISO/IEC 14496–3 (2006) Information technology—coding of audio-visual objects—part 3: audio
ITU-R Rec. BS.1116-1 (1994–1997) Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems
ITU-R Rec. BS.1284 (1997–2003) General methods for the subjective assessment of sound quality
ITU-R Rec. BS.1286 (1997) Methods for the subjective assessment of audio systems with accompanying picture
ITU-R Rec. BS.1534-1 (2001–2003) Method for the subjective assessment of intermediate quality levels of coding systems
ITU-T BS.1387 (2001) Method for objective measurements of perceived audio quality
ITU-T GSTP-GVBR (2010) Performance of ITU-T G.718. Series G: transmission systems and media, digital systems and networks
ITU-T Recommendation P.1201 (2012) Parametric non-intrusive assessment of audiovisual media streaming quality
ITU-T Recommendation P.1201.1 (2012) Parametric non-intrusive assessment of audiovisual media streaming quality—lower resolution application area
ITU-T Recommendation P.1201.2 (2012) Parametric non-intrusive assessment of audiovisual media streaming quality—higher resolution application area
ITU-T Recommendation G.107 (2011) The E-model: a computational model for use in transmission planning
ITU-T Recommendation P.800 (1996) Methods for subjective determination of transmission quality
Liu C-M, Hsu H-W, Lee W-C (2008) Compression artifacts in perceptual audio coding. IEEE Trans Audio Speech Lang Process (ASLP) 16(4):681–695
Lutzky M et al (2004) A guideline to audio codec delay. In: Proceedings 116th Audio Engineering Society (AES) convention, Berlin
Mattila VV (2002) Descriptive analysis and ideal point modeling of speech quality in mobile communications. In: Proceedings of the 113th audio engineering society (AES) convention, USALos Angeles
Mattila VV (2002) Ideal point modeling of speech quality in mobile communications based on multidimensional scaling. In: Proceedings of the 112th audio engineering society (AES) convention, DMunich
Moller H (1992) Fundamentals of binaural technology. Appl Acoust 36:171–218
Möller S, Chan WY, Côté N, Falk TH, Raake A, Wältermann M (2011) Speech quality estimation, IEEE Signal Process Mag
Myakotnykh ES, Svensson UP (2010) Computational quality model for IP-based audio. In: Proceedings of the 38th AES international conference on sound quality, evaluation
Neuendorf M et al (2009) Unified speech and audio coding scheme for high quality at low bitrates.: In: Proceedings IEEE International Conference on Audio Speech and Signal Processing (ICASSP)
Painter T, Spanias A (2000) Perceptual coding of digital audio. Proc IEEE 88(4):451–515
Perkins C, Hodson O, Hardman V (1998) A survey of packet loss recovery techniques for streaming audio. IEEE Netw 12(5):40–48
Raake A (2006) Short- and long-term packet loss behaviour: towards speech quality prediction for arbitrary loss distributions, IEEE Trans ASLP 14(6):1957–1968
Raake A, Wältermann M, Wüstenhagen U, Feiten B (2012) How to talk about speech and audio quality with speech and audio people? J Audio Eng Soc 60(3):147–155
Raake A, Blauert J (2013) Comprehensive modeling of the formation process of sound-quality. In: Proceedings international workshop on Quality of Multimedia Experience (QoMEX), Klagenfurt, Austria
Reichl P, Egger S, Schatz R, D’Alconzo A (2010) The logarithmic nature of QoE and the role of the Weber-Fechner Law in QoE assessment. In: Proceedings IEEE International Conference on Communications (ICC)
Rix AW, Beerends JG, Kim D-S (2006) Objective assessment of speech and audio quality—technology and applications. IEEE Trans ASLP 14(6):1890–1901
Rumsey F (2002) Spatial quality evaluation for reproduced sound: terminology, meaning, and a scene-based paradigm. J Audio Eng Soc 50(9):651–666
Sackl A, Egger S, Schatz R (2013) Where’s the music? Comparing the QoE impact of temporal impairments between music and video streaming. In: Proceedings international workshop on Quality of Multimedia Experience (QoMEX)
Schobben D, van de Par S (2004) The effect of room acoustics on MP3 audio quality evaluation. In: Proceedings of the 117th audio engineering society (AES) convention, USASan Francisco, 28–31 Oct 2004
Schuller G, Yu B (2002) Perceptual audio coding using adaptive pre and post-filters and lossless compression. IEEE Trans Speech Audio Process 10(6):379–390
Smirnoff S (2005) Difference level. An objective audio parameter. In: 118th AES-convention
Spors S, Wierstorf H, Raake A, Melchior F, Frank M, Zotter F (2013) Spatial sound with loudspeakers and its perception: a review of the current state. Proc IEEE 101(9):1920–1938
Thiede T, Treurniet WC, Bitto R, Schmidmer C, Sporer T, Beerends JG, Colomes C, Keyhl M, Stoll G, Brandenburg K, Feiten B (2000) PEAQ—the ITU standard for objective measurement of perceived audio quality. J Audio Eng Soc (AES) 48(1/2):3–29
Toole F (2008) Sound reproduction: the acoustics and psychoacoustics of loudspeakers and rooms. Focal Press
Website sound expert. http://soundexpert.org
Wüstenhagen U, Feiten B, Hoeg W (1998) Subjective listening test of multichannel audio codecs. AES Conv 105:P4813
Zielinski S, Rumsey F, Bech S (2008) On some biases encountered in modern audio quality listening tests—a review. J Audio Eng Soc 56(6):427–451
Zwicker E, Fastl H (1999) Psychoacoustics. Facts and models, 2nd edn. Springer, Berlin
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Feiten, B., Garcia, MN., Svensson, P., Raake, A. (2014). Audio Transmission. In: Möller, S., Raake, A. (eds) Quality of Experience. T-Labs Series in Telecommunication Services. Springer, Cham. https://doi.org/10.1007/978-3-319-02681-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-02681-7_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02680-0
Online ISBN: 978-3-319-02681-7
eBook Packages: EngineeringEngineering (R0)