Communications - Scientific Letters of the University of Zilina 2016, 18(1):3-10 | DOI: 10.26552/com.C.2016.1.3-10
Monitoring Voip Speech Quality for Chopped and Clipped Speech
- 1 Dublin Institute of Technology, Ireland and Trinity College Dublin, Ireland
- 2 Google, Inc., Mountain View, CA, USA
- 3 Trinity College Dublin, Ireland
Real-time monitoring of speech quality for VoIP calls is a significant challenge. This paper presents early work on a no-reference objective model for quantifying perceived speech quality in VoIP. The overall approach uses a modular design that will be able to help pinpoint the reason for degradations as well as quantifying their impact on speech quality. The model is being designed to work with narrowband and wideband signals. This initial work is focused on rating amplitude clipped or chopped speech, which are common problems in VoIP. A model sensitive to each of these degradations is presented and then tested with both synthetic and real examples of chopped and clipped speech. The results were compared with predicted MOS outputs from four objective speech quality models: ViSQOL, PESQ, POLQA and P.563. The model output showed consistent relationships between this model's clip and chop detection modules and the quality predictions from the other objective speech quality models. Further work is planned to widen the range of degradation types captured by the model, such as non-stationary background noise and speaker echo. While other components (e.g. a voice activity detector) would be necessary to deploy the model for stand-alone VoIP monitoring, the results show good potential for using the model in a realtime monitoring tool.
Keywords: speech quality model; clip; chop; VoIP
Published: February 29, 2016 Show citation
| ACS | AIP | APA | ASA | Harvard | Chicago | Chicago Notes | IEEE | ISO690 | MLA | NLM | Turabian | Vancouver |
References
- MOLLER, S., CHAN, W-Y., COTE, N., FALK, T. H., RAAKE, A., WALTERMANN, M.: Speech Quality Estimation: Models and Trends, Signal Processing Magazine, IEEE, vol. 28, No. 6, pp. 18-28, 2011.
Go to original source... - ITU: Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs, Int. Telecomm. Union, Geneva, CH, ITU-T Rec. P.862, 2001.
- ITU: Perceptual Objective Listening Quality Assessment, Int. Telecomm. Union, Geneva, CH, ITU-T Rec. P.863, 2011.
- ITU: Single-ended Method for Objective Speech Quality Assessment in Narrow-band telephony Applications, Int. Telecomm. Union, Geneva, CH, ITU-T Rec. P.563, 2004.
- GRANCHAROV, V., ZHAO, D. Y., LINDBLOM, J., KLEIJN, W. B.: Low-complexity, Nonintrusive Speech Quality Assessment, IEEE Audio, Speech, Language Process., vol. 14, No. 6, pp. 1948-1956, 2006.
Go to original source... - ANSI ATIS: 0100005-2006: Auditory Non-intrusive Quality Estimation Plus (ANIQUE+): Perceptual Model for Non-intrusive Estimation of Narrowband Speech Quality, American National Standards Institute, 2006.
- FALK, T. H., CHAN, W-Y.: Single-ended Speech Quality Measurement Using Achine Learning methods, IEEE Audio, Speech, Language Process., vol. 14, No. 6, pp. 1935-1947, 2006.
Go to original source... - STELMACHOWICZ, P. G., LEWIS, D. E., HOOVER, B., KEEFE, D H.: Subjective Effects of Peak Clipping and Compression Limiting in Normal and Hearing-impaired Children and Adults, J. Acoust Soc Am, vol. 105, pp. 412, 1999.
Go to original source... - LICKLIDER, J. C.: Effects of Amplitude Distortion Upon the Intelligibility of Speech, J. Acoust Soc Am, vol. 18, No. 2, pp. 429-434, 1946.
Go to original source... - BENESTY, J., SONDHI, M. M., HUANG, Y. A. : Springer Handbook of Speech Processing, Springer, 2007.
Go to original source... - GOOGLE: WebRTC NetEQ Overview, http://www.webrtc.org/reference/architecture#TOC-NetEQ-for-Voice.
- RAAKE, A.: Speech Quality of VoIP - Assessment and Prediction, Wiley, 2006.
Go to original source... - HINES, A., SKOGLUND, J., HARTE, N., KOKARAM, A. C.: Detection of Chopped Speech, Patent, US20150199979 A1, 07 2015.
Go to original source... - IEEE: IEEE Recommended Practice for Speech Quality Measurements, Audio and Electroacoustics, IEEE Transactions on, vol. 17, No. 3, pp. 225-246, Sep 1969.
Go to original source... - HINES, A., SKOGLUND, J., KOKARAM, A. C., HARTE, N.: VISQOL: An Objective Speech Quality Model, EURASIP J. on Audio, Speech, and Music Processing, vol. 2015:13, May 2015.
Go to original source... - HINES, A, SKOGLUND, J, KOKARAM, A. C., HARTE, N.: VISQOL: The Virtual Speech Quality Objective Listener, IWAENC, 2012.
- HINES, A, SKOGLUND, J, KOKARAM, A. C., HARTE, N.: Robustness of Speech Quality Metrics to Background Noise and Network Degradations: Comparing ViSQOL, PESQ and POLQA, Acoustics, Speech, and Signal Processing, IEEE Intern. Conference on (ICASSP '13), 2013.
Go to original source...
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.

