Communications - Scientific Letters of the University of Zilina 2014, 16(1):121-126 | DOI: 10.26552/com.C.2014.1.121-126
A Quality Estimation of Synthesized Speech Transmitted over IP Networks
- 1 Department of Telecommunications and Multimedia, Faculty of Electrical Engineering, University of Zilina, Slovakia
A design of the parametric models estimating a quality of synthesized speech transmitted through IP networks is presented in this paper. A Genetic Programming and Random Neural Network as machine learning techniques were deployed to design the models. A set of the quality-affecting parameters was used as an input to the designed parametric estimation models in order to estimate a quality of synthesized speech transmitted over IP networks (VoIP environment). The performance results obtained for the designed parametric estimation models have validated both genetic programming and random neural network as powerful techniques, delivering good accuracy and generalization ability; this makes them perspective candidates for quality estimation of this type of speech in the corresponding environment. The developed parametric models can be helpful for network operators and service providers in a planning phase or early-development stage of telecommunication services based on synthesized speech.
Keywords: genetic programming, random neural network, speech quality estimation, synthesized speech, packet loss, speech codec
Published: February 28, 2014 Show citation
ACS | AIP | APA | ASA | Harvard | Chicago | Chicago Notes | IEEE | ISO690 | MLA | NLM | Turabian | Vancouver |
References
- DE RANGO, F., TROPEA, M., FAZIO, P., MARANO, S.: Overview on VoIP: Subjective and Objective Measurement Methods, IJCSNS Intern. J. of Computer Science and Network Security, vol.6, No.1B, 2006.
- MAHDI, A. E., PICOVICI, D.: Advances in Voice Quality Measurement in Modern Telecommunications, Digital Signal Processing 19 (2009), pp.79-103.
Go to original source...
- MOELLER, S.: Quality of Telephone-based Spoken Dialogue Systems, Springer, New York, 2005, ISBN 0-387-23190-0.
- RAJA, A., ATIF AZAD, R. M., FLANAGAN, C., RYAN, C.: A Methodology for Deriving VoIP Equipment Impairment Factors for a mixed NB/WB Context, IEEE Transactions on Multimedia, vol.10, No. 6, 2008.
Go to original source...
- RUBINO, G., VARELA, M.: A New Approach for the Prediction of End-to-end Performance of Multimedia Streams, Proc. of the First Intern. Conference on the Quantitative Evaluation of Systems (QEST'04), 2004.
Go to original source...
- KOZA, J. R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection, A Bradford book, 1998, ISBN 2-262-11170-5.
- POLI, R., LANGDON, W. B., MCPHEE, N. F., KOZA, J. R.: A Field Guide to Genetic Programming, 2008, Published via http://dces.essex.ac.uk/staff/rpoli/gp-field-guide/A_Field_Guide_to_Genetic_Programming.pdf..
- ABDELBAKI, H. E.: Random Neural Network Simulator (RNNSIM v. 2), for use with MATLAB, September 1999, online: http://www.cs.ucf.edu/~ahossam/rnnsimv2/rnnsimv2.pdf.
- GELENBE, E.: Random Neural Networks with negative and positive Signals and Product Form Solution, Neural Computation 1 (4), 1989, pp. 502-510.
Go to original source...
- DARJAA, S., RUSKO, M., TRNKA, M.: Three Generations of Speech Synthesis Systems in Slovakia, Proc. of XI Intern. Conference Speech and Computer (SPECOM 2006), Sankt Peterburg, 2006, pp. 297-302, ISBN 5-7452-0074-X.
- ITU-T Rec. G.729: Coding of Speech at 8 kbit/s using Conjugate-Structure Algebraic-Code-Exited Linear Prediction (CS-ACELP), Intern. Telecommunication Union, Geneva (Switzerland), 2007.
- ITU-T Rec. G.711: Pulse Code Modulation (PCM) of Voice Frequencies, Intern. Telecommunication Union, Geneva, 1988.
- IETF RFC 3951: Internet Low Bit Rate Codec (iLBC), Internet Engineering Task Force, 2004.
- ITU-T P.862: Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs, Intern. Telecommunications Union, Geneva, 2001.
- POCTA, P., HOLUB, J.: Predicting the Quality of Synthesized and Natural Speech Impaired by Packet Loss and Coding Using PESQ and P.563 Models, Acta Acustica united with Acustica, vol. 97, No. 5, pp. 852-868, 2011, ISSN 1610-1928.
Go to original source...
- RAJA, A., ATIF AZAD, R. M., FLANAGAN, C., PICOVICI, D., RYAN, C.: Non-Intrusive Quality Evaluation of VoIP Using Genetic Programming, Bio-Inspired Models of Network, Information and Computing Systems, 2006, pp.1-8.
Go to original source...
- BASTERRECH, S., RUBINO, G., VARELA, M.: Single-sided Real-time PESQ Score Estimation, Proc. of Measurement of Speech, Audio, and Video Quality in Networks (MESAQIN'09), Prague, 2009.
- SILVA, S.: GPLAB: A Genetic Programming Toolbox for MATLAB, Published via http://gplab.sourceforge.net/download.html.
- VANNESCHI, L., CASTELLI, M., SILVA, S.: Measuring Bloat, Overfitting and Functional Complexity in Genetic Programming, GECCO 2010, pp. 877-884.
Go to original source...
- RAJA, A., ATIF AZAD, R. M., FLANAGAN, C., RYAN, C.: Real-Time, Non-intrusive Evaluation of VoIP, EuroGP'07, LNCS, vol. 4445, Springer, Heidelberg 2007, pp. 217-228.
Go to original source...
- ABDELBAKI, H. E.: Random Neural Network Simulator (RNNSIM v. 2), 1999, online: http://www.cs.ucf.edu/~ahossam/rnnsim.
- MRVOVA, M.: Quality Estimation of Synthesized Speech Signals Transmitted through a Telecommunication Channel, Ph.D. thesis (available only in Slovak), University of Zilina, 2013.
- ITU-T Rec. P.1401: Methods, Metrics and Procedures for Statistical Evaluation, Qualification and Comparison of Objective Quality Prediction Models, Intern. Telecommunication Union, Geneva, 2012.
- MRVOVA, M., POCTA, P.: Novel Parameter-based Models Estimating Quality of Synthesized Speech Transmitted over IP Network Based on Genetic Programming Approach, Microwave and Radio Electronics Week, 2013, Pardubice, pp. 361-366, ISBN 978-1-4673-5517-9.
- MRVOVA, M.: Novel Parameter-based Model Estimating Quality of Synthesized Speech Transmitted over IP Network Based on Different RNN Architectures, 10th European Conference of Young Research and Scientific Workers TRANSCOM 2013, 2013, Zilina, Slovakia, pp. 81-84, ISBN: 978-80-554-0692-3.
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.