Communications - Scientific Letters of the University of Zilina 2012, 14(2):63-69 | DOI: 10.26552/com.C.2012.2.63-69
Speech Coder Identification Using Chaotic Features Based on Steganalyzer Models
- 1 Electronics Engineering Department, Faculty of Engineering, Ankara University, 06100, Besevler, Ankara, Turkey
In this study a steganalyzer model based on chaotic features is adapted to codec identification problem. As conventional steganalyzers detect suspected packages buried in bitstreams using statistical analysis, the proposed "speech coder identification" model also reveals the statistical differences of the outputs produced by different speech codecs. Essentially, the design of a coder/codec identifier is equivalent to a classifier training where the chaotic features extracted from the suspicious bitstream samples are used as inputs. During training, weight values which dissociate the codec types best are investigated. Finally, performances of the trained classifiers are evaluated by using test sets. According to the test results, for a closed group which is formed from nine different output types, polynomial SVM classifiers have identified more than 97% of the samples correctly.
Keywords: coder identification; codec recognition; suspicious bit stream
Published: June 30, 2012 Show citation
ACS | AIP | APA | ASA | Harvard | Chicago | Chicago Notes | IEEE | ISO690 | MLA | NLM | Turabian | Vancouver |
References
- KOCAL, O. H., YURUKLU, E., AVCIBAS, I.: Speech Steganalysis Using Chaotic-type Features, IEEE Transactions on Information Forensics and Security, 2008, pp. 651-661.
Go to original source...
- KENNEL, M. B., ABARBANEL, H. D. I.: False Neighbors and False Strands: A Reliable Minimum Embedding Dimension Algorithm, Physical Review E (66), 2002, 026209.
Go to original source...
- MARTINEZ, F., GUILLAMON, A., ALCARAZ, J. C., ALCARAZ, M. C.: Detection of Chaotic Behaviour in Speech Signals Using the Largest Lyapunov Exponent, 14th Int'n Conference on Digital Signal Processing (1) (2002) 317-320.
- 3GPP, TS 26.090: Adaptive Multi-Rate (AMR) Speech Transcoding, version 4.0.0, 3rd Generation Partnership Project, 3GPP, 2001-03.
- ITU-T, G.726: 40, 32, 24, 16 kbit/s Adaptive Differential Pulse Code Modulation (ADPCM)
- ITU-T, G.729.1: G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729.
- ROJAS, R.: Neural Networks - A Systematic Introduction, Springer, 1996.
Go to original source...
- BURGES, C. J. C.: A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery (2), 1998, pp. 121-167.
Go to original source...
- HAYKIN, S.: Neural Networks-A Comprehensive Foundation, second ed., Prentice-Hall, New York, 1999.
- JANKOWSKI, C., KALYANSWAMY, A., BASSON, S., SPITZ, J.: NTIMIT: A Phonetically Balanced, Continuous Speech, Telephone Bandwidth Speech Database, ICASSP-90 Intern. Conference on Acoustics, Speech, and Signal Processing (1), 1990, pp. 109-112.
- HEGGER, R., KANTZ, H., SCHREIBER, T.: TISEAN: Nonlinear Time Series Analysis, http://www.mpipks-dresden.mpg.de/~tisean/ (accessed December 2009).
- ETSI, EN 300 961: Digital Cellular Telecommunications System (Phase 2+) (GSM) Full rate speech transcoding, GSM 06.10 version 8.1.1, 1999.
- US DoD, MIL-STD-3005, Department of Defense Telecommunications Systems Standard, 1999.
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.