Communications - Scientific Letters of the University of Zilina 2013, 15(2):13-18 | DOI: 10.26552/com.C.2013.2.13-18
Fuzzy Logic Networks for Speech Recognition
- 1 Department of Info-Com Networks, Faculty of Management Science and Informatics, University of Zilina
This paper proposes a massive fuzzy logic network which can be considered as a novel model of pattern classification network. Our approach introduces fuzzy logic circuits fulfilling the function of a binary classifier at first, which are connected into fuzzy logic networks with fuzzy flip-flop circuits as memories. Genetic programming is used as a circuit designing method. In order to establish design methodology, experiments aimed at testing the suitability of fuzzy logic operation sets, fitness functions and parameters of genetic algorithm were carried out. From trained circuits a hierarchical layered structure is built, where single layers consisting of given circuits are contextually dependent. Experiments with fuzzy logic circuits and fuzzy flip-flop network show some valuable results especially in the task of audio and visual speechrecognition.
Keywords: Fuzzy logic, speech recognition, genetic programming, binary classifier, memristor, lip-reading, structure, network
Published: June 30, 2013 Show citation
References
- JUANG, B. H., RABINER, L. R.: Automatic Speech Recognition - A Brief History of the Technology Development, Georgia Institute of Technology, Atlanta, 2004
- GUBKA, R., KUBA, M.: Audio Patterns Searching and Retrieval, 21st Intern. Conference Radioelektronika, 2011
Go to original source...
- WILLIAMS, R.J., ZIPSER, D.: A Learning Algorithm for Continually Running Fully Recurrent Neural Networks. Neural Computation. 1989
Go to original source...
- DUCHNOWSKI, P., HUNKE, M., BUSCHING, D., MEIER, U., WAIBEL, A.: Toward Movement-Invariant Automatic Lip-reading and Speech Recognition, 1995
- DUCHNOWSKI, P., MEIER, U., WAIBEL, A.: See Me, Hear Me: Integrating Automatic Speech Recognition and Lipreading. Proc. of the ICSLP, 1994
Go to original source...
- BREGLER, C., OMOHUNDRO, S.: Nonlinear Manifold Learning for Visual Speech Recognition, Proc. IEEE ICCV, pp. 494-499, 1995
Go to original source...
- GOLDSCHEN, A. J.: Continuous Automatic Speech Recognition by Lipreading, Ph.D. dissertation, George Washington Univ., Washington, DC, Sept. 1993
- POTAMIANOS, G., COSATTO, E., GRAF, H.P., ROE, D. B.: Speaker Independent Audiovisual Database for Bimodal ASR. Proc. European Tutorial Workshop Audiovisual Speech Processing, Rhodes, 1997
- POTAMIANOS, G., VERMA, A., NETI, C., IYENGAR, G., BASU, S.: A Cascade Image Transform for Speaker Independent Automatic Speechreading. Proc. IEEE Int. conf. Mulitmedia, New York, 2000
- YOICHI, M., HAJIME, U. et al.: Visual Image Reconstruction from Human Brain Activity Using a Combination of Multiscale Local Image Decoders, [online] www.sciencedirect.com/science/article/ pii/S0896627308009586, 2008
- NISHIMOTO, S., VU, A. T., NASELARIS, T. et al.: Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies, [online] http://www.sciencedirect.com/science/article/pii/ S09609822 11009377, 2011
- PASLEY, B. N., DAVID, S. V., MESGARANI, N., FLINKER, A., SHAMMA, S. A., et al., Reconstructing Speech from Human Auditory Cortex. PLoS Biol 10(1): e1001251.doi: 10.1371/journal.pbio.1001251, 2012
Go to original source...
- KLIMO, M., BORON, J.: Dynamicke vlastnosti pravdepodobnych fuzzy klopnych obvodov / Dynamic Properties of Probabilistic Fuzzy Flip-flops. Proc. of ITAT (Informacne technologie - aplikacie a teoria), 2009
- KOZA, J. R.: Genetic Programming: A Paradigm for Genetically Breeding Populations of Computer Programs to Solve Problems, Stanford University Computer Science Department, 1990
- STRUKOV, D. B., SNIDER, G. S., STEWART, D. R., WILLIAMS, R. S.: The Missing Memristor Found, Nature 453, pp: 80-83, doi:10.1038/nature06932, 2008
Go to original source...
- KLIMO, M., SUCH, O.: Memristors Can Implement Fuzzy Logic, in CoRR, [online] http://arxiv.org/abs/1110.2074v1, 2011.
- BADURA, S., KLIMO,M., SKVAREK, O.: Lip Reading Using Fuzzy Logic Network with Memory, AICT, Georgia, Tbilisi, ISBN: 978-1-4673-1740-5, pp:35-38, 17-19 Oct., 2012
Go to original source...
- SUCH, O.: Phoneme Discrimination Using KS-algebra I., [online] http://arxiv.org/abs/1302.6031, 2013
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.