A Novel Approach of Speech Recognition Using System Identification

https://doi.org/10.24237/djes.2019.12309

Authors

Keywords:

System identification, ARX, FIR filter, Wiener filter, speech recognition, signal processing

Abstract

With the growing of artificial intelligence and the usage of sound commands the needs for a high accuracy speech recognition increases. Many researches are done in this area using different kinds of methods and approaches. In this research two algorithms have been introduced. The autoregressive system identification and the FIR Wiener filter. The objective of this research is to show the robustness of system identification in terms of speech recognition.Both algorithms have been implemented and tested using MATLAB where the process is done by recording full sentences from different subjects under two conditions which are clear and noisy background. For each sentence, it has been recorded two timesfor each subject; the first one was used for testing and the second sentence was used for validation. The results show that both algorithms are giving an accurate prediction when the used data are from the same subject with clear background. The advantage of system identification over the Weiner filter is shine when using noisy signals. Another advantage of using system identification for speech recognition is it can distinguish the sound difference when same sentence from different subjects is used where the Weiner filter in some cases passes them as from the same subject. This could be a huge issue if the algorithm is used for security reasons

Downloads

Download data is not yet available.

References

Joseph W. Picone, Signal Modeling Techniques in Speech, Systems and Information Sciences Laboratory, Tsukuba Research and Development Center, Tsukuba, Japan, 1993.

John G. Proakis and Dimitris G. Manolakis, Digital Signal Processing, Principles, Algorithms, and Applications, 4th edition, Pearson Education Inc., Upper Saddle River, 1996.

Adam L. Buchsbaum and Raffaele Giancarlo, Algorithmic Aspects in Speech Recognition: An Introduction,Annual Symposium on Combinatorial Pattern Matching, 2009.

HartmutTraunmuller and Anders Eriksson, The Frequency Range of the Voice Fundamental in the Speech of Male and Female Adults, Institution for linguistic, Stockholm, Sweden, 2009.

John Wiley, Sons, 2006, Inc. Statistical Signal Processing and Modeling, Monson H Hayes, Georgia I institute of Technology, 2006.

Fredric J. Harris, On the Use of Windows for Harmonic Analysis with the Discrete Fourier Transform, Proceedings of the IEEE, VOL 66, No.1, 1978.

Henrik V. Sorensen and C. Sidney Burrus, Efficient Computation of the Short-Time Fast Fourier Transform, Electrical and Computer Engineering Department, Rice University, Houston, 2012.

Heather Sobey, Literature Survey Automatic Speech Recognition, Department of Computer Science, University of Cape Town,2011.

G.P. Rao and H. Unbehauen, Identification of Continuous Time Systems,IEE Proceedings - Control Theory and Applications, Vol. 153, No. 2, 13th March 2006.

Da Xu, Kathy L. Ryan, Caroline A. Rickards, Guanqun Zhang, Victor A. Convertino and Ramakrishna Mukkamala, Robust Pulse Wave Velocity Estimation by Application of System Identification to Proximal and Distal Arterial Waveforms,Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC),4th September 2010.

Da Xu, Kathy L. Ryan, Caroline A. Rickards, Guanqun Zhang, Victor A. Convertino and Ramakrishna Mukkamala, Improved Pulse Transit Time Estimation by System Identification Analysis of Proximal and Distal Arterial Waveforms, Am J Physiol Heart Circ Physiol. Vol. 301, No. 4, PP. 1389-1395.29th July 2011.

Tingxiao Yang, The Algorithms of Speech Recognition, Programming and Simulating in MATLAB. January 2012.

Published

2019-09-01

How to Cite

[1]
I. Sadoon Fatah, “A Novel Approach of Speech Recognition Using System Identification”, DJES, vol. 12, no. 3, pp. 72–83, Sep. 2019.