DLINE Journals

Journal of Digital Information Management

Vol No. 20 ,Issue No. 3 2022

ASR Features Extraction Using MFCC and LPC: A Comparative Study

Mohammad Madian Kamal Eddin, Bassel Alkhatib
Syrian virtual University Syrian Arab Republic

Abstract: The field of Automatic Speaker Recognition (ASR) is an important and open field for researchers and scientists, especially as it has become essential in facilitating the work we do in our daily lives. Many technologies have been developed in the recognition field. Such as digital authentication and electronic transactions; consider It a secure environment to authenticate user's access to their accounts. Still, so far, there is no complete tool or method for speaker identification, the most crucial step in ASR is the extraction of voice features. Many techniques and tools can extract the speaker's vocal characteristics (voice features), identifying the user and recognising his voice spectrum through the phonetic and linguistic message. In this paper, two methods will be studied, each using a different technique MFCC, which uses a logarithmic scale, and LPC, which uses a linear scale. The method used in ASR should have a minimal error because it is a crucial authentication technology like a fingerprint, where two different people cannot have the same voice spectral range (voiceprint).

Keywords: ASR, Speech Recognition, MFCC, LPC ASR Features Extraction Using MFCC and LPC: A Comparative Study

DOI:https://doi.org/10.6025/jdim/2022/20/3/79-89

Full_Text PDF 3.33 MB Download: 105 times

References:

[1] Poddar, A., Sahidullah, M., Saha, G. (2018) Speaker verification with short utterances: A review of challenges, trends, and opportunities. IET Biometrics, 7, 91–101. [2] Aizat, K., Mohamed, O., Orken, M., Ainur, A., Zhumazhanov, B. (2020) Identification and authentication of user’s voice using DNN features and i-vector. Cogent Engineering, 7. [3] https://en.wikipedia.org/wiki/Speaker_recognition. Wikipedia. [4] https://espanol.verizon.com/articles/speech-recognition- technology/. [5] Pew Research Cntr (2017) Voice assistants topline and methodology. https://www.pewresearch.org/facttank/ 2017/12/12/nearly-half-of-americans-use-digital-voice-assistants- mostly-on-their-smartphones/. [6] Kiran, U. (2021). MFCC techniques for speech recognition. https://www.analyticsvidhya.com/blog/2021/06/ mfcctechnique-for-speech-recognition/,June13. [7] Rao, K.S., Manjunath, K.E. (2017). Speech Recognition Using Articulatory and Excitation Source Features, SpringerBriefs in Speech. Technology. [8] Benesty, J., Sondhi, M.M., Huang, Y.A. (2008). Handbook of Speech Processing. Springer: New York, USA. [9] Karpov, E. (2003). Real-Time Speaker Identification. The University of Joensuu, Department of Computer Science Master’s Thesis. [10] Niemann, H. (2013) Klassifikation von mustern. Available from : https://www.springer.com/de/book/ 9783540126423. Springer-Verlag. [11] Rabiner, L., Juang, B.-H., Yegnanarayana, B. (2008). Fundamentals of Speech Recognition. Pearson Education: London. [12] Alim, S.A., Rashid, N.K.A. (2018). From Natural to Artificial Intelligence – Algorithms and Applications. [13] Buza, O., Toderean, G., Nica, A., Caruntu, A. (2006) Voice signal processing for speech synthesis IEEE International Conference on Automation, Quality and Testing Robotics, Vol. 2, p. 360–364. [14] Shrawankar, U., Thakare, V. (2013). Techniques for Feature Extraction in Speech Recognition System: A Comparative Study, [arXiv:1305.1145v1]. [15] Kurzekar, P.K., Kurzekar, P.K., Waghmare, V.B. , Shrishrimal, P.P. (2014) A comparative study of feature extraction techniques for speech recognition system. International Journal of Innovative Research in Science, Engineering and Technology, 03. [16] Mathuranathan Viswanathan, Y.-W. (2014) Estimation and simulation in MATLAB. Found at: https:// www.gaussianwaves.com/2014/05/yule-walker-estimation.