Hybrid approach for speaker recognition based on formant and pitch extraction

S Boujnah, R Ferjaoui…�- …�on Cyberworlds (CW), 2023 - ieeexplore.ieee.org
S Boujnah, R Ferjaoui, AB Khalifa
2023 International Conference on Cyberworlds (CW), 2023ieeexplore.ieee.org
Human voice is an ideal data source for identifying people in many applications. Because of
the increasing need for security in different public places, voice biometrics may be a good
solution, as we can easily take voice records. This paper provides a brief overview of the
approaches utilized in recognizing speakers, and then presents a novel approach for
recognizing speakers in degraded smart-home conditions. The suggested approach
includes a pre-processing phase, a feature extraction phase, and a classification phase�…
Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in recognizing speakers, and then presents a novel approach for recognizing speakers in degraded smart-home conditions. The suggested approach includes a pre-processing phase, a feature extraction phase, and a classification phase, where the feature extraction phase consists of formant extraction to get the spectrum energy maxima of speech audio, dynamic time warping (DTW)to find an optimal alignment between two provided temporal sequences under definite restrictions, and refinement process to improve the results of the DTW system output. The experiments are carried out on a database containing 1,248 samples in order to validate the suggested approach. The latter has good results as regards the state of the art with 94.5% accuracy.
ieeexplore.ieee.org
Showing the best result for this search. See all results