Residential College | false |
Status | 已發表Published |
Speaker identification based on PCC feature vector | |
He, Peichao1; Zuo, Yi1; Li, Tieshan1; Philip Chen, C. L.2; Ma, He1; Liu, Junxia1 | |
2019-09 | |
Conference Name | 6th International Conference on Information, Cybernetics, and Computational Social Systems, ICCSS 2019 |
Source Publication | 2019 6th International Conference on Information, Cybernetics, and Computational Social Systems, ICCSS 2019 |
Pages | 18-22 |
Conference Date | 2019/09/27-2019/09/30 |
Conference Place | Chongqing |
Country | China |
Abstract | In the research field of speaker identification, many extraction methods of speech feature have been investigated for several decades. However, several new features such as i-vector and d-vector were proposed in recent years. Mel Frequency Cepstral Coefficient (MFCC) is still wildly used in current speaker identification systems accounting for its high performance. Based on the similar generation approach of MFCC, this article proposes a novel feature extraction way based on Pearson correlation coefficient (PCC). Firstly, we also use inverse discrete cosine transform (IDCT) cepstrum coefficient as the initial speech inputs. Secondly, we employ a hierarchical clustering analysis based on PCC to merge the IDCT cepstrum coefficient until the dimension of speech inputs is reduced to 14. Finally, we output this 14-dimensional vector as speech feature named r-vector. In the experiments, Gaussian Mixture Model (GMM) was applied to compare the performance of r-vector with other speech features. According to the 630 people voice data in TIMIT database, the results of experiments claim that the r-vector could obtain higher recognition accuracy in speaker identification. |
Keyword | Clustering Analysis Feature Extraction Pearson Correlation Coefficient Speaker Identification |
DOI | 10.1109/ICCSS48103.2019.9115450 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85094858068 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Li, Tieshan |
Affiliation | 1.Dalian Maritime University, Navigation College, Dalian, China 2.University of Macau, Faculty of Science and Technology, Macao |
Recommended Citation GB/T 7714 | He, Peichao,Zuo, Yi,Li, Tieshan,et al. Speaker identification based on PCC feature vector[C], 2019, 18-22. |
APA | He, Peichao., Zuo, Yi., Li, Tieshan., Philip Chen, C. L.., Ma, He., & Liu, Junxia (2019). Speaker identification based on PCC feature vector. 2019 6th International Conference on Information, Cybernetics, and Computational Social Systems, ICCSS 2019, 18-22. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment