Status | 已發表Published |
Dual-channel speech separation using interaural time difference with Generalized Gaussian Mixture Model | |
Ding, Z.; Zhang, L.; Wang, L.; Li, W. | |
2015-09-01 | |
Source Publication | Proceedings of International Conference on Information Technology and Management Innovation (ICITMI 2015) |
Abstract | In this letter we present a novel speech separation scheme using two microphones. The proposed method utilizes the estimation of interaural time difference (ITD) statistics for the separation of mixed speech sources. The novelties of this paper consist in the use of Generalized Gaussian Mixture Model (GGMM) for speech separation frame by frame and cross-correlation coefficient for distributed parameter selection. The proposed model can be extended to audio enhancement. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed methods and show significant quality improvements over the conventional dual ITD based methods. |
Keyword | interaural time difference (ITD) statistics Generalized Gaussian Mixture Model correlation coefficient time-frequency mask |
URL | View the original |
Language | 英語English |
The Source to Article | PB_Publication |
PUB ID | 21481 |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Recommended Citation GB/T 7714 | Ding, Z.,Zhang, L.,Wang, L.,et al. Dual-channel speech separation using interaural time difference with Generalized Gaussian Mixture Model[C], 2015. |
APA | Ding, Z.., Zhang, L.., Wang, L.., & Li, W. (2015). Dual-channel speech separation using interaural time difference with Generalized Gaussian Mixture Model. Proceedings of International Conference on Information Technology and Management Innovation (ICITMI 2015). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment