This paper integrates location information of frames into conventional acoustic model (AM) and language model (LM) likelihoods, in order to distinguish potential path can- didates more precisely at decoding stage. This paper proposes an induced probability, which represents location information of frames within the whole acoustic space. By integrating the induced probability, the decoder is directed to search within the most promising regions of acoustic space. Promising paths are enhanced and unlikely paths are weakened. Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95% rel- atively without increasing decoding complexity significantly. Finally, pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance.
Although the signal subspace approach has been studied extensively for speech enhancement, no good solution has been found to identify signal subspace dimension in multi- channel situation. This paper presents a signal subspace dimension estimator based on F-norm of correlation matrix, with which subspace-based multi-channel speech enhancement is robust to adverse acoustic environments such as room reverberation and low input signal to noise ratio (SNR). Experiments demonstrate the presented method leads to more noise reduction and less speech distortion comparing with traditional methods.
Based on the analysis of the shortcomings of broadband MUSIC algorithm with short-time Fourier transform (SF-MUSIC) for sound source localization, a broadband MUSIC algorithm with auditory filter (AF-MUSIC) was proposed. The proposed algorithm first em- ploys auditory filter bank to decompose the signals received on the microphone array, and then locates the sound source with MUSIC algorithm over every frequency channel. At last, by combining with the subinterval frequency estimation, the final localization result is gained. Evaluations on the proposed algorithm prove that comparing with the SF-MUSIC algorithm, the AF-MUSIC algorithm decreases the average error of the estimation results with 2.5479 de- gree in different source conditions. The accuracy of sound source DOA estimation is enhanced effectively.