公共文化服务平台

共 8 条记录，以下是 1-7

全选清除导出

排序方式：

汉语大词汇量连续语音识别系统研究进展被引量：46: 2009年; 大词汇量连续语音识别(LVCSR)技术近年来发展迅速,并在许多领域得到了广泛的应用,国内外许多大公司加大了对语音识别技术的研究,不少商业化的语音识别系统已经面世,并得到较为广泛的使用。该文综述了近年来大词汇量连续语音识别技术的研究进展,描述了汉语大词汇量连续语音识别系统,主要是基于统计方法的语音识别系统的框架与设计方法,对语音识别系统的一些关键技术和原理进行了分析,并对近年来国内外对语音识别研究发展动向进行了讨论。; 倪崇嘉刘文举徐波; 关键词：中文信息处理语音识别模型自适应搜索技术

Integrating induced probability into decoding for large vocabulary continuous speech recognition被引量：2: 2012年; This paper integrates location information of frames into conventional acoustic model （AM） and language model （LM） likelihoods, in order to distinguish potential path can- didates more precisely at decoding stage. This paper proposes an induced probability, which represents location information of frames within the whole acoustic space. By integrating the induced probability, the decoder is directed to search within the most promising regions of acoustic space. Promising paths are enhanced and unlikely paths are weakened. Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95% rel- atively without increasing decoding complexity significantly. Finally, pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance.; YANG Zhanlei LIU Wenju CHAO Hao

A signal subspace dimension estimator based on F-norm with application to subspace-based multi-channel speech enhancement被引量：2: 2012年; Although the signal subspace approach has been studied extensively for speech enhancement, no good solution has been found to identify signal subspace dimension in multi- channel situation. This paper presents a signal subspace dimension estimator based on F-norm of correlation matrix, with which subspace-based multi-channel speech enhancement is robust to adverse acoustic environments such as room reverberation and low input signal to noise ratio （SNR）. Experiments demonstrate the presented method leads to more noise reduction and less speech distortion comparing with traditional methods.; LI Chao LIU Wenju

Auditory filter based broadband MUSIC algorithm for sound source localization被引量：7: 2013年; Based on the analysis of the shortcomings of broadband MUSIC algorithm with short-time Fourier transform （SF-MUSIC） for sound source localization, a broadband MUSIC algorithm with auditory filter （AF-MUSIC） was proposed. The proposed algorithm first em- ploys auditory filter bank to decompose the signals received on the microphone array, and then locates the sound source with MUSIC algorithm over every frequency channel. At last, by combining with the subinterval frequency estimation, the final localization result is gained. Evaluations on the proposed algorithm prove that comparing with the SF-MUSIC algorithm, the AF-MUSIC algorithm decreases the average error of the estimation results with 2.5479 de- gree in different source conditions. The accuracy of sound source DOA estimation is enhanced effectively.; LIAO FengchaiLI PengLIU Wenju; 关键词：MUSIC DOA

采用听觉滤波器的宽带MUSIC声源定位方法被引量：7: 2012年; 在分析了采用短时傅里叶变换的宽带MUSIC声源定位算法(SF-MUSIC)存在问题的基础上,提出了一种采用听觉滤波器的宽带MUSIC声源定位算法(AF-MUSIC)。该算法使用听觉滤波器组对传声器阵列接收到的信号进行不等带宽分解后,在各个频率通道上使用MUSIC算法进行声源定位,并结合子区间频数估计法得出最终定位结果。对算法进行的实验评估表明,在不同声源类型条件下,相比SF-MUSIC算法,AF-MUSIC算法的平均估计误差减少2.5479°,有效地提高了声源波达方向估计的精度。; 廖逢钗李鹏刘文举; 关键词：MUSIC算法声源定位宽带短时傅里叶变换

Mandarin Pitch Accent Prediction Using Hierarchical Model Based Ensemble Machine Learning: In this study, we combine the Mandarin characteristics with Mandarin acoustic attribute and text information a...; Chongjia Ni 1

汉语语音识别中基于音节的声学模型改进算法被引量：1: 2013年; 针对汉语语音识别中协同发音现象引起的语音信号的易变性,提出一种基于音节的声学建模方法。首先建立基于音节的声学模型以解决音节内部声韵母之间的音变现象,并提出以音节内双音子模型来初始化基于音节声学模型的参数以缓解训练数据稀疏的问题;然后引入音节之间的过渡模型来处理音节之间的协同发音问题。在"863-test"测试集上进行的汉语连续语音识别实验显示汉语字的相对错误率下降了12.13%,表明了基于音节的声学模型和音节间过渡模型相结合在解决汉语协同发音问题上的有效性。; 晁浩杨占磊刘文举; 关键词：语音识别协同发音音变声学建模

全选清除导出

共1页<1>

国家自然科学基金(90820011)