Based on the spectrograms analysis and the individual frequency bands of speech under G-force, in this pa-per, a new Mel frequency scale is proposed, and the related MFCC (Mel Frequency Cepstrum Coefficient) is adoptedas the features for recognition of stressed speech under G-force. It is shown from the experiments that the proposedmethod is better than other methods of Mel-based features for stressed speech recognition.
Stress is an important parameter for prosody processing in speech synthesis. In this paper, we compare the acoustic features of neutral tone syllables and strong stress syllables with moderate stress syllables, including pitch, syllable duration, intensity and pause length after syllable. The relation between duration and pitch, as well as the Third Tone (T3) and pitch are also studied. Three stress prediction models based on ANN, i.e. the acoustic model, the linguistic model and the mixed model, are presented for predicting Chinese sentential stress. The results show that the mixed model performs better than the other two models. In order to solve the problem of the diversity of manual labeling, an evaluation index of support ratio is proposed.