10. Analysis of speech signals in the time domain.

Lecture



When solving many problems of speech processing, the temporal characteristics of speech signals are of interest. Since speech is a non-stationary process, it is customary to analyze it in short sections (   10. Analysis of speech signals in the time domain. ms), where the spectral-correlation characteristics remain approximately constant.

One of the important parameters of a speech signal is its   energy :

  10. Analysis of speech signals in the time domain. . (one)

Another way to detect voiced and unvoiced areas of speech is based on measuring the average number of zero crossings of the speech signal. This measurement is a rough estimate of the frequency composition of the speech signal. It is known that the energy of vocalized sounds is concentrated in the range below kHz, while the energy of fricative sounds is concentrated mainly at frequencies above kHz. Therefore, if the average number of zero transitions is large, this indicates an unvoiced character of speech, and vice versa.

An important task of analyzing speech signals in the time domain is estimating the pitch period. The period of the fundamental tone can be defined as the time interval between the corresponding peaks of the voiced portion of the speech signal. However, the main difficulty here is that even at short time intervals the speech signal does not have a strict periodic structure.

Another way to determine the period of the fundamental tone in the time domain is based on the calculation of the short-term autocorrelation function :

  10. Analysis of speech signals in the time domain. ,   10. Analysis of speech signals in the time domain. , (2)

where is the maximum signal delay.

In identifying the period of the fundamental tone P by the autocorrelation function, it is necessary to take into account   10. Analysis of speech signals in the time domain. . For voiced speech plots   10. Analysis of speech signals in the time domain. traced peaks with an interval equal to the period of the fundamental tone.

In order to sharpen peaks in the graph   10. Analysis of speech signals in the time domain. signal   10. Analysis of speech signals in the time domain. clipping. The essence of this non-linear operation is shown in fig. 1, a. In fig. Figure 1b shows the autocorrelation function of the clipped signal.

  10. Analysis of speech signals in the time domain.

a) b)

Fig. 1. Clipping operation.

The disadvantage of the considered method for determining the period of the fundamental tone is that a large number of arithmetic operations are required to calculate the autocorrelation function.

function of the average number of zero crossings for speech processing

  10. Analysis of speech signals in the time domain. uh

  10. Analysis of speech signals in the time domain.


Comments


To leave a comment
If you have any suggestion, idea, thanks or comment, feel free to write. We really value feedback and are glad to hear your opinion.
To reply

Methods and means of computer information technology

Terms: Methods and means of computer information technology