Linear Prediction

x = prediction coefficients ak

H = last p samples

predicted zn+1 = Sum_of (akzn-k) k=1,..,p

errorn+1 = zn+1 - Sum_of (akzn-k)

Coefficients represent vocal tract filter

Errors represent laryngal and noise source


But can't we already do linear prediction?

Yes, but...

1. A "frame" ("window") at a time. Coefficients assumed constant over frame

2. Frames treated as independent. No way of enforcing continuity

3. Best fit tries to fit larynx pulses along with the rest.

Kalman filter can ignore points (not do an update) when the error is above a threshold.

4. Kalman confidence measure can be useful when making decisions based on pole estimates. E.g., formant tracking.