Scanning Classification Details
This method uses a segment of the entire audio signal, preferably the same length as the censored word, and feeds it into the classifier. The classifier defines the score, which is tangent to probability, for each window and each trained word. This data is stored and then the windowed is moved forward a small amount of time. In our program it moves 10 milliseconds. All these classified windows and their scores are plotted. We then use a similar threshold detection to find the censored word. If a score is above a threshold it is considered to be the desired word, and is censored. The key tools used in this iteration are windowing, a digital signal processing tool from class, and posterior probability from knn classification.
The score is determined based on the posterior probability. This considers the category of the k nearest neighboring points, but also their relative distances from each other and from the test vector. The first figure below is taken from MATLAB code documentation for “Predict”