Hi folks
I was hoping to get some guidance on how to go about detecting repetition of a phrase or gesture in a single recording using MUBU.
I want to record some audio, segment and analyze it with either MFCC or YIN, then be able to find series of segments that are the most similar. So if I play the same phrase on an instrument 3 times in a given recording, I want to detect that repetition occurred, and which segments are contained in each iteration of the phrase.
Is this something autocorrelation could be used for?
One approach might be looping over the segments and using KNN to find adjacent segments that have adjacent KNN results, although it doesn’t seem like you can exclude segments from KNN so a given segment always finds itself as the nearest neighbor in a given buffer (please correct me if I’m wrong on that!).
A friend told me to look into dynamic time warping or recurrence matrixes, but I’m not sure how to go about that in max.
Any input would be appreciated!
James