Dears,
starting to go a it deeper with mubu here.
I’d need to know a bit more how I could classify recordings.
Recording audio in a mubu track, ok.
Then, I’d like to trigger some analysis to know if the recording is
- a speech,
- a sound which is not a speech,
- a music, why not
Obviously, these are not disjoint sets…
A music could contain a speech, etc etc.
Without trying to do that on a VERY HIGH ACCURATE manner, how could I do that with pipo ?
The main question here is: what features/values should I analyze to do that ?
Any ideas/advices would be very appreciated