A library for audio feature extraction, regression, classification, segmentation

  • I was in the market for one these and ended up with yaafe [1], which is a little older, but has, IMO, a better api, more flexible output, and c as well as python bindings.

    Also, the documentation is rather good, with links to the various papers for each algorithm. The above library, in contrast, is little impenetrable for me.

    I'm using this with postgres and supercollider for more of an artistic project though, so YMMV.

    1. https://github.com/Yaafe/Yaafe

  • I used this for a personal project [1] a couple of years ago to shorten audio files without making them sound like they were cut. This is done by removing repeated sections like replacing two choruses with one. I initially wanted this to "resize" background music to match video footage I had, but it is kind of fun to just mess around with songs too (like those content aware scale picture memes, but to create the shortest possible audio).

    I think for my use case specifically, the library was kind of overkill though and something like librosa [2] would have been enough for feature extraction.

    1: https://projects.loud.red/snipsnip/

    2: https://librosa.org/doc/latest/index.html

  • Interesting, I've recently done a bit of searching in this space to find a project that would fit for an idea I had: I'd like to use a raspberry pi zero w to listen for our doorbell. If the doorbell rings, it should do something (e.g. send an sms or turn on a light).

    I couldn't really find anything, does someone know if a project like this exists? For the one listed here, I'm not sure if it is fast enough to run on a slow device like the W? Also, would it be able to detect audio in a continuous stream from say a microphone?