A little and very high level library to perform basic operations on audio files in the OCaml language
· Documentation · Report Bug · Request Feature ·
Warning
The project is still in development and is not yet ready for use.
This project uses Opam as a package manager
opam install soundml
The project is still work in progress.
- Read and Write audio
- Audio slicing (in a similar way to Owl's slicing)
- Basic onset detection algorithms
- Spectral analysis
- Generic spectrogram helper function
- Unify the spectrogram parameters inside a config module.
- Mel spectrogram
- MFCC spectrogram
- Chroma spectrogram
- Constant and linear detrend
- Time domain analysis
- RMS computation
- Zero crossing rate
- Effects module
- Pitch shifting
- Time stretching
- Filters (low-pass, high-pass, band-pass, etc...)
- Write test files for the whole library
- Natively written in OCaml for a perfect OCaml developer experience
- Easily read and write audio files in various formats (WAV, MP3, etc...)
- Audio slicing
- Feature extraction (MFCC, mel-spectrogram, ZCR, ...)
- Audio effects (pitch shifting, time stretching, filters, ...)
You should be using the OCaml compiler with a version at least equal to 5.1.0. You can install it by following the instructions on the OCaml website. This project uses the Dune build system.
This library heavily relies on the Owl and ocaml-ffmpeg libraries.
Name | Version | Description |
---|---|---|
Owl - OCaml Scientific Computing | >= 1.1 |
Library for scientific computing in OCaml. Used to make the heavy computations (FFT, IFFT, etc...) |
ocaml-ffmpeg - OCaml bindings to the FFmpeg library. | >= 1.2 |
OCaml bindings for FFmpeg. Used to read and write audio data. |
This project is heavily inspired by other amazing open-source libraries such as:
Name | Inspiration | Reference |
---|---|---|
librosa | General functionalities for audio signal processing | McFee, Brian, Colin Raffel, Dawen Liang, Daniel PW Ellis, Matt McVicar, Eric Battenberg, and Oriol Nieto. "librosa: Audio and music signal analysis in python." In Proceedings of the 14th python in science conference, pp. 18-25. 2015. |
pydub | Ease of use, audio slicing using milliseconds and manipulation | - |
Numpy | Numerous implementation of SoundML's algorithms were directly taken from Numpy | - |
Matplotlib | The implementation of their spectral helper to compute spectrogram as well as the one of the linear detrend function were took from the matplotlib.mlab module. |
- |
Don't hesitate to check the amazing work done by the authors and contributors of these libraries!
Distributed under the Apache License Version 2.0. See LICENSE for more information.
- Logo generated with DALL-E by OpenAI