In this preliminary work, we interpolate between two music samples in the embedding space of Music2Latent. We compare two classical schemes that allow for averaging two feature sequences with arbitrary lengths: symmetric averaging (by Kruskal and Liberman) and DTW Barycenter Averaging. The former method results in stable samples that are coherent with temporal alignment.
- Clone the repository
- Install the required packaged via
pip install -r requirements.txt
in your virtual or conda environment - Follow the notebooks
barycenters.ipynb
andinterpolation.ipynb
for the examples of usage