Lighthouse-Wrapper-for-Audio-Moment-Retrieval

What is this?

This repository provides the procedure to conduct experiments with Lighthouse for the paper "Language-based Audio Moment Retrieval" (Munakata et.al., ICASSP 2025). In addition, it supports the following functionalities:

Generation of Clotho-Moments from Clotho and UnAv-100
Extraction of CLAP Features
Evaluation of Zero-shot Sound Event Detection The raw audio dataset is provided in the following links:
HuggingFace
- Clotho-Moment
Zenodo
- Clotho-Moment/UnAV100-subset/TUT Sound Events 2017 The captions are available in Lighthouse:
- Clotho-Moment
- UnAV100-subset
- TUT Sound Events 2017

How to train/evaluate AMR models with Lighthouse?

Install Lighthouse
Download extracted CLAP features of Clotho-Moment/UnAV100-subset/TUT Sound Events 2017 from here
- You can also download wav files from here
Set the path to the downloaded features in "(LIGHTHOUSE_PATH)/features".
- For example, if you downloaded Clotho-Moment features, set the path to "(LIGHTHOUSE_PATH)/features/clotho-moment".

Run the following command to train the AMR model:

python training/train.py --model qd_detr --dataset clotho-moment --feature clap

Run the following command to evaluate the AMR model:

model=qd_detr
dataset=unav100-subset
feature=clap
model_path={lighthouse_dir}/results/qd_detr/clotho-moment/clap/best.ckpt
eval_split_name=val
eval_path=data/unav100-subset/unav100-subset_test_release.jsonl

python training/evaluate.py \
        --model $model \
        --dataset $dataset \
        --feature $feature \
        --model_path $model_path \
        --eval_split_name $eval_split_name \
        --eval_path $eval_path

Generation of Clotho-Moments

./clotho-moment_generetor generates Clotho-Moments from Clotho and Walking Tours. Please read the README.md in the directory for more details.

Feature Extraction using CLAP

./feature_extractor extracts CLAP features for lighthouse. Please read the README.md in the directory for more details.

Evaluation of Zero-shot Sound Event Detection

./zero-shot_sed_eval evaluates the zero-shot SED system. Please read the README.md in the directory for more details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lighthouse-Wrapper-for-Audio-Moment-Retrieval

What is this?

How to train/evaluate AMR models with Lighthouse?

Generation of Clotho-Moments

Feature Extraction using CLAP

Evaluation of Zero-shot Sound Event Detection

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Lighthouse-Wrapper-for-Audio-Moment-Retrieval

What is this?

How to train/evaluate AMR models with Lighthouse?

Generation of Clotho-Moments

Feature Extraction using CLAP

Evaluation of Zero-shot Sound Event Detection