When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions

This repository is the official implementation of the paper When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions (NeurIPS 2025)

Zhuo Cao, Heming Du, Bingqing Zhang, Xin Yu, Xue Li, Sen Wang

The University of Queensland, Australia

Preparation | Inference | Evaluation | Citation | Acknowledgements

🔨 Preparation

Environment

conda create -n flashmmr python=3.12 -y
conda activate flashmmr
pip install -r requirements.txt

We recommend Python 3.12.2 (same as FlashVTG environment setup).

Features & annotation
- Download QVHighlights video and query features exactly as described in FlashVTG.
- Download QV-M2 text features at this link.
- Please find annotation files under data/ and update any path fields that differ on your machine.
- We provide checkpoints here. Please download and place them under results/ directory. Configuration can be find in each opt.json file.

🔍 Inference

All entry points go through FlashMMR/inference.py. The script expects:

python FlashMMR/inference.py \
    data/MR.py \
    --resume results/<experiment>/model_best.ckpt \
    --eval_split_name val \
    --eval_path data/QV-M2/test.jsonl

Argument quick reference

data/MR.py: nncore config that defines strides, pyramids, and generators.
--resume: path to a checkpoint (e.g., model_best.ckpt). Required.
--eval_split_name: val will compute metrics (needs GT); test skips evaluation.
--eval_path: dataset JSONL containing query metadata + feature IDs.

Outputs are saved under results/<experiment>/, including raw submissions and, when possible, _metrics.json.

📊 Evaluation

If you want to evaluate on multi-moment retrieval task separately, you can use the standalone evaluation script:

python standalone_eval/eval.py \
    --submission_path results/<experiment>/best_hl_val_preds_nms_thd_0.7.jsonl \
    --gt_path data/QV-M2/test.jsonl \
    --save_path results/<experiment>/best_hl_val_preds_nms_thd_0.7_metrics.json

Argument quick reference

--submission_path: path to the prediction file.
--gt_path: path to the ground truth file.
--save_path: path to save the evaluation results.

🎓 Citation

If you find FlashMMR useful for your research, please consider citing:

@InProceedings{cao2025flashmmr,
    author    = {Cao, Zhuo and Du, Heming and Zhang, Bingqing and Yu, Xin and Li, Xue and Wang, Sen},
    title     = {When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions},
    booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
    year      = {2025},
}

Acknowledgements

This work is supported by Australian Research Council (ARC) Discovery Project DP230101753 and the code is based on FlashVTG.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
FlashMMR		FlashMMR
blocks		blocks
data		data
figure		figure
standalone_eval		standalone_eval
utils		utils
LISCENSE		LISCENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions

🔨 Preparation

🔍 Inference

📊 Evaluation

🎓 Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

When One Moment Isn’t Enough: Multi-Moment Retrieval with Cross-Moment Interactions

🔨 Preparation

🔍 Inference

📊 Evaluation

🎓 Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages