Dataset and code of our paper Learning to Predict Advertisement Expansion Moments in Short-Form Video Platforms (ICMR 2025 Full Research Paper).
Authors: Wenxuan Hou, Kaibing Yang, and Di Hu.
Due to the data privacy policy of Tencent, we provide extracted features for each video. Specifically, we use AST to extract audio features, use Swin Transformer and Video Swin Transformer to extract visual features.
Extracted features (~6.4G): Google Drive, Quark Drive (password: 1Emm).
Label files (~826K): Google Drive, Quark Drive (password: bRuD).
python==3.9.16
torch==2.0.0
torchvision=0.15.1
pandas=2.0.1
numpy==1.23.5
sh run.sh
You can find the description of each arg in main.py.
@inproceedings{hou2025learning,
title={Learning to Predict Advertisement Expansion Moments in Short-Form Video Platforms},
author={Hou, Wenxuan and Yang, Kaibing and Hu, Di},
booktitle={Proceedings of the 2025 International Conference on Multimedia Retrieval},
pages={451--459},
year={2025}
}
We deeply thank Hao Lin, Chong Peng, and Gong Chen in the WeChat Advertisement Department of the Corporate Development Group of Tencent for their support in data collection and processing.
The source code referenced AVVP-ECCV20.
This project is released under the CC BY-NC 4.0 License.