Skip to content

fz-zsl/AQuA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

AQuA: Learn 3D VQA Better with Active Selection and Reannotation

Paper ACM DL Video ACM DL Paper PDF Paper HTML Paper arXiv Checkpoints

Shengli Zhou, Yang Liu, Feng Zheng📧

This repository is the official implementation of the ACM MM 2025 paper "Learn 3D VQA Better with Active Selection and Reannotation".

In our paper, we conduct comparative experiments (i.e., the "Lazy Oracle Experiment" and the "Diligent Oracle Experiment") and an ablation study to validate our methods. This repository contains the code for experiments.

ScanQA

For ScanQA, we modify the code from the official implementation of ScanQA. Please refer to the official repository for dependency installation and data preparation.

Model Training

python scripts/train.py --use_color --tag <tag_name> --AL_mode <AL_strategy> [--AL_oracle]

Options:

  • --AL_mode sets the strategy used for active learning, which includes [random, entropy, infogain, variance].
  • Adding --AL_oracle enables the usage of Hierarchical Reannotation Strategy; otherwise, the "lazy oracle" is applied.
  • For more training options, please run python scripts/train.py -h.

Model Evaluation

  • Evaluation of trained ScanQA models with the val dataset:

    python scripts/eval.py --folder <folder_name> --qa --force

    <folder_name> corresponds to the folder under outputs/ with the timestamp + <tag_name>.

  • Scoring with the val dataset:

    python scripts/score.py --folder <folder_name>
  • Prediction with the test dataset:

    python scripts/predict.py --folder <folder_name> --test_type <test_type>

    <test_type> includes test_w_obj and test_wo_obj.

3D-VisTA

For 3D-VisTA, we modify the code from the official implementation of 3D-VisTA. Please refer to the official repository for dependency installation and data preparation. Before running the model, path configurations in line 3 of ./dataset/path_config.py and line 5 of ./model/language/lang_encoder.py needs to be modified.

Model Training

python3 run.py --config project/vista/train_scanqa_config.yml

Options: in train_scanqa_config.yml,

  • AL_mode sets the strategy used for active learning, which includes [random, variance].
  • AL_oracle represents the usage of Hierarchical Reannotation Strategy.

Model Evaluation

python3 run.py --config project/vista/eval_scanqa_config.yml

Acknowledgement

We would like to thank the authors of ScanQA and 3D-VisTA for their open-source release.

Citation

If you find this project useful in your research, please consider citing:

@inproceedings{10.1145/3746027.3755515,
    author = {Zhou, Shengli and Liu, Yang and Zheng, Feng},
    title = {Learn 3D VQA Better with Active Selection and Reannotation},
    year = {2025},
    isbn = {9798400720352},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3746027.3755515},
    doi = {10.1145/3746027.3755515},
    booktitle = {Proceedings of the 33rd ACM International Conference on Multimedia},
    pages = {4610–4618},
    numpages = {9},
    keywords = {3d visual question-answering, active learning, online learning},
    location = {Dublin, Ireland},
    series = {MM '25}
}

About

Official implementation of the ACM MM 2025 paper "Learn 3D VQA Better with Active Selection and Reannotation".

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors