PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

Installation

Please see docs for setting up the venv.

Data Preparation

Please follow the link setting up gsuitls then for downloading and running pvt_ssd on waymo dataset you can run waymo_prep.sh (modify to your needs) (or if you have followed me exactly then you can go ahead and directly run it). For the Waymo dataset please also place we use the evaluation toolkits to evaluate detection results.

data
│── waymo
│   │── ImageSets/ Nuscenes, W
│   │── raw_data
│   │   │── segment-xxxxxxxx.tfrecord
│   │   │── ...
│   │── waymo_processed_data
│   │   │── segment-xxxxxxxx/
│   │   │── ...
│   │── waymo_processed_data_gt_database_train_sampled_1/
│   │── waymo_processed_data_waymo_dbinfos_train_sampled_1.pkl
│   │── waymo_processed_data_infos_test.pkl
│   │── waymo_processed_data_infos_train.pkl
│   │── waymo_processed_data_infos_val.pkl
│   │── compute_detection_metrics_main
│   │── gt.bin
│── kitti
│   │── ImageSets/
│   │── training
│   │   │── label_2/
│   │   │── velodyne/
│   │   │── ...
│   │── testing
│   │   │── velodyne/
│   │   │── ...
│   │── gt_database/
│   │── kitti_dbinfos_train.pkl
│   │── kitti_infos_test.pkl
│   │── kitti_infos_train.pkl
│   │── kitti_infos_val.pkl
│   │── kitti_infos_trainval.pkl
│── once
│   │── ImageSets/
│   │── data
│   │   │── 000000/
│   │   │── ...
│   │── gt_database/
│   │── once_dbinfos_train.pkl
│   │── once_infos_raw_large.pkl
│   │── once_infos_raw_medium.pkl
│   │── once_infos_raw_small.pkl
│   │── once_infos_train.pkl
│   │── once_infos_val.pkl
│── kitti-360
│   │── data_3d_raw
│   │   │── xxxxxxxx_sync/
│   │   │── ...
│── ckpts
│   │── pvt_ssd.pth
│   │── ...

Training & Testing

# train
bash scripts/dist_train.sh

# test
bash scripts/dist_test.sh

Results

Waymo

	Vec_L1	Vec_L2	Ped_L1	Ped_L2	Cyc_L1	Cyc_L2	Model
PVT-SSD	79.2/78.7	70.2/69.8	79.9/74.0	72.6/67.0	77.1/76.0	74.0/73.0	log
PVT-SSD_3f	80.6/80.2	71.9/71.5	83.9/80.6	75.1/72.1	77.9/77.0	74.8/74.0	log

We could not provide the above pretrained models due to Waymo Dataset License Agreement.

Citation

If you find this project useful in your research, please consider citing:

@inproceedings{yang2023pvtssd,
    author    = {Yang, Honghui and Wang, Wenxiao and Chen, Minghao and Lin, Binbin and He, Tong and Chen, Hua and He, Xiaofei and Ouyang, Wanli},
    title     = {PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {13476-13487}
}

Acknowledgement

This project is mainly based on the following codebases. Thanks for their great works!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
docs		docs
pcdet		pcdet
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
waymo_prep.sh		waymo_prep.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

Installation

Data Preparation

Training & Testing

Results

Waymo

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

Veerainsood/PVT-SSD

Folders and files

Latest commit

History

Repository files navigation

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

Installation

Data Preparation

Training & Testing

Results

Waymo

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages