π Awesome 3D Scene Understanding in the Wild
1. LiDAR Semantic Segmentation
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
PointNet
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
CVPR 2017
-
PointNet++
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
NeurIPS 2017
-
TangentConv
Tangent Convolutions for Dense Prediction in 3D
CVPR 2018
KPConv
KPConv: Flexible and Deformable Convolution for Point Clouds
ICCV 2019
-
RandLA-Net
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
CVPR 2020
PointASNL
PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling
CVPR 2020
-
PTv1
Point Transformer
CVPR 2021
-
RandLA-Net+
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling
TPAMI 2021
BAF-LAC
Backward Attentive Fusing Network With Local Aggregation Classifier for 3D Point Cloud Semantic Segmentation
TIP 2021
-
PTv2
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
NeurIPS 2022
-
WaffleIron
Using a Waffle Iron for Automotive Point Cloud Semantic Segmentation
ICCV 2023
-
PCB-RandNet
PCB-RandNet: Rethinking Random Sampling for LiDAR Semantic Segmentation in Autonomous Driving Scene
ICRA 2024
-
PTv3
Point Transformer V3: Simpler Faster Stronger
CVPR 2024
-
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
SqueezeSeg
SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud
ICRA 2018
-
SqueezeSegV2
SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud
ICRA 2019
-
RangeNet++
Rangenet++: Fast and accurate lidar semantic segmentation
IROS 2019
-
PolarNet
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
CVPR 2020
-
SqueezeSegV3
SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation
ECCV 2020
-
SalsaNet
SalsaNet: Fast Road and Vehicle Segmentation in LiDAR Point Clouds for Autonomous Driving
IV 2020
-
SalsaNext
SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving
ISVC 2020
-
3D-MiniNet
3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation
RA-L 2020
-
KPRNet
KPRNet: Improving projection-based LiDAR semantic segmentation
arXiv 2020
-
Lite-HDSeg
Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions
ICRA 2021
-
-
FIDNet
FIDNet: LiDAR Point Cloud Semantic Segmentation with Fully Interpolation Decoding
IROS 2021
-
MINet
Multi-Scale Interaction for Real-Time LiDAR Data Segmentation on an Embedded Platform
RA-L 2021
-
CENet
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving
ICME 2022
-
RangeViT
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving
CVPR 2023
-
RangeFormer
Rethinking Range View Representation for LiDAR Segmentation
ICCV 2023
-
-
FRNet
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation
TIP 2025
RangeSAM
RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation
arXiv 2025
-
-
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
SSCN
3D Semantic Segmentation with Submanifold Sparse Convolutional Networks
CVPR 2018
-
MinkUNet
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
CVPR 2019
JS3C-Net
Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion
AAAI 2021
-
Cylinder3D
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation
CVPR 2021
-
(AF)2-S3Net
Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network
CVPR 2021
-
-
Cylinder3D+
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based Perception
TPAMI 2021
-
PVKD
Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation
CVPR 2022
-
SDSeg3D
Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving
ECCV 2022
-
GASN
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks
ECCV 2022
-
-
MSSNet
Point Cloud Semantic Segmentation using Multi Scale Sparse Convolution Neural Network
arXiv 2022
-
-
SphereFormer
Spherical Transformer for LiDAR-based 3D Recognition
CVPR 2023
-
LinK
LinK: Linear Kernel for LiDAR-based 3D Perception
CVPR 2023
-
SFPNet
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds
ECCV 2024
-
NUC-Net
NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation
TCSVT 2025
-
4οΈβ£ Multi-Representation
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
SPVCNN
Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
ECCV 2020
FusionNet
Deep FusionNet for Point Cloud Semantic Segmentation
ECCV 2020
-
AMVNet
AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic Segmentation
arXiv 2020
-
-
MPF
Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds
WACV 2021
-
-
RPVNet
RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation
ICCV 2021
-
-
PMF
Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation
ICCV 2021
-
CPGNet
CPGNet: Cascade Point-Grid Fusion Network for Real-Time LiDAR Semantic Segmentation
ICRA 2022
-
2DPASS
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
ECCV 2022
-
GFNet
GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation
TMLR 2022
LidarMultiNet
LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception
AAAI 2023
-
-
MSeg3D
MSeg3D: Multi-Modal 3D Semantic Segmentation for Autonomous Driving
CVPR 2023
-
UniSeg
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
ICCV 2023
-
-
M3Net
Multi-Space Alignments Towards Universal LiDAR Segmentation
CVPR 2024
-
TASeg
TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
CVPR 2024
-
EPMF
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation
TPAMI 2024
-
PC-BEV
PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation
AAAI 2025
-
2. LiDAR Panoptic Segmentation
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
Panoptic-TrackNet
MOPT: Multi-Object Panoptic Tracking
arXiv 2020
-
-
EfficientLPS
EfficientLPS: Efficient LiDAR Panoptic Segmentation
TRO 2021
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
LPSAD
LiDAR Panoptic Segmentation for Autonomous Driving
IROS 2020
-
-
Panoptic-PolarNet
Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation
CVPR 2021
-
DS-Net
LiDAR-based Panoptic Segmentation via Dynamic Shifting Network
CVPR 2021
-
4D-PLS
4D Panoptic LiDAR Segmentation
CVPR 2021
GP-S3Net
GP-S3Net: Graph-Based Panoptic Sparse Semantic Segmentation Network
ICCV 2021
-
-
PanosterK
Panoster: End-to-end Panoptic Segmentation of LiDAR Point Clouds
RA-L 2021
-
-
CPSeg
CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds
arXiv 2021
-
-
SCAN
Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation
AAAI 2022
-
-
PC-Cluster
A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation
ICRA 2022
-
-
SMAC-Seg
SMAC-Seg: LiDAR Panoptic Segmentation via Sparse Multi-directional Attention Clustering
ICRA 2022
-
-
PVCL
Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation
ICRA 2022
-
-
Panoptic-PHNet
Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap
CVPR 2022
-
-
MaskRange
MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation
arXiv 2022
-
-
PUPS
PUPS: Point Cloud Unified Panoptic Segmentation
AAAI 2023
-
-
LCPS
LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment
ICCV 2023
-
MaskPLS
Mask-Based Panoptic LiDAR Segmentation for Autonomous Driving
RA-L 2023
-
Mask4D
Mask4D: End-to-End Mask-Based 4D Panoptic Segmentation for LiDAR Sequences
RA-L 2023
-
Mask4Former
Mask4Former: Mask Transformer for 4D Panoptic Segmentation
ICRA 2024
4D-DS-Net
Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks
TPAMI 2024
-
P3Former
Position-Guided Point Cloud Panoptic Segmentation Transformer
IJCV 2024
-
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
3DSketch
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior
CVPR 2020
-
AIC-Net
Anisotropic Convolutional Networks for 3D Semantic Scene Completion
CVPR 2020
MonoScene
MonoScene: Monocular 3D Semantic Scene Completion
CVPR 2022
TPVFormer
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
CVPR 2023
VoxFormer
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
CVPR 2023
-
OccFormer
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
ICCV 2023
-
SurroundOcc
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
ICCV 2023
FB-Occ
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
arXiv 2023
-
MonoOcc
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
ICRA 2024
-
SparseOcc
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction
CVPR 2024
Symphonies
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
CVPR 2024
-
HASSC
Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation
CVPR 2024
-
COTR
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
CVPR 2024
-
GaussianFormer
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
ECCV 2024
CGFormer
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
NeurIPS 2024
-
ReliOcc
RELIOCC: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
arXiv 2024
-
-
VLScene
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion
AAAI 2025
-
TrackOcc
TrackOcc: Camera-based 4D Panoptic Occupancy Tracking
ICRA 2025
-
GaussianFormer-2
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
CVPR 2025
-
SceneDINO
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
ICCV 2025
DISC
Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion
ICCV 2025
-
ALOcc
ALOcc: Adaptive Lifting-Based 3D Semantic Occupancy and Cost Volume-Based Flow Predictions
ICCV 2025
-
CausalOcc
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
ICCV 2025
-
VoxDet
VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
NeurIPS 2025
QuadricFormer
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction
arXiv 2025
FMOcc
FMOcc: TPV-Driven Flow Matching for 3D Occupancy Prediction with Selective State Space Model
arXiv 2025
-
-
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
LMSCNet
LMSCNet: Lightweight Multiscale 3D Semantic Completion
3DV 2020
-
JS3C-Net
Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion
AAAI 2021
-
S3CNet
S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds
CoRL 2021
-
-
SSA-SC
Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds
IROS 2021
-
Local-DIFs
Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data
TPAMI 2021
-
-
SCPNet
SCPNet: Semantic Scene Completion on Point Cloud
CVPR 2023
-
SSC-RS
SSC-RS: Elevate LiDAR semantic scene completion with representation separation and BEV fusion
IROS 2023
-
PointOcc
PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction
arXiv 2023
-
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
OpenOccupancy
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
ICCV 2023
-
OccGen
OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving
ECCV 2024
TEOcc
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
ECAI 2024
-
FusionOcc
FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction
MM 2024
-
AFOcc
AFOcc: Multimodal Semantic Occupancy Prediction With Accurate Fusion
JSEN 2024
-
-
EFFOcc
EFFOcc: Learning Efficient Occupancy Networks from Minimal Labels for Autonomous Driving
arXiv 2024
-
MR-Occ
MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation
arXiv 2024
-
-
L2COcc
L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model
arXiv 2025
4. Label-Efficient Learning
1οΈβ£ Weakly-Supervised Learning
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
W4DTS
Weakly Supervised Segmentation on Outdoor 4D point clouds with Temporal Matching and Spatial Graph Propagation
CVPR 2022
-
-
SQN
SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds
ECCV 2022
-
IGNet
2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation
WACV 2024
-
-
P4G
Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Progressive 4D Grouping
TPAMI 2025
-
-
2οΈβ£ Semi-Supervised Learning
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
GPC
Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation
ICCV 2021
LaserMix
LaserMix for Semi-Supervised LiDAR Semantic Segmentation
CVPR 2023
Lim3D
Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation
CVPR 2023
ImageTo360
360deg from a Single Camera: A Few-Shot Approach for LiDAR Segmentation
ICCVW 2023
-
-
IGNet
2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation
WACV 2024
-
-
SSMP
Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
AAAI 2024
-
DDSemi
Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling
CVPR 2024
-
-
IT2
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation
ECCV 2024
-
BST
Bayesian Self-Training for Semi-Supervised 3D Segmentation
ECCV 2024
-
LASS3D
LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation
ECCV 2024
-
-
PLE
Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation
IROS 2024
-
AIScene
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation
CVPR 2025
-
HiLoTs
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
CVPR 2025
-
LaserMix++
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving
TPAMI 2025
3οΈβ£ Unsupervised Learning
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
xMUDA
xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
CVPR 2020
-
SF-UDA^{3D}
SF-UDA^{3D}: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection
3DV 2020
-
AUDA
Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning
ISPRS 2021
-
CoSMix
Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling
ECCV 2022
-
GIPSO
GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation
ECCV 2022
-
OGC
OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point Clouds
NeurIPS 2022
-
GrowSP
GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds
CVPR 2023
-
U3DS^3
U3DS^3: Unsupervised 3D Semantic Scene Segmentation
WACV 2024
-
-
LiOn-XA
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training
IROS 2024
-
OGC+
Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency
TPAMI 2024
-
DAKD
Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation
ICRA 2025
-
LogoSP
LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds
CVPR 2025
-
VFMSeg
Visual foundation models boost cross-modal unsupervised domain adaptation for 3d semantic segmentation
T-ITS 2025
-
-
Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling
arXiv 2025
-
-
4οΈβ£ Self-Supervised Learning
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
PointContrast
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
ECCV 2020
-
Info3D
Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning
ECCV 2020
-
-
DepthContrast
Self-Supervised Pretraining of 3D Features on any Point-Cloud
ICCV 2021
-
OcCo
Unsupervised Point Cloud Pre-training via Occlusion Completion
ICCV 2021
-
STRL
Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds
ICCV 2021
-
PPKT
Learning from 2D: Contrastive Pixel-to-Point Knowledge Transfer for 3D Pretraining
arXiv 2021
-
-
SLidR
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
CVPR 2022
-
Point-BERT
Point-BERT: Pre-Training 3D Point Cloud Transformers with Masked Point Modeling
CVPR 2022
MaskPoint
Masked Discrimination for Self-Supervised Learning on Point Clouds
ECCV 2022
-
Point-MAE
Masked Autoencoders for Point Cloud Self-supervised Learning
ECCV 2022
-
Also
ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation
CVPR 2023
-
ST-SLidR
Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss
CVPR 2023
-
-
TriCC
Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving
CVPR 2023
-
-
Seal
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
NeurIPS 2023
BEVContrast
BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds
3DV 2024
-
ScaLR
Three Pillars improving Vision Foundation Model Distillation for Lidar
CVPR 2024
-
CSC
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
CVPR 2024
-
SuperFlow
4D Contrastive Superflows are Dense 3D Representation Learners
ECCV 2024
-
HVDistill
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation
IJCV 2024
-
CMCR
Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations?
arXiv 2024
-
-
LiMoE
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
CVPR 2025
LiMA
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations
ICCV 2025
LargeAD
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving
TPAMI 2025
SuperFlow++
Enhanced Spatiotemporal Consistency for Image-to-LiDAR Data Pretraining
TPAMI 2025
-
CleverDistiller
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
arXiv 2025
-
-
5οΈβ£ Open Vocabulary Segmentation
β²οΈ In chronological order, from the earliest to the latest.
Model
Paper
Venue
Website
Github
OpenScene
OpenScene: 3D Scene Understanding with Open Vocabularies
CVPR 2023
CLIP2Scene
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP
CVPR 2023
-
PLA
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
CVPR 2023
CLIP-FO3D
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
ICCV 2023
-
-
LERF
LERF: Language Embedded Radiance Fields
ICCV 2023
CNS
Towards Label-free Scene Understanding by Vision Foundation Models
NeurIPS 2023
-
OpenMask3D
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
NeurIPS 2023
OpenNeRF
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
ICLR 2024
OV3D
Open-Vocabulary 3D Semantic Segmentation with Foundation Models
CVPR 2024
-
-
RegionPLC
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
CVPR 2024
LEGaussians
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
CVPR 2024
LangSplat
LangSplat: 3D Language Gaussian Splatting
CVPR 2024
Feature 3DGS
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
CVPR 2024
Open3DIS
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
CVPR 2024
GGSD
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
ECCV 2024
-
Gaussian Grouping
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
ECCV 2024
EgoLifter
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
ECCV 2024
OpenIns3D
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
ECCV 2024
-
OpenGaussian
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
NeurIPS 2024
OWL
Lidar Panoptic Segmentation in an Open World
IJCV 2024
-
SAL
Zero-Shot 4D Lidar Panoptic Segmentation
CVPR 2025
-
-
ULOPS
Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning
IROS 2025
-
OVGaussian
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies
arXiv 2025
-
LOSC
LOSC: LiDAR Open-voc Segmentation Consolidator
arXiv 2025
-
-
β²οΈ In chronological order, from the earliest to the latest.
Datasets
Paper
Venue
Website
Github
SemanticKITTI
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
ICCV 2019
Waymo Open
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
CVPR 2020
SemanticPOSS
SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances
IV 2020
A2D2
A2D2: Audi Autonomous Driving Dataset
arXiv 2020
-
RELLIS-3D
RELLIS-3D Dataset: Data, Benchmarks and Analysis
ICRA 2021
PandaSet
PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving
ITSC 2021
SynLiDAR
Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation
AAAI 2022
-
ScribbleKITTI
Scribble-Supervised LiDAR Semantic Segmentation
CVPR 2022
Synth4D
GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation
ECCV 2022
-
Panoptic nuScenes
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking
RAL 2022
SemanticSTF
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds
CVPR 2023
-
nuScenes-Occupancy
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
ICCV 2023
-
Robo3D
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions
ICCV 2023
Occ3D
Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving
NeurIPS 2023
DAPS3D
DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds
Access 2023
-
SSCBench
SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
IROS 2024
-