😎 Awesome 3D Scene Understanding in the Wild

1. LiDAR Semantic Segmentation

1️⃣ Raw Points

Model	Paper	Venue	Website
`PointNet`	PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation	CVPR 2017	-
`PointNet++`	PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space	NeurIPS 2017	-
`TangentConv`	Tangent Convolutions for Dense Prediction in 3D	CVPR 2018
`KPConv`	KPConv: Flexible and Deformable Convolution for Point Clouds	ICCV 2019	-
`RandLA-Net`	RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds	CVPR 2020
`PointASNL`	PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling	CVPR 2020	-
`PTv1`	Point Transformer	CVPR 2021	-
`RandLA-Net+`	Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling	TPAMI 2021
`BAF-LAC`	Backward Attentive Fusing Network With Local Aggregation Classifier for 3D Point Cloud Semantic Segmentation	TIP 2021	-
`PTv2`	Point Transformer V2: Grouped Vector Attention and Partition-based Pooling	NeurIPS 2022	-
`WaffleIron`	Using a Waffle Iron for Automotive Point Cloud Semantic Segmentation	ICCV 2023	-
`PCB-RandNet`	PCB-RandNet: Rethinking Random Sampling for LiDAR Semantic Segmentation in Autonomous Driving Scene	ICRA 2024	-
`PTv3`	Point Transformer V3: Simpler Faster Stronger	CVPR 2024	-

2️⃣ Pseudo Images

Model	Paper	Venue	Website	Github
`SqueezeSeg`	SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud	ICRA 2018	-
`SqueezeSegV2`	SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud	ICRA 2019	-
`RangeNet++`	Rangenet++: Fast and accurate lidar semantic segmentation	IROS 2019	-
`PolarNet`	PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation	CVPR 2020	-
`SqueezeSegV3`	SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation	ECCV 2020	-
`SalsaNet`	SalsaNet: Fast Road and Vehicle Segmentation in LiDAR Point Clouds for Autonomous Driving	IV 2020	-
`SalsaNext`	SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving	ISVC 2020	-
`3D-MiniNet`	3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation	RA-L 2020	-
`KPRNet`	KPRNet: Improving projection-based LiDAR semantic segmentation	arXiv 2020	-
`Lite-HDSeg`	Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions	ICRA 2021	-	-
`FIDNet`	FIDNet: LiDAR Point Cloud Semantic Segmentation with Fully Interpolation Decoding	IROS 2021	-
`MINet`	Multi-Scale Interaction for Real-Time LiDAR Data Segmentation on an Embedded Platform	RA-L 2021	-
`CENet`	CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving	ICME 2022	-
`RangeViT`	RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving	CVPR 2023	-
`RangeFormer`	Rethinking Range View Representation for LiDAR Segmentation	ICCV 2023	-	-
`FRNet`	FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation	TIP 2025
`RangeSAM`	RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation	arXiv 2025	-	-

3️⃣ Sparse Voxels

Model	Paper	Venue	Website	Github
`SSCN`	3D Semantic Segmentation with Submanifold Sparse Convolutional Networks	CVPR 2018	-
`MinkUNet`	4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks	CVPR 2019
`JS3C-Net`	Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion	AAAI 2021	-
`Cylinder3D`	Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation	CVPR 2021	-
`(AF)2-S3Net`	Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network	CVPR 2021	-	-
`Cylinder3D+`	Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based Perception	TPAMI 2021	-
`PVKD`	Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation	CVPR 2022	-
`SDSeg3D`	Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving	ECCV 2022	-
`GASN`	Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks	ECCV 2022	-	-
`MSSNet`	Point Cloud Semantic Segmentation using Multi Scale Sparse Convolution Neural Network	arXiv 2022	-	-
`SphereFormer`	Spherical Transformer for LiDAR-based 3D Recognition	CVPR 2023	-
`LinK`	LinK: Linear Kernel for LiDAR-based 3D Perception	CVPR 2023	-
`SFPNet`	SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds	ECCV 2024	-
`NUC-Net`	NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation	TCSVT 2025	-

4️⃣ Multi-Representation

Model	Paper	Venue	Website	Github
`SPVCNN`	Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution	ECCV 2020
`FusionNet`	Deep FusionNet for Point Cloud Semantic Segmentation	ECCV 2020	-
`AMVNet`	AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic Segmentation	arXiv 2020	-	-
`MPF`	Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds	WACV 2021	-	-
`RPVNet`	RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation	ICCV 2021	-	-
`PMF`	Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation	ICCV 2021	-
`CPGNet`	CPGNet: Cascade Point-Grid Fusion Network for Real-Time LiDAR Semantic Segmentation	ICRA 2022	-
`2DPASS`	2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds	ECCV 2022	-
`GFNet`	GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation	TMLR 2022
`LidarMultiNet`	LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception	AAAI 2023	-	-
`MSeg3D`	MSeg3D: Multi-Modal 3D Semantic Segmentation for Autonomous Driving	CVPR 2023	-
`UniSeg`	UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase	ICCV 2023	-	-
`M3Net`	Multi-Space Alignments Towards Universal LiDAR Segmentation	CVPR 2024	-
`TASeg`	TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation	CVPR 2024	-
`EPMF`	EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation	TPAMI 2024	-
`PC-BEV`	PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation	AAAI 2025	-

2. LiDAR Panoptic Segmentation

1️⃣ Proposal-based

Model	Paper	Venue	Website	Github
`Panoptic-TrackNet`	MOPT: Multi-Object Panoptic Tracking	arXiv 2020	-	-
`EfficientLPS`	EfficientLPS: Efficient LiDAR Panoptic Segmentation	TRO 2021

2️⃣ Proposal-free

Model	Paper	Venue	Website	Github
`LPSAD`	LiDAR Panoptic Segmentation for Autonomous Driving	IROS 2020	-	-
`Panoptic-PolarNet`	Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation	CVPR 2021	-
`DS-Net`	LiDAR-based Panoptic Segmentation via Dynamic Shifting Network	CVPR 2021	-
`4D-PLS`	4D Panoptic LiDAR Segmentation	CVPR 2021
`GP-S3Net`	GP-S3Net: Graph-Based Panoptic Sparse Semantic Segmentation Network	ICCV 2021	-	-
`PanosterK`	Panoster: End-to-end Panoptic Segmentation of LiDAR Point Clouds	RA-L 2021	-	-
`CPSeg`	CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds	arXiv 2021	-	-
`SCAN`	Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation	AAAI 2022	-	-
`PC-Cluster`	A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation	ICRA 2022	-	-
`SMAC-Seg`	SMAC-Seg: LiDAR Panoptic Segmentation via Sparse Multi-directional Attention Clustering	ICRA 2022	-	-
`PVCL`	Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation	ICRA 2022	-	-
`Panoptic-PHNet`	Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap	CVPR 2022	-	-
`MaskRange`	MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation	arXiv 2022	-	-
`PUPS`	PUPS: Point Cloud Unified Panoptic Segmentation	AAAI 2023	-	-
`LCPS`	LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment	ICCV 2023	-
`MaskPLS`	Mask-Based Panoptic LiDAR Segmentation for Autonomous Driving	RA-L 2023	-
`Mask4D`	Mask4D: End-to-End Mask-Based 4D Panoptic Segmentation for LiDAR Sequences	RA-L 2023	-
`Mask4Former`	Mask4Former: Mask Transformer for 4D Panoptic Segmentation	ICRA 2024
`4D-DS-Net`	Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks	TPAMI 2024	-
`P3Former`	Position-Guided Point Cloud Panoptic Segmentation Transformer	IJCV 2024	-

3. Occupancy Prediction

1️⃣ Camera

Model	Paper	Venue	Website	Github
`3DSketch`	3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior	CVPR 2020	-
`AIC-Net`	Anisotropic Convolutional Networks for 3D Semantic Scene Completion	CVPR 2020
`MonoScene`	MonoScene: Monocular 3D Semantic Scene Completion	CVPR 2022
`TPVFormer`	Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction	CVPR 2023
`VoxFormer`	VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion	CVPR 2023	-
`OccFormer`	OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction	ICCV 2023	-
`SurroundOcc`	SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving	ICCV 2023
`FB-Occ`	FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation	arXiv 2023	-
`MonoOcc`	MonoOcc: Digging into Monocular Semantic Occupancy Prediction	ICRA 2024	-
`SparseOcc`	SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction	CVPR 2024
`Symphonies`	Symphonize 3D Semantic Scene Completion with Contextual Instance Queries	CVPR 2024	-
`HASSC`	Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation	CVPR 2024	-
`COTR`	COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction	CVPR 2024	-
`GaussianFormer`	GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction	ECCV 2024
`CGFormer`	Context and Geometry Aware Voxel Transformer for Semantic Scene Completion	NeurIPS 2024	-
`ReliOcc`	RELIOCC: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning	arXiv 2024	-	-
`VLScene`	VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion	AAAI 2025	-
`TrackOcc`	TrackOcc: Camera-based 4D Panoptic Occupancy Tracking	ICRA 2025	-
`GaussianFormer-2`	GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction	CVPR 2025	-
`SceneDINO`	Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion	ICCV 2025
`DISC`	Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion	ICCV 2025	-
`ALOcc`	ALOcc: Adaptive Lifting-Based 3D Semantic Occupancy and Cost Volume-Based Flow Predictions	ICCV 2025	-
`CausalOcc`	Semantic Causality-Aware Vision-Based 3D Occupancy Prediction	ICCV 2025	-
`VoxDet`	VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection	NeurIPS 2025
`QuadricFormer`	QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction	arXiv 2025
`FMOcc`	FMOcc: TPV-Driven Flow Matching for 3D Occupancy Prediction with Selective State Space Model	arXiv 2025	-	-

2️⃣ LiDAR

Model	Paper	Venue	Website	Github
`LMSCNet`	LMSCNet: Lightweight Multiscale 3D Semantic Completion	3DV 2020	-
`JS3C-Net`	Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion	AAAI 2021	-
`S3CNet`	S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds	CoRL 2021	-	-
`SSA-SC`	Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds	IROS 2021	-
`Local-DIFs`	Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data	TPAMI 2021	-	-
`SCPNet`	SCPNet: Semantic Scene Completion on Point Cloud	CVPR 2023	-
`SSC-RS`	SSC-RS: Elevate LiDAR semantic scene completion with representation separation and BEV fusion	IROS 2023	-
`PointOcc`	PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction	arXiv 2023	-

3️⃣ Multi-Modality

Model	Paper	Venue	Website	Github
`OpenOccupancy`	OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception	ICCV 2023	-
`OccGen`	OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving	ECCV 2024
`TEOcc`	TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement	ECAI 2024	-
`FusionOcc`	FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction	MM 2024	-
`AFOcc`	AFOcc: Multimodal Semantic Occupancy Prediction With Accurate Fusion	JSEN 2024	-	-
`EFFOcc`	EFFOcc: Learning Efficient Occupancy Networks from Minimal Labels for Autonomous Driving	arXiv 2024	-
`MR-Occ`	MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation	arXiv 2024	-	-
`L2COcc`	L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model	arXiv 2025

4. Label-Efficient Learning

1️⃣ Weakly-Supervised Learning

Model	Paper	Venue	Website	Github
`W4DTS`	Weakly Supervised Segmentation on Outdoor 4D point clouds with Temporal Matching and Spatial Graph Propagation	CVPR 2022	-	-
`SQN`	SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds	ECCV 2022	-
`IGNet`	2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation	WACV 2024	-	-
`P4G`	Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Progressive 4D Grouping	TPAMI 2025	-	-

2️⃣ Semi-Supervised Learning

Model	Paper	Venue	Website	Github
`GPC`	Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation	ICCV 2021
`LaserMix`	LaserMix for Semi-Supervised LiDAR Semantic Segmentation	CVPR 2023
`Lim3D`	Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation	CVPR 2023
`ImageTo360`	360deg from a Single Camera: A Few-Shot Approach for LiDAR Segmentation	ICCVW 2023	-	-
`IGNet`	2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation	WACV 2024	-	-
`SSMP`	Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix	AAAI 2024	-
`DDSemi`	Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling	CVPR 2024	-	-
`IT2`	ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation	ECCV 2024	-
`BST`	Bayesian Self-Training for Semi-Supervised 3D Segmentation	ECCV 2024		-
`LASS3D`	LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation	ECCV 2024	-	-
`PLE`	Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation	IROS 2024	-
`AIScene`	Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation	CVPR 2025	-
`HiLoTs`	HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving	CVPR 2025	-
`LaserMix++`	Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving	TPAMI 2025

3️⃣ Unsupervised Learning

Model	Paper	Venue	Website	Github
`xMUDA`	xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation	CVPR 2020	-
`SF-UDA^{3D}`	SF-UDA^{3D}: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection	3DV 2020	-
`AUDA`	Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning	ISPRS 2021	-
`CoSMix`	Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling	ECCV 2022	-
`GIPSO`	GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation	ECCV 2022	-
`OGC`	OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point Clouds	NeurIPS 2022	-
`GrowSP`	GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds	CVPR 2023	-
`U3DS^3`	U3DS^3: Unsupervised 3D Semantic Scene Segmentation	WACV 2024	-	-
`LiOn-XA`	LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training	IROS 2024	-
`OGC+`	Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency	TPAMI 2024	-
`DAKD`	Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation	ICRA 2025	-
`LogoSP`	LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds	CVPR 2025	-
`VFMSeg`	Visual foundation models boost cross-modal unsupervised domain adaptation for 3d semantic segmentation	T-ITS 2025	-
`-`	Unsupervised Domain Adaptation for 3D LiDAR Semantic Segmentation Using Contrastive Learning and Multi-Model Pseudo Labeling	arXiv 2025	-	-

4️⃣ Self-Supervised Learning

Model	Paper	Venue	Website	Github
`PointContrast`	PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding	ECCV 2020	-
`Info3D`	Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning	ECCV 2020	-	-
`DepthContrast`	Self-Supervised Pretraining of 3D Features on any Point-Cloud	ICCV 2021	-
`OcCo`	Unsupervised Point Cloud Pre-training via Occlusion Completion	ICCV 2021	-
`STRL`	Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds	ICCV 2021	-
`PPKT`	Learning from 2D: Contrastive Pixel-to-Point Knowledge Transfer for 3D Pretraining	arXiv 2021	-	-
`SLidR`	Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data	CVPR 2022	-
`Point-BERT`	Point-BERT: Pre-Training 3D Point Cloud Transformers with Masked Point Modeling	CVPR 2022
`MaskPoint`	Masked Discrimination for Self-Supervised Learning on Point Clouds	ECCV 2022	-
`Point-MAE`	Masked Autoencoders for Point Cloud Self-supervised Learning	ECCV 2022	-
`Also`	ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation	CVPR 2023	-
`ST-SLidR`	Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss	CVPR 2023	-	-
`TriCC`	Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving	CVPR 2023	-	-
`Seal`	Segment Any Point Cloud Sequences by Distilling Vision Foundation Models	NeurIPS 2023
`BEVContrast`	BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds	3DV 2024	-
`ScaLR`	Three Pillars improving Vision Foundation Model Distillation for Lidar	CVPR 2024	-
`CSC`	Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception	CVPR 2024	-
`SuperFlow`	4D Contrastive Superflows are Dense 3D Representation Learners	ECCV 2024	-
`HVDistill`	HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation	IJCV 2024	-
`CMCR`	Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations?	arXiv 2024	-	-
`LiMoE`	LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes	CVPR 2025
`LiMA`	Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations	ICCV 2025
`LargeAD`	LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	TPAMI 2025
`SuperFlow++`	Enhanced Spatiotemporal Consistency for Image-to-LiDAR Data Pretraining	TPAMI 2025	-
`CleverDistiller`	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	arXiv 2025	-	-

5️⃣ Open Vocabulary Segmentation

Model	Paper	Venue	Website	Github
`OpenScene`	OpenScene: 3D Scene Understanding with Open Vocabularies	CVPR 2023
`CLIP2Scene`	CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	CVPR 2023	-
`PLA`	PLA: Language-Driven Open-Vocabulary 3D Scene Understanding	CVPR 2023
`CLIP-FO3D`	CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP	ICCV 2023	-	-
`LERF`	LERF: Language Embedded Radiance Fields	ICCV 2023
`CNS`	Towards Label-free Scene Understanding by Vision Foundation Models	NeurIPS 2023	-
`OpenMask3D`	OpenMask3D: Open-Vocabulary 3D Instance Segmentation	NeurIPS 2023
`OpenNeRF`	OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views	ICLR 2024
`OV3D`	Open-Vocabulary 3D Semantic Segmentation with Foundation Models	CVPR 2024	-	-
`RegionPLC`	RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding	CVPR 2024
`LEGaussians`	Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding	CVPR 2024
`LangSplat`	LangSplat: 3D Language Gaussian Splatting	CVPR 2024
`Feature 3DGS`	Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields	CVPR 2024
`Open3DIS`	Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance	CVPR 2024
`GGSD`	Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation	ECCV 2024	-
`Gaussian Grouping`	Gaussian Grouping: Segment and Edit Anything in 3D Scenes	ECCV 2024
`EgoLifter`	EgoLifter: Open-world 3D Segmentation for Egocentric Perception	ECCV 2024
`OpenIns3D`	OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation	ECCV 2024	-
`OpenGaussian`	OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding	NeurIPS 2024
`OWL`	Lidar Panoptic Segmentation in an Open World	IJCV 2024	-
`SAL`	Zero-Shot 4D Lidar Panoptic Segmentation	CVPR 2025	-	-
`ULOPS`	Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning	IROS 2025		-
`OVGaussian`	OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	arXiv 2025	-
`LOSC`	LOSC: LiDAR Open-voc Segmentation Consolidator	arXiv 2025	-	-

5. Datasets

Datasets	Paper	Venue	Website	Github
`SemanticKITTI`	SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences	ICCV 2019
`Waymo Open`	Scalability in Perception for Autonomous Driving: Waymo Open Dataset	CVPR 2020
`SemanticPOSS`	SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances	IV 2020
`A2D2`	A2D2: Audi Autonomous Driving Dataset	arXiv 2020		-
`RELLIS-3D`	RELLIS-3D Dataset: Data, Benchmarks and Analysis	ICRA 2021
`PandaSet`	PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving	ITSC 2021
`SynLiDAR`	Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation	AAAI 2022	-
`ScribbleKITTI`	Scribble-Supervised LiDAR Semantic Segmentation	CVPR 2022
`Synth4D`	GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation	ECCV 2022	-
`Panoptic nuScenes`	Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking	RAL 2022
`SemanticSTF`	3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds	CVPR 2023	-
`nuScenes-Occupancy`	OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception	ICCV 2023	-
`Robo3D`	Robo3D: Towards Robust and Reliable 3D Perception against Corruptions	ICCV 2023
`Occ3D`	Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving	NeurIPS 2023
`DAPS3D`	DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds	Access 2023	-
`SSCBench`	SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving	IROS 2024	-