Group 01 | Data Analytics Course | University of Glasgow | March 2025
[List of team members with their roles and contributions]
Indoor positioning systems require accurate distance measurements between anchors and tags. In Ultra-Wideband (UWB) systems, Non-Line-of-Sight (NLOS) conditions significantly degrade ranging accuracy, leading to positioning errors. This project focuses on solving two critical challenges:
- Classification Challenge: Accurately identifying whether a UWB measurement is from a Line-of-Sight (LOS) or Non-Line-of-Sight (NLOS) condition.
- Range Estimation Challenge: Predicting the true distance between devices, especially in NLOS conditions where measured ranges are typically biased.
Accurate LOS/NLOS classification and range estimation are essential for:
- Indoor Positioning Systems: Enabling centimeter-level accuracy in indoor environments
- Robotics and Autonomous Navigation: Providing reliable positioning in complex indoor settings
- Asset Tracking: Improving inventory management and logistics operations
- Smart Buildings: Enhancing occupancy detection and room-level positioning
- AR/VR Applications: Supporting precise spatial mapping and user positioning
This project aims to:
- Develop an accurate classifier to distinguish between LOS and NLOS UWB measurements
- Identify the most significant features that affect LOS/NLOS conditions
- Create a range estimation model to correct for biases in NLOS measurements
- Evaluate model performance using rigorous metrics and cross-validation
The UWB LOS/NLOS dataset contains:
- 42,000 samples (21,000 LOS, 21,000 NLOS)
- Data collected from 7 different indoor environments
- 15 metadata features + 1016 Channel Impulse Response (CIR) values per sample
Our exploratory analysis revealed:
- A balanced dataset (50% LOS, 50% NLOS)
- Significant differences in signal characteristics between LOS and NLOS conditions
- Strong correlations between certain features and LOS/NLOS classification
- Variations in CIR patterns indicative of multipath effects in NLOS scenarios
The main challenges identified include:
- High dimensionality due to the 1016 CIR values
- Potential redundancy in features
- Variations across different indoor environments
- Need for robust models that generalize to new environments
Our solution is novel in several ways:
- We apply feature engineering to extract key characteristics from the CIR
- We implement a two-stage pipeline: classification followed by range estimation
- We utilize ensemble learning to improve robustness and accuracy
- We incorporate domain knowledge about UWB signal propagation into our models
- Identification and handling of outliers
- Normalization of CIR values by RXPACC (received preamble count)
- Feature scaling to ensure algorithm convergence
- Statistical analysis of feature importance
- Extraction of key CIR characteristics (peak amplitude, rise time, etc.)
- Creation of new engineered features (ratios, statistical moments, etc.)
- PCA analysis to reduce CIR dimensionality
- Selection of top features based on their importance scores
- Random Forest Classifier
- Gradient Boosting Classifier
- Support Vector Machine
- Neural Network
- K-Nearest Neighbors
- Random Forest Regressor
- Gradient Boosting Regressor
- Linear Regression with polynomial features
- Grid search for optimal hyperparameters
- Cross-validation to prevent overfitting
- Accuracy, Precision, Recall, F1-score
- ROC curve and AUC
- Confusion matrix
- RMSE (Root Mean Square Error)
- MAE (Mean Absolute Error)
- R² score
[Results table comparing the performance of different classification models]
[Analysis of the most important features for classification]
[ROC curves for different classification models]
[Results table comparing the performance of different regression models]
[Analysis of error distributions in range estimation]
[Analysis of model performance across different environments]
[Correlation analysis between features and target variables]
Our results show that:
- Feature X, Y, and Z are the most significant indicators of NLOS conditions
- Ensemble methods (Random Forest, Gradient Boosting) outperform other algorithms
- CIR characteristics provide valuable information beyond metadata features
- Range estimation accuracy varies with distance and environmental complexity
The practical implications of our findings include:
- Recommendations for UWB system deployment in complex environments
- Guidelines for anchor placement to minimize NLOS conditions
- Strategies for integrating our models into real-time positioning systems
- Suggested enhancements to UWB hardware and signal processing
Our approach has the following limitations:
- Dependence on the environments represented in the training data
- Computational complexity of processing full CIR data
- Challenges in real-time implementation on resource-constrained devices
We successfully developed and evaluated models for UWB LOS/NLOS classification and range estimation. Our approaches achieved high accuracy in distinguishing between LOS and NLOS conditions and provided significant improvements in range estimation for NLOS scenarios.
Potential directions for future research include:
- Integration with sensor fusion approaches (IMU, camera, etc.)
- Online learning algorithms that adapt to new environments
- Deep learning architectures for end-to-end CIR processing
- Transfer learning to adapt models to new UWB hardware
- [List of references]
[Pseudocode for key algorithms]
[Additional plots and visualizations]
[Details on implementation and software used]