2025/02/15 Weekly Meeting Notes #134
Replies: 1 comment
-
Notes from NikReview the following functions for comprehension: Find & read literature on: From a higher-level perspective (Himanshu will research): Def find_homo(im1,im2): description The function find_homo is designed to find the homography matrix between two images using feature matching. This matrix can be used to warp one image to align with another. The function takes two images as input: im1, the image to be warped, and im2, the reference image. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
The following notes contain the progress and next steps for each sub-task:
Computer Vision Model Update
Assigned to Himanshu Naidu.
While the Deeplabv3 TensorFlow 1 model has been successfully converted to CoreML with better on-report performance, there are more issues with it than initially understood.
While the underperformance on important classes is still an important factor, more importantly, the real-time segmentation performance degrades much more rapidly than expected, to the point where the model is barely able to process 1 frame per second. Further analysis will be done on the performance (such as checks on the CPU/GPU/NE load), however, it would be prudent to focus on other tasks for now.
Computer Vision ML Pipeline
Assigned to Himanshu Naidu.
As mentioned before, the following ML pipeline is being set up to train models from scratch.
https://github.com/himanshunaidu/CoreML_Pipeline_iOSPointMapper
It would be wise to focus on this since we eventually get more control over the model we produce (preferably start with a TorchVision model).
Currently, only the data loader/pre-processor for CityScapes has been merged, but further work is expected to be merged soon.
Depth Aware Instance Segmentation
Assigned to Himanshu Naidu.
The current plan is to use a first version of depth-aware instance segmentation which is a combination of the watershed algorithm and depth-based clustering.
After further analysis, it became clear that developing a water-shed algorithm from scratch is not very practical if there are existing solutions. Fortunately, one can integrate OpenCV into Swift development, although this is not as well-documented.
Still, this seems to be the most viable way to integrate watershed algorithm.
OpenCV has now been integrated into the application with some basic methods for experimentation.
Watershed algorithm will now be integrated into the application.
Still need to see what would be the best possible way to integrate depth-based clustering with DBSCAN.
Assigned to Nik Wilson.
Nik has gone through the segment_streets code of OASIS framework on a high level.
We had further discussion on how the algorithm works, and some high-level next steps have been established, which include going further through the centroid-tracking and homography transformation code.
Note: Since the post-processing code will likely depend largely on OpenCV, the current integration of OpenCV into the application will benefit development greatly.
Beta Was this translation helpful? Give feedback.
All reactions