-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed
Description
Description
The clustering algorithm leaves the parts of the masked images that aren't part of the clustered class black. This makes it incredibly hard for Google to classify them. Using some clever algorithm to remove these black spots can help a lot.
To Reproduce
Steps to reproduce the behavior:
from MAGIST.Vision.UnsupervisedModels.img_cluster import RoughCluster
cluster = RoughCluster("config.json")
imgs = cluster.unsupervised_clusters(6, "Input.jpg", (150, 150), "Clusters")
from MAGIST.Utils.WebScraper.google import GoogleScraper
scraper = GoogleScraper("config.json")
labels = []
for i in imgs:
label = scraper.reverse_image_search(i)
labels.append(label)
print(labels)Expected behavior
Ideally, it should mask out nearby, similar pixels but it doesn't. This leaves massive, unstructured black gaps that cause a lot of issues when reverse searching.
Additional context
Google generally returns language or night when there is too much black. The solution would be something like this:
- Find each masked image's edge pixels.
- Compute all possible lines that can be formed from edge pixels.
- Crop image at that line if all pixels in the line are black.
This is super computationally intensive, however.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed


