Is it worth trying to sort out 4k+ face clusters? #1297
AstroPhotoJay
started this conversation in
General
Replies: 1 comment 2 replies
-
|
Hey @AstroPhotoJay I'm sorry you're disappointed by the quality of the face recognition. Manually assigning and merging clusters is only going to get you so far, currently, because the model that extracts the face vectors leaves a lot to be desired. If the automatic run is already generating too much garbage, I don't think it is worthwhile to correct it. We are currently planning a rewrite of this app in python in order to be able to use newer models that will greatly improve the quality of the clusters. I recommend waiting until this effort bears fruit :) Cheers |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I've been tinkering with Nextcloud and Recognize for a while now, with a focus on using Recognize for facial recognition, and have reached a point where face clustering has turned into a disorganised mess and I'm not sure if it's worth any more effort.
In NC, I have
274,246 indexed media files in Memories
336,742 face detections
4,117 face clusters
81,482 unassigned faces
Of these 4k face clusters, there are approx 40 which contain only one image. The largest unnamed/unchanged clusters only contain about 100 images. Most of the 4k clusters have somewhere between 10 - 70 faces.
There are countless duplicate clusters for the same people, images of myself and my partner (who looks nothing like me) have been grouped into clusters together and countless shitclusters.
To be fair, the content library is messy. It spans 20+ years and the images would be challenging to work with (people getting older, images of people with and without sunglasses on, at costume parties, partially covered faces, etc...)
I don't want to dump recognize just yet, as I have spent more time than I care to admit manually adding/removing/merging clusters in the hope that accuracy would improve. But at the same time, I don't see how its feasible to sort through this mess.
Does any one have recommendations on how I might go about improving the accuracy of my clusters and sorting through this mess without throwing out all of the manual effort? Would this even be worth doing?
Any tips/tricks much appreciated.
Beta Was this translation helpful? Give feedback.
All reactions