Hi,
I'm posting this issue because I observed that I get worse scores with a DeepSORT tracker compared to what I get with a simple SORT tracker. I used your DETRAC tool to create the training images for the cosine metric learning. I also tried to re-train my DeepSORT model but I got equivalent results.
I compared the HOTA scores obtained on several cameras of AICity Challenge (Track 3) using the same detector (YOLO v4). I tried to tweak the max_consine_distance parameter of nn_matching and it seems that decreasing the distance increases the score. I obtained the best result when I set it to 0, I guess it means that I'm much closer to SORT in this case. For instance, on one camera, I obtained those scores:
| Tracker |
HOTA |
| SORT |
43,13 |
| Your DeepSORT - cosine distance 0.3 |
39,68 |
| DeepSORT - cosine distance 0.5 |
40,18 |
| DeepSORT - cosine distance 0.2 |
40,21 |
| DeepSORT - cosine distance 0.1 |
40,36 |
| DeepSORT - cosine distance 0.05 |
40,77 |
| DeepSORT - cosine distance 0 |
41,22 |
There's not much things to tweak in the training parameters of the cosine metric. However, I was forced to use the triplet loss though, like this issue reports , did you have to use the triplet loss as well? At first, I guessed it was related to the dataset, but now I wonder if this may be the reason why my model does not work very well.
Thank you in advance!
Hi,
I'm posting this issue because I observed that I get worse scores with a DeepSORT tracker compared to what I get with a simple SORT tracker. I used your DETRAC tool to create the training images for the cosine metric learning. I also tried to re-train my DeepSORT model but I got equivalent results.
I compared the HOTA scores obtained on several cameras of AICity Challenge (Track 3) using the same detector (YOLO v4). I tried to tweak the max_consine_distance parameter of nn_matching and it seems that decreasing the distance increases the score. I obtained the best result when I set it to 0, I guess it means that I'm much closer to SORT in this case. For instance, on one camera, I obtained those scores:
There's not much things to tweak in the training parameters of the cosine metric. However, I was forced to use the triplet loss though, like this issue reports , did you have to use the triplet loss as well? At first, I guessed it was related to the dataset, but now I wonder if this may be the reason why my model does not work very well.
Thank you in advance!