Skip to content

[The testing result on a siren audio file seems not working from my end] #62

@allenhung1025

Description

@allenhung1025

Hi @RetroCirce , I want to thank you for your great work!!!!
I am playing around the model checkpoint you provided (HTSAT_AudioSet_Saved_6.ckpt).
I can successfully load the model checkpoint and make the predictions. However, the probability summation is not 1 and after I do the softmax, the resulting prediction doesn't make sense to me, since the probability is super low. Do you have any insight or ideas about why this is the case?
The siren audio file I tried on the model and I believed it should be detected as either 323,/m/04qvtq,"Police car (siren)",
or 324,/m/012n7d,"Ambulance (siren)", but it is detected as music instead.
The colab demo
Appreciate the great work again, and hope to gain insight from you!!!

Before softmax

Running prediction...
pred probability sum: 2.45
[
  [
    137,
    "Music",
    0.6176819801330566
  ],
  [
    320,
    "Ice cream truck, ice cream van",
    0.30062487721443176
  ],
  [
    0,
    "Speech",
    0.13741374015808105
  ]
]

After softmax

pred probability sum: 1.00
[
  [
    137,
    "Music",
    0.0035008378326892853
  ],
  [
    320,
    "Ice cream truck, ice cream van",
    0.002549622440710664
  ],
  [
    0,
    "Speech",
    0.0021656793542206287
  ]
]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions