USing 2D model for 1D signal (converted to image) #798

electro020 · 2022-12-18T21:27:44Z

electro020
Dec 18, 2022

Hello,
I am currently trying to detect anomalies in 1D signals. I tried to implement a 1D autoencoder aswell as a 1D GANomaly. I got mediocre results. I tried to convert my 1D signal into 2D signal and got ''good'' result with the GANomaly from this library.

I got those results :
image_F1score = 0.8273
image_AUROC = 0.4500
I am having trouble interpreting those results. On one way , the F1 score is very good , but the AUROC is worse than random guessing. So my first question : Why is the F1 score so good yet the AUROC so low? Of course , my dataset is imbalanced with a ratio of 1:2 ... but the GANomaly I found in the literature That works with 1D signals has a decent AUROC.

After that , I tried with the patchcore and got phenomenal results :
image_AUROC =0.8967
Image_F1score=0.91459

Those results are miles ahead of everything done in the literature of anomaly detection in my application. The best I saw in the literature would be a F1 score of 0.75 and the roc curve looks like this:
I removed the image because I am not sure if I had the rights to post it.

I dont want to do a publication on the subject yet , but if I do , would strongly consider using this library.

My second question :
Do you think I have good results simply because my images are pretty simple ? And therefore the generalization would be mediocre?

Here is a normal image :

Here is an abnormal image :

My second question :
Is there a built in method to do inference of a given data base or do I have to do it manually like in the tutorial ?

Thank you!

Answered by djdameln

Dec 19, 2022

Hi, thanks for your question. This sounds like an interesting use-case.

Why is the F1 score so good yet the AUROC so low?

In Anomalib, we evaluate the F1 score at the optimal threshold value. This means that the displayed F1 score is the best F1 score that you can get by varying the threshold applied to the model's raw predictions. If the AUROC score is low while the F1 score is high, this means that the performance of the model can be very good for the specific, optimal, threshold value, but the model performs poorly at most other threshold values. To get a better idea of the threshold-dependent behaviour of your model, you could visually inspect the ROC curve, which is generated auto…

View full answer

djdameln · 2022-12-19T13:06:16Z

djdameln
Dec 19, 2022

Hi, thanks for your question. This sounds like an interesting use-case.

Why is the F1 score so good yet the AUROC so low?

In Anomalib, we evaluate the F1 score at the optimal threshold value. This means that the displayed F1 score is the best F1 score that you can get by varying the threshold applied to the model's raw predictions. If the AUROC score is low while the F1 score is high, this means that the performance of the model can be very good for the specific, optimal, threshold value, but the model performs poorly at most other threshold values. To get a better idea of the threshold-dependent behaviour of your model, you could visually inspect the ROC curve, which is generated automatically by Anomalib. Just make sure that the visualization.save_images parameter is set to true, and the ROC plot will be written to your results folder under the images subdirectory.

Do you think I have good results simply because my images are pretty simple ? And therefore the generalization would be mediocre?

I would be concerned about that, yes.

By converting your 1D data to a 2D plot, you are increasing the dimensionality while the amount of information in your dataset remains the same. In other words, you introduce sparsity to your dataset which is a recipe for overfitting. For this reason I would generally expect better results from a 1D model trained on a time-series signal, than a 2D model trained on a plot of that signal. (Of course, this also depends largely on picking the right model which may not be a trivial task).

A better approach (at least theoretically) would be to convert your 1D data to spectrograms before using them as input of the Anomalib models. By transforming your data to the frequency domain, you introduce new potentially useful information that your models can leverage to learn the characteristics of the data distribution.

We haven't tested any of the models in Anomalib on 2D representations of 1D data, but we are definitely interested in the topic. It would be great if you would be willing to share any interesting findings with us in this thread (or elsewhere). If the spectrogram approach works we could potentially add some utility functions to convert 1D data, which would facilitate working with audio and 1D data.

Is there a built in method to do inference of a given data base or do I have to do it manually like in the tutorial ?

We provide several entrypoint scripts for running inference in different frameworks. The easiest inference method is PyTorch Lightning inference. You can point the script to your trained model file, and a folder of image data, and the script will generate and visualize the model's predictions on the images.

If you're having trouble running inference on your dataset or if anything is unclear, please let us know.

2 replies

electro020 Dec 19, 2022
Author

Hello,
Thank you very much for your answer. I am still a student in machine learning and this is very helpful. I will print the ROC curve .

I will be meeting with my prof in January to talk about this avenue. It would be a very different approach than the rest of literature. I cannot yet speak about the specifics of the application as of now.

It is one of my preoccupation that overfitting would happen with an image based approach(especially when my database is only of 552 segments when it is not augmented). I was inspired to use this approach with this paper https://paperswithcode.com/paper/ecg-arrhythmia-classification-using-a-2-d .It got great results that i was able to reproduce. It is a great proposition to work in the frequency domain. I will explore this avenue some more. I will definitively keep in touch with this thread.

One way I could verify the generalization, it would be to test on an other database that is completely independant from the one I used.

Thank you so much for those answers, it helps alot! I will reach out if I have other results or other questions.

electro020 Dec 23, 2022
Author

Hello,
here is the roc curve.

I was expecting to see a spike to explain a good F1 score for a given treshold , but it doesnt seems to be.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

USing 2D model for 1D signal (converted to image) #798

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

USing 2D model for 1D signal (converted to image) #798

Uh oh!

Uh oh!

electro020 Dec 18, 2022

Replies: 1 comment · 2 replies

Uh oh!

djdameln Dec 19, 2022

Uh oh!

electro020 Dec 19, 2022 Author

Uh oh!

electro020 Dec 23, 2022 Author

electro020
Dec 18, 2022

Replies: 1 comment 2 replies

djdameln
Dec 19, 2022

electro020 Dec 19, 2022
Author

electro020 Dec 23, 2022
Author