Replies: 1 comment 1 reply
-
Hi @MMayr96 ,
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I have some problems fine-tuning pre-trained TSTPlus models.
Problem Statement:
Given n features, i.e. sensor measurements tracked over time, I want to pretrain a general model using masked value prediction on slided windows of multidimensional data. I then want to take this pre-trained model and fine-tune it on specific tasks, e.g. forecasting, classification. Some snippets of the currently implemented pre-training approach that converges is shown in the following.
Data Preparation:
Data Loaders:
Pre-Training:
This converges well. The actual problems start when utilizing this pre-trained model. Given now this models/test_pretrain.pth, I would like to fine-tune this on the labelled data loader (i.e. incl. target sensor) . As far as I understand, one can load the pre-trained model like the following. I tried to fine-tune on a forecasting / regression task, i.e. predict y:
However the fine-tuned model does not converge at all, trainings and validation loss is through the roof. I think I may have some logical flaws somewhere in my used pipeline. Any hints are appreciated!
Beta Was this translation helpful? Give feedback.
All reactions