Replies: 1 comment 1 reply
-
Hi @MMayr96 ,
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I have some problems fine-tuning pre-trained TSTPlus models.
Problem Statement:
Given n features, i.e. sensor measurements tracked over time, I want to pretrain a general model using masked value prediction on slided windows of multidimensional data. I then want to take this pre-trained model and fine-tune it on specific tasks, e.g. forecasting, classification. Some snippets of the currently implemented pre-training approach that converges is shown in the following.
Data Preparation:
Data Loaders:
Pre-Training:
This converges well. The actual problems start when utilizing this pre-trained model. Given now this models/test_pretrain.pth, I would like to fine-tune this on the labelled data loader (i.e. incl. target sensor) . As far as I understand, one can load the pre-trained model like the following. I tried to fine-tune on a forecasting / regression task, i.e. predict y:
However the fine-tuned model does not converge at all, trainings and validation loss is through the roof. I think I may have some logical flaws somewhere in my used pipeline. Any hints are appreciated!
Beta Was this translation helpful? Give feedback.
All reactions