Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'ErrorMessage': 'Due to a recent data gap, training is waiting for more data to populate' #132

Open
9race opened this issue Jul 20, 2024 · 2 comments

Comments

@9race
Copy link

9race commented Jul 20, 2024

I tried running the below code snippet on my data with 28 data points, no gaps.
de_obj = DataExploration(freq='D', data_shift_truncate=False, is_log_transformed=False, fill_rate=0.8, sig_level=0.001) print(de_obj.min_ts_length) imputed_data, pre_prc = de_obj.profile(df) print(len(df))

However, I keep getting the error 'ErrorMessage': 'Due to a recent data gap, training is waiting for more data to populate' for pre_prc. From the source code online it seem that 'min_ts_length' by default is 21 for daily data, but even if I further lower it I still get the same error message.

I also created dummy data with 208 data points and the error is the same, so I assume this isn't really about the number of data points. Any help?

@sayanchk
Copy link
Collaborator

Hello @9race, it will be great if it's possible to have to dummy data to replicate and debug the error in my end. Given you have daily data, it should not show this specific error unless you have more missing values compared to the specified fill_rate.

@9race
Copy link
Author

9race commented Jul 22, 2024

Hi @sayanchk, I've managed to fix the error by playing with the types of my Dataframe indexes and columns. However, I run into a new issue where anomalies that visually seem obvious to me are not being detected. I was wondering if there is some minimum number of datapoints that is required for sufficient model performance. I've attached dummy data, as well as a screenshot of the data with obvious-seeming anomalies for reference.

luminaire_testing.csv
Graph

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants