It starts to predict the same value as I increase sample number to predict #35

matteoottaviani · 2019-03-25T18:03:30Z

Hi, sorry for bothering you.
I have been dealing with the following problem for 3 months, hence I decided to try to share it.

I am training kriging and xgboost on sets of increasing sample number ( say from 100 samples to 1000 samples) of 15 inputs and 1 output each sample and I use the trained functions to predict a test set (that is always the same).

Whilst I haven't ever had any problem with xgboost prediction, when I predict the test set with kriging, I have no problems up to 400ish samples training; the more I increase from say around 400 samples to train, the more the last values of the test set which I predict equal the same value.
Have you got any idea about that?
Thanks!
Matteo

capaulson · 2019-04-02T05:46:00Z

It's really tough to say based on this information. Can you share the data?

matteoottaviani · 2019-04-03T14:13:15Z

Thank you very much for your answer. Yes, I can share, of course.

samplesize=1000
testsize=100
mc=0
X,OB = pickle.load( open('BH_DATA/sample'+str(samplesize)+'OB_'+str(mc)+'.pkl', 'rb') )
Xt,OBt = pickle.load( open('BH_DATA/testsample'+str(testsize)+'OB_'+str(mc)+'.pkl', 'rb') )

I train on X (1000 input combinations) and three different model outputs OB[:,s], for s=[0,1,2], and I test on Xt with the three model outputs OBt[:,s]

Up to X=X[:400] OB=OB[:400] any values predicted is different one another and it seems to work quite well; from X=X[:500] OB=OB[:500] on it predicts increasingly more identical values at the bottom of the prediction lists of 100 test samples.

thank you.
mat

smple+test set.zip

mjoshii · 2020-10-28T11:30:51Z

Hi Matteo, Capaulson

I am trying to use a dataset with ~7000 samples having 8 Xs and 1 Y. I can train the dataset but I am not sure how to save the model. I went through the scripts but don't see any option to save the model. If I use regression kriging, I don't the plot function or save figure function. So, how do I actually plot the actual vs predicted values of Y or even the error in them?

I think I will need to do the plotting outside the standard PyKriging library by extracting the predicted Y if using regression kriging.

What I am essentially trying to understand is:

Is there a way to save the model?
Which value in the code gives Y-predicted?

Appreciate your insights.

Best Regards,
Manish

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

It starts to predict the same value as I increase sample number to predict #35

It starts to predict the same value as I increase sample number to predict #35

matteoottaviani commented Mar 25, 2019

capaulson commented Apr 2, 2019

matteoottaviani commented Apr 3, 2019 •

edited

Loading

mjoshii commented Oct 28, 2020

It starts to predict the same value as I increase sample number to predict #35

It starts to predict the same value as I increase sample number to predict #35

Comments

matteoottaviani commented Mar 25, 2019

capaulson commented Apr 2, 2019

matteoottaviani commented Apr 3, 2019 • edited Loading

mjoshii commented Oct 28, 2020

matteoottaviani commented Apr 3, 2019 •

edited

Loading