about dataset spiltting #62

raymond00000 · 2021-08-04T08:17:17Z

Hi,

Sorry I am confused on the data splitting steps and codes.
Please kindly correct me if I got it wrong.

Omniglot has 50 classes. it allocated 30 classes in its original train split, and 20 in test split. these 20 is not used during model-training.

Assume we do 10 classes 5 shot classification in MAML.
in inner loop, we will randomly picked 10 classes from that 30 classes. For each class, we sample 5 pictures.
so, we will have 50 samples as support set, then we use these samples to update the base-model, so base-model becomes meta-model.
then, we will retrieve some samples (how and how many?? I am very confused on this sampling step) as query set, then we calculate loss with meta-model using these samples but to update the base-model.

my question is about the query set.

At the very beginning, do I have to split the 30 classes into 2 groups, say, 15 for support set, 15 for query set? I think I don't have too.
within the same class, does it have to be non-overlapping between the query set and support set? In another words, for one specific sample, will it be in support set in one mini-batch and in query set in next mini-batch? I think it is yes.

Many thanks for answering.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about dataset spiltting #62

about dataset spiltting #62

raymond00000 commented Aug 4, 2021 •

edited

Loading

about dataset spiltting #62

about dataset spiltting #62

Comments

raymond00000 commented Aug 4, 2021 • edited Loading

raymond00000 commented Aug 4, 2021 •

edited

Loading