How should I sample my validation set if I randomly sample training data?
I have:
training dataset of size 150k.
validation dataset of size 19k.
At each epoch I randomly sample without replacement 10k datapoints for training because I get Out of Mem Errors.
I need to downsample my validation set too. Which of the following methods seem most appropriate:
- Randomly sampling validation set which is x% of 10k and use the same set across every epoch.
- Randomly sampling validation set which is x% of 10k at every epoch.
Topic deep-learning dataset
Category Data Science