Now, the training data and testing data are both labeled datasets. The training data is what we’ll fit the neural network with, and the test data is what we’re going to use to validate the results. The test data will be “out of sample,” meaning the testing data will only be used to test the accuracy of the network, not to train it.
We also have “test” images that we downloaded. THOSE images are not labeled at all, and those are what we’ll submit to Kaggle for the competition.