Learning to classify images without explicit human annotations