Why is Reinforcement Learning so Curious?

    In this simplest of all cases we get some data, say pairs of (images, labels), e.g. of cats and humans and we want to build a cat vs. human discriminator.

