I certainly get how this works, conceptually, for say MNIST.
I learned that NNs are universal function approximators - and the way I understand this is that, at a very high level, they model a set of functions that map inputs to outputs for a particular domain. This genuinely seems like magic to me, and it feels like I don't know how to place it in my mental model of how compuation works.