Interpreting loss functions probabilistically reveals the massive, hidden depths of the iceberg beneath the surface.
Moving beyond simple geometric distances, this perspective fundamentally broadens how we interpret data.
By treating real-world outcomes as observations drawn from a probability distribution that the model predicts, we gain a deeper, unified understanding of how our predictions connect to the ground truth.

Likelihood and its Calculation
Maximum Likelihood Estimation (MLE)