In many regression tasks, the target variable is a continuous value that is strictly greater than zero, meaning $y^{(i)} \in (0, \infty)$.
Unlike count data (which must be integers), these targets can take on any fractional value but cannot be negative. Common real-world examples include:
The Gamma distribution is a continuous probability distribution defined exclusively for positive real numbers.
Because it is bounded at zero and can flexibly model heavily right-skewed data (where lower values are common but extreme high values occasionally occur), it is highly suitable for predicting positive continuous targets in regression. Real-world data like transaction costs or waiting times rarely follow a perfectly symmetric "bell curve" and are strictly non-negative, making the Gamma distribution a much better fit than a standard Gaussian (Normal) distribution.
If a continuous random variable $Y$ follows a Gamma distribution parameterized by a shape parameter $\alpha$ and a scale parameter $\rho$, its probability density function (PDF) is given by:
$$ p(y ; k,\rho) = \frac{1}{\Gamma(k)\left(\rho\right)^{k}} \left(y\right)^{k-1} \exp\left(-\frac{y}{\rho}\right) $$
Where:
To model this probabilistically, we assume that the target variable $y$, given the input $x$, follows a Gamma distribution parameterized by a shape $k > 0$ and a scale $\rho> 0$.
$$ y^{(i)} \mid x^{(i)},\theta \sim \text{Gamma}(k^{(i)}, \rho^{(i)}) $$