Probability Distributions for Travel Demand Modelling
Probability Distributions for Travel Demand Modelling
0. Probability Distributions
Probability Density Function (PDF):
Cumulative Density Function (CDF):
Mean:
Variance:
Propositions:
-
\(\mathrm{Var}[X] = \mathrm{E} [X - \mathrm{E}[X]]^2 = \mathrm{E} [X ^2] - (\mathrm{E}[X])^2\)
-
Covariance: \(\mathrm{Cov}[X, Y] = \mathrm{E}[(X - \mathrm{E}[X])(Y - \mathrm{E}[Y])]\)
-
Correlation coefficient: \(\displaystyle \rho_{X, Y} = \frac{\mathrm{Cov}[X, Y]}{\sigma_X \, \sigma_Y}\)
1. Continuous Uniform Distribution
CDF
Mean : \(\displaystyle \mu = \mathrm{E}[X] = \frac{a+b}{2}\)
Variance : \(\displaystyle \sigma^2 = \mathrm{Var}[X] = \frac{(b-a)^2}{12}\)
2. Normal Distribution
CDF
Note : If mean \(\mu=0\) and variance \(\sigma=1\), random \(X\) ( i.e., named by a \(Z\) ) follows the standard normal distribution \(\mathcal{N}(0,1)\), the CDF is expressed by \(F(z) = \Pr(Z \leq z) = \Phi(z)\).
3. Gumbel Distribution
For a series of random variables \(T_1, T_2, \cdots, T_n\) are statistically independent and identically distributed (IID, or i.i.d.) with an exponential tail distribution, and the CDF is denoted as \(F_T(t)\). \(n\) is taken as the extreme value (very large \(n\)). Then, let random variable \(X = \max \{T_1, T_2, \cdots, T_n\}\) and \(X\) follows the Gumbel distribution:
where \(\eta\) is a location parameter, \(\mu > 0\) is a scale parameter.
CDF
Propositions
-
Mode: \(\eta\)
-
Mean: \(\displaystyle \eta + \frac{\gamma}{\mu}\), where \(\gamma\) is Euler-Mascheroni constant (\(\gamma \approx 0.577\) )
-
Variance: \(\displaystyle \frac{\pi^2}{6\mu^2}\)
-
If \(X \sim \text{Gumbel}(\eta, \mu)\), then:
\[\alpha X + V \sim \text{Gumbel} \left(\alpha \, \eta + V, \frac{\mu}{\alpha} \right) \]where \(V\) and \(\alpha\) are any scalar constants.
-
If \(X_1 \sim \text{Gumbel}(\eta_1, \mu)\), \(X_2 \sim \text{Gumbel}(\eta_2, \mu)\), and, \(X_1\) and \(X_2\) are i.i.d., then:
\[\max \{X_1, X_2\} \sim \text{Gumbel} \left( \frac{\ln \left[ \exp(\eta_1 \, \mu) + \exp(\eta_2 \,\mu)\right]}{\mu}, \mu \right) \] -
As a corollary to proposition 5: if \((X_1, X_2, \cdots, X_J)\) are \(J\) independent Gumbel distributed variables with parameters \((\eta_1, \mu), (\eta_2, \mu), \cdots, (\eta_J, \mu)\), respectively,
then \(\max \{X_1, X_2, \cdots, X_J\}\) is Gumbel distributed:\[\max \{X_1, X_2, \cdots, X_J\} \sim \text{Gumbel} \left(\frac{\ln \left[ \sum_{j=1}^J \exp(\eta_j \, \mu)\right]}{\mu}, \mu \right) \]
4. Logistic Distribution
For a series of random variables \(T_1, T_2, \cdots, T_n\) are statistically independent and identically distributed (IID, or i.i.d.) with an exponential tail distribution, and the CDF is denoted as \(F_T(t)\). \(n\) is taken as the extreme value (very large \(n\)). Then, let random variable \(X\)
and \(X\) follows the Logistic distribution:
PDF:
where \(\eta\) is a location parameter, \(\mu > 0\) is a scale parameter.
CDF:
Propositions
-
Mode: \(\eta\)
-
Mean: \(\eta\)
-
Variance: \(\displaystyle \frac{\pi^2}{3\mu}\)
-
If \(X \sim \text{Logistic}(\eta, \mu)\), then:
\[\alpha X + V \sim \text{Logistic} \left(\alpha \, \eta + V, \alpha \, \mu \right) \]where \(V\) and \(\alpha\) are any scalar constants.
-
If \(X_1 \sim \text{Gumbel}(\eta_1, \mu)\), \(X_2 \sim \text{Gumbel}(\eta_2, \mu)\), and, \(X_1\) and \(X_2\) are i.i.d., then:
\[Y = X_1 - X_2 \sim \text{Logistic} \left(\eta_1 - \eta_2, \mu \right) \]Remark : \(X_1 + Y_2 \nsim \mathrm {Logistic} (\eta_1 + \eta_2 ,\mu)\)
5. Bivariant Normal Distribution
\(X \sim \mathcal{N} \left(\mu_X, \sigma_X^2 \right)\) and \(Y \sim \mathcal{N} \left(\mu_Y, \sigma_Y^2 \right)\). \(X\) and \(Y\) are not independent, and the correlation coefficient is \(\rho\).
PDF :
where \(-\infty < x < +\infty\) and \(-\infty < y < +\infty\)
Propositions :
-
\(\mathrm{E}[X] = \mu_X\) and \(\mathrm{E}[Y] = \mu_Y\)
-
\(\mathrm{Var}[X] = \sigma_X^2\) and \(\mathrm{Var}[Y] = \sigma_Y^2\)
-
\(\mathrm{Cov}[X, Y] = \rho \, \sigma_X \, \sigma_Y\)
6. Multivariant Normal Distribution
Assume that a \(n\)-dimension random variable vector \(\boldsymbol{X}\) follows a multivariant normal distribution with a mean vector \(\boldsymbol{\mu} \in \mathbb{R}^{n}\) and a covariance matrix \(\mathbf{\Sigma} \in \mathbb{R}^{n\times n}\):
The joint PDF is:
For the bivariant normal distribution:

浙公网安备 33010602011771号