Skip to content

Commit 2e24859

Browse files
authored
Merge pull request #152 from StatProofBook/master
update to master
2 parents 04f6b05 + 6f1ee2a commit 2e24859

13 files changed

Lines changed: 80 additions & 26 deletions

File tree

D/para.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,6 @@ username: "JoramSoch"
2727
---
2828

2929

30-
**Definition:** A parameter, also "statistical parameter", is any fixed quantity, i.e. [constant](/D/const) scalar, vector or matrix, that describes a parametrized [probability distribution](/D/dist).
30+
**Definition:** A parameter, also "statistical parameter", is any fixed quantity, i.e. [constant](/D/const) scalar, vector or matrix, that describes a parametrized [probability distribution](/D/dist) by influencing its [probability mass function](/D/pmf) or [probability density function](/D/pdf).
3131

3232
Examples of parameters include the mean and variance parameters of a [normal distribution](/D/norm), covariance parameters in a [multivariate](/D/mvn) or [matrix](/D/matn)-normal distribution, shape and rate parameters of the [gamma distribution](/D/gam) or the vector of category probabilities in a [multinomial distribution](/D/mult).

I/DbA.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ title: "Definition by Author"
1313

1414
- [Sample covariance](/D/cov-samp)
1515

16-
### JoramSoch (187 definitions)
16+
### JoramSoch (188 definitions)
1717

1818
- [Akaike information criterion](/D/aic)
1919
- [Alternative hypothesis](/D/h1)
@@ -136,6 +136,7 @@ title: "Definition by Author"
136136
- [One-tailed and two-tailed test](/D/test-tail)
137137
- [One-way analysis of variance](/D/anova1)
138138
- [p-value](/D/pval)
139+
- [Parameter](/D/para)
139140
- [Point and set hypothesis](/D/hyp-point)
140141
- [Poisson distribution](/D/poiss)
141142
- [Poisson distribution with exposure values](/D/poissexp)

I/DbN.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -210,3 +210,4 @@ title: "Definition by Number"
210210
| D201 | post-pred | [Posterior predictive distribution](/D/post-pred) | aloctavodia | 2024-08-18 |
211211
| D202 | prior-pred | [Prior predictive distribution](/D/prior-pred) | aloctavodia | 2024-08-19 |
212212
| D203 | data | [Data](/D/data) | JoramSoch | 2024-09-20 |
213+
| D204 | para | [Parameter](/D/para) | JoramSoch | 2024-09-27 |

I/DbT.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -177,6 +177,7 @@ title: "Definition by Topic"
177177
### P
178178

179179
- [p-value](/D/pval)
180+
- [Parameter](/D/para)
180181
- [Point and set hypothesis](/D/hyp-point)
181182
- [Poisson distribution](/D/poiss)
182183
- [Poisson distribution with exposure values](/D/poissexp)

I/PbA.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ title: "Proof by Author"
1212

1313
- [Posterior predictive distribution is a marginal distribution of the joint likelihood](/P/postpred-jl)
1414

15-
### JoramSoch (435 proofs)
15+
### JoramSoch (436 proofs)
1616

1717
- [Accuracy and complexity for Bayesian linear regression](/P/blr-anc)
1818
- [Accuracy and complexity for Bayesian linear regression with known covariance](/P/blrkc-anc)
@@ -147,6 +147,7 @@ title: "Proof by Author"
147147
- [Gibbs' inequality](/P/gibbs-ineq)
148148
- [Independence of estimated parameters and residuals in multiple linear regression](/P/mlr-ind)
149149
- [Independence of products of multivariate normal random vector](/P/mvn-indprod)
150+
- [Independent random variables are uncorrelated](/P/corr-ind)
150151
- [Inflection points of the probability density function of the normal distribution](/P/norm-infl)
151152
- [Invariance of the covariance matrix under addition of constant vector](/P/covmat-inv)
152153
- [Invariance of the differential entropy under addition of a constant](/P/dent-inv)
@@ -475,9 +476,10 @@ title: "Proof by Author"
475476
- [Variance of the exponential distribution](/P/exp-var)
476477
- [Variance of the log-normal distribution](/P/lognorm-var)
477478

478-
### salbalkus (1 proof)
479+
### salbalkus (2 proofs)
479480

480481
- [The expected value minimizes the mean squared error](/P/mean-mse)
482+
- [The median minimizes the mean absolute error](/P/med-mae)
481483

482484
### StatProofBook (1 proof)
483485

I/PbN.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -476,3 +476,5 @@ title: "Proof by Number"
476476
| P468 | mean-wlln | [Weak law of large numbers](/P/mean-wlln) | JoramSoch | 2024-09-13 |
477477
| P469 | mean-mse | [The expected value minimizes the mean squared error](/P/mean-mse) | salbalkus | 2024-09-13 |
478478
| P470 | ind-self | [Self-independence of random event](/P/ind-self) | JoramSoch | 2024-09-20 |
479+
| P471 | med-mae | [The median minimizes the mean absolute error](/P/med-mae) | salbalkus | 2024-09-23 |
480+
| P472 | corr-ind | [Independent random variables are uncorrelated](/P/corr-ind) | JoramSoch | 2024-09-27 |

I/PbT.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -165,6 +165,7 @@ title: "Proof by Topic"
165165

166166
- [Independence of estimated parameters and residuals in multiple linear regression](/P/mlr-ind)
167167
- [Independence of products of multivariate normal random vector](/P/mvn-indprod)
168+
- [Independent random variables are uncorrelated](/P/corr-ind)
168169
- [Inflection points of the probability density function of the normal distribution](/P/norm-infl)
169170
- [Invariance of the covariance matrix under addition of constant vector](/P/covmat-inv)
170171
- [Invariance of the differential entropy under addition of a constant](/P/dent-inv)
@@ -497,6 +498,7 @@ title: "Proof by Topic"
497498
- [t-test for multiple linear regression using contrast-based inference](/P/mlr-t)
498499
- [The expected value minimizes the mean squared error](/P/mean-mse)
499500
- [The log probability scoring rule is a strictly proper scoring rule](/P/lpsr-spsr)
501+
- [The median minimizes the mean absolute error](/P/med-mae)
500502
- [The p-value follows a uniform distribution under the null hypothesis](/P/pval-h0)
501503
- [The regression line goes through the center of mass point](/P/slr-comp)
502504
- [The residuals and the covariate are uncorrelated in simple linear regression](/P/slr-rescorr)

P/anova1-repara.md

Lines changed: 42 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -90,12 +90,50 @@ y_{ij} = \mu + \delta_i + \varepsilon_{ij} &= \mu_i + \varepsilon_{ij} = y_{ij}
9090
\end{split}
9191
$$
9292

93-
2) Equation \eqref{eq:anova1-repara} is a special case of the [two-way analysis of variance](/D/anova2) with (i) just one factor $A$ and (ii) no interaction term. Thus, OLS estimates are identical to [that of two-way ANOVA](/P/anova2-ols), i.e. given by
93+
2) The [residual sum of squares](/D/rss) for the reparametrized model is
9494

95-
$$ \label{eq:anova1-repara-c2-qed}
95+
$$ \label{eq:anova1-repara-rss}
96+
\mathrm{RSS}(\mu,\delta) = \sum_{i=1}^{k} \sum_{j=1}^{n_i} \varepsilon_{ijk}^2 = \sum_{i=1}^{k} \sum_{j=1}^{n_i} (y_{ij} - \mu - \delta_i)^2
97+
$$
98+
99+
and the derivatives of $\mathrm{RSS}$ with respect to $\mu$, $\delta$ are
100+
101+
$$ \label{eq:anova1-repara-rss-der-mu}
102+
\begin{split}
103+
\frac{\mathrm{d}\mathrm{RSS}}{\mathrm{d}\mu}
104+
&= \sum_{i=1}^{k} \sum_{j=1}^{n_i} \frac{\mathrm{d}}{\mathrm{d}\mu} (y_{ij} - \mu - \delta_i)^2 \\
105+
&= \sum_{i=1}^{k} \sum_{j=1}^{n_i} -2 (y_{ij} - \mu - \delta_i) \\
106+
&= \sum_{i=1}^{k} \left( 2 n_i \mu + 2 n_i \delta_i - 2 \sum_{j=1}^{n_i} y_{ij} \right) \\
107+
&= 2 n \mu + 2 \sum_{i=1}^{k} n_i \delta_i - 2 \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij}
108+
\end{split}
109+
$$
110+
111+
$$ \label{eq:anova1-repara-rss-der-delta}
112+
\begin{split}
113+
\frac{\mathrm{d}\mathrm{RSS}}{\mathrm{d}\delta_i}
114+
&= \sum_{j=1}^{n_i} \frac{\mathrm{d}}{\mathrm{d}\delta_i} (y_{ij} - \mu - \delta_i)^2 \\
115+
&= \sum_{j=1}^{n_i} -2 (y_{ij} - \mu - \delta_i) \\
116+
&= 2 n_i \mu + 2 n_i \delta_i - 2 \sum_{j=1}^{n_i} y_{ij} \; .
117+
\end{split}
118+
$$
119+
120+
Setting these derivatives to zero, we obtain the estimates of $\mu$ and $\delta_i$:
121+
122+
$$ \label{eq:anova1-repara-rss-der-mu-zero}
123+
\begin{split}
124+
0 &= 2 n \hat{\mu} + 2 \sum_{i=1}^{k} n_i \delta_i - 2 \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} \\
125+
\hat{\mu} &= \frac{1}{n} \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} - \sum_{i=1}^{k} \frac{n_i}{n} \delta_i \\
126+
&\overset{\eqref{eq:anova1-constr}}{=} \frac{1}{n} \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} \\
127+
&= \bar{y}
128+
\end{split}
129+
$$
130+
131+
$$ \label{eq:anova1-repara-rss-der-delta-zero}
96132
\begin{split}
97-
\hat{\mu} &= \bar{y}_{\bullet \bullet} = \frac{1}{n} \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} \\
98-
\hat{\delta}_i &= \bar{y}_{i \bullet} - \bar{y}_{\bullet \bullet} = \frac{1}{n_i} \sum_{j=1}^{n_i} y_{ij} - \frac{1}{n} \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} \; .
133+
0 &= 2 n_i \hat{\mu} + 2 n_i \hat{\delta}_i - 2 \sum_{j=1}^{n_i} y_{ij} \\
134+
\hat{\delta}_i &= \frac{1}{n_i} \sum_{j=1}^{n_i} y_{ij} - \hat{\mu} \\
135+
&\overset{\eqref{eq:anova1-repara-rss-der-mu-zero}}{=} \frac{1}{n_i} \sum_{j=1}^{n_i} y_{ij} - \frac{1}{n} \sum_{i=1}^{k} \sum_{j=1}^{n_i} y_{ij} \\
136+
&= \bar{y}_i - \bar{y} \; .
99137
\end{split}
100138
$$
101139

P/anova2-ols.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -122,7 +122,7 @@ $$ \label{eq:rss-der-gamma}
122122
\end{split}
123123
$$
124124

125-
Setting these derivatives to zero, we obtain the estimates of $\mu_i$, $\alpha_i$, $\beta_j$ and $\gamma_{ij}$:
125+
Setting these derivatives to zero, we obtain the estimates of $\mu$, $\alpha_i$, $\beta_j$ and $\gamma_{ij}$:
126126

127127
$$ \label{eq:rss-der-mu-zero}
128128
\begin{split}

P/med-mae.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -64,8 +64,8 @@ Applying Leibniz's rule, we can differentiate the objective function as follows:
6464

6565
$$ \label{eq:med-mae-s2}
6666
\begin{split}
67-
& \frac{\partial}{\partial a} \left( \int_{-\infty}^a (a - x) f(x) \, \mathrm{d}x + \int_{a}^\infty (x - a) f(x) \, \mathrm{d}x \right) \\
68-
= & (a - x) f(x) + \int_{-\infty}^a f(x) \, \mathrm{d}x - (x - a) f(x) - \int_{a}^\infty f(x) \, \mathrm{d}x \; .
67+
&\frac{\partial}{\partial a} \left( \int_{-\infty}^a (a - x) f(x) \, \mathrm{d}x + \int_{a}^\infty (x - a) f(x) \, \mathrm{d}x \right) \\
68+
=\; &(a - x) f(x) + \int_{-\infty}^a f(x) \, \mathrm{d}x - (x - a) f(x) - \int_{a}^\infty f(x) \, \mathrm{d}x \; .
6969
\end{split}
7070
$$
7171

0 commit comments

Comments
 (0)