Merge pull request #138 from JoramSoch/master

JoramSoch · web-flow · commit 136fe56703f0 · 2021-11-16T15:54:59.000+01:00
added 5 proofs
diff --git a/I/ToC.md b/I/ToC.md
@@ -483,21 +483,26 @@ title: "Table of Contents"
    1.3. Simple linear regression <br>
    &emsp;&ensp; 1.3.1. *[Definition](/D/slr)* <br>
    &emsp;&ensp; 1.3.2. **[Special case of multiple linear regression](/P/slr-mlr)** <br>
-   &emsp;&ensp; 1.3.3. **[Ordinary least squares](/P/slr-ols)** <br>
-   &emsp;&ensp; 1.3.4. **[Expectation of estimates](/P/slr-olsmean)** <br>
-   &emsp;&ensp; 1.3.5. **[Variance of estimates](/P/slr-olsvar)** <br>
-   &emsp;&ensp; 1.3.6. **[Distribution of estimates](/P/slr-olsdist)** <br>
-   &emsp;&ensp; 1.3.7. **[Effects of mean-centering](/P/slr-meancent)** <br>
-   &emsp;&ensp; 1.3.8. *[Regression line](/D/regline)* <br>
-   &emsp;&ensp; 1.3.9. **[Regression line includes center of mass](/P/slr-comp)** <br>
-   &emsp;&ensp; 1.3.10. **[Projection of data point to regression line](/P/slr-proj)** <br>
-   &emsp;&ensp; 1.3.11. **[Sums of squares](/P/slr-sss)** <br>
-   &emsp;&ensp; 1.3.12. **[Transformation matrices](/P/slr-mat)** <br>
-   &emsp;&ensp; 1.3.13. **[Sum of residuals is zero](/P/slr-ressum)** <br>
-   &emsp;&ensp; 1.3.14. **[Correlation with covariate is zero](/P/slr-rescorr)** <br>
-   &emsp;&ensp; 1.3.15. **[Residual variance in terms of sample variance](/P/slr-resvar)** <br>
-   &emsp;&ensp; 1.3.16. **[Correlation coefficient in terms of slope estimate](/P/slr-corr)** <br>
-   &emsp;&ensp; 1.3.17. **[Coefficient of determination in terms of correlation coefficient](/P/slr-rsq)** <br>
+   &emsp;&ensp; 1.3.3. **[Ordinary least squares](/P/slr-ols)** (1) <br>
+   &emsp;&ensp; 1.3.4. **[Ordinary least squares](/P/slr-ols2)** (2) <br>
+   &emsp;&ensp; 1.3.5. **[Expectation of estimates](/P/slr-olsmean)** <br>
+   &emsp;&ensp; 1.3.6. **[Variance of estimates](/P/slr-olsvar)** <br>
+   &emsp;&ensp; 1.3.7. **[Distribution of estimates](/P/slr-olsdist)** <br>
+   &emsp;&ensp; 1.3.8. **[Effects of mean-centering](/P/slr-meancent)** <br>
+   &emsp;&ensp; 1.3.9. *[Regression line](/D/regline)* <br>
+   &emsp;&ensp; 1.3.10. **[Regression line includes center of mass](/P/slr-comp)** <br>
+   &emsp;&ensp; 1.3.11. **[Projection of data point to regression line](/P/slr-proj)** <br>
+   &emsp;&ensp; 1.3.12. **[Sums of squares](/P/slr-sss)** <br>
+   &emsp;&ensp; 1.3.13. **[Transformation matrices](/P/slr-mat)** <br>
+   &emsp;&ensp; 1.3.14. **[Weighted least squares](/P/slr-wls)** (1) <br>
+   &emsp;&ensp; 1.3.15. **[Weighted least squares](/P/slr-wls2)** (2) <br>
+   &emsp;&ensp; 1.3.16. **[Maximum likelihood estimation](/P/slr-mle)** (1) <br>
+   &emsp;&ensp; 1.3.17. **[Maximum likelihood estimation](/P/slr-mle2)** (2) <br>
+   &emsp;&ensp; 1.3.18. **[Sum of residuals is zero](/P/slr-ressum)** <br>
+   &emsp;&ensp; 1.3.19. **[Correlation with covariate is zero](/P/slr-rescorr)** <br>
+   &emsp;&ensp; 1.3.20. **[Residual variance in terms of sample variance](/P/slr-resvar)** <br>
+   &emsp;&ensp; 1.3.21. **[Correlation coefficient in terms of slope estimate](/P/slr-corr)** <br>
+   &emsp;&ensp; 1.3.22. **[Coefficient of determination in terms of correlation coefficient](/P/slr-rsq)** <br>
 
    1.4. Multiple linear regression <br>
    &emsp;&ensp; 1.4.1. *[Definition](/D/mlr)* <br>
diff --git a/P/slr-mle.md b/P/slr-mle.md
@@ -0,0 +1,124 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-11-16 08:34:00
+
+title: "Maximum likelihood estimation for simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Maximum likelihood estimation"
+
+sources:
+
+proof_id: "P287"
+shortcut: "slr-mle"
+username: "JoramSoch"
+---
+
+
+**Theorem:** Given a [simple linear regression model](/D/mlr) with independent observations
+
+$$ \label{eq:slr}
+y = \beta_0 + \beta_1 x + \varepsilon, \; \varepsilon_i \sim \mathcal{N}(0, \sigma^2), \; i = 1,\ldots,n \; ,
+$$
+
+the [maximum likelihood estimates](/D/mle) of $\beta_0$, $\beta_1$ and $\sigma^2$ are given by
+
+$$ \label{eq:slr-mle}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2} \\
+\hat{\sigma}^2 &= \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2
+\end{split}
+$$
+
+where $\bar{x}$ and $\bar{y}$ are the [sample means](/D/mean-samp), $s_x^2$ is the [sample variance](/D/var-samp) of $x$ and $s_{xy}$ is the [sample covariance](/D/cov-samp) between $x$ and $y$.
+
+
+**Proof:** With the [probability density function of the normal distribution](/P/norm-pdf) and [probability under independence](/D/ind), the linear regression equation \eqref{eq:slr} implies the following [likelihood function](/D/lf)
+
+$$ \label{eq:slr-lf}
+\begin{split}
+p(y|\beta_0,\beta_1,\sigma^2) &= \prod_{i=1}^n p(y_i|\beta_0,\beta_1,\sigma^2) \\
+&= \prod_{i=1}^n \mathcal{N}(y_i; \beta_0 + \beta_1 x_i, \sigma^2) \\
+&= \prod_{i=1}^n \frac{1}{\sqrt{2 \pi \sigma}} \cdot \exp \left[ -\frac{(y_i - \beta_0 - \beta_1 x_i)^2}{2 \sigma^2} \right] \\
+&= \frac{1}{\sqrt{(2 \pi \sigma^2)^n}} \cdot \exp\left[ -\frac{1}{2 \sigma^2} \sum_{i=1}^n (y_i - \beta_0 - \beta_1 x_i)^2 \right]
+\end{split}
+$$
+
+and the [log-likelihood function](/D/llf)
+
+$$ \label{eq:slr-ll}
+\begin{split}
+\mathrm{LL}(\beta_0,\beta_1,\sigma^2) &= \log p(y|\beta_0,\beta_1,\sigma^2) \\
+&= -\frac{n}{2} \log(2\pi) - \frac{n}{2} \log (\sigma^2) -\frac{1}{2 \sigma^2} \sum_{i=1}^n (y_i - \beta_0 - \beta_1 x_i)^2 \; .
+\end{split}
+$$
+
+<br>
+The derivative of the log-likelihood function \eqref{eq:slr-ll} with respect to $\beta_0$ is
+
+$$ \label{eq:dLL-dbeta0}
+\frac{\mathrm{d}\mathrm{LL}(\beta_0,\beta_1,\sigma^2)}{\mathrm{d}\beta_0} = \frac{1}{\sigma^2} \sum_{i=1}^n (y_i - \beta_0 - \beta_1 x_i)
+$$
+
+and setting this derivative to zero gives the MLE for $\beta_0$:
+
+$$ \label{eq:beta0-mle}
+\begin{split}
+\frac{\mathrm{d}\mathrm{LL}(\hat{\beta}_0,\hat{\beta}_1,\hat{\sigma}^2)}{\mathrm{d}\beta_0} &= 0 \\
+0 &= \frac{1}{\hat{\sigma}^2} \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i) \\
+0 &= \sum_{i=1}^n y_i - n \hat{\beta}_0 - \hat{\beta}_1 \sum_{i=1}^n x_i \\
+\hat{\beta}_0 &= \frac{1}{n} \sum_{i=1}^n y_i - \hat{\beta}_1 \frac{1}{n} \sum_{i=1}^n x_i \\
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \; .
+\end{split}
+$$
+
+<br>
+The derivative of the log-likelihood function \eqref{eq:slr-ll} at $\hat{\beta}_0$ with respect to $\beta_1$ is
+
+$$ \label{eq:dLL-dbeta1}
+\frac{\mathrm{d}\mathrm{LL}(\hat{\beta}_0,\beta_1,\sigma^2)}{\mathrm{d}\beta_1} = \frac{1}{\sigma^2} \sum_{i=1}^n (x_i y_i - \hat{\beta}_0 x_i - \beta_1 x_i^2) \\
+$$
+
+and setting this derivative to zero gives the MLE for $\beta_1$:
+
+$$ \label{eq:beta1-mle}
+\begin{split}
+\frac{\mathrm{d}\mathrm{LL}(\hat{\beta}_0,\hat{\beta}_1,\hat{\sigma}^2)}{\mathrm{d}\beta_0} &= 0 \\
+0 &= \frac{1}{\hat{\sigma}^2} \sum_{i=1}^n (x_i y_i - \hat{\beta}_0 x_i - \hat{\beta}_1 x_i^2) \\
+0 &= \sum_{i=1}^n x_i y_i - \hat{\beta}_0 \sum_{i=1}^n x_i - \hat{\beta}_1 \sum_{i=1}^n x_i^2) \\
+0 &\overset{\eqref{eq:beta0-mle}}{=} \sum_{i=1}^n x_i y_i - (\bar{y} - \hat{\beta}_1 \bar{x}) \sum_{i=1}^n x_i - \hat{\beta}_1 \sum_{i=1}^n x_i^2 \\
+0 &= \sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i + \hat{\beta}_1 \bar{x} \sum_{i=1}^n x_i - \hat{\beta}_1 \sum_{i=1}^n x_i^2 \\
+0 &= \sum_{i=1}^n x_i y_i - n \bar{x} \bar{y} + \hat{\beta}_1 n \bar{x}^2 - \hat{\beta}_1 \sum_{i=1}^n x_i^2 \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n x_i y_i - \sum_{i=1}^n \bar{x} \bar{y}}{\sum_{i=1}^n x_i^2 - \sum_{i=1}^n \bar{x}^2} \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2} \; .
+\end{split}
+$$
+
+<br>
+The derivative of the log-likelihood function \eqref{eq:slr-ll} at $(\hat{\beta}_0,\hat{\beta}_1)$ with respect to $\sigma^2$ is
+
+$$ \label{eq:dLL-ds2}
+\frac{\mathrm{d}\mathrm{LL}(\hat{\beta}_0,\hat{\beta}_1,\sigma^2)}{\mathrm{d}\sigma^2} = - \frac{n}{2\sigma^2} + \frac{1}{2(\sigma^2)^2} \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2
+$$
+
+and setting this derivative to zero gives the MLE for $\sigma^2$:
+
+$$ \label{eq:s2-mle}
+\begin{split}
+\frac{\mathrm{d}\mathrm{LL}(\hat{\beta}_0,\hat{\beta}_1,\hat{\sigma}^2)}{\mathrm{d}\sigma^2} &= 0 \\
+0 &= - \frac{n}{2\hat{\sigma}^2} + \frac{1}{2(\hat{\sigma}^2)^2} \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2 \\
+\frac{n}{2\hat{\sigma}^2} &= \frac{1}{2(\hat{\sigma}^2)^2} \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2 \\
+\hat{\sigma}^2 &= \frac{1}{n} \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2 \; .
+\end{split}
+$$
+
+<br>
+Together, \eqref{eq:beta0-mle}, \eqref{eq:beta1-mle} and \eqref{eq:s2-mle} constitute the MLE for simple linear regression.
diff --git a/P/slr-mle2.md b/P/slr-mle2.md
@@ -0,0 +1,91 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-11-16 11:53:00
+
+title: "Maximum likelihood estimation for simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Maximum likelihood estimation"
+
+sources:
+
+proof_id: "P290"
+shortcut: "slr-mle2"
+username: "JoramSoch"
+---
+
+
+**Theorem:** Given a [simple linear regression model](/D/mlr) with independent observations
+
+$$ \label{eq:slr}
+y = \beta_0 + \beta_1 x + \varepsilon, \; \varepsilon_i \sim \mathcal{N}(0, \sigma^2), \; i = 1,\ldots,n \; ,
+$$
+
+the [maximum likelihood estimates](/D/mle) of $\beta_0$, $\beta_1$ and $\sigma^2$ are given by
+
+$$ \label{eq:slr-mle}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2} \\
+\hat{\sigma}^2 &= \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2
+\end{split}
+$$
+
+where $\bar{x}$ and $\bar{y}$ are the [sample means](/D/mean-samp), $s_x^2$ is the [sample variance](/D/var-samp) of $x$ and $s_{xy}$ is the [sample covariance](/D/cov-samp) between $x$ and $y$.
+
+
+**Proof:** [Simple linear regression is a special case of multiple linear regression](/P/slr-mlr) with
+
+$$ \label{eq:slr-mlr}
+X = \left[ 1_n, \, x \right] \quad \text{and} \quad \beta = \left[ \begin{matrix} \beta_0 \\ \beta_1 \end{matrix} \right]
+$$
+
+and [weighted least sqaures estimates](/P/mlr-mle) are given by
+
+$$ \label{eq:mlr-mle}
+\begin{split}
+\hat{\beta} &= (X^\mathrm{T} V^{-1} X)^{-1} X^\mathrm{T} V^{-1} y \\
+\hat{\sigma}^2 &= \frac{1}{n} (y-X\hat{\beta})^\mathrm{T} V^{-1} (y-X\hat{\beta}) \; .
+\end{split}
+$$
+
+Under independent observations, the covariance matrix is
+
+$$ \label{eq:mlr-ind}
+V = I_n, \quad \text{such that} \quad V^{-1} = I_n \; .
+$$
+
+Thus, we can write out the estimate of $\beta$
+
+$$ \label{eq:slr-mle-b}
+\begin{split}
+\hat{\beta} &= \left( \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] V^{-1} \left[ \begin{matrix} 1_n & x \end{matrix} \right] \right)^{-1} \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] V^{-1} y \\
+&= \left( \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] \left[ \begin{matrix} 1_n & x \end{matrix} \right] \right)^{-1} \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] y
+\end{split}
+$$
+
+which [is equal to the ordinary least squares solution for simple linear regression](/P/slr-ols):
+
+$$ \label{eq:slr-mle-b-qed}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2} \; .
+\end{split}
+$$
+
+Additionally, we can write out the estimate of $\sigma^2$:
+
+$$ \label{eq:slr-mle-s2}
+\begin{split}
+\hat{\sigma}^2 &= \frac{1}{n} (y-X\hat{\beta})^\mathrm{T} V^{-1} (y-X\hat{\beta}) \\
+&= \frac{1}{n} \left( y - \left[ \begin{matrix} 1_n & x \end{matrix} \right] \left[ \begin{matrix} \hat{\beta}_0 \\ \hat{\beta}_1 \end{matrix} \right] \right)^\mathrm{T} \left( y - \left[ \begin{matrix} 1_n & x \end{matrix} \right] \left[ \begin{matrix} \hat{\beta}_0 \\ \hat{\beta}_1 \end{matrix} \right] \right) \\
+&= \frac{1}{n} \left( y - \hat{\beta}_0 - \hat{\beta}_1 x \right)^\mathrm{T} \left( y - \hat{\beta}_0 - \hat{\beta}_1 x \right) \\
+&= \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i)^2 \; .
+\end{split}
+$$
diff --git a/P/slr-ols2.md b/P/slr-ols2.md
@@ -0,0 +1,89 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-11-16 09:36:00
+
+title: "Ordinary least squares for simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Ordinary least squares"
+
+sources:
+
+proof_id: "P288"
+shortcut: "slr-ols2"
+username: "JoramSoch"
+---
+
+
+**Theorem:** Given a [simple linear regression model](/D/slr) with independent observations
+
+$$ \label{eq:slr}
+y = \beta_0 + \beta_1 x + \varepsilon, \; \varepsilon_i \sim \mathcal{N}(0, \sigma^2), \; i = 1,\ldots,n \; ,
+$$
+
+the parameters minimizing the [residual sum of squares](/D/rss) are given by
+
+$$ \label{eq:slr-ols}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+where $\bar{x}$ and $\bar{y}$ are the [sample means](/D/mean-samp), $s_x^2$ is the [sample variance](/D/var-samp) of $x$ and $s_{xy}$ is the [sample covariance](/D/cov-samp) between $x$ and $y$.
+
+
+**Proof:** [Simple linear regression is a special case of multiple linear regression](/P/slr-mlr) with
+
+$$ \label{eq:slr-mlr}
+X = \left[ 1_n, \, x \right] \quad \text{and} \quad \beta = \left[ \begin{matrix} \beta_0 \\ \beta_1 \end{matrix} \right]
+$$
+
+and [ordinary least sqaures estimates](/P/mlr-ols) are given by
+
+$$ \label{eq:mlr-ols}
+\hat{\beta} = (X^\mathrm{T} X)^{-1} X^\mathrm{T} y \; .
+$$
+
+Writing out equation \eqref{eq:mlr-ols}, we have
+
+$$ \label{eq:slr-ols-b}
+\begin{split}
+\hat{\beta} &= \left( \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] \left[ \begin{matrix} 1_n & x \end{matrix} \right] \right)^{-1} \left[ \begin{matrix} 1_n^\mathrm{T} \\ x^\mathrm{T} \end{matrix} \right] y \\
+&= \left( \left[ \begin{matrix} n & n\bar{x} \\ n\bar{x} & x^\mathrm{T} x \end{matrix} \right] \right)^{-1} \left[ \begin{matrix} n \bar{y} \\ x^\mathrm{T} y \end{matrix} \right] \\
+&= \frac{1}{n x^\mathrm{T} x - (n\bar{x})^2} \left[ \begin{matrix} x^\mathrm{T} x & -n\bar{x} \\ -n\bar{x} & n \end{matrix} \right]  \left[ \begin{matrix} n \bar{y} \\ x^\mathrm{T} y \end{matrix} \right] \\
+&= \frac{1}{n x^\mathrm{T} x - (n\bar{x})^2} \left[ \begin{matrix} n \bar{y} \, x^\mathrm{T} x - n \bar{x} \, x^\mathrm{T} y \\ n \, x^\mathrm{T} y - (n \bar{x})(n \bar{y}) \end{matrix} \right] \; .
+\end{split}
+$$
+
+Thus, the second entry of $\hat{\beta}$ [is equal to](/P/slr-ols):
+
+$$ \label{eq:slr-ols-b1}
+\begin{split}
+\hat{\beta}_1 &= \frac{n \, x^\mathrm{T} y - (n \bar{x})(n \bar{y})}{n x^\mathrm{T} x - (n\bar{x})^2} \\
+&= \frac{x^\mathrm{T} y - n \bar{x} \bar{y}}{x^\mathrm{T} x - n \bar{x}^2} \\
+&= \frac{\sum_{i=1}^n x_i y_i - \sum_{i=1}^n \bar{x} \bar{y}}{\sum_{i=1}^n x_i^2 - \sum_{i=1}^n \bar{x}^2} \\
+&= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} \\
+&= \frac{s_{xy}}{s_x^2} \; .
+\end{split}
+$$
+
+Moreover, the first entry of $\hat{\beta}$ is equal to:
+
+$$ \label{eq:slr-ols-b2}
+\begin{split}
+\hat{\beta}_0 &= \frac{n \bar{y} \, x^\mathrm{T} x - n \bar{x} \, x^\mathrm{T} y}{n x^\mathrm{T} x - (n\bar{x})^2} \\
+&= \frac{\bar{y} \, x^\mathrm{T} x - \bar{x} \, x^\mathrm{T} y}{x^\mathrm{T} x - n \bar{x}^2} \\
+&= \frac{\bar{y} \, x^\mathrm{T} x - \bar{x} \, x^\mathrm{T} y + n \bar{x}^2 \bar{y} - n \bar{x}^2 \bar{y}}{x^\mathrm{T} x - n \bar{x}^2} \\
+&= \frac{\bar{y} (x^\mathrm{T} x - n \bar{x}^2) - \bar{x} (x^\mathrm{T} y - n \bar{x} \bar{y})}{x^\mathrm{T} x - n \bar{x}^2} \\
+&= \frac{\bar{y} (x^\mathrm{T} x - n \bar{x}^2)}{x^\mathrm{T} x - n \bar{x}^2} - \frac{\bar{x} (x^\mathrm{T} y - n \bar{x} \bar{y})}{x^\mathrm{T} x - n \bar{x}^2} \\
+&= \bar{y} - \bar{x} \, \frac{\sum_{i=1}^n x_i y_i - \sum_{i=1}^n \bar{x} \bar{y}}{\sum_{i=1}^n x_i^2 - \sum_{i=1}^n \bar{x}^2} \\
+&= \bar{y} - \hat{\beta}_1 \bar{x} \; .
+\end{split}
+$$
diff --git a/P/slr-wls.md b/P/slr-wls.md
diff --git a/P/slr-wls2.md b/P/slr-wls2.md