StatProofBook
diff --git a/‎P/slr-comp.md‎
Lines changed: 48 additions & 0 deletions b/‎P/slr-comp.md‎
Lines changed: 48 additions & 0 deletions
diff --git a/‎P/slr-corr.md‎
Lines changed: 76 additions & 0 deletions b/‎P/slr-corr.md‎
Lines changed: 76 additions & 0 deletions
diff --git a/‎P/slr-meancent.md‎
Lines changed: 101 additions & 0 deletions b/‎P/slr-meancent.md‎
Lines changed: 101 additions & 0 deletions
diff --git a/‎P/slr-ols.md‎
Lines changed: 144 additions & 0 deletions b/‎P/slr-ols.md‎
Lines changed: 144 additions & 0 deletions
@@ -0,0 +1,48 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-10-27 12:52:00
+
+title: "The regression line goes through the center of mass point"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Regression line includes center of mass"
+
+sources:
+  - authors: "Wikipedia"
+    year: 2021
+    title: "Simple linear regression"
+    in: "Wikipedia, the free encyclopedia"
+    pages: "retrieved on 2021-10-27"
+    url: "https://en.wikipedia.org/wiki/Simple_linear_regression#Numerical_properties"
+
+proof_id: "P275"
+shortcut: "slr-comp"
+username: "JoramSoch"
+---
+
+
+**Theorem:** In [simple linear regression](/D/slr), the [regression line](/D/regline) estimated using [ordinary least squares](/P/slr-ols) includes the point $M(\bar{x},\bar{y})$.
+
+**Proof:** The [fitted regression line](/D/regline) is described by the equation
+
+$$ \label{eq:slr-ols-regline}
+y = \hat{\beta}_0 + \hat{\beta}_1 x \quad \text{where} \quad x,y \in \mathbb{R} \; .
+$$
+
+Plugging in the coordinates of $M$ and the [ordinary least squares estimate of the intercept](/P/slr-ols), we obtain
+
+$$ \label{eq:slr-ols}
+\begin{split}
+\bar{y} &= \hat{\beta}_0 + \hat{\beta}_1 \bar{x} \\
+\bar{y} &= \bar{y} - \hat{\beta}_1 \bar{x} + \hat{\beta}_1 \bar{x} \\
+\bar{y} &= \bar{y} \; .
+\end{split}
+$$
+
+which is a true statement. Thus, the [regression line](/D/regline) goes through the center of mass point $(\bar{x},\bar{y})$, if [the model](/D/slr) includes an intercept term $\beta_0$.
@@ -0,0 +1,76 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-10-27 14:58:00
+
+title: "Relationship between correlation coefficient and slope estimate in simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Correlation coefficient in terms of slope estimate"
+
+sources:
+  - authors: "Penny, William"
+    year: 2006
+    title: "Relation to correlation"
+    in: "Mathematics for Brain Imaging"
+    pages: "ch. 1.2.3, p. 18, eq. 1.27"
+    url: "https://ueapsylabs.co.uk/sites/wpenny/mbi/mbi_course.pdf"
+  - authors: "Wikipedia"
+    year: 2021
+    title: "Simple linear regression"
+    in: "Wikipedia, the free encyclopedia"
+    pages: "retrieved on 2021-10-27"
+    url: "https://en.wikipedia.org/wiki/Simple_linear_regression#Fitting_the_regression_line"
+
+proof_id: "P279"
+shortcut: "slr-corr"
+username: "JoramSoch"
+---
+
+
+**Theorem:** Assume a [simple linear regression model](/D/slr) with independent observations
+
+$$ \label{eq:slr}
+y = \beta_0 + \beta_1 x + \varepsilon, \; \varepsilon_i \sim \mathcal{N}(0, \sigma^2), \; i = 1,\ldots,n
+$$
+
+and consider estimation using [ordinary least squares](/P/slr-ols). Then, [correlation coefficient](/D/corr) and the estimated value of the [slope parameter](/D/slr) are related to each other via the sample [standard deviations](/D/std):
+
+$$ \label{eq:slr-corr}
+r_{xy} = \frac{s_x}{s_y} \, \hat{\beta}_1 \; .
+$$
+
+
+**Proof:** The [ordinary least squares estimate of the slope](/P/slr-ols) is given by
+
+$$ \label{eq:slr-ols-sl}
+\hat{\beta}_1 = \frac{s_{xy}}{s_x^2} \; .
+$$
+
+Using the [relationship between covariance and correlation](/D/cov-corr)
+
+$$ \label{eq:cov-corr}
+\mathrm{Cov}(X,Y) = \sigma_X \, \mathrm{Corr}(X,Y) \, \sigma_Y
+$$
+
+which also holds for sample [correlation](/D/corr) and [sample covariance](/D/cov-samp)
+
+$$ \label{eq:cov-corr-samp}
+s_{xy} = s_x \, r_{xy} \, s_y \; ,
+$$
+
+we get the final result:
+
+$$ \label{eq:slr-corr-qed}
+\begin{split}
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2} \\
+\hat{\beta}_1 &= \frac{s_x \, r_{xy} \, s_y}{s_x^2} \\
+\hat{\beta}_1 &= \frac{s_y}{s_x} \, r_{xy} \\
+\Leftrightarrow \quad r_{xy} &= \frac{s_x}{s_y} \, \hat{\beta}_1 \; .
+\end{split}
+$$
@@ -0,0 +1,101 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-10-27 12:38:00
+
+title: "Effects of mean-centering on parameter estimates for simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Effects of mean-centering"
+
+sources:
+
+proof_id: "P274"
+shortcut: "slr-meancent"
+username: "JoramSoch"
+---
+
+
+**Theorem:** In [simple linear regression](/D/slr), when the independent variable $y$ and/or the dependent variable $x$ are [mean-centered](/D/mean), the [ordinary least squares](/P/slr-ols) estimate for the intercept changes, but that of the slope does not.
+
+**Proof:**
+
+1) Under unaltered $y$ and $x$, [ordinary least squares estimates for simple linear regression](/P/slr-ols) are
+
+$$ \label{eq:slr-ols}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} = \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+with [sample means](/D/mean-samp) $\bar{x}$ and $\bar{y}$, [sample variance](/D/var-samp) $s_x^2$ and [sample covariance](/D/cov-samp) $s_{xy}$, such that $\beta_0$ estimates "the mean $y$ at $x = 0$".
+
+<br>
+2) Let $\tilde{x}$ be the mean-centered [covariate vector](/D/slr):
+
+$$ \label{eq:slr-meancent-x}
+\tilde{x}_i = x_i - \bar{x} \quad \Rightarrow \quad \bar{\tilde{x}} = 0 \; .
+$$
+
+Under this condition, the parameter estimates become
+
+$$ \label{eq:slr-ols-meancent-x}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{\tilde{x}} \\
+&= \bar{y} \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n (\tilde{x}_i - \bar{\tilde{x}}) (y_i - \bar{y})}{\sum_{i=1}^n (\tilde{x}_i - \bar{\tilde{x}})^2} \\
+&= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} = \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+and we can see that $\hat{\beta}_1(\tilde{x},y) = \hat{\beta}_1(x,y)$, but $\hat{\beta}_0(\tilde{x},y) \neq \hat{\beta}_0(x,y)$, specifically $\beta_0$ now estimates "the mean $y$ at the mean $x$".
+
+
+<br> 
+3) Let $\tilde{y}$ be the mean-centered [data vector](/D/slr):
+
+$$ \label{eq:slr-meancent-y}
+\tilde{y}_i = y_i - \bar{y} \quad \Rightarrow \quad \bar{\tilde{y}} = 0 \; .
+$$
+
+Under this condition, the parameter estimates become
+
+$$ \label{eq:slr-ols-meancent-y}
+\begin{split}
+\hat{\beta}_0 &= \bar{\tilde{y}} - \hat{\beta}_1 \bar{x} \\
+&= - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n (x_i - \bar{x}) (\tilde{y}_i - \bar{\tilde{y}})}{\sum_{i=1}^n (x_i - \bar{x})^2} \\
+&= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} = \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+and we can see that $\hat{\beta}_1(x,\tilde{y}) = \hat{\beta}_1(x,y)$, but $\hat{\beta}_0(x,\tilde{y}) \neq \hat{\beta}_0(x,y)$, specifically $\beta_0$ now estimates "the mean $x$, multiplied with the negative slope".
+
+<br> 
+4) Finally, consider mean-centering both $x$ and $y$::
+
+$$ \label{eq:slr-meancent-xy}
+\begin{split}
+\tilde{x}_i = x_i - \bar{x} \quad &\Rightarrow \quad \bar{\tilde{x}} = 0 \\
+\tilde{y}_i = y_i - \bar{y} \quad &\Rightarrow \quad \bar{\tilde{y}} = 0 \; .
+\end{split}
+$$
+
+Under this condition, the parameter estimates become
+
+$$ \label{eq:slr-ols-meancent-xy}
+\begin{split}
+\hat{\beta}_0 &= \bar{\tilde{y}} - \hat{\beta}_1 \bar{\tilde{x}} \\
+&= 0 \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n (\tilde{x}_i - \bar{\tilde{x}}) (\tilde{y}_i - \bar{\tilde{y}})}{\sum_{i=1}^n (\tilde{x}_i - \bar{\tilde{x}})^2} \\
+&= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} = \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+and we can see that $\hat{\beta}_1(\tilde{x},\tilde{y}) = \hat{\beta}_1(x,y)$, but $\hat{\beta}_0(\tilde{x},\tilde{y}) \neq \hat{\beta}_0(x,y)$, specifically $\beta_0$ is now forced to become zero.
@@ -0,0 +1,144 @@
+---
+layout: proof
+mathjax: true
+
+author: "Joram Soch"
+affiliation: "BCCN Berlin"
+e_mail: "joram.soch@bccn-berlin.de"
+date: 2021-10-27 08:56:00
+
+title: "Ordinary least squares for simple linear regression"
+chapter: "Statistical Models"
+section: "Univariate normal data"
+topic: "Simple linear regression"
+theorem: "Ordinary least squares"
+
+sources:
+  - authors: "Penny, William"
+    year: 2006
+    title: "Linear regression"
+    in: "Mathematics for Brain Imaging"
+    pages: "ch. 1.2.2, pp. 14-16, eqs. 1.24/1.25"
+    url: "https://ueapsylabs.co.uk/sites/wpenny/mbi/mbi_course.pdf"
+  - authors: "Wikipedia"
+    year: 2021
+    title: "Proofs involving ordinary least squares"
+    in: "Wikipedia, the free encyclopedia"
+    pages: "retrieved on 2021-10-27"
+    url: "https://en.wikipedia.org/wiki/Proofs_involving_ordinary_least_squares#Derivation_of_simple_linear_regression_estimators"
+
+proof_id: "P271"
+shortcut: "slr-ols"
+username: "JoramSoch"
+---
+
+
+**Theorem:** Given a [simple linear regression model](/D/slr) with independent observations
+
+$$ \label{eq:slr}
+y = \beta_0 + \beta_1 x + \varepsilon, \; \varepsilon_i \sim \mathcal{N}(0, \sigma^2), \; i = 1,\ldots,n \; ,
+$$
+
+the parameters minimizing the [residual sum of squares](/D/rss) are given by
+
+$$ \label{eq:slr-ols}
+\begin{split}
+\hat{\beta}_0 &= \bar{y} - \hat{\beta}_1 \bar{x} \\
+\hat{\beta}_1 &= \frac{s_{xy}}{s_x^2}
+\end{split}
+$$
+
+where $\bar{x}$ and $\bar{y}$ are the [sample means](/D/mean-samp), $s_x^2$ is the [sample variance](/D/var-samp) of $x$ and $s_{xy}$ is the [sample covariance](/D/cov-samp) between $x$ and $y$.
+
+
+**Proof:** The [residual sum of squares](/D/rss) is defined as
+
+$$ \label{eq:rss}
+\mathrm{RSS}(\beta_0,\beta_1) = \sum_{i=1}^n \varepsilon_i^2 = \sum_{i=1}^n (y_i - \beta_0 - \beta_1 x_i)^2 \; .
+$$
+
+The derivatives of $\mathrm{RSS}(\beta_0,\beta_1)$ with respect to $\beta_0$ and $\beta_1$ are
+
+$$ \label{eq:rss-der}
+\begin{split}
+\frac{\mathrm{d}\mathrm{RSS}(\beta_0,\beta_1)}{\mathrm{d}\beta_0} &= \sum_{i=1}^n 2 (y_i - \beta_0 - \beta_1 x_i) (-1) \\
+&= -2 \sum_{i=1}^n (y_i - \beta_0 - \beta_1 x_i) \\
+\frac{\mathrm{d}\mathrm{RSS}(\beta_0,\beta_1)}{\mathrm{d}\beta_1} &= \sum_{i=1}^n 2 (y_i - \beta_0 - \beta_1 x_i) (-x_i) \\
+&= -2 \sum_{i=1}^n (x_i y_i - \beta_0 x_i - \beta_1 x_i^2)
+\end{split}
+$$
+
+and setting these derivatives to zero
+
+$$ \label{eq:rss-der-zero}
+\begin{split}
+0 &= -2 \sum_{i=1}^n (y_i - \hat{\beta}_0 - \hat{\beta}_1 x_i) \\
+0 &= -2 \sum_{i=1}^n (x_i y_i - \hat{\beta}_0 x_i - \hat{\beta}_1 x_i^2)
+\end{split}
+$$
+
+yields the following equations:
+
+$$ \label{eq:slr-norm-eq}
+\begin{split}
+\hat{\beta}_1 \sum_{i=1}^n x_i + \hat{\beta}_0 \cdot n &= \sum_{i=1}^n y_i \\
+\hat{\beta}_1 \sum_{i=1}^n x_i^2 + \hat{\beta}_0 \sum_{i=1}^n x_i &= \sum_{i=1}^n x_i y_i \; .
+\end{split}
+$$
+
+From the first equation, we can derive the estimate for the intercept:
+
+$$ \label{eq:slr-ols-int}
+\begin{split}
+\hat{\beta}_0 &= \frac{1}{n} \sum_{i=1}^n y_i - \hat{\beta}_1 \cdot \frac{1}{n} \sum_{i=1}^n x_i \\
+&= \bar{y} - \hat{\beta}_1 \bar{x} \; .
+\end{split}
+$$
+
+From the second equation, we can derive the estimate for the slope:
+
+$$ \label{eq:slr-ols-sl}
+\begin{split}
+\hat{\beta}_1 \sum_{i=1}^n x_i^2 + \hat{\beta}_0 \sum_{i=1}^n x_i &= \sum_{i=1}^n x_i y_i \\
+\hat{\beta}_1 \sum_{i=1}^n x_i^2 + \left( \bar{y} - \hat{\beta}_1 \bar{x} \right) \sum_{i=1}^n x_i &\overset{\eqref{eq:slr-ols-int}}{=} \sum_{i=1}^n x_i y_i \\
+\hat{\beta}_1 \left( \sum_{i=1}^n x_i^2 - \bar{x} \sum_{i=1}^n x_i \right) &= \sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i \\
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i}{\sum_{i=1}^n x_i^2 - \bar{x} \sum_{i=1}^n x_i} \; .
+\end{split}
+$$
+
+Note that the numerator can be rewritten as
+
+$$ \label{eq:slr-ols-sl-num}
+\begin{split}
+\sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i &= \sum_{i=1}^n x_i y_i - n \bar{x} \bar{y} \\
+&= \sum_{i=1}^n x_i y_i - n \bar{x} \bar{y} - n \bar{x} \bar{y} + n \bar{x} \bar{y} \\
+&= \sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i - \bar{x} \sum_{i=1}^n y_i + \sum_{i=1}^n \bar{x} \bar{y} \\
+&= \sum_{i=1}^n \left( x_i y_i - x_i \bar{y} - \bar{x} y_i + \bar{x} \bar{y} \right) \\
+&= \sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})
+\end{split}
+$$
+
+and that the denominator can be rewritten as
+
+$$ \label{eq:slr-ols-sl-den}
+\begin{split}
+\sum_{i=1}^n x_i^2 - \bar{x} \sum_{i=1}^n x_i &= \sum_{i=1}^n x_i^2 - n \bar{x}^2 \\
+&= \sum_{i=1}^n x_i^2 - 2 n \bar{x} \bar{x} + n \bar{x}^2 \\
+&= \sum_{i=1}^n x_i^2 - 2 \bar{x} \sum_{i=1}^n x_i - \sum_{i=1}^n \bar{x}^2 \\
+&= \sum_{i=1}^n \left( x_i^2 - 2 \bar{x} x_i + \bar{x}^2 \right) \\
+&= \sum_{i=1}^n (x_i - \bar{x})^2 \; .
+\end{split}
+$$
+
+With \eqref{eq:slr-ols-sl-num} and \eqref{eq:slr-ols-sl-den}, the estimate from \eqref{eq:slr-ols-sl} can be simplified as follows:
+
+$$ \label{eq:slr-ols-sl-qed}
+\begin{split}
+\hat{\beta}_1 &= \frac{\sum_{i=1}^n x_i y_i - \bar{y} \sum_{i=1}^n x_i}{\sum_{i=1}^n x_i^2 - \bar{x} \sum_{i=1}^n x_i} \\
+&= \frac{\sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\sum_{i=1}^n (x_i - \bar{x})^2} \\
+&= \frac{\frac{1}{n-1} \sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y})}{\frac{1}{n-1} \sum_{i=1}^n (x_i - \bar{x})^2} \\
+&= \frac{s_{xy}}{s_x^2} \; .
+\end{split}
+$$
+
+Together, \eqref{eq:slr-ols-int} and \eqref{eq:slr-ols-sl-qed} constitute the ordinary least squares parameter estimates for simple linear regression.