Skip to content

Commit 3949bf2

Browse files
authored
Merge pull request #281 from JoramSoch/master
added 1 definition and 1 proof
2 parents aa8ce01 + ed6a8a3 commit 3949bf2

3 files changed

Lines changed: 131 additions & 1 deletion

File tree

D/samp-size.md

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
---
2+
layout: definition
3+
mathjax: true
4+
5+
author: "Joram Soch"
6+
affiliation: "BCCN Berlin"
7+
e_mail: "joram.soch@bccn-berlin.de"
8+
date: 2024-11-22 11:33:27
9+
10+
title: "Sample size"
11+
chapter: "General Theorems"
12+
section: "Probability theory"
13+
topic: "Random experiments"
14+
definition: "Sample size"
15+
16+
sources:
17+
- authors: "Wikipedia"
18+
year: 2024
19+
title: "Sampling (statistics)"
20+
in: "Wikipedia, the free encyclopedia"
21+
pages: "retrieved on 2024-11-22"
22+
url: "https://en.wikipedia.org/wiki/Sampling_(statistics)"
23+
24+
def_id: "D211"
25+
shortcut: "samp-size"
26+
username: "JoramSoch"
27+
---
28+
29+
30+
**Definition:** Let $x = \left\lbrace x_1, \ldots, x_n \right\rbrace$ be a [statistical sample](/D/samp) from a [random variable](/D/rvar) or a set of [random variables](/D/rvar) $X$. Then, the number of elements $n$ in this sample is called the "sample size".

I/ToC.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,8 @@ title: "Table of Contents"
2222
&emsp;&ensp; 1.1.4. *[Probability space](/D/prob-spc)* <br>
2323
&emsp;&ensp; 1.1.5. *[Measured data](/D/data)* <br>
2424
&emsp;&ensp; 1.1.6. *[Statistical sample](/D/samp)* <br>
25-
&emsp;&ensp; 1.1.7. *[Sample statistic](/D/stat)* <br>
25+
&emsp;&ensp; 1.1.7. *[Sample size](/D/samp-size)* <br>
26+
&emsp;&ensp; 1.1.8. *[Sample statistic](/D/stat)* <br>
2627

2728
<p id="Random variables"></p>
2829
1.2. Random variables <br>
@@ -547,6 +548,7 @@ title: "Table of Contents"
547548
&emsp;&ensp; 3.6.7. **[Mode](/P/lognorm-mode)** <br>
548549
&emsp;&ensp; 3.6.8. **[Variance](/P/lognorm-var)** <br>
549550
&emsp;&ensp; 3.6.9. **[Product of independent log-normals](/P/lognorm-prodind)** <br>
551+
&emsp;&ensp; 3.6.10. **[Geometric mean of independent log-normals](/P/lognorm-geomind)** <br>
550552

551553
<p id="Chi-squared distribution"></p>
552554
3.7. Chi-squared distribution <br>

P/lognorm-geomind.md

Lines changed: 98 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,98 @@
1+
---
2+
layout: proof
3+
mathjax: true
4+
5+
author: "Joram Soch"
6+
affiliation: "BCCN Berlin"
7+
e_mail: "joram.soch@bccn-berlin.de"
8+
date: 2024-11-22 11:39:58
9+
10+
title: "The geometric mean of independent log-normal random variables is a log-normal random variable"
11+
chapter: "Probability Distributions"
12+
section: "Univariate continuous distributions"
13+
topic: "Log-normal distribution"
14+
theorem: "Geometric mean of independent log-normals "
15+
16+
sources:
17+
- authors: "Probability Fact"
18+
year: 2022
19+
title: "The geometric mean of independent log-normal random variables has a log-normal distribution"
20+
in: "X"
21+
pages: "retrieved on 2024-11-22"
22+
url: "https://x.com/ProbFact/status/1592989704646848512"
23+
24+
proof_id: "P480"
25+
shortcut: "lognorm-geomind"
26+
username: "JoramSoch"
27+
---
28+
29+
30+
**Theorem:** Let $X_1, \ldots, X_n$ be [independent](/D/ind) [random variables](/D/rvar) following [log-normal distributions](/D/lognorm):
31+
32+
$$ \label{eq:X-lognorm}
33+
X_i \sim \ln \mathcal{N}(\mu_i, \sigma_i^2), \; i = 1, \ldots, n \; .
34+
$$
35+
36+
Then, the [geometric mean](/D/mean-geom) of these random variables also follows a [log-normal distribution](/D/lognorm):
37+
38+
$$ \label{eq:Z-lognorm}
39+
Z = \sqrt[n]{\prod_{i=1}^n X_i} \sim \mathcal{N}(\mu, \sigma^2)
40+
$$
41+
42+
where the [log-normal](/D/lognorm) parameters are given by
43+
44+
$$ \label{eq:Z-lognorm-para}
45+
\mu = \frac{1}{n} \sum_{i=1}^n \mu_i
46+
\quad \text{and} \quad
47+
\sigma^2 = \frac{1}{n^2} \sum_{i=1}^n \sigma_i^2 \; .
48+
$$
49+
50+
51+
**Proof:** A random variable [follows a log-normal distribution, if and only if its natural logarithm follows a normal distribution](/D/lognorm):
52+
53+
$$ \label{eq:lognorm-norm}
54+
X \sim \ln \mathcal{N}(\mu, \sigma^2)
55+
\quad \Leftrightarrow \quad
56+
\ln X \sim \mathcal{N}(\mu, \sigma^2) \; .
57+
$$
58+
59+
Thus, from \eqref{eq:X-lognorm}, we have
60+
61+
$$ \label{eq:Y-norm}
62+
Y_i = \ln X_i \sim \mathcal{N}(\mu_i, \sigma_i^2)
63+
$$
64+
65+
and from \eqref{eq:Z-lognorm}, we have
66+
67+
$$ \label{eq:ln-Z}
68+
\ln Z
69+
= \ln \left( \sqrt[n]{\prod_{i=1}^n X_i} \right)
70+
= \frac{1}{n} \sum_{i=1}^n \ln X_i
71+
= \frac{1}{n} \sum_{i=1}^n Y_i \; .
72+
$$
73+
74+
This means that the logarithm of the geometric mean of independent [log-normal](/D/lognorm) random variables is the arithmetic mean of independent [normal](/D/norm) random variables. This average, like any [linear combination of independent normal random variables, is again normally distributed](/P/norm-lincomb). Thus, combining \eqref{eq:ln-Z} and \eqref{eq:Y-norm}, we have:
75+
76+
$$ \label{eq:ln-Z-norm}
77+
\ln Z
78+
= \frac{1}{n} \sum_{i=1}^n Y_i
79+
\sim \mathcal{N}\left( \frac{1}{n} \sum_{i=1}^n \mu_i, \, \frac{1}{n^2} \sum_{i=1}^n \sigma_i^2 \right) \; .
80+
$$
81+
82+
If a random variable [follows a normal distribution, then its exponential follows a log-normal distribution with the same parameters]:
83+
84+
$$ \label{eq:norm-lognorm}
85+
Y \sim \mathcal{N}(\mu, \sigma^2)
86+
\quad \Leftrightarrow \quad
87+
\exp(Y) \sim \ln \mathcal{N}(\mu, \sigma^2) \; .
88+
$$
89+
90+
Thus, from \eqref{eq:ln-Z-norm}, we have
91+
92+
$$ \label{eq:Z-lognorm-qed}
93+
Z
94+
= \exp(\ln Z)
95+
\sim \ln \mathcal{N}\left( \frac{1}{n} \sum_{i=1}^n \mu_i, \, \frac{1}{n^2} \sum_{i=1}^n \sigma_i^2 \right)
96+
$$
97+
98+
which is equivalent to \eqref{eq:Z-lognorm} and \eqref{eq:Z-lognorm-para}.

0 commit comments

Comments
 (0)