测试的大小和重要程度

两者之间有什么区别，为什么显着性水平必须始终大于或等于测试的大小？

estimation

— 法修
source

我不了解“测试的大小”的含义。也许你的意思，如“检验统计量” ˚F或ŧ或ž。在那种情况下，显着性水平（p）不必更高或更低。您是从特定来源报价吗？如果是这样，请附上报价，毫无疑问，有人会帮您为您澄清。

— rolando2'3

@rolando“测试大小”是一个标准术语：请参阅Scholar.google.com/…。

— whuber

假设您有一个随机样本 $X_1,\dots,X_n$ 来自涉及参数的分布 $\theta$ 假设参数空间中的值 $\Theta$ 。您将参数空间划分为 $\Theta=\Theta_0\cup\Theta_1$ ，而您想检验假设

H_{0} : θ \in Θ_{0},

$H_0 : \theta \in \Theta_0 \, ,$

H_{1} : θ \in Θ_{1},

$H_1 : \theta \in \Theta_1 \, ,$ 这是所谓的零和替代分别假设，。

让 $\mathscr{X}$ 表示随机向量所有可能值的样本空间 $X=(X_1,\dots,X_n)$ 。您建立测试程序的目标是对样本空间进行分区 $\mathscr{X}$ 分为两部分：关键区域 $\mathscr{C}$ ，包含的值 $X$ 为此，您将拒绝原假设 $H_0$ （因此，请接受替代方法 $H_1$ ），以及接受区域 $\mathscr{A}$ ，包含的值 $X$ 为此，您不会拒绝原假设 $H_0$ （因此，拒绝其他选择 $H_1$ ）。

Formally, a test procedure can be described as a measurable function $\varphi:\mathscr{X}\to\{0,1\}$ , with the obvious interpretation in terms of the decisions made in favor of each of the hypotheses. The critical region is $\mathscr{C}=\varphi^{-1}(\{1\})$ , and the acceptance region is $\mathscr{A}=\varphi^{-1}(\{0\})$ .

For each test procedure $\varphi$ , we define its power function $\pi_\varphi:\Theta\to[0,1]$ by

π_{φ} (θ) = Pr (φ (X) = 1 ∣ θ) = Pr (X \in C ∣ θ) .

$\pi_\varphi(\theta) = \Pr(\varphi(X)=1\mid\theta) = \Pr(X\in\mathscr{C}\mid\theta) \, .$ In words,

π_{φ} (θ)

$\pi_\varphi(\theta)$ gives you the probability of rejecting

H_{0}

$H_0$ when the parameter value is

θ

$\theta$ .

The decision to reject $H_0$ when $\theta\in\Theta_0$ 是错误的。因此，对于给定的问题，您可能只想考虑那些测试过程 $\varphi$ 为此 $\pi_\varphi(\theta)\leq\alpha$ ，每个 $\theta\in\Theta_0$ ，其中 $\alpha$ 有一定的意义（ $0<\alpha<1$ ）。请注意，重要程度是一类测试程序的属性。我们可以将这个类准确地描述为

T_{α} = {φ \in {0, 1}^{X} : π_{φ} (θ) \leq α, for every θ \in Θ_{0}} .

$\mathscr{T}_{\alpha} = \left\{ \varphi\in\{0,1\}^\mathscr{X} : \pi_\varphi(\theta)\leq\alpha, \textrm{for every}\; \theta\in\Theta_0\right\} \, .$

For each individual test procedure $\varphi$ , the maximum probability $\alpha_\varphi=\sup_{\theta\in\Theta_0}\pi_\varphi(\theta)$ of wrongly rejecting $H_0$ is called the size of the test procedure $\varphi$ .

It follows directly from these definitions that, once we have established a significance level $\alpha$ , and therefore determined the class $\mathscr{T}_{\alpha}$ of acceptable test procedures, each test procedure $\varphi$ within this class will have size $\alpha_\varphi\leq\alpha$ , and conversely. Concisely, $\varphi\in\mathscr{T}_{\alpha}$ if and only if $\alpha_\varphi\leq\alpha$ .

— Zen
source

Wow. Thanks for all the effort you invested in this answer.

— asb

I came here to learn about size vs level and left understanding hypothesis testing better overall. Excellent combination of intuition and notation.

— gwg