在两个样本的置换测试中将尾部加倍

假设我们有两个样本，我们希望确定它们是否来自相同的分布，样本A，B由一些整数组成。

如果我们使用两个样本的置换检验来测试这一点，特别是通过查看样本均值差异与观察到的差异一样极端的置换：是否有理由认为我们可以计算出两尾p-通过观察一条尾巴并加倍概率来获得价值？

这就是我在讲义中似乎要说的，但是我不明白为什么我们可以假设尾巴是对称的（或者为什么它不需要这种假设）。没有解释。

permutation-test

— 哈里
source

测试统计信息的排列分布不保证是对称的，因此您不能那样做。相反，您添加了两条尾巴。在两个独立样本的情况下，原假设是两个位置参数相等。假设两组的连续分布和均等分布，我们在零假设下具有可交换性。检验统计量是均值之差，在零下。 $T$ $E(T) = 0$

为值在原始样品中是，其用于置换值。是短的东西“的数”，例如，是置换检验统计的数目。然后，为双面假设-值是，其中 $T$ $T_{\text{emp}}$ $T^{\star}$ $\sharp(\cdot)$ $\sharp(T^{\star})$ $p$ $p_{\text{ts}} = p_{\text{left}} + p_{\text{right}}$

$p_{\text{left}} = \frac{\sharp(T^{\star} \, <= \, \text{min}(T_{\text{emp}}, -T_{\text{emp}}))}{\sharp(T^{\star})}$

$p_{\text{right}} = \frac{\sharp(T^{\star} \, >= \, \text{max}(T_{\text{emp}}, -T_{\text{emp}}))}{\sharp(T^{\star})}$

（假设我们具有完整的排列分布）。当我们可以计算出精确的（完整的）置换分布时，让我们比较两种独立样本的两种方法。

set.seed(1234)
Nj   <- c(9, 8)                      # group sizes
DVa  <- rnorm(Nj[1], 5, 20)^2        # data group 1
DVb  <- rnorm(Nj[2], 10, 20)^2       # data group 2
DVab <- c(DVa, DVb)                  # data from both groups
IV   <- factor(rep(c("A", "B"), Nj)) # grouping factor
idx  <- seq(along=DVab)              # all indices
idxA <- combn(idx, Nj[1])            # all possible first groups

# function to calculate test statistic for a given permutation x
getDM <- function(x) { mean(DVab[x]) - mean(DVab[!(idx %in% x)]) }
resDM <- apply(idxA, 2, getDM)       # test statistic for all permutations
diffM <- mean(DVa) - mean(DVb)       # empirical stest statistic

现在计算值，并使用R's 包中的实现验证所提出的解决方案。观察，因此计算方式很重要。 $p$ coin $p_{\text{left}} \neq p_{\text{right}}$ $p_{ts}$

> (pL <- sum(resDM <= min(diffM, -diffM)) / length(resDM))  # left p-value
[1] 0.1755245

> (pR <- sum(resDM >= max(diffM, -diffM)) / length(resDM))  # right p-value
[1] 0.1585356

> 2*pL        # doubling left p-value
[1] 0.351049

> 2*pR        # doubling right p-value
[1] 0.3170712

> pL+pR       # two-sided p-value
[1] 0.3340601

> sum(abs(resDM) >= abs(diffM)) / length(resDM)  # two-sided p-value (more concise)
[1] 0.3340601

# validate with coin implementation
> library(coin)              # for oneway_test()    
> oneway_test(DVab ~ IV, alternative="two.sided", distribution="exact")
Exact 2-Sample Permutation Test
data:  DVab by IV (A, B) 
Z = 1.0551, p-value = 0.3341
alternative hypothesis: true mu is not equal to 0

PS对于仅从排列分布中采样的蒙特卡洛情况，将定义如下： $p$

$p_{\text{left}} = \frac{\sharp(T^{\star} \, <= \, \text{min}(T_{\text{emp}}, -T_{\text{emp}})) + 1}{\sharp(T^{\star}) \, + \, 1}$

$p_{\text{right}} = \frac{\sharp(T^{\star} \, >= \, \text{max}(T_{\text{emp}}, -T_{\text{emp}})) +1 }{\sharp(T^{\star}) \, + \, 1}$

$p_{\text{ts}} = \frac{\sharp(\text{abs}(T^{\star}) \, >= \, \text{abs}(T_{\text{emp}})) \, + \, 1 }{\sharp(T^{\star}) + 1}$

$p$

— 卡拉卡尔
source

T

$T$

E (T) = 0

$E(T) = 0$

谢谢，这是一个进步。您能否解释一下在这种假设下统计量如何不能具有对称分布？

— whuber

T^{⋆} = - 1, .5, .5

$T^{\star} = {-1, .5, .5}$

感谢您的澄清：我现在遵循逻辑。

— whuber