来自两个独立的伯努利人口的抽样分布


17

让我们假设我们有两个独立的伯努利随机变量的样本,Ë[Rθ1个Ë[Rθ2

我们如何证明吗?

(X¯1X¯2)(θ1θ2)θ1(1θ1)n1+θ2(1θ2)n2dN(0,1)

假设。n1n2


Z_i = X_1i-X_2i是具有有限均值和方差的iid rv序列。因此,它满足您的结果遵循的Levy-Linderberg中心极限定理。还是您要要求提供证明的证据?
3 Diag

@ThreeDiag您如何应用CLT的LL版本?我认为那是不对的。给我写一个答案以检查详细信息。
一位老人在海里。

所有详细信息已经存在。为了应用LL,您需要一个具有有限均值和方差的iid rv序列。变量Z_i = X_i1和X_i2满足所有三个要求。独立性来自于两个原始bernoulli var的独立性,您可以通过应用E和V的标准属性来看到E(Z_i)和V(Z_i)是有限的
3 Diag

1
“两个独立的伯努利随机变量的样本”-错误的表达。必须为:“来自伯努利分布的两个独立样本”。
维克多

1
请添加“ as ”。ñ1个ñ2
维克多

Answers:


10

b=a=θ1(1θ1)n1=ˉX1-θ1/一个=ˉX2-θ2/b。我们有 dÑ01dÑ01。在的特性功能方面则意味着 φÈÈb=θ2(1θ2)n2A=(X¯1θ1)/aB=(X¯2θ2)/bAdN(0,1), BdN(0,1) 我们要证明 D= a

ϕA(t)EeitAet2/2, ϕB(t)et2/2.
D:=aa2+b2Aba2+b2BdN(0,1)

由于B是独立的,所以 ϕ Dt = ϕ A aAB 因为我们希望它是。

ϕD(t)=ϕA(aa2+b2t)ϕB(ba2+b2ŤË-Ť2/2

该证明不完整。在这里,我们需要一些估计来统一特征函数。但是,在考虑中的情况下,我们可以进行显式计算。把φ X 1 1p=θ1个 =ñ1个t3m-3/20。因此,对于固定的tϕDt=1a2t2

ϕX1,1(t)=1+p(eit1),ϕX¯1(t)=(1+p(eit/m1))m,ϕX¯1θ1(t)=(1+p(eit/m1))meipt,ϕA(t)=(1+p(eit/mp(1p)1))meiptm/p(1p)=((1+p(eit/mp(1p)1))eipt/mp(1p))m=(1t22m+O(t3m3/2))m
t3m3/20t
ϕD(t)=(1a2t22(a2+b2)n1+O(n13/2))n1(1b2t22(a2+b2)n2+O(n23/2))n2et2/2
a0b0|ey(1y/m)m|y2/2m  when  y/m<1/2 (see /math/2566469/uniform-bounds-for-1-y-nn-exp-y/ ).

Note that similar calculations may be done for arbitrary (not necessarily Bernoulli) distributions with finite second moments, using the expansion of characteristic function in terms of the first two moments.


This seems correct. I'll get back to you later on, when I have time to check everything. ;)
An old man in the sea.

-1

Proving your statement is equivalent to proving the (Levy-Lindenberg) Central Limit Theorem which states

If {Zi}i=1n is a sequence of i.i.d random variable with finite mean E(Zi)=μ and finite variance V(Zi)=σ2 then

n(Z¯μ)dN(0,σ2)

Here Z¯=iZi/n that is the sample variance.

Then it is easy to see that if we put

Zi=X1iX2i
with X1i,X2i following a Ber(θ1) and Ber(θ2) respectively the conditions for the theorem are satisfied, in particular

E(Zi)=θ1θ2=μ

and

V(Zi)=θ1(1θ1)+θ2(1θ2)=σ2

(There's a last passage, and you have to adjust this a bit for the general case where n1n2 but I have to go now, will finish tomorrow or you can edit the question with the final passage as an exercise )


I could not obtain what I wanted exactly because of the possibility of n1n2
An old man in the sea.

I will show later if you can't get it. Hint: compute the variance of the sample mean of Z and use that as the variable in the theorem
Three Diag

Three, could you please add the details for when n1n2? Thanks
An old man in the sea.

Will do as soon as find a little timr. There was in fact a subtlety that prevents from using LL clt without adjustment. There are three ways to go, the simplest of which is invoking the fact that for large n1 and n2, X1 and X2 go in distribution to normals, then a linear combination of normal is also normal. This is a property of normals that you can take as given, otherwise you can prove it by characteristic functions.
Three Diag

其他两个或者需要不同的CLT(李雅普诺夫可能)或可替代地对待N1 = i和N2 = I + K。然后对于大号i,您基本上可以忽略k,您可以返回以应用LL(但仍然需要谨慎以确保正确的方差)
3 Diag
By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.