用Gamma分布构造Dirichlet分布


18

令是相互独立的随机变量,每个变量的伽玛分布参数为表示,与X1,,Xk+1αi,i=1,2,,k+1Yi=XiX1++Xk+1,i=1,,kDirichlet(α1,α2,,αk;αk+1)

的联合PDF。然后找到关节(y_1,\ dots,Y_ {k + 1})的 pdf文件,我找不到jacobian即J(\ frac {x_1,\ dots,x_ {k + 1}} {y_1,\ dots,y_ {k + 1} })Ý1...ÿķ+1ĴX1...X ķ + 1(X1,,Xk+1)=ei=1k+1xix1α11xk+1αk+11Γ(α1)Γ(α2)Γ(αk+1)(Y1,,Yk+1)J(x1,,xk+1y1,,yk+1)


3
请参阅本文档的第13-14页。

@Procrastinator非常感谢您,对于我的问题,文档是最好的答案。
Argha 2012年

2
@Procrastinator-也许您应该将此作为答案,因为OP对此很满意,并添加了几句话,这样您就不会遇到“我们希望提供多句子答案”的警告了吗?
jbowman 2012年

4
该文档现在是404
无法解决。–麻烦

2
Wayback机器进行救援:pdf
mobeets'Apr 19'17 at

Answers:


30

雅各比学派(Jacobians)-变量函数变化的绝对决定因素-看起来很强大并且可能很复杂。然而,它们是变量多元变化的计算中必不可少的部分。似乎没有什么可做的,只是写下一个k+1 x k+1的导数矩阵并进行计算。

有更好的方法。 它在“解决方案”部分的末尾显示。因为这篇文章的目的是向统计学家介绍对于许多人来说可能是一种新方法,所以它的大部分致力于解释解决方案背后的机制。这是微分形式的代数。(差异形式是一个在多个维度上集成的事物。)其中包含一个详细而有效的示例,以帮助使其更加熟悉。


背景

一个多世纪以前,数学家发展了微分代数理论,以处理多维几何中出现的“高阶导数”。行列式是由此类代数操纵的基本对象的特例,这些代数通常是交替的多线性形式这样做的好处在于计算变得如此简单。

这就是您需要了解的所有内容。

  1. 差动是如下形式的表达“ dxi ”。它是“ d ”与任何变量名的串联。

  2. 一种形式是微分的线性组合,例如dx1+dx2或什至x2dx1exp(x2)dx2。即,系数是变量的函数

  3. 可以使用记为楔积乘以“乘积”形式。该乘积是反交换的(也称为交替的):对于任何两种形式的ωηωη

    ωη=ηω.

    此乘法是线性且关联的:换句话说,它以熟悉的方式工作。一个直接的后果是,,这意味着任何一个形式的平方始终为零。这使得乘法非常容易!ωω=ωω

  4. 用于操纵所述积的出现在概率计算的目的,就像表达式可以理解为| d X 1d X 2d X ķ + 1 | dx1dx2dxk+1|dx1dx2dxk+1|

  5. 是一个函数时,则其微分由微分给出:y=g(x1,,xn)

    dy=dg(x1,,xn)=gx1(x1,,xn)dx1++gx1(x1,,xn)dxn.

与雅可比行列式的联系是:变换的雅可比行列式直到符号为止,只是d x的系数(y1,,yn)=F(x1,,xn)=(f1(x1,,xn),,fn(x1,,xn))显示在计算dx1dxn

dy1dyn=df1(x1,,xn)dfn(x1,,xn)

在将每个为规则(5)中d x j的线性组合之后。dfidxj


雅可比定律的简单定义很吸引人。尚未确信这值得吗?考虑将二维积分从笛卡尔坐标为极坐标r θ 的众所周知的问题,其中x y = r cos θ r sin θ 。以下是上述规则的完全机械应用,其中“ * (x,y)(r,θ)(x,y)=(rcos(θ),rsin(θ))()“用于简化将明显凭借规则(3),这意味着消失表达式drdr=dθdθ=0

dxdy=|dxdy|=|d(rcos(θ))d(rsin(θ))|=|(cos(θ)drrsin(θ)dθ)(sin(θ)dr+rcos(θ)dθ|=|()drdr+()dθdθrsin(θ)dθsin(θ)dr+cos(θ)drrcos(θ)dθ|=|0+0+rsin2(θ)drdθ+rcos2(θ)drdθ|=|r(sin2(θ)+cos2(θ))drdθ)|=r drdθ.

这样做的重点是可以轻松执行此类计算,而不会弄乱矩阵,行列式或其他此类多指标对象。您只是将事情乘以,记住楔是反交换的。它比高中代数中的授课要容易


初赛

让我们看看这个微分代数的作用。 在这个问题中,的联合分布的PDF 是单独的PDF的产物(因为X 被假设为独立的)。为了处理变量Y i的变化,我们必须明确要积分的微分元素。这些构成了术语d X 1 d X 2d X ķ + 1(X1,X2,,Xk+1)XiYidx1dx2dxk+1. Including the PDF gives the probability element

fX(x,α)dx1dxk+1(x1α11exp(x1))(xk+1αk+11exp(xk+1))dx1dxk+1=x1α11xk+1αk+11exp((x1++xk+1))dx1dxk+1.

(The normalizing constant has been ignored; it will be recovered at the end.)

Staring at the definitions of the Yi a few seconds ought to reveal the utility of introducing the new variable

Z=X1+X2++Xk+1,

giving the relationships

Xi=YiZ.

This suggests making the change of variables xiyiz in the probability element. The intention is to retain the first k variables y1,,yk along with z and then integrate out z. To do so, we have to re-express all the dxi in terms of the new variables. This is the heart of the problem. It's where the differential algebra takes place. To begin with,

dxi=d(yiz)=yidz+zdyi.

Note that since Y1+Y2++Yk+1=1, then

0=d(1)=d(y1+y2++yk+1)=dy1+dy2++dyk+1.

Consider the one-form

ω=dx1++dxk=z(dy1++dyk)+(y1++yk)dz.

It appears in the differential of the last variable:

dxk+1=zdyk+1+yk+1dz=z(dy1++dyk)+(1y1yk)dz=dzω.

The value of this lies in the observation that

dx1dxkω=0

because, when you expand this product, there is one term containing dx1dx1=0 as a factor, another containing dx2dx2=0, and so on: they all disappear. Consequently,

dx1dxkdxk+1=dx1dxkzdx1dxkω=dx1dxkz.

Whence (because all products dzdz disappear),

dx1dxk+1=(zdy1+y1dz)(zdyk+ykdz)dz=zkdy1dykdz.

The Jacobian is simply |zk|=zk, the coefficient of the differential product on the right hand side.


Solution

The transformation (x1,,xk,xk+1)(y1,,yk,z) is one-to-one: its inverse is given by xi=yiz for 1ik and xk+1=z(1y1yk). Therefore we don't have to fuss any more about the new probability element; it simply is

(zy1)α11(zyk)αk1(z(1y1yk))αk+11exp(z)|zkdy1dykdz|=(zα1++αk+11exp(z)dz)(y1α11ykαk1(1y1yk)αk+11dy1dyk).

That is manifestly a product of a Gamma(α1++αk+1) distribution (for Z) and a Dirichlet(α) distribution (for (Y1,,Yk)). In fact, since the original normalizing constant must have been a product of Γ(αi), we deduce immediately that the new normalizing constant must be divided by Γ(α1++αk+1), enabling the PDF to be written

fY(y,α)=Γ(α1++αk+1)Γ(α1)Γ(αk+1)(y1α11ykαk1(1y1yk)αk+11).
By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.