大矩阵的近似光谱

14

我想计算一个大型稀疏矩阵（数十万行）的频谱（所有特征值）。这很难。

我愿意定约。有近似方法可以做到这一点吗？

虽然我希望对此问题有一个一般性的答案，但在以下特定情况下，我也会感到满意。我的矩阵是大图的规范化拉普拉斯算子。特征值将在0到2之间，其中许多聚集在1周围。

— 麦考林
source

矩阵是稀疏的还是密集的？

— 阿隆·艾玛迪亚

矩阵稀疏。我已经编辑了问题以反映这一点。

— MRocklin

为什么要所有特征值？当您拥有稀疏或结构化的矩阵时，这通常是一件不好的事情，因此了解如何计划使用它很重要。

— 杰德·布朗

图拉普拉斯算子的频谱包含一些我想检查的重要信息。我不需要全部，我只需要大致了解它们在哪里。

— MRocklin

15

如果您的图是无向的（我怀疑），则矩阵是对称的，并且您做不到比Lanczsos算法更好的事情（如果需要，可以进行选择性重新正交化，以保持稳定性）。由于整个频谱由100000个数字组成，因此，我主要对频谱密度感兴趣。

要获得近似的光谱密度，请获取尺寸约为100的前导Krylov子空间的光谱，并用平滑版本替换其离散密度。

领先的Krylov谱将具有几乎解析的良好隔离的特征值（应存在），近似于非隔离谱的末尾的特征值，且两者之间有些随机，其累积分布函数类似于真实光谱的分布。如果维数增加，它将以精确的算术收敛到它。（如果您的算子是无限维的，那么情况仍然会如此，您将获得连续光谱上真实光谱密度函数的积分。）

— 阿诺德·纽迈耶
source

领先的Krylov子空间的频谱难道不是100个最大的特征值吗？我也对中等和最小特征值的分布感兴趣。

— MRocklin

1

@MRocklin：不。我增加了答案以提供更多细节。

— 阿诺德·诺迈耶

6

Lin Lin，Yousef Saad和Chao Yang（2016）在论文“大矩阵的近似光谱密度”的第3.2节中更详细地讨论了Arnold Neumaier的答案。

还讨论了其他一些方法，但本文末尾的数值分析表明，Lanczos方法优于这些方法。

— 范·维尔德
source

4

如果您可以考虑不是特征值而是可以在某种意义上告诉您有关频谱的函数的事情，那么我认为您应该查看莱斯大学的马克·恩伯里的一些工作。

— 沃尔夫冈·邦格斯
source

2

这是表征光谱的另一种方法。

$\mathbf{A} v_k = \lambda_k v_k$ $\mathbf{A}$

S (ω) = \sum_{k} \frac{π^{- 1} σ}{σ^{2} + (λ_{k} - ω)^{2}} = \frac{σ}{π} T r [σ^{2} + (ω - A)^{2}]^{- 1}

$S(\omega) = \sum_k \frac{\pi^{-1}\sigma}{\sigma^2 + (\lambda_k - \omega)^2} = \frac{\sigma}{\pi} \mathop{\mathrm{Tr}} [\sigma^2 + (\omega - \mathbf{A})^2]^{-1}$

S (ω) = \frac{σ}{π} ⟨ z^{T} [σ^{2} + (ω - A)^{2}]^{- 1} z ⟩

$S(\omega) = \frac{\sigma}{\pi}\langle z^T [\sigma^2 + (\omega - \mathbf{A})^2]^{-1} z \rangle$

z

$z$

+ 1

$+1$

- 1

$-1$ 每个概率为0.5。对于给定

σ

$\sigma$ 和

ω

$\omega$ ，逆积

[σ^{2} + (ω - A)^{2}]^{- 1} z

$[\sigma^2 + (\omega - \mathbf{A})^2]^{-1} z$ 例如可以使用共轭梯度法计算，或者

[ω + i σ - A]^{- 1} [ω - i σ - A]^{- 1}

$[\omega + i \sigma - \mathbf{A}]^{-1} [\omega - i \sigma - \mathbf{A}]^{-1}$ to minimize fill-in. This allows estimation of

S (ω)

$S(\omega)$ also for large matrices. In practice, it seems the CG solution doesn't need to be very accurate, and neither are many vectors necessary in computing the average. This may depend on the problem.

The above appears to weigh parts of the spectrum more evenly than a similarly smeared Krylov spectral density --- try diag(linspace(0, 1, 150000)) --- although maybe there is a way to correct for this?. This is somewhat similar to the pseudospectral approach, but the result indicates the (smeared) number of eigenvalues in the vicinity to point $\omega$ , rather than the inverse distance to the nearest eigenvalue.

EDIT: A better performing alternative for computing the above quantity is to compute Chebyshev moments (via similar stochastic evaluation as above) and then reconstruct the spectral density from them. This requires neither matrix inversions nor separate computations for each $\omega$ . See http://theorie2.physik.uni-greifswald.de/downloads/publications/LNP_chapter19.pdf and references therein.

— pv.
source

0

See the paper "On Sampling-based Approximate Spectral Decomposition" by Sanjiv Kumar, Mehryar Mohri & Ameet Talwalkar (ICML 2009.). It uses sampling of columns of your matrix.

Since your matrix is symmetric you should do the following:

Let A be your n*n matrix. You want to reduce the computation of the eigenvalues of an n*n matrix to the computation of the eigenvalues of an k*k matrix. First choose your value of k. Let's say you choose k=500, since you can easily compute the eigenvalues of a 500*500 matrix. Then, randomly choose k columns of the matrix A. Contruct the matrix B that keeps only these columns, and the corresponding rows.

B = A(x,x) for a random set of k indexes x

B is now a k*k matrix. Compute the eigenvalues of B, and multiply them by (n/k). You now have k values which are approximately distributed like the n eigenvalues of A. Note that you get only k values, not n, but their distribution will be correct (up to the fact that they are an approximation).

— Jérôme Kunegis
source

-1

You can always use the Gershgorin circle Theorem bounds to approximate the eigenvalues.

If the off-diagonal terms are small, the diagonal itself is a good approximation of the spectrum. Otherwise if you end up with an approximation of the eigenspace (by other methods) you could try to express the diagonal entries in this system. This will lead to a matrix with smaller off-diagonal terms and the new diagonal will be a better approximation of the spectrum.

— FKaria
source

Gerschgoring gives no aprroximations but error bounds, so is irrelevant here. Moreover, using your method on a sparse matrix would require a dense eigenvector matrix, which is impossible to store for the OPs problem.

— Arnold Neumaier

As I said, the diagonal is itself an approximation of the spectrum with the error bounds given by the Gershgorin circle theorem, of course Gershgorin error bounds are not approximations. The diagonal will be a good approximation if the off-diagonal terms are small, wich I believe is the case since OP said that matrix is sparse.

— FKaria

5

实际上，大多数稀疏矩阵在每一行和每一列中都有一些显着的非对角元素，这使得对角线的近似值非常差（例如，对于正则图的Laplacian，对角线是恒定的），并且误差范围无用。

— 阿诺德·纳伊迈尔