17

我给您列出了宽度为的 $n$ 向量。您的目标是从列表中返回两个不共有1的位向量，或者报告不存在这样的对。 $k$

例如，如果我给您那么唯一的解决方案是。可替代地，输入没有解。任何包含全零位向量和另一个元素都具有平凡的解。 $[00110, 01100, 11000]$ $\{00110, 11000\}$ $[111, 011, 110, 101]$ $000...0$ $e$ $\{e, 000...0\}$

这是一个稍微困难一点的示例，没有解决方案（每行都是位向量，黑色正方形为1s，白色正方形为0s）：

■ ■ ■ ■ □ □ □ □ □ □ □ □ □
■ □ □ □ ■ ■ ■ □ □ □ □ □ □ 
■ □ □ □ □ □ □ ■ ■ ■ □ □ □
■ □ □ □ □ □ □ □ □ □ ■ ■ ■
□ ■ □ □ □ ■ □ □ □ ■ ■ □ □
□ ■ □ □ ■ □ □ □ ■ □ □ □ ■
□ ■ □ □ □ □ ■ ■ □ □ □ ■ □ <-- All row pairs share a black square
□ □ ■ □ □ □ ■ □ ■ □ ■ □ □
□ □ ■ □ □ ■ □ ■ □ □ □ □ ■
□ □ ■ □ ■ □ □ □ □ ■ □ ■ □
□ □ □ ■ ■ □ □ ■ □ □ ■ □ □
□ □ □ ■ □ □ ■ □ □ ■ □ □ ■
□ □ □ ■ □ ■ □ □ ■ □ □ ■ □

如何找到两个不重叠的位向量，或者显示它们不存在，效率如何？

您只比较每个可能的对的朴素算法是。有可能做得更好吗？ $O(n^2 k)$

algorithms search-algorithms

— 克雷格·吉德尼
source

可能的减少：您有一个图形

G $G$ ，每个矢量有一个顶点，并且如果两个对应的矢量共有1，则在两个顶点之间有一个边。你想知道如果图直径

≥2 $\geq 2$ 。但是似乎很难比

快O(n2k) $O(n^2k)$ 。

— 弗朗索瓦·

@FrançoisGodi任何具有三个节点且缺少边的连接图形组件的直径至少为2。使用邻接表表示时，要花费

O(V) $O(V)$ 时间进行检查。

— Craig Gidney 2015年

@Strilanc当然，如果没有解决方案，则图形是完整的（比直径= 1更清晰，您是对的），但是计算邻接表表示可能会很长。

— 弗朗索瓦2015年

是否k $k$ 小于机器的字宽？

— 拉斐尔

1

@TomvanderZanden听起来好像违反了数据结构可能依赖的不变式。特别是，这种平等应该是可传递的。我一直在想用特里已经和我没有看到如何避免因子的-2爆破每次查询掩码有0

— 克雷格Gidney

10

预热：随机位向量

作为热身，我们可以从随机地均匀选择每个位向量的情况开始。事实证明，该问题可以在时间内解决（更确切地说，可以用代替）。 $O(n^{1.6} \min(k, \lg n))$ $1.6$ $\lg 3$

我们将考虑问题的以下两套变体：

给定套 bitvectors的，确定其中存在非重叠的一对。 $S,T \subseteq \{0,1\}^k$ $s \in S, t \in T$

解决此问题的基本技术是分而治之。这是使用分治法的时间算法： $O(n^{1.6} k)$

根据第一个比特位置拆分和换句话说，形式，，， $S$ $T$ $S_0 = \{s \in S : s_0=0\}$ $S_1 = \{s \in S : s_0 = 1\}$ $T_0 = \{t \in T : t_0 = 0\}$ 。 $T_1 = \{t \in T : t_0 = 1\}$
现在从，和递归地寻找一个非重叠对。如果有任何递归调用找到非重叠对，则输出它，否则输出“不存在重叠对”。 $S_0,T_0$ $S_0,T_1$ $T_1,S_0$

由于所有位向量都是随机选择的，因此我们可以期望和。因此，我们进行了三个递归调用，并且将问题的大小减少了两倍（两组的大小都减少了两倍）。后拆分，所述两个组中的一个是下降到大小为1，并且该问题可在线性时间内解决。我们得到如下的递归关系： $|S_b| \approx |S|/2$ $|T_b| \approx |T|/2$ $\lg \min(|S|,|T|)$ $T(n) = 3T(n/2) + O(nk)$ , whose solution is $T(n) = O(n^{1.6} k)$ . Accounting for running time more precisely in the two-set case, we see the running time is $O(\min(|S|,|T|)^{0.6} \max(|S|,|T|) k)$ .

This can be further improved, by noting that if $k \ge 2.5\lg n+100$ , then the probability that a non-overlapping pair exists is exponentially small. In particular, if $x,y$ are two random vectors, the probability that they're non-overlapping is $(3/4)^k$ . If $|S|=|T|=n$ , there are $n^2$ such pairs, so by a union bound, the probability a non-overlapping pair exists is at most $n^2 (3/4)^k$ . When $k \ge 2.5 \lg n+100$ , this is $\le 1/2^{100}$ . So, as a pre-processing step, if $k \ge 2.5 \lg n + 100$ , then we can immediately return "No non-overlapping pair exists" (the probability this is incorrect is negligibly small), otherwise we run the above algorithm.

Thus we achieve a running time of $O(n^{1.6} \min(k, \lg n))$ (or $O(\min(|S|,|T|)^{0.6} \max(|S|,|T|) \min(k, \lg n))$ for the two-set variant proposed above), for the special case where the bitvectors are chosen uniformly at random.

Of course, this is not a worst-case analysis. Random bitvectors are considerably easier than the worst case -- but let's treat it as a warmup, to get some ideas that perhaps we can apply to the general case.

Lessons from the warmup

We can learn a few lessons from the warmup above. First, divide-and-conquer (splitting on a bit position) seems helpful. Second, you want to split on a bit position with as many $1$ 's in that position as possible; the more $0$ 's there are, the less reduction in subproblem size you get.

Third, this suggests that the problem gets harder as the density of $1$ 's gets smaller -- if there are very few $1$ 's among the bitvectors (they are mostly $0$ 's), the problem looks quite hard, as each split reduces the size of the subproblems a little bit. So, define the density $\Delta$ to be the fraction of bits that are $1$ (i.e., out of all $nk$ bits), and the density of bit position $i$ to be the fraction of bitvectors that are $1$ at position $i$ .

Handling very low density

As a next step, we might wonder what happens if the density is extremely small. It turns out that if the density in every bit position is smaller than $1/\sqrt{k}$ , we're guaranteed that a non-overlapping pair exists: there is a (non-constructive) existence argument showing that some non-overlapping pair must exist. This doesn't help us find it, but at least we know it exists.

Why is this the case? Let's say that a pair of bitvectors $x,y$ is covered by bit position $i$ if $x_i=y_i=1$ . Note that every pair of overlapping bitvectors must be covered by some bit position. Now, if we fix a particular bit position $i$ , the number of pairs that can be covered by that bit position is at most $(n \Delta(i))^2 < n^2/k$ . Summing across all $k$ of the bit positions, we find that the total number of pairs that are covered by some bit position is $< n^2$ . This means there must exist some pair that's not covered by any bit position, which implies that this pair is non-overlapping. So if the density is sufficiently low in every bit position, then a non-overlapping pair surely exists.

However, I'm at a loss to identify a fast algorithm to find such a non-overlapping pair, in these regime, even though one is guaranteed to exist. I don't immediately see any techniques that would yield a running time that has a sub-quadratic dependence on $n$ . So, this is a nice special case to focus on, if you want to spend some time thinking about this problem.

Towards a general-case algorithm

In the general case, a natural heuristic seems to be: pick the bit position $i$ with the most number of $1$ 's (i.e., with the highest density), and split on it. In other words:

Find a bit position $i$ that maximizes $\Delta(i)$ .
Split $S$ and $T$ based upon bit position $i$ . In other words, form $S_0 = \{s \in S : s_i=0\}$ , $S_1 = \{s \in S : s_i = 1\}$ , $T_0 = \{t \in T : t_i = 0\}$ , $T_1 = \{t \in T : t_i = 1\}$ .
Now recursively look for a non-overlapping pair from $S_0,T_0$ , from $S_0,T_1$ , and from $T_1,S_0$ . If any recursive call finds a non-overlapping pair, output it, otherwise output "No overlapping pair exists".

The challenge is to analyze its performance in the worst case.

Let's assume that as a pre-processing step we first compute the density of every bit position. Also, if $\Delta(i) < 1/\sqrt{k}$ for every $i$ , assume that the pre-processing step outputs "An overlapping pair exists" (I realize that this doesn't exhibit an example of an overlapping pair, but let's set that aside as a separate challenge). All this can be done in $O(nk)$ time. The density information can be maintained efficiently as we do recursive calls; it won't be the dominant contributor to running time.

What will the running time of this procedure be? I'm not sure, but here are a few observations that might help. Each level of recursion reduces the problem size by about $n/\sqrt{k}$ bitvectors (e.g., from $n$ bitvectors to $n-n/\sqrt{k}$ bitvectors). Therefore, the recursion can only go about $\sqrt{k}$ levels deep. However, I'm not immediately sure how to count the number of leaves in the recursion tree (there are a lot less than $3^{\sqrt{k}}$ leaves), so I'm not sure what running time this should lead to.

— D.W.
source

ad low density: this seems to be some kind of pigeon-hole argument. Maybe if we use your general idea (split w.r.t. the column with the most ones), we get better bounds because the

(S1,T1) $(S_1, T_1)$ -case (we don't recurse to) already gets rid of "most" ones?

— Raphael

The total number of ones may be a useful parameter. You have already shown a lower bound we can use for cutting off the tree; can we show upper bounds, too? For example, if there are more than

ck $ck$ ones, we have at least

c $c$ overlaps.

— Raphael

By the way, how do you propose we do the first split; arbitrarily? Why not just split the whole input set w.r.t. some column

i $i$ ? We only need to recurse in the

0 $0$ -case (there is no solution among those that share a one at

i $i$ ). In expectation, that gives via

T(n)=T(n/2)+O(nk) $T(n) = T(n/2) + O(nk)$ a bound of

O(nk) $O(nk)$ (if

k $k$ fixed). For a general bound, you have shown that we can (assuming the lower-bound-cutoff you propose) that we get rid of at least

n/k−−√ $n/\sqrt{k}$ elements with every split, which seems to imply an

O(nk) $O(nk)$ worst-case bound. Or am I missing something?

— Raphael

Ah, that's wrong, of course, since it does not consider 0-1-mismatches. That's what I get for trying to think before breakfast, I guess.

— Raphael

@Raphael, there are two issues: (a) the vectors might be mostly zeros, so you can't count on getting a 50-50 split; the recurrence would be something more like

T(n)=T((n−n/k−−√)k)+O(nk) $T(n) = T((n-n/\sqrt{k})k)+O(nk)$ , (b) more importantly, it's not enough to just recurse on the 0-subset; you also need to examine pairings between a vector from the 0-subset and a vector from the 1-subset, so there's an additional recursion or two to do. (I think? I hope I got that right.)

— D.W.

8

Faster solution when $n \approx k$ , using matrix multiplication

Suppose that $n = k$ . Our goal is to do better than an $O(n^2k) = O(n^3)$ running time.

We can think of the bitvectors and bit positions as nodes in a graph. There is an edge between a bitvector node and a bit position node when the bitvector has a 1 in that position. The resulting graph is bipartite (with the bitvector-representing nodes on one side and the bitposition-representing nodes on the other), and has $n + k = 2n$ nodes.

Given the adjacency matrix $M$ of a graph, we can tell if there is a two-hop path between two vertices by squaring $M$ and checking if the resulting matrix has an "edge" between those two vertices (i.e. the edge's entry in the squared matrix is non-zero). For our purposes, a zero entry in the squared adjacency matrix corresponds to a non-overlapping pair of bitvectors (i.e. a solution). A lack of any zeroes means there's no solution.

Squaring an n x n matrix can be done in $O(n^\omega)$ time, where $\omega$ is known to be under $2.373$ and conjectured to be $2$ .

So the algorithm is:

Convert the bitvectors and bit positions into a bipartite graph with $n+k$ nodes and at most $nk$ edges. This takes $O(nk)$ time.
Compute the adjacency matrix of the graph. This takes $O((n+k)^2)$ time and space.
Square the adjacency matrix. This takes $O((n+k)^\omega)$ time.
Search the bitvector section of the adjacency matrix for zero entries. This takes $O(n^2)$ time.

The most expensive step is squaring the adjacency matrix. If $n=k$ then the overall algorithm takes $O((n+k)^\omega) = O(n^\omega)$ time, which is better than the naive $O(n^3)$ time.

This solution is also faster when $k$ grows not-too-much-slower and not-too-much-faster than $n$ . As long as $k \in \Omega(n^{\omega-2})$ and $k \in O(n^\frac{2}{\omega-1})$ , then $(n+k)^\omega$ is better than $n^2 k$ . For $w \approx 2.373$ that translates to $n^{0.731} \leq k \leq n^{1.373}$ (asymptotically). If $w$ limits to 2, then the bounds widen towards $n^\epsilon \leq k \leq n^{2-\epsilon}$ .

— Craig Gidney
source

1. This is also better than the naive solution if

k=Ω(n) $k=\Omega(n)$ but

k=o(n1.457) $k=o(n^{1.457})$ . 2. If

k≥n $k \ge n$ , a heuristic could be: pick a random subset of

n $n$ bit positions, restrict to those bit positions and use matrix multiplication to enumerate all pairs that don't overlap in those

n $n$ bit positions; for each such pair, check if it solves the original problem. If there aren't many pairs that don't overlap in those

n $n$ bit positions, this provides a speedup over the naive algorithm. However I don't know a good upper bound on the number of such pairs.

— D.W.

4

This is equivalent to finding a bit vector which is a subset of the complement of another vector; ie its 1's occur only where 0's occur in the other.

If k (or the number of 1's) is small, you can get $O(n2^k)$ time by simply generating all the subsets of the complement of each bitvector and putting them in a trie (using backtracking). If a bitvector is found in the trie (we can check each before complement-subset insertion) then we have a non-overlapping pair.

If the number of 1's or 0's is bounded to an even lower number than k, then the exponent can be replaced by that. The subset-indexing can be on either each vector or its complement, so long as probing uses the opposite.

There's also a scheme for superset-finding in a trie that only stores each vector only once, but does bit-skipping during probes for what I believe is similar aggregate complexity; ie it has $o(k)$ insertion but $o(2^k)$ searches.

— KWillets
source

thanks. The complexity of your solution is

∼n2(1−p)k $\sim n 2^{(1-p)k}$ , where

p $p$ is the probability of 1's in the bitvector. A couple of implementation details: though this is a slight improvement, there's no need to compute and store the complements in the trie. Just following the complementary branches when checking for a non-overlapping match is enough. And, taking the 0's directly as wildcards, no special wildcard is needed, either.

— Mauro Lacy

2

Represent the bit vectors as an $n\times k$ matrix $M$ . Take $i$ and $j$ between 1 and $n$ .

(M M T) i j = \sum l M i l M j l .

$\begin{align} (MM^T)_{ij} = \sum_l M_{il}M_{jl} \end{align}.$

$(MM^T)_{ij}$ , the dot product of the $i$ th and $j$ th vector, is non-zero if, and only if, vectors $i$ and $j$ share a common 1. So, to find a solution, compute $MM^T$ and return the position of a zero entry, if such an entry exists.

Complexity

Using naive multiplication, this requires $O(n^2k)$ arithmetic operations. If $n=k$ , it takes $O(n^{2.37})$ operations using the utterly impractical Coppersmith-Winograd algorithm, or $O(n^{2.8})$ using the Strassen algorithm. If $k=O(n^{0.302})$ , then the problem may be solved using $n^{2 + o(1)}$ operations.

— Ben
source

How is this different from Strilanc's answer?

— D.W.

1

@D.W. Using an

$n$ -by-

$k$ matrix instead of an

$(n+k)$ -by-

$(n+k)$ matrix is an improvement. Also it mentions a way to cut off the factor of k when k << n, so that might be useful.

— Craig Gidney

查找一对不重叠的位向量

预热：随机位向量

Lessons from the warmup

Handling very low density

Towards a general-case algorithm

Faster solution when n≈kn≈kn \approx k, using matrix multiplication

Faster solution when $n \approx k$ , using matrix multiplication