任务完成时间的变化如何影响制造跨度？

16

比方说，我们有大量的任务和的相同集合（在性能方面）处理器其完全并行操作。对于感兴趣的场景，我们可以假定。每个需要时间/周期的一定量来完成，一旦它被分配给一个处理器 $\tau_1, \tau_2, ..., \tau_n$ $\rho_1, \rho_2, ..., \rho_m$ $m \leq n$ $\tau_i$ $\rho_j$ ，并且一旦分配，就无法重新分配，直到完成（处理器总是最终完成分配的任务）。让我们假定每个需要的时间量/周期，在事先不知道，从一些离散的随机分布截取。对于这个问题，我们甚至可以假设一个简单的分布：，并且所有是两两独立的。因此和 $\tau_i$ $X_i$ $P(X_i = 1) = P(X_i = 5) = 1/2$ $X_i$ $\mu_i = 3$ $\sigma^2 = 4$ .

Suppose that, statically, at time/cycle 0, all tasks are assigned as evenly as possible to all processors, uniformly at random; so each processor $\rho_j$ is assigned $n/m$ tasks (we can just as well assume $m | n$ for the purposes of the question). We call the makespan the time/cycle at which the last processor $\rho^*$ to finish its assigned work, finishes the work it was assigned. First question:

As a function of $m$ , $n$ , and the $X_i$ 's, what is the makespan $M$ ? Specifically, what is $E[M]$ ? $Var[M]$ ?

Second question:

Suppose $P(X_i = 2) = P(X_i = 4) = 1/2$ , and all $X_i$ are pairwise independent, so $\mu_i = 3$ and $\sigma^2 = 1$ . As a function of $m$ , $n$ , and these new $X_i$ 's, what is the makespan? More interestingly, how does it compare to the answer from the first part?

Some simple thought experiments demonstrate the answer to the latter is that the makespan is longer. But how can this be quantified? I will be happy to post an example if this is either (a) controversial or (b) unclear. Depending on the success with this one, I will post a follow-up question about a dynamic assignment scheme under these same assumptions. Thanks in advance!

Analysis of an easy case: $m = 1$

If $m = 1$ , then all $n$ tasks are scheduled to the same processor. The makespan $M$ is just the time to complete $n$ tasks in a complete sequential fashion. Therefore,

E [M] = E [X 1 + X 2 + . . . + X n] = E [X 1] + E [X 2] + . . . + E [X n] = μ + μ + . . . + μ = n μ

$\begin{align*} E[M] &= E[X_1 + X_2 + ... + X_n] \\ &= E[X_1] + E[X_2] + ... + E[X_n] \\ &= \mu + \mu + ... + \mu \\ &= n\mu \end{align*}$ and

V a r [M] = V a r [X 1 + X 2 + . . . + X n] = V a r [X 1] + V a r [X 2] + . . . + V a r [X n] = σ 2 + σ 2 + . . . + σ 2 = n σ 2

$\begin{align*} Var[M] &= Var[X_1 + X_2 + ... + X_n] \\ &= Var[X_1] + Var[X_2] + ... + Var[X_n] \\ &= \sigma^2 + \sigma^2 + ... + \sigma^2 \\ &= n\sigma^2 \\ \end{align*}$

It seems like it might be possible to use this result to answer the question for $m > 1$ ; we simply need to find an expression (or close approximation) for $\max(Y_1, Y_2, ..., Y_m)$ where $Y_i = X_{i\frac{n}{m} + 1} + X_{i\frac{n}{m} + 2} + ... + X_{i\frac{n}{m} + \frac{n}{m}}$ , a random variable with $\mu_Y = \frac{n}{m}\mu_X$ and $\sigma_Y^2 = \frac{n}{m}\sigma_X^2$ . Is this heading in the right direction?

probability-theory scheduling parallel-computing

— Patrick87
source

Nice question. If only there wasn't a deadline today....

— Dave Clarke

8

As $m = k \times n$ , we can look at this in terms of $k$ and $n$ instead of $n$ and $m$ . Let's say $T_i$ is the time it takes the $i$ -th processor to finish its work.

As $n$ grows, the probability that $T_i$ = $5k$ (the processor was assigned only $T=5$ tasks) for some $i$ approaches $1$ , so makespan being defined as $\mathrm{max}(T_i)$ , $E[M]$ approaches $5k$ .

For the second scenario this is $4k$ so increasing the number of processors makes the 4–2 split better.

What about $k$ — increasing the number of tasks per processor? Increasing $k$ has the opposite effect, it makes it less likely to have a processor with an unlucky set of tasks. I'm going home now but I'll come back to this later. My "hunch" is that as $k$ grows, the difference in $E[M]$ between the 4–2 split and the 5–1 split disappears, $E[M]$ becomes the same for both. So I would assume that 4–2 is always better except maybe for some special cases (very small specific values of $k$ and $n$ ), if even that.

So to summarize:

Lower variance is better, all else being equal.
As the number of processors grows, lower variance becomes more important.
As the number of tasks per processor grows, lower variance becomes less important.

— svinja
source

+1 Excellent intuition, and this helps to clarify my thinking as well. So increasing processor counts tends to increase makespan under a weak scaling assumption; and increasing task counts tends to decrease makespan under a strong scaling assumption (of course it takes longer; I mean the work/makespan ratio improves). These are interesting observations, and they seem true;

— Patrick87

the first is justified by the fact that

1−(1−P(X=5)k)n $1 - (1 - P(X = 5)^k)^n$ tends to

1 $1$ for fixed

k $k$ and increasing

n $n$ ; the latter by the fact that

Var[X+X]=Var[X]+Var[X]=2σ2≤4σ2=4Var[X]=Var[2X] $Var[X + X] = Var[X] + Var[X] = 2\sigma^2 \leq 4\sigma^2 = 4Var[X] = Var[2X]$ ... so the variance doesn't increase linearly as a function of

k $k$ . Is that compatible with your thinking (that's how I'm interpreting what you have so far)?

— Patrick87

I don't know where the "hunch" came from; it is not consistent with the rest of the heuristic reasoning.

— András Salamon

2

I find that heuristic arguments are often quite misleading when considering task scheduling (and closely related problems like bin packing). Things can happen that are counter-intuitive. For such a simple case, it is worthwhile actually doing the probability theory.

Let $n = km$ with $k$ a positive integer. Suppose $T_{ij}$ is the time taken to complete the $j$ -th task given to processor $i$ . This is a random variable with mean $\mu$ and variance $\sigma^2$ . The expected makespan in the first case is

E [M] = E [max {\sum j = 1 k T i j ∣ i = 1, 2, \dots, m}] .

$E[M] = E[\max \left\{\sum_{j=1}^k T_{ij} \mid i=1,2,\dots,m \right\}].$ The sums are all iid with mean

kμ $k\mu$ and variance

kσ2 $k\sigma^2$ , assuming that

Tij $T_{ij}$ are all iid (this is stronger than pairwise independence).

Now to obtain the expectation of a maximum, one either needs more information about the distribution, or one has to settle for distribution-free bounds, such as:

Peter J. Downey, Distribution-free bounds on the expectation of the maximum with scheduling applications, Operations Research Letters 9, 189–201, 1990. doi:10.1016/0167-6377(90)90018-Z

which can be applied if the processor-wise sums are iid. This would not necessarily be the case if the underlying times were just pairwise independent. In particular, by Theorem 1 the expected makespan is bounded above by

E [M] \leq k μ + σ k - - \sqrt n - 1 2 n - 1 - - - - - \sqrt .

$E[M] \le k\mu + \sigma\sqrt{k}\frac{n-1}{\sqrt{2n-1}}.$ Downey also gives a particular distribution achieving this bound, although the distribution changes as

n $n$ does, and is not exactly natural.

Note that the bound says that the expected makespan can increase as any of the parameters increase: the variance $\sigma^2$ , the number of processors $n$ , or the number of tasks per processor $k$ .

For your second question, the low-variance scenario resulting in a larger makespan seems to be an unlikely outcome of a thought experiment. Let $X = \max_{i=1}^m X_i$ denote the makespan for the first distribution, and $Y = \max_{i=1}^m Y_i$ for the second (with all other parameters the same). Here $X_i$ and $Y_i$ denote the sums of $k$ task durations corresponding to processor $i$ under the two distributions. For all $x \ge k\mu$ , independence yields

P r [X \leq x] = \prod i = 1 m P r [X i \leq x] \leq \prod i = 1 m P r [Y i \leq x] = P r [Y \leq x] .

$Pr[X \le x] = \prod_{i=1}^m Pr[X_i \le x] \le \prod_{i=1}^m Pr[Y_i \le x] = Pr[Y \le x].$ Since most of the mass of the probability distribution of the maximum will be above its mean,

E[X] $E[X]$ will therefore tend to be larger than

$E[Y]$ . This is not a completely rigorous answer, but in short, the second case seems preferable.

— András Salamon
source