192

是否存在用于Python的SciPy函数或NumPy函数或模块，用于在给定特定窗口的情况下计算一维数组的运行平均值？

— Shejo284
source

24

对于一个简短，快速的解决方案，它可以在一个循环中完成所有事情，而没有依赖关系，下面的代码效果很好。

mylist = [1, 2, 3, 4, 5, 6, 7]
N = 3
cumsum, moving_aves = [0], []

for i, x in enumerate(mylist, 1):
    cumsum.append(cumsum[i-1] + x)
    if i>=N:
        moving_ave = (cumsum[i] - cumsum[i-N])/N
        #can do stuff with moving_ave here
        moving_aves.append(moving_ave)

— 爱库德
source

44

快速？！该解决方案比Numpy解决方案要慢几个数量级。

— 巴特

3

尽管此本机解决方案很酷，但OP要求使用numpy / scipy函数-大概这些速度会更快。

— Demis

254

UPD：Alleo和jasaarim提出了更有效的解决方案。

您可以使用np.convolve：

np.convolve(x, np.ones((N,))/N, mode='valid')

说明

运行平均值是卷积数学运算的一种情况。对于移动平均值，您可以沿输入滑动窗口并计算窗口内容的平均值。对于离散的一维信号，卷积是相同的事情，除了用平均值代替之外，您还可以计算任意线性组合，即将每个元素乘以相应的系数并相加结果。窗口中每个位置对应的那些系数有时称为卷积核。现在，N个值的算术平均值为(x_1 + x_2 + ... + x_N) / N，因此对应的内核为(1/N, 1/N, ..., 1/N)，这正是我们使用所得到的np.ones((N,))/N。

边缘

的mode参数np.convolve指定如何处理边缘。我在valid这里选择模式是因为我认为大多数人都希望运行平均值起作用，但是您可能还有其他优先事项。这是说明两种模式之间差异的图表：

import numpy as np
import matplotlib.pyplot as plt
modes = ['full', 'same', 'valid']
for m in modes:
    plt.plot(np.convolve(np.ones((200,)), np.ones((50,))/50, mode=m));
plt.axis([-10, 251, -.1, 1.1]);
plt.legend(modes, loc='lower center');
plt.show()

Running mean convolve modes

— 青金石
source

4

我喜欢这个解决方案，因为它很干净（一行）并且相对有效（在numpy内部完成）。但是Alleo的“高效解决方案”使用起来numpy.cumsum具有更好的复杂性。

— Ulrich Stern

2

@denfromufa，我相信文档可以很好地涵盖实现，并且还链接到解释数学的Wikipedia。考虑到问题的重点，您是否认为此答案需要复制那些内容？

— 青金石

@lapis卷积在移动平均线上的用法非常少见且不明显。这是我找到的最佳视觉解释：matlabtricks.com/post-11/moving-average-by-convolution

— denfromufa

对于绘图和相关任务，将其填充为None值将很有帮助。我的建议（不是很漂亮，但很简短）：```def moving_average（x，N，fill = True）：返回np.concatenate（[x for x in [[无] *（N // 2 + N％2） * fill，np.convolve（x，np.ones（（N，））/ N，mode ='valid'），[None] *（N // 2）* fill，]如果len（x）]）` ``代码在SO注释xD中看起来非常丑陋，我不想添加另一个答案，因为答案太多了，但是您可以将其复制并粘贴到您的IDE中。

— Chaoste

143

高效的解决方案

卷积比简单方法好得多，但是（我猜）它使用FFT，因此速度很慢。但是专门用于计算运行，以下方法可以正常工作

def running_mean(x, N):
    cumsum = numpy.cumsum(numpy.insert(x, 0, 0)) 
    return (cumsum[N:] - cumsum[:-N]) / float(N)

要检查的代码

In[3]: x = numpy.random.random(100000)
In[4]: N = 1000
In[5]: %timeit result1 = numpy.convolve(x, numpy.ones((N,))/N, mode='valid')
10 loops, best of 3: 41.4 ms per loop
In[6]: %timeit result2 = running_mean(x, N)
1000 loops, best of 3: 1.04 ms per loop

注意 numpy.allclose(result1, result2)是True，这两种方法是等效的。N越大，时间差越大。

警告：尽管累积速度更快，但浮点错误会增加，这可能会导致结果无效/不正确/不可接受

评论指出了此浮点错误问题，但我在答案中将其变得更加明显。。

# demonstrate loss of precision with only 100,000 points
np.random.seed(42)
x = np.random.randn(100000)+1e6
y1 = running_mean_convolve(x, 10)
y2 = running_mean_cumsum(x, 10)
assert np.allclose(y1, y2, rtol=1e-12, atol=0)

累积的点越多，浮点误差就越大（因此值得注意的是1e5点，1e6点更重要，大于1e6，并且您可能想重置累加器）
您可以通过使用作弊，np.longdouble但是相对较大数量的点（大约> 1e5，但取决于您的数据），浮点错误仍然会变得很明显
您可以绘制误差并看到它相对较快地增加
卷积解算较慢，但没有这种浮点精度损失
Uniform_filter1d解决方案比该cumsum解决方案更快，并且没有这种浮点精度损失

— 等位基因
source

3

不错的解决方案！我的直觉是numpy.convolveO（mn）; 其文档中提到scipy.signal.fftconvolve使用FFT。

— Ulrich Stern

3

此方法不处理数组的边缘，对吗？

— 2016年

6

好的解决方案，但请注意，对于大型数组，它可能会遭受数值误差，因为在数组末尾，您可能要减去两个大数以获得较小的结果。

— 巴斯·温克尔斯

1

这使用整数除法而不是浮点除法：running_mean([1,2,3], 2)Gives array([1, 2])。更换x用[float(value) for value in x]的伎俩。

— ChrisW

4

如果x包含浮点数，则此解决方案的数值稳定性可能会成为问题。范例：在有人期望的情况下running_mean(np.arange(int(1e7))[::-1] + 0.2, 1)[-1] - 0.2返回。更多信息：en.wikipedia.org/wiki/Loss_of_significance0.0031250.0

— 米兰

79

更新：以下示例显示了旧pandas.rolling_mean功能，该功能已在最新版本的熊猫中删除。下面的函数调用的等效形式为

In [8]: pd.Series(x).rolling(window=N).mean().iloc[N-1:].values
Out[8]: 
array([ 0.49815397,  0.49844183,  0.49840518, ...,  0.49488191,
        0.49456679,  0.49427121])

熊猫比NumPy或SciPy更适合于此。它的函数rolling_mean方便地完成工作。当输入是数组时，它还会返回一个NumPy数组。

rolling_mean使用任何自定义的纯Python实现都很难在性能上胜过。这是针对两个建议的解决方案的示例性能：

In [1]: import numpy as np

In [2]: import pandas as pd

In [3]: def running_mean(x, N):
   ...:     cumsum = np.cumsum(np.insert(x, 0, 0)) 
   ...:     return (cumsum[N:] - cumsum[:-N]) / N
   ...:

In [4]: x = np.random.random(100000)

In [5]: N = 1000

In [6]: %timeit np.convolve(x, np.ones((N,))/N, mode='valid')
10 loops, best of 3: 172 ms per loop

In [7]: %timeit running_mean(x, N)
100 loops, best of 3: 6.72 ms per loop

In [8]: %timeit pd.rolling_mean(x, N)[N-1:]
100 loops, best of 3: 4.74 ms per loop

In [9]: np.allclose(pd.rolling_mean(x, N)[N-1:], running_mean(x, N))
Out[9]: True

关于如何处理边缘值，也有不错的选择。

— 贾萨里姆
source

6

Pandas的rolling_mean是完成这项工作的好工具，但已被ndarrays弃用。在将来的Pandas版本中，它将仅在Pandas系列上起作用。现在我们该如何处理非熊猫阵列数据？

— 迈克，

5

@Mikerolling_mean（）已被弃用，但现在您可以分别使用rolling和mean：df.rolling(windowsize).mean()现在可以代替（我可能会很快添加）。对于6,000行系列%timeit test1.rolling(20).mean()返回了1000个循环，最好是每个循环3：1.16毫秒

— Vlox

5

@Vlox df.rolling()运行得很好，问题在于，即使这种形式将来也将不支持ndarrays。要使用它，我们必须首先将数据加载到Pandas Dataframe中。我希望看到将此功能添加到numpy或中scipy.signal。

— 迈克

1

@迈克完全同意。我特别努力将pandas .ewm（）。mean（）速度匹配到自己的数组（而不是必须先将它们加载到df中）。我的意思是，它的速度很快，这很好，但是太频繁地移入和移出数据帧感觉有点笨拙。

— Vlox

6

%timeit bottleneck.move_mean(x, N)比我的PC上的cumsum和pandas方法快3到15倍。查看回购协议README中的基准。

— 2013年

50

您可以使用以下方法计算运行平均值：

import numpy as np

def runningMean(x, N):
    y = np.zeros((len(x),))
    for ctr in range(len(x)):
         y[ctr] = np.sum(x[ctr:(ctr+N)])
    return y/N

但这很慢。

幸运的是，numpy包含一个卷积函数，我们可以使用它来加快处理速度。运行均值等效于对所有成员等于的长x向量进行卷积。卷积的numpy实现包括起始瞬变，因此您必须删除前N-1个点：N1/N

def runningMeanFast(x, N):
    return np.convolve(x, np.ones((N,))/N)[(N-1):]

在我的机器上，快速版本的速度要快20-30倍，具体取决于输入向量的长度和平均窗口的大小。

请注意，convolve确实包含一个'same'模式，该模式似乎应该解决起始瞬态问题，但是它将其在开始和结束之间进行了拆分。

— 地铁
source

请注意，删除前N-1个点仍然会在最后一个点保留边界效果。解决此问题的更简单方法是使用mode='valid'在convolve不需要任何后期处理。

— 青金石2014年

1

@Psycho- mode='valid'从两端消除瞬态，对吗？如果len(x)=10和N=4，那么我希望获得10个结果，但valid返回

— 7。– mtrw 2014年

1

它从末尾消除瞬态，开始时没有瞬态。好吧，我想这是一个优先事项，我并不需要相同数量的结果，而只是要使数据中不存在的零斜率成为代价。顺便说一句，这是一个显示两种模式之间差异的命令：（

modes = ('full', 'same', 'valid'); [plot(convolve(ones((200,)), ones((50,))/50, mode=m)) for m in modes]; axis([-10, 251, -.1, 1.1]); legend(modes, loc='lower center')

使用pyplot和numpy导入）。

— 青金石

runningMean当您离开数组x[ctr:(ctr+N)]右侧时，我对零平均产生副作用。

— mrgloom

runningMeanFast也有这个边界效应的问题。

— mrgloom

21

或用于计算的python模块

在Tradewave.net上进行的测试中，TA-lib总是会赢得：

import talib as ta
import numpy as np
import pandas as pd
import scipy
from scipy import signal
import time as t

PAIR = info.primary_pair
PERIOD = 30

def initialize():
    storage.reset()
    storage.elapsed = storage.get('elapsed', [0,0,0,0,0,0])

def cumsum_sma(array, period):
    ret = np.cumsum(array, dtype=float)
    ret[period:] = ret[period:] - ret[:-period]
    return ret[period - 1:] / period

def pandas_sma(array, period):
    return pd.rolling_mean(array, period)

def api_sma(array, period):
    # this method is native to Tradewave and does NOT return an array
    return (data[PAIR].ma(PERIOD))

def talib_sma(array, period):
    return ta.MA(array, period)

def convolve_sma(array, period):
    return np.convolve(array, np.ones((period,))/period, mode='valid')

def fftconvolve_sma(array, period):    
    return scipy.signal.fftconvolve(
        array, np.ones((period,))/period, mode='valid')    

def tick():

    close = data[PAIR].warmup_period('close')

    t1 = t.time()
    sma_api = api_sma(close, PERIOD)
    t2 = t.time()
    sma_cumsum = cumsum_sma(close, PERIOD)
    t3 = t.time()
    sma_pandas = pandas_sma(close, PERIOD)
    t4 = t.time()
    sma_talib = talib_sma(close, PERIOD)
    t5 = t.time()
    sma_convolve = convolve_sma(close, PERIOD)
    t6 = t.time()
    sma_fftconvolve = fftconvolve_sma(close, PERIOD)
    t7 = t.time()

    storage.elapsed[-1] = storage.elapsed[-1] + t2-t1
    storage.elapsed[-2] = storage.elapsed[-2] + t3-t2
    storage.elapsed[-3] = storage.elapsed[-3] + t4-t3
    storage.elapsed[-4] = storage.elapsed[-4] + t5-t4
    storage.elapsed[-5] = storage.elapsed[-5] + t6-t5    
    storage.elapsed[-6] = storage.elapsed[-6] + t7-t6        

    plot('sma_api', sma_api)  
    plot('sma_cumsum', sma_cumsum[-5])
    plot('sma_pandas', sma_pandas[-10])
    plot('sma_talib', sma_talib[-15])
    plot('sma_convolve', sma_convolve[-20])    
    plot('sma_fftconvolve', sma_fftconvolve[-25])

def stop():

    log('ticks....: %s' % info.max_ticks)

    log('api......: %.5f' % storage.elapsed[-1])
    log('cumsum...: %.5f' % storage.elapsed[-2])
    log('pandas...: %.5f' % storage.elapsed[-3])
    log('talib....: %.5f' % storage.elapsed[-4])
    log('convolve.: %.5f' % storage.elapsed[-5])    
    log('fft......: %.5f' % storage.elapsed[-6])

结果：

[2015-01-31 23:00:00] ticks....: 744
[2015-01-31 23:00:00] api......: 0.16445
[2015-01-31 23:00:00] cumsum...: 0.03189
[2015-01-31 23:00:00] pandas...: 0.03677
[2015-01-31 23:00:00] talib....: 0.00700  # <<< Winner!
[2015-01-31 23:00:00] convolve.: 0.04871
[2015-01-31 23:00:00] fft......: 0.22306

— 精益求精
source

NameError: name 'info' is not defined。先生，我遇到了这个错误。

— Rezwanul Haque博士

1

看起来您的时间序列在平滑后发生了偏移，这是理想的效果吗？

— mrgloom

@mrgloom是的，出于可视化目的；否则它们将在图表上一条线出现; Rezwanul Haque女士，您可以删除所有对PAIR和信息的引用；那些是现已失效的tradewave.net的内部沙盒方法

— litepresence

21

有关即用型解决方案，请参见https://scipy-cookbook.readthedocs.io/items/SignalSmooth.html。它提供了与flat窗口类型的。请注意，这比简单的自己动手卷积方法要复杂得多，因为它试图通过反映数据来处理数据开头和结尾的问题（在您的情况下可能有效，也可能无效）。 ..）。

首先，您可以尝试：

a = np.random.random(100)
plt.plot(a)
b = smooth(a, window='flat')
plt.plot(b)

— 汉斯曼
source

1

该方法依赖于numpy.convolve，仅在改变顺序上有所不同。

— Alleo 2014年

10

当输入和输出具有相同的性质（例如，两个时间信号）时，返回的形状与输入信号形状不同的输出信号总是让我烦恼。它破坏了与相关自变量（例如时间，频率）的对应关系，使得绘图或比较不是直接的问题……无论如何，如果您有共同的感觉，则可能希望将建议函数的最后几行更改为y = np .convolve（w / w.sum（），s，mode ='same'）; return y [window_len-1 :-( window_len-1）]

— Christian O'Reilly

@ ChristianO'Reilly，您应该将其作为一个单独的答案发布，这正是我所要的，因为我确实有两个其他数组必须与平滑数据的长度匹配，进行绘图等。我想知道究竟是怎么做到的- w窗口大小和s数据是多少？

— Demis

@Demis Glad评论有帮助。有关numpy卷积函数的更多信息，请参见docs.scipy.org/doc/numpy-1.15.0/reference/Generated/…卷积函数（en.wikipedia.org/wiki/Convolution）将两个信号相互卷积。在这种情况下，它将信号与归一化（即，单位面积）窗口（w / w.sum（））卷积。

— Christian O'Reilly

18

您可以使用scipy.ndimage.filters.uniform_filter1d：

import numpy as np
from scipy.ndimage.filters import uniform_filter1d
N = 1000
x = np.random.random(100000)
y = uniform_filter1d(x, size=N)

uniform_filter1d：

使输出具有相同的numpy形状（即点数）
允许使用多种方式来处理'reflect'默认的边框，但就我而言，我宁愿'nearest'

它也相当快（比上面给出的cumsum方法快 50倍，快np.convolve2-5倍）：

%timeit y1 = np.convolve(x, np.ones((N,))/N, mode='same')
100 loops, best of 3: 9.28 ms per loop

%timeit y2 = uniform_filter1d(x, size=N)
10000 loops, best of 3: 191 µs per loop

这是3个函数，可让您比较不同实现的错误/速度：

from __future__ import division
import numpy as np
import scipy.ndimage.filters as ndif
def running_mean_convolve(x, N):
    return np.convolve(x, np.ones(N) / float(N), 'valid')
def running_mean_cumsum(x, N):
    cumsum = np.cumsum(np.insert(x, 0, 0))
    return (cumsum[N:] - cumsum[:-N]) / float(N)
def running_mean_uniform_filter1d(x, N):
    return ndif.uniform_filter1d(x, N, mode='constant', origin=-(N//2))[:-(N-1)]

— 莫伊
source

1

这是似乎唯一考虑到边界问题的答案（相当重要，尤其是在绘制时）。谢谢！

— 加百利

1

我用矩形绘制了轮廓uniform_filter1d，然后按。我的结果：（1.）卷积是最慢的。（2.）积/减约快20-30倍。（3.）uniform_filter1d的速度比“求和/减”快2-3倍。赢家肯定是uniform_filter1d。np.convolvenp.cumsumnp.subtract

— Trevor Boyd Smith

使用uniform_filter1d是比快cumsum溶液（约2-5x）。并且uniform_filter1d 不会像cumsum解决方案那样出现大量的浮点错误。

— Trevor Boyd Smith

15

我知道这是一个古老的问题，但这是不使用任何额外数据结构或库的解决方案。它在输入列表中的元素数量上是线性的，我想不出任何其他方法来提高它的效率（实际上，如果有人知道更好的分配结果的方法，请告诉我）。

注意：使用numpy数组而不是列表会更快，但是我想消除所有依赖关系。通过多线程执行还可以提高性能

该函数假定输入列表是一维的，因此要小心。

### Running mean/Moving average
def running_mean(l, N):
    sum = 0
    result = list( 0 for x in l)

    for i in range( 0, N ):
        sum = sum + l[i]
        result[i] = sum / (i+1)

    for i in range( N, len(l) ):
        sum = sum - l[i-N] + l[i]
        result[i] = sum / N

    return result

例

假设我们有一个清单 data = [ 1, 2, 3, 4, 5, 6 ]，我们要在该上计算周期为3的滚动平均值，并且还需要一个输出列表，该列表的大小与输入值的大小相同（通常是这种情况）。

第一个元素的索引为0，因此应该对索引为-2，-1和0的元素计算滚动平均值。显然，我们没有data [-2]和data [-1]（除非您想使用特殊的边界条件），因此我们假设这些元素为0。这等效于对列表进行零填充，除了我们实际上不填充它外，只需跟踪需要填充的索引（从0到N-1）即可。

因此，对于前N个元素，我们只是将这些元素累加到一个累加器中。

result[0] = (0 + 0 + 1) / 3  = 0.333    ==   (sum + 1) / 3
result[1] = (0 + 1 + 2) / 3  = 1        ==   (sum + 2) / 3
result[2] = (1 + 2 + 3) / 3  = 2        ==   (sum + 3) / 3

从元素N + 1开始，简单的累积不起作用。我们期望，result[3] = (2 + 3 + 4)/3 = 3但这与有所不同(sum + 4)/3 = 3.333。

计算正确值的方法是减去，从而 data[0] = 1得出。sum+4sum + 4 - 1 = 9

发生这种情况的原因是当前sum = data[0] + data[1] + data[2]，但对于每一种情况也是如此，i >= N因为在减法之前sum是data[i-N] + ... + data[i-2] + data[i-1]。

— 关系
source

12

我觉得可以通过瓶颈解决这个问题

请参见下面的基本示例：

import numpy as np
import bottleneck as bn

a = np.random.randint(4, 1000, size=100)
mm = bn.move_mean(a, window=5, min_count=1)

“ mm”是“ a”的移动平均值。
“窗口”是移动平均值要考虑的最大条目数。
“ min_count”是移动平均值（例如，对于前几个元素或数组具有nan值）要考虑的最小条目数。

好的部分是Bottleneck有助于处理nan值，而且效率很高。

— 安东尼·安扬武
source

这个库真的很快。纯Python移动平均函数很慢。Bootleneck是一个PyData库，我认为它很稳定，并且可以从Python社区获得持续的支持，那么为什么不使用它呢？

— GoingMyWay

6

我尚未检查这有多快，但是您可以尝试：

from collections import deque

cache = deque() # keep track of seen values
n = 10          # window size
A = xrange(100) # some dummy iterable
cum_sum = 0     # initialize cumulative sum

for t, val in enumerate(A, 1):
    cache.append(val)
    cum_sum += val
    if t < n:
        avg = cum_sum / float(t)
    else:                           # if window is saturated,
        cum_sum -= cache.popleft()  # subtract oldest value
        avg = cum_sum / float(n)

— 克里斯
source

1

这就是我要做的。谁能批评为什么这是一个糟糕的路呢？

— staggart

1

这个简单的python解决方案对我很有效，不需要numpy。我最终将其滚动到一个类中以供重用。

— Matthew Tschiegg '16

6

该答案包含针对三种不同情况使用Python 标准库的解决方案。

与运行平均 `itertools.accumulate`

这是一种内存有效的Python 3.2+解决方案，可利用来计算可迭代值的运行平均值itertools.accumulate。

>>> from itertools import accumulate
>>> values = range(100)

请注意，它values可以是任何可迭代的，包括生成器或任何其他动态生成值的对象。

首先，延迟构造值的累加和。

>>> cumu_sum = accumulate(value_stream)

接下来，enumerate累加和（从1开始），并构造一个生成器，该生成器产生累加值的分数和当前枚举索引。

>>> rolling_avg = (accu/i for i, accu in enumerate(cumu_sum, 1))

您可以发出means = list(rolling_avg)是否需要一次存储在内存中的所有值或next递增调用的问题。
（当然，你也可以遍历rolling_avg一个for循环，这将调用next隐式）。

>>> next(rolling_avg) # 0/1
>>> 0.0
>>> next(rolling_avg) # (0 + 1)/2
>>> 0.5
>>> next(rolling_avg) # (0 + 1 + 2)/3
>>> 1.0

该解决方案可以编写为如下功能。

from itertools import accumulate

def rolling_avg(iterable):
    cumu_sum = accumulate(iterable)
    yield from (accu/i for i, accu in enumerate(cumu_sum, 1))

一个协同程序来，你可以随时发送值

该协程会消耗您发送给它的值，并保持到目前为止所见值的运行平均值。

当您没有可迭代的值但需要在程序的整个生命周期的不同时间获取要平均的值时，此方法很有用。

def rolling_avg_coro():
    i = 0
    total = 0.0
    avg = None

    while True:
        next_value = yield avg
        i += 1
        total += next_value
        avg = total/i

协程的工作方式如下：

>>> averager = rolling_avg_coro() # instantiate coroutine
>>> next(averager) # get coroutine going (this is called priming)
>>>
>>> averager.send(5) # 5/1
>>> 5.0
>>> averager.send(3) # (5 + 3)/2
>>> 4.0
>>> print('doing something else...')
doing something else...
>>> averager.send(13) # (5 + 3 + 13)/3
>>> 7.0

计算大小滑动窗口上的平均值 `N`

该生成器函数具有可迭代的窗口大小，N 并得出窗口内部当前值的平均值。它使用deque，这是类似于列表的数据结构，但针对两个端点的快速修改（pop，append）进行了优化。

from collections import deque
from itertools import islice

def sliding_avg(iterable, N):        
    it = iter(iterable)
    window = deque(islice(it, N))        
    num_vals = len(window)

    if num_vals < N:
        msg = 'window size {} exceeds total number of values {}'
        raise ValueError(msg.format(N, num_vals))

    N = float(N) # force floating point division if using Python 2
    s = sum(window)
    
    while True:
        yield s/N
        try:
            nxt = next(it)
        except StopIteration:
            break
        s = s - window.popleft() + nxt
        window.append(nxt)

这是起作用的功能：

>>> values = range(100)
>>> N = 5
>>> window_avg = sliding_avg(values, N)
>>> 
>>> next(window_avg) # (0 + 1 + 2 + 3 + 4)/5
>>> 2.0
>>> next(window_avg) # (1 + 2 + 3 + 4 + 5)/5
>>> 3.0
>>> next(window_avg) # (2 + 3 + 4 + 5 + 6)/5
>>> 4.0

— 提格布
source

5

聚会晚了一点，但是我做了一个自己的小函数，它不会缠住两端或填充零的垫子，这些垫子也可以用来寻找平均值。进一步的处理是，它还在线性间隔的点上对信号进行重新采样。随意自定义代码以获取其他功能。

该方法是具有归一化的高斯核的简单矩阵乘法。

def running_mean(y_in, x_in, N_out=101, sigma=1):
    '''
    Returns running mean as a Bell-curve weighted average at evenly spaced
    points. Does NOT wrap signal around, or pad with zeros.

    Arguments:
    y_in -- y values, the values to be smoothed and re-sampled
    x_in -- x values for array

    Keyword arguments:
    N_out -- NoOf elements in resampled array.
    sigma -- 'Width' of Bell-curve in units of param x .
    '''
    N_in = size(y_in)

    # Gaussian kernel
    x_out = np.linspace(np.min(x_in), np.max(x_in), N_out)
    x_in_mesh, x_out_mesh = np.meshgrid(x_in, x_out)
    gauss_kernel = np.exp(-np.square(x_in_mesh - x_out_mesh) / (2 * sigma**2))
    # Normalize kernel, such that the sum is one along axis 1
    normalization = np.tile(np.reshape(sum(gauss_kernel, axis=1), (N_out, 1)), (1, N_in))
    gauss_kernel_normalized = gauss_kernel / normalization
    # Perform running average as a linear operation
    y_out = gauss_kernel_normalized @ y_in

    return y_out, x_out

具有正态分布噪声的正弦信号的简单用法：

— 克劳森
source

这对我不起作用（python 3.6）。 1没有名为功能sum，使用np.sum代替2的@操作（不知道那是什么）抛出一个错误。我可能稍后再调查，但我现在没有时间

— -Bastian

的@是矩阵乘法运算符，它实现np.matmul。检查您的y_in数组是否为numpy数组，这可能是问题所在。

— xyzzyqed

5

我建议不要用numpy或scipy来更快地做到这一点：

df['data'].rolling(3).mean()

这将采用“数据”列的3个周期的移动平均值（MA）。您还可以计算偏移版本，例如，不包含当前单元格的版本（向后偏移一次）可以很容易地计算为：

df['data'].shift(periods=1).rolling(3).mean()

— 古瑟尔·卡拉科（Gursel Karacor）
source

这与2016年提出的解决方案有何不同？

— T先生

2

2016年提出的解决方案使用pandas.rolling_mean矿山使用pandas.DataFrame.rolling。您也可以轻松地计算移动min(), max(), sum()等mean()。

— Gursel Karacor

在前一种情况下，您需要使用诸如pandas.rolling_min, pandas.rolling_maxetc等不同的方法。它们相似但有所不同。

— Gursel Karacor

4

上面的答案之一掩盖了mab的评论，上面有此方法。有一个简单的移动平均线：bottleneckmove_mean

import numpy as np
import bottleneck as bn

a = np.arange(10) + np.random.random(10)

mva = bn.move_mean(a, window=2, min_count=1)

min_count是一个方便的参数，基本上可以将移动平均线带到数组中的该点。如果不设置min_count，它将相等window，直到window点为止的一切都是如此nan。

— 话语
source

3

无需使用Numpy，Panda 即可找到移动平均线的另一种方法

import itertools
sample = [2, 6, 10, 8, 11, 10]
list(itertools.starmap(lambda a,b: b/a, 
               enumerate(itertools.accumulate(sample), 1)))

将打印[2.0、4.0、6.0、6.5、7.4、7.833333333333333]

— 德米特里·谢梅诺夫
source

itertools.accumulate在python 2.7中不存在，但是在python 3.4中存在

— grayaii

3

现在，这个问题甚至比NeXuS上个月撰写该问题时还要古老，但是我喜欢他的代码如何处理边缘情况。但是，由于它是“简单的移动平均线”，因此其结果落后于它们应用的数据。我认为，在比与NumPy的方式更令人满意的方式处理边缘情况valid，same以及full可以通过应用类似的方式，以实现convolution()为基础的方法。

我的贡献使用中央移动平均值将其结果与他们的数据保持一致。当可用的点数太少而无法使用完整大小的窗口时，将根据数组边缘处连续较小的窗口来计算移动平均值。[实际上，是从依次更大的窗口开始的，但这是一个实现细节。]

import numpy as np

def running_mean(l, N):
    # Also works for the(strictly invalid) cases when N is even.
    if (N//2)*2 == N:
        N = N - 1
    front = np.zeros(N//2)
    back = np.zeros(N//2)

    for i in range(1, (N//2)*2, 2):
        front[i//2] = np.convolve(l[:i], np.ones((i,))/i, mode = 'valid')
    for i in range(1, (N//2)*2, 2):
        back[i//2] = np.convolve(l[-i:], np.ones((i,))/i, mode = 'valid')
    return np.concatenate([front, np.convolve(l, np.ones((N,))/N, mode = 'valid'), back[::-1]])

它相对较慢，因为它使用convolve()，并且可能由真正的Pythonista大量使用，但是，我相信这个想法是正确的。

— 玛丽萨诺
source

3

上面有很多关于计算运行平均值的答案。我的答案增加了两个额外功能：

忽略nan值
计算不包括感兴趣值本身的N个相邻值的平均值

第二个特征对于确定哪些值与总体趋势相差一定量特别有用。

我使用numpy.cumsum，因为它是最省时的方法（请参见上面的Alleo回答）。

N=10 # number of points to test on each side of point of interest, best if even
padded_x = np.insert(np.insert( np.insert(x, len(x), np.empty(int(N/2))*np.nan), 0, np.empty(int(N/2))*np.nan ),0,0)
n_nan = np.cumsum(np.isnan(padded_x))
cumsum = np.nancumsum(padded_x) 
window_sum = cumsum[N+1:] - cumsum[:-(N+1)] - x # subtract value of interest from sum of all values within window
window_n_nan = n_nan[N+1:] - n_nan[:-(N+1)] - np.isnan(x)
window_n_values = (N - window_n_nan)
movavg = (window_sum) / (window_n_values)

此代码仅适用于Ns。可以通过更改padded_x和n_nan的np.insert来调整奇数。

输出示例（黑色为原始，蓝色为movavg）：

可以轻松地修改此代码，以删除从少于cutoff = 3个非nan值计算出的所有移动平均值。

window_n_values = (N - window_n_nan).astype(float) # dtype must be float to set some values to nan
cutoff = 3
window_n_values[window_n_values<cutoff] = np.nan
movavg = (window_sum) / (window_n_values)

— gtcoder
source

2

仅使用Python标准库（高效存储）

仅给出使用标准库的另一个版本deque。令我惊讶的是，大多数答案都使用pandas或numpy。

def moving_average(iterable, n=3):
    d = deque(maxlen=n)
    for i in iterable:
        d.append(i)
        if len(d) == n:
            yield sum(d)/n

r = moving_average([40, 30, 50, 46, 39, 44])
assert list(r) == [40.0, 42.0, 45.0, 43.0]

其实我在python文档中找到了另一个实现

def moving_average(iterable, n=3):
    # moving_average([40, 30, 50, 46, 39, 44]) --> 40.0 42.0 45.0 43.0
    # http://en.wikipedia.org/wiki/Moving_average
    it = iter(iterable)
    d = deque(itertools.islice(it, n-1))
    d.appendleft(0)
    s = sum(d)
    for elem in it:
        s += elem - d.popleft()
        d.append(elem)
        yield s / n

但是，对我来说，实现似乎比应该的要复杂一些。但这必须在标准python文档中，这是有原因的，有人可以评论我的实现和标准doc吗？

— 数学家
source

2

您每次迭代都会对窗口成员求和，这是一个很大的不同，它们有效地更新了总和（删除一个成员并添加另一个成员）。就复杂性而言，您正在执行 O(n*d) 计算（d即窗口的n大小，可迭代的大小），并且正在执行O(n)

— Iftah

@Iftah，很好，谢谢您的解释，您是对的。

— MaThMaX

2

使用@Aikude的变量，我编写了单行代码。

import numpy as np

mylist = [1, 2, 3, 4, 5, 6, 7]
N = 3

mean = [np.mean(mylist[x:x+N]) for x in range(len(mylist)-N+1)]
print(mean)

>>> [2.0, 3.0, 4.0, 5.0, 6.0]

— 格林泰克
source

1

尽管这里有针对此问题的解决方案，但请查看我的解决方案。它非常简单并且运行良好。

import numpy as np
dataset = np.asarray([1, 2, 3, 4, 5, 6, 7])
ma = list()
window = 3
for t in range(0, len(dataset)):
    if t+window <= len(dataset):
        indices = range(t, t+window)
        ma.append(np.average(np.take(dataset, indices)))
else:
    ma = np.asarray(ma)

— 艾伯克（Ayberk Yavuz）
source

1

通过阅读其他答案，我认为这不是问题所要解决的问题，但是我到达这里的目的是保持一个不断增长的值列表的运行平均值。

因此，如果要保留从某处（站点，测量设备等）获取的值的列表以及最新值的平均值n，可以使用下面的代码，以最大程度地减少添加新值的工作量。元素：

class Running_Average(object):
    def __init__(self, buffer_size=10):
        """
        Create a new Running_Average object.

        This object allows the efficient calculation of the average of the last
        `buffer_size` numbers added to it.

        Examples
        --------
        >>> a = Running_Average(2)
        >>> a.add(1)
        >>> a.get()
        1.0
        >>> a.add(1)  # there are two 1 in buffer
        >>> a.get()
        1.0
        >>> a.add(2)  # there's a 1 and a 2 in the buffer
        >>> a.get()
        1.5
        >>> a.add(2)
        >>> a.get()  # now there's only two 2 in the buffer
        2.0
        """
        self._buffer_size = int(buffer_size)  # make sure it's an int
        self.reset()

    def add(self, new):
        """
        Add a new number to the buffer, or replaces the oldest one there.
        """
        new = float(new)  # make sure it's a float
        n = len(self._buffer)
        if n < self.buffer_size:  # still have to had numbers to the buffer.
            self._buffer.append(new)
            if self._average != self._average:  # ~ if isNaN().
                self._average = new  # no previous numbers, so it's new.
            else:
                self._average *= n  # so it's only the sum of numbers.
                self._average += new  # add new number.
                self._average /= (n+1)  # divide by new number of numbers.
        else:  # buffer full, replace oldest value.
            old = self._buffer[self._index]  # the previous oldest number.
            self._buffer[self._index] = new  # replace with new one.
            self._index += 1  # update the index and make sure it's...
            self._index %= self.buffer_size  # ... smaller than buffer_size.
            self._average -= old/self.buffer_size  # remove old one...
            self._average += new/self.buffer_size  # ...and add new one...
            # ... weighted by the number of elements.

    def __call__(self):
        """
        Return the moving average value, for the lazy ones who don't want
        to write .get .
        """
        return self._average

    def get(self):
        """
        Return the moving average value.
        """
        return self()

    def reset(self):
        """
        Reset the moving average.

        If for some reason you don't want to just create a new one.
        """
        self._buffer = []  # could use np.empty(self.buffer_size)...
        self._index = 0  # and use this to keep track of how many numbers.
        self._average = float('nan')  # could use np.NaN .

    def get_buffer_size(self):
        """
        Return current buffer_size.
        """
        return self._buffer_size

    def set_buffer_size(self, buffer_size):
        """
        >>> a = Running_Average(10)
        >>> for i in range(15):
        ...     a.add(i)
        ...
        >>> a()
        9.5
        >>> a._buffer  # should not access this!!
        [10.0, 11.0, 12.0, 13.0, 14.0, 5.0, 6.0, 7.0, 8.0, 9.0]

        Decreasing buffer size:
        >>> a.buffer_size = 6
        >>> a._buffer  # should not access this!!
        [9.0, 10.0, 11.0, 12.0, 13.0, 14.0]
        >>> a.buffer_size = 2
        >>> a._buffer
        [13.0, 14.0]

        Increasing buffer size:
        >>> a.buffer_size = 5
        Warning: no older data available!
        >>> a._buffer
        [13.0, 14.0]

        Keeping buffer size:
        >>> a = Running_Average(10)
        >>> for i in range(15):
        ...     a.add(i)
        ...
        >>> a()
        9.5
        >>> a._buffer  # should not access this!!
        [10.0, 11.0, 12.0, 13.0, 14.0, 5.0, 6.0, 7.0, 8.0, 9.0]
        >>> a.buffer_size = 10  # reorders buffer!
        >>> a._buffer
        [5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0, 12.0, 13.0, 14.0]
        """
        buffer_size = int(buffer_size)
        # order the buffer so index is zero again:
        new_buffer = self._buffer[self._index:]
        new_buffer.extend(self._buffer[:self._index])
        self._index = 0
        if self._buffer_size < buffer_size:
            print('Warning: no older data available!')  # should use Warnings!
        else:
            diff = self._buffer_size - buffer_size
            print(diff)
            new_buffer = new_buffer[diff:]
        self._buffer_size = buffer_size
        self._buffer = new_buffer

    buffer_size = property(get_buffer_size, set_buffer_size)

您可以使用以下示例进行测试：

def graph_test(N=200):
    import matplotlib.pyplot as plt
    values = list(range(N))
    values_average_calculator = Running_Average(N/2)
    values_averages = []
    for value in values:
        values_average_calculator.add(value)
        values_averages.append(values_average_calculator())
    fig, ax = plt.subplots(1, 1)
    ax.plot(values, label='values')
    ax.plot(values_averages, label='averages')
    ax.grid()
    ax.set_xlim(0, N)
    ax.set_ylim(0, N)
    fig.show()

这使：

— berna1111
source

1

另一个使用标准库和双端队列的解决方案：

from collections import deque
import itertools

def moving_average(iterable, n=3):
    # http://en.wikipedia.org/wiki/Moving_average
    it = iter(iterable) 
    # create an iterable object from input argument
    d = deque(itertools.islice(it, n-1))  
    # create deque object by slicing iterable
    d.appendleft(0)
    s = sum(d)
    for elem in it:
        s += elem - d.popleft()
        d.append(elem)
        yield s / n

# example on how to use it
for i in  moving_average([40, 30, 50, 46, 39, 44]):
    print(i)

# 40.0
# 42.0
# 45.0
# 43.0

— 弗拉德·贝兹登
source

1

出于教育目的，让我添加另外两个Numpy解决方案（比cumsum解决方案要慢）：

import numpy as np
from numpy.lib.stride_tricks import as_strided

def ra_strides(arr, window):
    ''' Running average using as_strided'''
    n = arr.shape[0] - window + 1
    arr_strided = as_strided(arr, shape=[n, window], strides=2*arr.strides)
    return arr_strided.mean(axis=1)

def ra_add(arr, window):
    ''' Running average using add.reduceat'''
    n = arr.shape[0] - window + 1
    indices = np.array([0, window]*n) + np.repeat(np.arange(n), 2)
    arr = np.append(arr, 0)
    return np.add.reduceat(arr, indices )[::2]/window

使用的函数：as_strided，add.reduceat

— 安德烈亚斯（Andreas K.）
source

1

上述所有解决方案均较差，因为它们缺乏

由于使用本地python而不是numpy向量化实现，因此速度更快，
由于numpy.cumsum，或
由于O(len(x) * w)实现为卷积而提高了速度。

给定

import numpy
m = 10000
x = numpy.random.rand(m)
w = 1000

注意x_[:w].sum()等于x[:w-1].sum()。因此，对于所述第一平均的numpy.cumsum(...)增加x[w] / w（通过x_[w+1] / w），并减去0（从x_[0] / w）。这导致x[0:w].mean()

通过cumsum，您将通过加法x[w+1] / w和减法更新第二个平均值x[0] / w，从而得出x[1:w+1].mean()。

这一直持续到x[-w:].mean()达到为止。

x_ = numpy.insert(x, 0, 0)
sliding_average = x_[:w].sum() / w + numpy.cumsum(x_[w:] - x_[:-w]) / w

该解决方案是矢量化，O(m)可读性和数值稳定的。

— 赫伯特
source

1

如何移动平均滤波器？它也是单线的，并且具有优点，如果您需要矩形以外的其他东西，即可以轻松地操纵窗口类型。数组的N长简单移动平均值a：

lfilter(np.ones(N)/N, [1], a)[N:]

并应用三角形窗口：

lfilter(np.ones(N)*scipy.signal.triang(N)/N, [1], a)[N:]

注意：我通常会将前N个样本作为伪造品丢弃，因此[N:]最后将其作为伪造品，但是这不是必需的，只是个人选择的问题。

— mac13k
source

-7

如果您确实选择自己滚动而不是使用现有的库，请注意浮点错误并尝试最小化其影响：

class SumAccumulator:
    def __init__(self):
        self.values = [0]
        self.count = 0

    def add( self, val ):
        self.values.append( val )
        self.count = self.count + 1
        i = self.count
        while i & 0x01:
            i = i >> 1
            v0 = self.values.pop()
            v1 = self.values.pop()
            self.values.append( v0 + v1 )

    def get_total(self):
        return sum( reversed(self.values) )

    def get_size( self ):
        return self.count

如果您所有的值都是大致相同的数量级，那么这将通过始终添加大致相似的数量级的值来帮助保持精度。

— Mayur Patel
source

15

这是一个非常不清楚的答案，至少在代码中有注释，或者解释为什么这有助于浮点错误。

— 加布2014年

在我的最后一句话中，我试图指出为什么它有助于浮点错误。如果两个值的数量级大致相同，则相加的精度会比将非常大的数字相加的精度低。该代码以这样的方式组合“相邻”值：即使是中间和，也应始终在大小上合理地接近，以最大程度地减少浮点误差。没有什么是万无一失的，但是这种方法在生产中节省了几个实施非常差的项目。

— Mayur Patel 2014年

1.应用于原始问题，这将是非常慢的（计算平均值），所以这是无关紧要的。2.遭受64位数字精度的问题，一个人必须总结>> >> 2 ^ 30相等的数字。

— Alleo

@Alleo：您将不做每个值一个加法，而是做两个。证明与位翻转问题相同。但是，此答案的重点不一定是性能，而是精度。平均64位值的内存使用量不会超过缓存中的64个元素，因此它在内存使用方面也很友好。

— Mayur Patel 2014年

是的，这是对的，这比简单的求和操作要多出两倍的操作，这是正确的，但是最初的问题是计算运行均值，而不仅仅是求和。可以在O（n）中完成，但是您的答案需要O（mn），其中m是窗口的大小。

— Alleo 2014年

均线或均线

说明

边缘

高效的解决方案

警告：尽管累积速度更快，但浮点错误会增加，这可能会导致结果无效/不正确/不可接受

与运行平均 itertools.accumulate

一个协同程序来，你可以随时发送值

计算大小滑动窗口上的平均值 N

与运行平均 `itertools.accumulate`

计算大小滑动窗口上的平均值 `N`