查找和替换列表中的字符串值

153

我得到了这个清单：

words = ['how', 'much', 'is[br]', 'the', 'fish[br]', 'no', 'really']

我想[br]用一些与之相似的奇异值代替，<br />从而得到一个新的清单：

words = ['how', 'much', 'is<br />', 'the', 'fish<br />', 'no', 'really']

python string list

— 埃里克·赫里兹
source

274

words = [w.replace('[br]', '<br />') for w in words]

这些称为列表推导。

— 桑伯
source

5

在此列表理解方法和map方法（由@Anthony Kong发布）之间进行比较，此列表方法的速度大约快了2倍。它还允许在同一个通话中插入多个替换项，例如

resname = [name.replace('DA', 'ADE').replace('DC', 'CYT').replace('DG', 'GUA').replace('DT', 'THY') for name in ncp.resname()]

— Steven C. Howell 2015年

1

@sberry我有一个列表['word STRING', 'word_count BIGINT', 'corpus STRING', 'corpus_date BIGINT']尝试将其替换'为空，但这不起作用。我们如何用这个替换它？

— Sandeep Singh

如果其中一项是浮点数/整数怎么办？

— Patriots299 '18

32

您可以使用，例如：

words = [word.replace('[br]','<br />') for word in words]

— 好比软件
source

2

@macetw实际上是第一个答案。

— CodeIt

看着时间戳似乎他们都在同一时间回答，也许这是迟到了几分之一秒……

— maksbd19

31

除了列表理解之外，您还可以尝试地图

>>> map(lambda x: str.replace(x, "[br]", "<br/>"), words)
['how', 'much', 'is<br/>', 'the', 'fish<br/>', 'no', 'really']

— 安东尼·孔
source

15

如果您想知道不同方法的性能，请参考以下时间安排：

In [1]: words = [str(i) for i in range(10000)]

In [2]: %timeit replaced = [w.replace('1', '<1>') for w in words]
100 loops, best of 3: 2.98 ms per loop

In [3]: %timeit replaced = map(lambda x: str.replace(x, '1', '<1>'), words)
100 loops, best of 3: 5.09 ms per loop

In [4]: %timeit replaced = map(lambda x: x.replace('1', '<1>'), words)
100 loops, best of 3: 4.39 ms per loop

In [5]: import re

In [6]: r = re.compile('1')

In [7]: %timeit replaced = [r.sub('<1>', w) for w in words]
100 loops, best of 3: 6.15 ms per loop

如您所见，对于这种简单的模式，可接受的列表理解是最快的，但请查看以下内容：

In [8]: %timeit replaced = [w.replace('1', '<1>').replace('324', '<324>').replace('567', '<567>') for w in words]
100 loops, best of 3: 8.25 ms per loop

In [9]: r = re.compile('(1|324|567)')

In [10]: %timeit replaced = [r.sub('<\1>', w) for w in words]
100 loops, best of 3: 7.87 ms per loop

这表明对于更复杂的替换，预编译的reg-exp（如中的9-10）可以更快。这实际上取决于您的问题和reg-exp的最短部分。

— 约恩·希斯（JörnHees）
source

3

一个for循环的示例（我更喜欢列表理解）。

a, b = '[br]', '<br />'
for i, v in enumerate(words):
    if a in v:
        words[i] = v.replace(a, b)
print(words)
# ['how', 'much', 'is<br/>', 'the', 'fish<br/>', 'no', 'really']

— 郑威琪
source