TypeError：需要一个类似字节的对象，而在python和CSV中不是'str'

172

TypeError：需要一个类似字节的对象，而不是'str'

在执行以下python代码以将HTML表格数据保存到Csv文件时遇到上述错误。不知道如何获得rideup.pls帮助我。

import csv
import requests
from bs4 import BeautifulSoup

url='http://www.mapsofindia.com/districts-india/'
response=requests.get(url)
html=response.content

soup=BeautifulSoup(html,'html.parser')
table=soup.find('table', attrs={'class':'tableizer-table'})
list_of_rows=[]
for row in table.findAll('tr')[1:]:
    list_of_cells=[]
    for cell in row.findAll('td'):
        list_of_cells.append(cell.text)
    list_of_rows.append(list_of_cells)
outfile=open('./immates.csv','wb')
writer=csv.writer(outfile)
writer.writerow(["SNo", "States", "Dist", "Population"])
writer.writerows(list_of_rows)

在最后一行上方。

— 湿婆
source

的可能重复的类型错误：：蟒3.5的字节状物体是必需的，而不是“STR”

— tripleee

您好-我尝试在MX-Linux上的ATOM上运行此文件-但我得到了回音：“回溯（最近一次通话最近）：文件“ /home/martin/.atom/python/examples/bs_gumtree_pl.py”，行20，在<module> writer.writerows（list_of_rows）UnicodeEncodeError中：'ascii'编解码器无法在位置0编码字符u'\ xa0'：序数不在范围内（128）[在2.015s中完成]继续在这里！？很

— 为零

328

您正在使用Python 2方法而不是Python 3。

更改：

outfile=open('./immates.csv','wb')

至：

outfile=open('./immates.csv','w')

您将获得一个带有以下输出的文件：

SNo,States,Dist,Population
1,Andhra Pradesh,13,49378776
2,Arunachal Pradesh,16,1382611
3,Assam,27,31169272
4,Bihar,38,103804637
5,Chhattisgarh,19,25540196
6,Goa,2,1457723
7,Gujarat,26,60383628
.....

在Python 3中，csv以文本模式获取输入，而在Python 2中，其以二进制模式获取输入。

编辑添加

这是我运行的代码：

url='http://www.mapsofindia.com/districts-india/'
html = urllib.request.urlopen(url).read()
soup = BeautifulSoup(html)
table=soup.find('table', attrs={'class':'tableizer-table'})
list_of_rows=[]
for row in table.findAll('tr')[1:]:
    list_of_cells=[]
    for cell in row.findAll('td'):
        list_of_cells.append(cell.text)
    list_of_rows.append(list_of_cells)
outfile = open('./immates.csv','w')
writer=csv.writer(outfile)
writer.writerow(['SNo', 'States', 'Dist', 'Population'])
writer.writerows(list_of_rows)

— 德斯杜巴
source

19

为了与csv模块一起使用，Python 3 open还应具有newline=''一个参数[ref ]

— Mark Tolonen

1

将“ wb”字符串更改为“ w”对我有用。非常感谢

— Loc Huynh

如果您使用缓冲液，请参阅乙烯基的答案！

— handras

嗨，我-尝试了代码-并获得了以下信息：追溯（最近一次调用是最近一次）：文件</home/martin/.atom/python/examples/bs_gumtree_pl.py”，第20行，在<module> UnicodeEncodeError中：“ASCII”编解码器无法编码的字符的u“\ XA0”在位置0：序不在范围内（128）[成品在1.415s]`我没有胶水什么这里推移

— 零

21

我在Python3中遇到了同样的问题。我的代码正在写入io.BytesIO()。

替换为已io.StringIO()解决。

— 乙烯基
source

我也发生了stringio

— thebeancounter

一个考虑因素是： io.StringIO()内存贪婪，并且对于大文件而言可能会令人头疼。

— Flavio

1

file = open('parsed_data.txt', 'w')
for link in soup.findAll('a', attrs={'href': re.compile("^http")}): print (link)
soup_link = str(link)
print (soup_link)
file.write(soup_link)
file.flush()
file.close()

就我而言，我使用BeautifulSoup用Python 3.x编写了.txt。它有同样的问题。就像@tsduteba所说的那样，将第一行中的'wb'更改为'w'。

— 杨力
source

给出答案时，最好对原因做出一些解释。在这种情况下，此答案与接受的答案有何不同？

— 斯蒂芬·劳赫

@StephenRauch谢谢您的评论。我是新来的，几周前才刚开始学习Python。我将在以后尝试给出更好的答案。

— 杨力

您可以编辑此帖子，并添加更多详细信息。点击帖子下方和左侧的编辑按钮。

— 斯蒂芬·劳奇

@StephenRauch感谢您的提示！

— 杨力

1

只需将wb更改为w

outfile=open('./immates.csv','wb')

至

outfile=open('./immates.csv','w')

— 萨拉斯·阿克
source

0

您正在以二进制模式打开csv文件，应该是 'w'

import csv

# open csv file in write mode with utf-8 encoding
with open('output.csv','w',encoding='utf-8',newline='')as w:
    fieldnames = ["SNo", "States", "Dist", "Population"]
    writer = csv.DictWriter(w, fieldnames=fieldnames)
    # write list of dicts
    writer.writerows(list_of_dicts) #writerow(dict) if write one row at time

— 索汉·达斯（Sohan Das）
source