Question

0 0

一个Python爬虫字符串相关的错误？

用Python爬豆瓣日记，将页面以html格式保存。其中至少20篇日记保存正常，某篇报错。错误信息如下。
是字符编码方式的错误吗，应该如何解决？
该日记名可在终端正常显示。


 file_object = open(dir + '/' + article + '.html', 'w')
IOError: [Errno 2] No such file or directory: 'dir/Even artichokes have doubts - \xe4\xb8\x80\xe4\xb8\xaaYALE\xe5\xa5\xb3\xe7\x94\x9f\xe5\xaf\xb9\xe4\xba\x8eIB/Consulting\xe7\x9a\x84\xe8\xb4\xa8\xe7\x96\x91.html'

python 网络爬虫

11 years, 7 months ago

Danono

share

Danono 11 years, 7 months ago

Answer 1

0

因为日志标题包含 IB/Consulting 字符串，也就是说有斜杠。而 HFS+ 的文件名是不允许有 / 的。

answered 11 years, 7 months ago

开飞机的猛男

share

开飞机的猛男 answered 11 years, 7 months ago

一个Python爬虫字符串相关的错误？

Danono

Answers

开飞机的猛男

Your Answer