scrapy能爬取https的网页么


想获得 https://www.facebook.com/feeds/page.php?format=json&id=1433215316907826 中的alternate,但是爬虫返回404的网页错误

python scrapy

金肛互撸娃 10 years, 10 months ago

到最后发现是proxy的问题。线上测试就没问题

古手·羽 入 answered 10 years, 10 months ago

用Scrapy对https是不行的,你可以用Facebook的SDK

https://github.com/pythonforfacebook/facebook-sdk

荒耶丿宗蓮 answered 10 years, 10 months ago

这有个用scrapy爬取https的解决方法: http://stackoverflow.com/questions/23958073/scrapy-not-scraping-https

kuuuuuu answered 10 years, 10 months ago

Your Answer