python爬虫编码问题

跟着教程写了个爬虫，结果爬到的中文都是乱码的，应该怎么解决

python代码

from __future__ import unicode_literals 
#-*-coding:utf-8-*-
import requests
from bs4 import BeautifulSoup
res = requests.get('http://news.sina.com.cn/china/')
res.encoding='utf-8'
soup=BeautifulSoup(res.text,'html.parser')
for news in soup.select('.news-item'):
    if len(news.select('h2'))>0:
        h2=news.select('h2')[0].text
        a=news.select('a')[0]['href']
        print(h2,a)

爬取结果：

(u'u539fu56fdu52a1u9662u5b98u5458uff1au804cu5de5u65e9u9000u4f11u53bbu8df3u5e7fu573au821eu662fu6d6au8d3
9', u'http://news.sina.com.cn/c/sd/...')
(u'u4e2du56fdu8239u5458u88abu7d22u9a6cu91ccu6d77u76d7u52abu63011671u5929 u79f0u4e0du518du51fau6d77', u'h
ttp://news.sina.com.cn/o/2016-10-29/doc-ifxxfysn8035051.shtml')
(u'u6cb3u5317u6cb3u5357u4f1au8baeu8d39u65b0u89c4uff1au4e00u7c7bu4f1au8baeu6bcfu4ebau6bcfu5929600u5143',
u'http://news.sina.com.cn/c/201...')
(u'u4e2du7eaau59d4u53cdu8150u7247u66ddu514977u540du5b98u5458 u6709u526fu56fdu7ea7u6709u6751u4e3bu4efb',
u'http://news.sina.com.cn/c/sd/...')

阅读 6k

from __future__ import unicode_literals #-*-coding:utf-8-*- import requests from bs4 import BeautifulSoup res = requests.get('http://news.sina.com.cn/china/') res.encoding='utf-8' soup=BeautifulSoup(res.text,'html.parser') for news in soup.select('.news-item'): if len(news.select('h2'))>0: h2=news.select('h2')[0].text a=news.select('a')[0]['href'] test = str((h2, a)) print(test.decode("unicode-escape"))

python爬虫编码问题

你尚未登录，登录后可以

Qt中布局是否只有5种呢？

字节的 trae AI IDE 不支持类似 vscode 的 ssh remote 远程开发怎么办？

这段代码为什么不能获取到数据？

请问一下，如何理解reduce函数呢？

如何使用Python+Selenium爬取Goodreads上万条书评而不崩溃？

如何使用 python 代码实现迅雷磁力链接资源的下载？

在PyCharm开发不同python项目，如果每个项目使用自己的venv环境，是不是每次切换项目都需要修改python interpreter？