爬虫时网页显示的正常中文，源代码却乱了

初学python不久，在用selenium爬实习僧的网站时（搜索奔驰），再用开发者工具看源代码，是这样的
图片描述

源网页是正常的：

读取这个网页的代码是这样的（小部分）：

def get_products():

"""
提取商品数据
"""
#page_source属于str格式
html = browser.page_source
doc = pq(html)
items = doc('.position .position-list li.font').items()
for item in items:
    product = {
        'name': item.find('.name').text(),
        'release_time': item.find('.release-time').text(),
        'company': item.find('.company').text(),
        'area': item.find('.area').text(),
        'info': item.find('.more').text(),
    }
    print(product)

然后在spyder（用的anaconda3）的控制台中的输出是这样的，其中上面截图对应的是‘info’的信息

{'name': 'ue222uee04uf627 uf627uee14uee14uebe3ue321ue817 实习ue194', 'release_time': '2天前', 'company': '戴姆勒奔驰', 'area': '北京', 'info': 'ue83buf591uf591-ue83buf825uf591/天|uf825天/周|uecb6个月'}

之后写入txt文件，用了utf8编码，发现还是一个样子。
代码：

def save_to_text(product):

file = word + '.txt'
with open(file, 'a' , encoding='utf-8') as k:
    for key, value in product.items():
        k.write(key + ':' + value + '\n')

打开文件：
name:  实习
release_time:2天前
company:戴姆勒奔驰
area:北京
info:-/天|天/周|个月

所以到底还是编码的问题么？

阅读 3.1k

爬虫时网页显示的正常中文，源代码却乱了

你尚未登录，登录后可以

字节的 trae AI IDE 不支持类似 vscode 的 ssh remote 远程开发怎么办？

DataCap 中验证码无法显示，后台出现 NullPointerException 错误?

发现深拷贝和浅拷贝效果一致：请问一下有什么区别呢？

如何实现一个深拷贝函数？

Python 成员变量在多个子类实例间共享，如何避免？

为什么 Qwen2.5-Omni-7B 官方教程都报错 Cannot import available module of Qwen2_5OmniModel in modelscope ？

Spark-TTS-0.5B 的 requirements.txt 在哪里？