Python3爬虫数据写入csv

爬虫文学网站,获取了章节数, 点击量, 章节字数, 想写入一个csv文件,可是爬出来的内容和我设的colum 对应不上, 章节字数出现在章节数和点击量前面。我想让它出现在wordcount这个colum下面。
代码如下:

import requests
import re
import json
import csv
from bs4 import BeautifulSoup as bs
start_url = "http://www.jjwxc.net/onebook.php?novelid=3601"
res = requests.get(start_url)
res.encoding = "gb2312"
soup = bs(res.text, "html.parser")

headers = {
    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 \
    (KHTML, like Gecko) Chrome/67.0.3396.62 Safari/537.36"}

url = "http://s8-static.jjwxc.net/getnovelclick.php?novelid=3601&jsonpcall\
back=novelclick"
web_data = requests.get(url, headers=headers)
web_data.encoding = "gzip"
result = web_data.content.decode()
string = re.findall(r'({.*?})', result)[0]
tmp_dict = json.loads(string)
wordcount = soup.find_all("td", {"itemprop": "wordCount"})
with open("JJWXC Scraping.csv","w") as csvfile:
    writer = csv.writer(csvfile)
    writer.writerow(['Chapters','Views', 'Wordcount'])
    for w in wordcount:
        writer.writerow(w)
    for k in tmp_dict.items():
        writer.writerow(k)

打印出来在csv file 里面显示这样:
Chapters,Views,Wordcount
500
2592
819
1720
2862
4862
1988
1559
1069
2570
1812
2441
3549
2841
6222
1485
5002
2330
1795
3620
3969
5120
4943
4892
4818
6707
5014
6140
2553
4587
1,82799
2,73460
3,52374
4,49213
5,46872
6,43722
7,36363
8,36089
9,35938
10,35594
11,32933
12,34381
13,33675
14,31390
15,33825
16,32669
17,30706
18,32187
19,29489
20,31241
21,30233
22,28571
23,30078
24,28894
25,29471
26,29500
27,29411
28,29703
29,31449
30,53456

wordcount出现在所有文章数和点击量前面了。

阅读 3.4k
1 个回答

csv.writer.writerow() 一次写入一行,你只需要把 wordcount 、文章数、点击量 一同写入便可。
像这样

writer.writerow([1, 30, 1000])
撰写回答
你尚未登录,登录后可以
  • 和开发者交流问题的细节
  • 关注并接收问题和回答的更新提醒
  • 参与内容的编辑和改进,让解决方法与时俱进
推荐问题