我正在尝试用 numpy 读取一个 csv 文件,我有以下代码
from numpy import genfromtxt
data = genfromtxt(open('errerr.csv', "r"), names=True, delimiter=',')
然后出现以下内容
(nan, nan, nan, nan, nan, nan, nan),
(nan, nan, nan, nan, nan, nan, nan),
(nan, nan, nan, nan, nan, nan, nan)],
dtype=[('name', '<f8'), ('severity', '<f8'), ('Message', '<f8'), ('AppDomainName', '<f8'), ('ProcessName', '<f8'), ('clientid', '<f8'), ('type', '<f8')])
dtype 看起来不错
为了证明我没有发疯,我尝试了这段代码
import csv
f = open('errors.csv', 'rt')
reader = csv.reader(f)
data = []
for r in reader:
data.append(r)
f.close()
效果很好,但我想弄清楚 genfromtxt 是怎么回事
这是来自 csv 的示例
name,severity,Message,AppDomainName,ProcessName,clientid,type
Strings strings,Error,") Thread Name: Extended Properties:",SunDSrvc.exe,C:\Program Files\\SunDSrvc.exe,5DAA9377 ,Client
Strings strings,Error,") Thread Name: Extended Properties:",SunDSrvc.exe,C:\Program Files\\SunDSrvc.exe,5DAA9377 ,Client
Strings strings,Error,") Thread Name: Extended Properties:",SunDSrvc.exe,C:\Program Files\\SunDSrvc.exe,5DAA9377 ,Client
原文由 Marcom 发布,翻译遵循 CC BY-SA 4.0 许可协议
您的
dtype
不正常。它为每个字段指定'<f8'
,一个浮点数。你想要字符串。尝试dtype=None
:产生:
(我已经删除了引号内有关定界符的无关内容)