新手上路，请多包涵

使用正则表达式时，我得到：

 import re
string = r'http://www.example.com/abc.html'
result = re.search('^.*com', string).group()

在熊猫中，我写道：

 df = pd.DataFrame(columns = ['index', 'url'])
df.loc[len(df), :] = [1, 'http://www.example.com/abc.html']
df.loc[len(df), :] = [2, 'http://www.hello.com/def.html']
df.str.extract('^.*com')

ValueError: pattern contains no capture groups

如何解决问题？

谢谢。

原文由 Chan 发布，翻译遵循 CC BY-SA 4.0 许可协议

python pandas

阅读 1.4k

2 个回答

得票最新

社区维基

发布于
2023-01-08

✓ 已被采纳

根据文档，您需要为 str.extract 指定一个 _捕获组_（即括号），以便提取。

Series.str.extract(pat, flags=0, expand=True)

对于系列中的每个主题字符串，从正则表达式 pat 的第一个匹配项中提取组。

每个捕获组在输出中构成自己的列。

 df.url.str.extract(r'(.*.com)')

                        0
0  http://www.example.com
1    http://www.hello.com

 # If you need named capture groups,
df.url.str.extract(r'(?P<URL>.*.com)')

                      URL
0  http://www.example.com
1    http://www.hello.com

或者，如果你需要一个系列，

 df.url.str.extract(r'(.*.com)', expand=False)

0    http://www.example.com
1      http://www.hello.com
Name: url, dtype: object

原文由 cs95 发布，翻译遵循 CC BY-SA 4.0 许可协议

社区维基

发布于
2023-01-08

您需要为匹配组指定列 url 和 () ：

 df['new'] = df['url'].str.extract(r'(^.*com)')
print (df)
  index                              url                     new
0     1  http://www.example.com/abc.html  http://www.example.com
1     2    http://www.hello.com/def.html    http://www.hello.com

原文由 jezrael 发布，翻译遵循 CC BY-SA 4.0 许可协议

撰写回答

你尚未登录，登录后可以

和开发者交流问题的细节
关注并接收问题和回答的更新提醒
参与内容的编辑和改进，让解决方法与时俱进

推荐问题

Stack Overflow 翻译

子站问答

访问

本篇内容翻译自 Stack Overflow，如果你觉得翻译结果值得改进，欢迎直接编辑修改，感谢你为社区贡献。

相似问题

找不到问题？创建新问题

pandas ValueError：模式不包含捕获组

你尚未登录，登录后可以

字节的 trae AI IDE 不支持类似 vscode 的 ssh remote 远程开发怎么办？

DataCap 中验证码无法显示，后台出现 NullPointerException 错误?

发现深拷贝和浅拷贝效果一致：请问一下有什么区别呢？

如何实现一个深拷贝函数？

Python 成员变量在多个子类实例间共享，如何避免？

为什么 Qwen2.5-Omni-7B 官方教程都报错 Cannot import available module of Qwen2_5OmniModel in modelscope ？

Spark-TTS-0.5B 的 requirements.txt 在哪里？

Stack Overflow 翻译