【自学python】关于爬虫int问题出错

代码如下：

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("http://ent.sina.com.cn")


page = BeautifulSoup(html)



def get_next_target(page):
    start_link = page.find('<a href=')
    if start_link == -1:
        url, end_quote = None, 0
        return url, end_quote

    else:
        start_quote = page.find('"', start_link)
        end_quote = page.find('"', start_quote + 1)
        url = page[start_quote + 1:end_quote]

    return url, end_quote



def print_all_links(page):
    while True:
        url, endpos = get_next_target(page)
        if url:
            print( url)
            page = page[endpos:]
        else:
            break

print_all_links(get_next_target(page))

执行结果如下：

复制下来就是：

python@ubuntu:~$ python3 aa.py 
/usr/lib/python3/dist-packages/bs4/__init__.py:166: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("lxml"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

To get rid of this warning, change this:

 BeautifulSoup([your markup])

to this:

 BeautifulSoup([your markup], "lxml")

  markup_type=markup_type))
Traceback (most recent call last):
  File "aa.py", line 35, in <module>
    print_all_links(get_next_target(page))
  File "aa.py", line 19, in get_next_target
    end_quote = page.find('"', start_quote + 1)
TypeError: unsupported operand type(s) for +: 'NoneType' and 'int'

这个是我自制的一个爬虫函数，然后出错了。这个到底错在哪里呢？看不懂

阅读 2.8k

【自学python】关于爬虫int问题出错

你尚未登录，登录后可以

字节的 trae AI IDE 不支持类似 vscode 的 ssh remote 远程开发怎么办？

DataCap 中验证码无法显示，后台出现 NullPointerException 错误?

发现深拷贝和浅拷贝效果一致：请问一下有什么区别呢？

如何实现一个深拷贝函数？

Python 成员变量在多个子类实例间共享，如何避免？

为什么 Qwen2.5-Omni-7B 官方教程都报错 Cannot import available module of Qwen2_5OmniModel in modelscope ？

Spark-TTS-0.5B 的 requirements.txt 在哪里？

【自学python】 关于爬虫int问题出错

你尚未登录，登录后可以

字节的 trae AI IDE 不支持类似 vscode 的 ssh remote 远程开发怎么办？

DataCap 中验证码无法显示，后台出现 NullPointerException 错误?

发现深拷贝和浅拷贝效果一致：请问一下有什么区别呢？

如何实现一个深拷贝函数？

Python 成员变量在多个子类实例间共享，如何避免？

为什么 Qwen2.5-Omni-7B 官方教程都报错 Cannot import available module of Qwen2_5OmniModel in modelscope ？

Spark-TTS-0.5B 的 requirements.txt 在哪里？

【自学python】关于爬虫int问题出错