response = requests.get('https://36kr.com/newsflashes')
all_list=re.findall(
'"title":"(.*?)","catch_title":"","description":".*?","cover":"","news_url_type":"news_url","news_url":"(.*?)","user_id":"344033181","published_at":"(.*?)",',
response.text, re.S)
print(all_list)
print(len(all_list))
我这个正则为啥不能匹配全部呢?只能匹配16个 ,总共是20个标题
https://36kr.com/newsflashes 这是网址
一般这种还是先拿到json再从里边取数据吧,合理,还能多很多数据也许有用.