如题目所示,页面信息如下可以获得商品名称,但是价格是空的,用了好几种方法。
import requests
from lxml import etree
from lxml import html
import urllib.request
from bs4 import BeautifulSoup
url = "https://item.jd.com/5056201.html"
page = requests.get(url)
tree = html.fromstring(page.text)
selector = etree.HTML(page.text)
ProductNameP = '/html/body/div[5]/div/div[2]/div[1]/text()'
Name = selector.xpath(ProductNameP) #获得字符串
print(Name)
ProductPriceP ='/html/body/div[5]/div/div[2]/div[3]/div/div[1]/div[2]/text()'
Price = selector.xpath(ProductPriceP)
是 JS 渲染的
抓这个链接