使用beautifulsoup4提取标题标签元素

aenish 发表于 Dev

艾尼什

想要提取标题中提到的评论评级弹出评级百分比。这里给出了 html：

    a class="a-link-normal" href="http://www.amazon.in/product-reviews/B01FM7GGFI/ref=cm_cr_dp_hist_one/261-4285111-5015802?ie=UTF8&amp;filterByStar=one_star&amp;reviewerType=all_reviews&amp;showViewpoints=0" title="11% of reviews have 1 stars">1 star</a>

beautifulsoup python 脚本：

     from bs4 import BeautifulSoup
     import requests
     url = "http://www.amazon.in/Samsung-G-550FY-On5-Pro-Gold/dp/B01FM7GGFI/ref=lp_4363159031_1_1/261-4285111-5015802?s=electronics&ie=UTF8&qid=1503582445&sr=1-1"

    headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/54.0.2840.71 Safari/537.36'}
    r = requests.get(url, headers=headers)
     soup = BeautifulSoup(r.content, "lxml")

    for link in soup.find_all("div", attrs={"class": "a-fixed-left-grid-col a-col-left"}):
      for link1 in link.find_all("a", attrs={"class": "a-link-normal"}):
         print(link1)

德米特里·菲亚尔科夫斯基

html = '<a class="a-link-normal" href="http://www.amazon.in/product-reviews/B01FM7GGFI/ref=cm_cr_dp_hist_one/261-4285111-5015802?ie=UTF8&amp;filterByStar=one_star&amp;reviewerType=all_reviews&amp;showViewpoints=0" title="11% of reviews have 1 stars">1 star</a>'
soup = BeautifulSoup(html, 'lxml')

a_tags = soup.find_all('a', class_='a-link-normal')
for a in a_tags:
    if 'title' in a.attrs:
        print(a['title'])

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2021-06-14

我来说两句

0 条评论

登录后参与评论

上一篇：从图像的右中心使用 jquery 的图像幻灯片

使用 BeautifulSoup4 提取 XML 标签中的属性

从<script>标签BeautifulSoup4中提取令牌，请求

使用beautifulsoup4提取标题标签元素

使用beautifulsoup4提取标题标签元素

Android Studio Kotlin：提取为常量

IE 11中的FormData未定义

计算数据帧R中的字符串频率

如何在R中转置数据

如何使用Redux-Toolkit重置Redux Store

Excel 2016图表将增长与4个参数进行比较

在 Python 2.7 中。如何从文件中读取特定文本并分配给变量

未捕获的SyntaxError：带有Ajax帖子的意外令牌u

OpenCv：改变 putText() 的位置

ActiveModelSerializer仅显示关联的ID

算术中的c ++常量类型转换

如何开始为Ubuntu开发

将加号/减号添加到jQuery菜单

去噪自动编码器和常规自动编码器有什么区别？

获取并汇总所有关联的数据

OpenGL纹理格式的颜色错误

在 React Native Expo 中使用 react-redux 更改另一个键的值

http：// localhost：3000 /＃！/为什么我在localhost链接中得到“＃！/”。

TreeMap中的自定义排序

Redux动作正常，但减速器无效

如何对treeView的子节点进行排序