使用 BeautifulSoup 从 wiki 类别中抓取数据

Hamza Khan 发表于 Dev

27

哈姆扎汗

我正在尝试从https://dota2.gamepedia.com/Category:Counters.

我试过下面的代码

from bs4 import BeautifulSoup
import requests

source = requests.get('https://dota2.gamepedia.com/Category:Counters').text

soup = BeautifulSoup(source, 'lxml')
link = soup.find('div', class_="mw-category")

for link in link:
    link = link.text
    print(link)

我只需要列表形式的角色（在 DOTA2 中称为英雄）名称。随意尝试自己的代码并检查输出。

HK1911

您的代码将英雄列表作为 HeroName/Counter 给出，我基本上更正了它。hero_names 是您正在寻找的列表，我相信

from bs4 import BeautifulSoup
import requests

source = requests.get('https://dota2.gamepedia.com/Category:Counters').text

soup = BeautifulSoup(source, 'lxml')
link = soup.find('div', class_="mw-category")

heroes_names = []

for link in link:
    link = link.text
    heroes = link.split("\n")

    for i in range(1,len(heroes)):
        heroname = heroes[i].split("/")[0]

        heroes_names.append(heroname)


for hero_name in heroes_names:
    print(hero_name)

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2021-07-29

我来说两句

0 条评论

登录后参与评论

上一篇：编码中的切换案例 Java 错误

相关文章

使用BeautifulSoup抓取亚马逊

如何使用beautifulsoup在h4中抓取数据？

使用BeautifulSoup抓取网页

使用beautifulsoup从脚本标签中抓取数据

使用BeautifulSoup抓取财务数据

BeautifulSoup抓取数据-指定行？子类别？

在404中使用beautifulsoup结果抓取数据

使用 BeautifulSoup 抓取网页

使用 Python-BeautifulSoup 抓取表格数据

使用 BeautifulSoup 从数据框中抓取数据

使用 BeautifulSoup 从 html 中抓取特定数据

使用 BeautifulSoup 抓取 HTML

使用 BeautifulSoup 从网站抓取数据的问题

使用 beautifulsoup 在 Pandas 数据框中抓取问题/错误

如何使用beautifulsoup从python中的url中抓取数据

使用 Beautiful Soup 和 Python 从 wiki 抓取表格数据

使用 Python Beautifulsoup 抓取表格和数据

使用 BeautifulSoup 抓取数据

使用beautifulSoup在元素中抓取数据

如何使用 BeautifulSoup 抓取特定数据

使用 BeautifulSoup 抓取 Web 数据

使用 BeautifulSoup 抓取 url

使用 BeautifulSoup 抓取问题

使用 BeautifulSoup 在具有多个表格的页面上抓取单个 Wiki 表格

不使用beautifulsoup抓取网站数据

使用 BeautifulSoup 从 Zillow.com 抓取数据

使用 Requests 和 Beautifulsoup 抓取数据

如何使用 Python 和 BeautifulSoup 从 html 表中抓取数据？

使用 BeautifulSoup 和 Selenium 抓取数据

TOP 榜单

文章

热门标签

归档