nested tags and attributes in BeautifulSOUP and OpenStreetMap XML

Anna Ghildina

Please help to write the meaningful code for the task: I need to count for all tags "way" in XML OpenStreet Map file, the quantity of "nd" tag in each, and input the id of the tag 'way', which include the biggest quantity of tags "nd". If there are several ide's then input the first one in alphabetical order. Seems easy, but I cannot understand how to operate. (I only think it will be useful to use vocabulary) This is the code:

from urllib.request import urlopen, urlretrieve

from bs4 import BeautifulSoup


resp = urlopen('https://stepik.org/media/attachments/lesson/245681/map2.osm') # 

xml = resp.read().decode('utf8') # 

soup = BeautifulSoup(xml, 'xml') # делаем суп с помощью lxml

cnt = 0

names ={}

for way in soup.find_all('way'): # go through the nodes

    flag=False

    for nd in way('nd'):

        flag=True

        if nd['k'] == 'id':

            name=nd['v']

    if flag:

        if name not in names:

            names[name]=0

        names[name]+=1

print(sort(names))

Andrej Kesely

You can use max() builtin method to find <way> tag with biggest quantity of <nd>.

For example:

import requests
from bs4 import BeautifulSoup


url = 'https://stepik.org/media/attachments/lesson/245681/map2.osm'
soup = BeautifulSoup(requests.get(url).content, 'html.parser')

num_way = len(soup.select('way'))
w = max(sorted(soup.select('way:has(nd)'), reverse=True, key=lambda tag: int(tag['id'])), key=lambda tag: len(tag.select('nd')))

print('number of <way>:', num_way)
print('id:', w['id'])
print('quantity of <nd>:', len(w.select('nd')))

Prints:

number of <way>: 3181
id: 227140108
quantity of <nd>: 249

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2020-12-17

Comments

0 comments

Extracting attributes that are in XML tags with BeautifulSoup4

XSLT to access and create nested xml and convert tags to attributes

Include newline characters between nested XML Tags (BeautifulSoup)

Beautifulsoup Escaping Nested Tags

Searching nested tags with BeautifulSoup

BeautifulSoup shuffles the attributes of html tags

reading xml tags attributes

Regex for nested XML attributes

Change attributes in nested XML

Beautifulsoup accessing nested HTML tags

BeautifulSoup finding nested tags, children

Python nested html tags with Beautifulsoup

Extracting similar XML attributes with BeautifulSoup

XSLT: XML to XML with nested attributes

How to find tags with only certain attributes - BeautifulSoup

Python BeautifulSoup, iterating through tags and attributes

BeautifulSoup to find unofficial HTML tags/attributes

How to select a few tags with attributes (BeautifulSoup, python)

Using beautifulsoup to get multiple tags and attributes data

How to access attributes of XML tags

Nested XML tags in Python

Reading an xml with nested tags

How to extract xml tags with BeautifulSoup?

Strip tags (with tags inside attributes and nested tags) using javascript

Unmarshaling attributes in nested xml golang

Golang nested, renamed XML attributes

How to remove content in nested tags with BeautifulSoup?

How do I parse nested a tags with BeautifulSoup

Create new tag with nested tags using BeautifulSoup

TOP Ranking

Article

nested tags and attributes in BeautifulSOUP and OpenStreetMap XML

nested tags and attributes in BeautifulSOUP and OpenStreetMap XML

pump.io port in URL

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

How to import an asset in swift using Bundle.main.path() in a react-native native module

Inner Loop design for webscrapping

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

ggplotly no applicable method for 'plotly_build' applied to an object of class "NULL" if statements

mysql.connector.errors.InterfaceError: 2003: Can't connect to MySQL server on '127.0.0.1:3306' (111 Connection refused)

Removed zsh, but forgot to change shell back to bash, and now Ubuntu crashes (wsl)

Ambiguous use of 'init' with CFStringTransform and Swift 3

Resetting Value of <input type="time"> in Firefox

Execute ./script.sh with a crontab

Converting a class method to a property with a backing field

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

How to update azerothcore-wotlk docker container

How to set tab order for array of cluster,where cluster elements have different data types in LabVIEW?

Grails with Oracle thick OCI driver authenticate to Oracle with wrong user

How to pass data to the ng2-bs3-modal?

Making Array From Page Elements in jQuery

Retrieve Element Tag Value XML Using Bash

Laravel's ORM sync with timestamps doesn't update timestamps

Do animations stop css changes after animation completion?