Python：在一个节点中解析具有多个属性的XML文件

Ize 发表于 Dev

伊泽

我仍然是编程新手，但我了解一些Python，并且对XPath和XML总体上很熟悉。目前，我正在处理一些看起来像这样的XML数据：

<foo>
  <bar>
      <unit>
          <structure>
              <token word="Rocky" att1="noun" att2="name">Rocky</token>
              <token word="the" att1="article" att2="">the</token>
              <token word="yellow" att1="adjective" att2="color">yellow</token>
              <token word="dog" att1="noun" att2="animal">dog</token>
          </structure>
      </unit>
  </bar>
</foo>

现在我需要做的是首先找到一个属性值，让我们

<token word="dog" att1="noun"att2="animal"</token>

对于一个实例。因此，在文档的所有结构中，我首先要查找所有具有动物作为att2值的节点，然后将该节点的所有兄弟节点放入列表中。因为节点每个都有几个属性，所以我试图将它们每个都包含到一个不同的列表中，也就是说，从结构中的所有属性中创建一个列表，该结构中的动物具有其子代的att2值之一。例如：

 listWord = [Rocky, the, yellow, dog]
 listAtt1 = [noun, article, adjective, noun]
 listAtt2 = [name, ,color, animal]

目前，我只是想知道是否有可能。到目前为止，我只靠属性结构碰壁，更不用说空值了。

Asongtoruin

包括结束标记标记，并假设您的文本包含在中test.xml，以下内容：

import xml.etree.ElementTree

e = xml.etree.ElementTree.parse('test.xml').getroot()

listWord = []
listAtt1 = []
listAtt2 = []

for child in e.iter('token'):
    listWord.append(child.attrib['word'])
    listAtt1.append(child.attrib['att1'])
    listAtt2.append(child.attrib['att2'])

print listWord
print listAtt1
print listAtt2

将返回：

['Rocky', 'the', 'yellow', 'dog']
['noun', 'article', 'adjective', 'noun']
['name', '', 'color', 'animal']

e.iter()让您e作为根及其下的元素进行迭代-我们指定的标记token仅返回token元素。child.attrib返回属性字典，我们将其附加到列表中。

编辑：对于您的问题的第二点，我认为以下内容（虽然可能不是最佳实践）会满足您的需求：

import xml.etree.ElementTree

e = xml.etree.ElementTree.parse('test.xml').getroot()

listWord = []
listAtt1 = []
listAtt2 = []
animal_structs =[]

for structure in e.iter('structure'):
    for child in structure.iter('token'):
        if 'att2' in child.keys():
            if child.attrib['att2'] == 'animal':
                animal_structs.append(structure)
                break

for structure in animal_structs:
    for child in structure.iter('token'):
        listWord.append(child.attrib['word'])
        listAtt1.append(child.attrib['att1'])
        listAtt2.append(child.attrib['att2'])

print listWord
print listAtt1
print listAtt2

我们首先创建一个structure带有animal子元素的所有元素的列表，然后返回每个结构的所有then属性。

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2021-05-13

我来说两句

0 条评论

登录后参与评论

上一篇：Azure DocumentDB受限制的请求

Python：在一个节点中解析具有多个属性的XML文件

Python：在一个节点中解析具有多个属性的XML文件

Android Studio Kotlin：提取为常量

IE 11中的FormData未定义

计算数据帧R中的字符串频率

如何在R中转置数据

如何使用Redux-Toolkit重置Redux Store

Excel 2016图表将增长与4个参数进行比较

在 Python 2.7 中。如何从文件中读取特定文本并分配给变量

未捕获的SyntaxError：带有Ajax帖子的意外令牌u

OpenCv：改变 putText() 的位置

ActiveModelSerializer仅显示关联的ID

算术中的c ++常量类型转换

如何开始为Ubuntu开发

将加号/减号添加到jQuery菜单

去噪自动编码器和常规自动编码器有什么区别？

获取并汇总所有关联的数据

OpenGL纹理格式的颜色错误

在 React Native Expo 中使用 react-redux 更改另一个键的值

http：// localhost：3000 /＃！/为什么我在localhost链接中得到“＃！/”。

TreeMap中的自定义排序

Redux动作正常，但减速器无效

如何对treeView的子节点进行排序