根据字典填充选定列的 NaN，其键是列名，值是 Python 中另一列的内容

ah bon 发表于 Dev

是这样吗

对于数据框df1如下：

         id  products  black metal  non-ferrous metals  precious metal
0  M0066350    copper          NaN                 NaN             NaN
1  M0066352  aluminum          NaN                 NaN             NaN
2  M0066353      gold          NaN                 NaN             NaN
3  M0066354    silver          NaN                 NaN             NaN
4  S0200837   soybean          NaN                 NaN             NaN
5  S0212350     Apple          NaN                 NaN             NaN
6  S0212351  iron ore          NaN                 NaN             NaN
7  S0212352      coke          NaN                 NaN             NaN
8  S0212353    others          1.0                 NaN             1.0

我希望根据以下内容cols = ['black metal', 'non-ferrous metals', 'precious metal']用1s填充列customized_dict：

customized_dict = {
    'black metal': ['iron ore', 'coke'], 
    'non-ferrous metals': ['copper', 'aluminum'],
    'precious metal': ['gold', 'silver']
                   }

请注意，键来自 in 的列名df1和values来自productsin的内容df1。

所以我的问题是如何获得以下输出：

         id  products  black metal  non-ferrous metals  precious metal
0  M0066350    copper          NaN                 1.0             NaN
1  M0066352  aluminum          NaN                 1.0             NaN
2  M0066353      gold          NaN                 NaN             1.0
3  M0066354    silver          NaN                 NaN             1.0
4  S0200837   soybean          NaN                 NaN             NaN
5  S0212350     Apple          NaN                 NaN             NaN
6  S0212351  iron ore          1.0                 NaN             NaN
7  S0212352      coke          1.0                 NaN             NaN
8  S0212353    others          1.0                 NaN             1.0

编辑：列中有重复的新数据products。

    id  products  black metal  non-ferrous metals  precious metal
0  S0212350     Apple          NaN                 NaN             NaN
1  M0066352  aluminum          NaN                 1.0             NaN
2  S0212352      coke          1.0                 NaN             NaN
3  S0212354      coke          1.0                 NaN             NaN
4  M0066350    copper          NaN                 1.0             NaN
5  M0066353      gold          NaN                 NaN             1.0
6  S0212351  iron ore          1.0                 NaN             NaN
7  S0212353    others          1.0                 NaN             1.0
8  M0066354    silver          NaN                 NaN             1.0
9  S0200837   soybean          NaN                 NaN             NaN

莫兹韦

在列上使用一个简单的循环和update：

customized_dict = {
    'black metal': ['iron ore', 'coke'], 
    'non-ferrous metals': ['copper', 'aluminum'],
    'precious metal': ['gold', 'silver']
                   }
df.update(df.iloc[:,2:].apply(lambda c: c[df['products']
                                         .isin(customized_dict[c.name])]
                                         .fillna(1)))

输出：

         id  products  black metal  non-ferrous metals  precious metal
0  M0066350    copper          NaN                 1.0             NaN
1  M0066352  aluminum          NaN                 1.0             NaN
2  M0066353      gold          NaN                 NaN             1.0
3  M0066354    silver          NaN                 NaN             1.0
4  S0200837   soybean          NaN                 NaN             NaN
5  S0212350     Apple          NaN                 NaN             NaN
6  S0212351  iron ore          1.0                 NaN             NaN
7  S0212352      coke          1.0                 NaN             NaN
8  S0212353    others          1.0                 NaN             1.0

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2022-03-30

我来说两句

0 条评论

登录后参与评论

上一篇：将列表中的值与 Ansible 中的其他列表进行比较

根据字典填充选定列的 NaN，其键是列名，值是 Python 中另一列的内容

根据字典填充选定列的 NaN，其键是列名，值是 Python 中另一列的内容

根据字典填充选定列的 NaN，其键是列名，值是 Python 中另一列的内容

计算数据帧R中的字符串频率

Android Studio Kotlin：提取为常量

Excel 2016图表将增长与4个参数进行比较

获取并汇总所有关联的数据

如何使用Redux-Toolkit重置Redux Store

http：// localhost：3000 /＃！/为什么我在localhost链接中得到“＃！/”。

将加号/减号添加到jQuery菜单

算术中的c ++常量类型转换

TYPO3：将 Formhandler 添加到新闻扩展

TreeMap中的自定义排序

如何开始为Ubuntu开发

在 Python 2.7 中。如何从文件中读取特定文本并分配给变量

无法使用 envoy 访问 .ssh/config

在Ubuntu和Windows中，触摸板有时会滞后。硬件问题？

遍历元素数组以每X秒在浏览器上显示

在Jenkins服务器中使用Selenium和Ruby进行的黄瓜测试失败，但在本地计算机中通过

警告消息：在matrix（unlist（drop.item），ncol = 10，byrow = TRUE）中：数据长度[16]不是列数的倍数[10]>？

未捕获的SyntaxError：带有Ajax帖子的意外令牌u

如何使用tweepy流式传输来自指定用户的推文（仅在该用户发布推文时流式传输）

尝试在Dell XPS13 9360上安装Windows 7时出错

如果从DB接收到的值为空，则JMeter JDBC调用将返回该值作为参数名称