根据某一列过滤行，然后检查另一列的值是否在Python的特定列表中

ahbon 发表于 Dev

阿邦

给定以下玩具构建数据，我想检查何时type == 'other'，在数据框中是否存在一些不规则的值（表示inHall或Parking space）name。

    id    type                          name
0    1  office  Hessel, Macejkovic and Nader
1    2  office                Stiedemann LLC
2    3  office                     Grant Ltd
3    4  office                Anderson Group
4    5  retail                   MacDanald's
5    6  retail                      Wallmart
6    7  retail                      Wallmart
7    8   other                          Hall
8    9   other                 Parking space
9   10   other                 Parking space
10  11   other                   Roberts PLC

对于上面的数据集，我希望它创建一个新列indication并返回N最后一行，因为Roberts PLC它不在中['Hall', 'Parking space']。

    id    type                          name indication
0    1  office  Hessel, Macejkovic and Nader        NaN
1    2  office                Stiedemann LLC        NaN
2    3  office                     Grant Ltd        NaN
3    4  office                Anderson Group        NaN
4    5  retail                   MacDanald's        NaN
5    6  retail                      Wallmart        NaN
6    7  retail                      Wallmart        NaN
7    8   other                          Hall        NaN
8    9   other                 Parking space        NaN
9   10   other                 Parking space        NaN
10  11   other                   Roberts PLC          N

我使用过的代码需要编辑：

m = df1.loc[df1['type'].isin(['other'])]
if m['name'].str.contains('Hall|Parking space', na = False).any():
    print('')
else:
    print('N')

感谢您提前的帮助。

加：

对于打印指示：

if (df["type"].eq("other")) & (~df["name"].str.contains('Hall|Parking space', na = False).any()):
    print('Other type data has irregular data')
else:
    print('No irregular data found in other type data')

出：

ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

奕奕

您可以直接进行分配：

df.loc[(df["type"].eq("other"))&(~df["name"].str.contains('Hall|Parking space', na = False)), "indication"] = "N"

print (df)

    id    type                          name indication
0    1  office  Hessel, Macejkovic and Nader        NaN
1    2  office                Stiedemann LLC        NaN
2    3  office                     Grant Ltd        NaN
3    4  office                Anderson Group        NaN
4    5  retail                   MacDanald's        NaN
5    6  retail                      Wallmart        NaN
6    7  retail                      Wallmart        NaN
7    8   other                          Hall        NaN
8    9   other                 Parking space        NaN
9   10   other                 Parking space        NaN
10  11   other                   Roberts PLC          N

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2021-01-25

我来说两句

0 条评论

登录后参与评论

上一篇：使用声明式管道的waitUntil实现依赖并行任务的更优雅方法

根据某一列过滤行，然后检查另一列的值是否在Python的特定列表中

根据某一列过滤行，然后检查另一列的值是否在Python的特定列表中

隐藏发件人没有短信PHP

Hashchange事件侦听器在将事件处理程序附加到事件之前进行侦听

用日期数据透视表和日期顺序查询

flask-admin 如何自定义删除按钮

在浏览器中请求URL时会发生什么？

材质UI垂直滑块。如何改变在垂直材料UI滑块导轨的厚度（反应）

为什么PlusShare.Builder setRecipients方法不起作用？

OS X-为什么我需要打开WiFi才能确定最近的位置

在Windows 7中无法删除文件（2）

android 背部按下

Swift如何使用Base64Url编码JWT标头和有效负载之类的json对象

PyQt4.QtCore模块无法向sip模块注册

用白色图像隐藏Android Studio中的所有textView

为什么随机森林中的平均降低基尼系数取决于人口规模？

应用发明者仅从列表中选择一个随机项一次

正则表达式，用于查找所有以任何字母开头和数字开头的文件

ArgumentError：错误＃2109：在场景默认设置中未找到默认的帧标签

sshd AllowGroups组未授予访问权限

jQuery无限滚动固定div中的滚动

无法加载文件或程序集System.Runtime.CompilerServices.Unsafe

Jqgrid：多级别组摘要