熊猫：只要条件持续存在并且某个值出现在另一列中，则为该列分配值

Luca91 发表于 Dev

卢卡91

我有一个数据框，其中包括一个特定条件，一个条件连续发生次数的计数器以及一个特定值，看起来像这样：

         date                condition         count        Value1    Value2
    01,01,2018 08:00             A               1            0         0
    01,01,2018 08:01             A               2            0         0
    01,01,2018 08:02             A               3            0         0
    01,01,2018 08:03             B               1            1         1
    01,01,2018 08:04             B               2            0         1
    01,01,2018 08:05             B               3            0         1
    01,01,2018 08:06             B               4            0         0
    01,01,2018 08:07             C               1            0         0
    01,01,2018 08:08             C               2            0         0
    01,01,2018 08:09             C               3            0         0
    01,01,2018 08:10             C               4            0         0
    01,01,2018 08:11             C               5            0         0
    01,01,2018 08:12             A               1            0         0
    01,01,2018 08:13             A               2            0         0
    01,01,2018 08:14             B               1            0         0
    01,01,2018 08:15             B               2            0         1
    01,01,2018 08:16             B               3            0         1
    01,01,2018 08:17             C               1            0         0

我想添加另一列“错误”，该列在条件下具有值1：
如果在count = 1时，如果value1 = 1和condition = B，则只要value2 = 1，就分配error = 1。

它应该看起来像：

         date                condition         count        Value1    Value2    error 
    01,01,2018 08:00             A               1            0         0        0
    01,01,2018 08:01             A               2            0         0        0
    01,01,2018 08:02             A               3            0         0        0
    01,01,2018 08:03             B               1            1         1        1
    01,01,2018 08:04             B               2            0         1        1
    01,01,2018 08:05             B               3            0         1        1
    01,01,2018 08:06             B               4            0         0        0
    01,01,2018 08:07             C               1            0         0        0
    01,01,2018 08:08             C               2            0         0        0
    01,01,2018 08:09             C               3            0         0        0
    01,01,2018 08:10             C               4            0         0        0
    01,01,2018 08:11             C               5            0         0        0
    01,01,2018 08:12             A               1            0         0        0
    01,01,2018 08:13             A               2            0         0        0
    01,01,2018 08:14             B               1            0         0        0
    01,01,2018 08:15             B               2            0         1        0
    01,01,2018 08:16             B               3            0         1        0
    01,01,2018 08:17             C               1            0         0        0

请注意，条件B第二次出现时，value1永远不会等于1，因此即使value2 = 1也没有错误。

我已经尝试过类似的事情：

df['error']=np.where(((df['value1']==1) & (df['condition']=='B') & df['value2']==1)) | ((df['error'].shift(1)=='1')&(df['value2']==1))),'1', 0)

但是它给了我主要的错误，因为我df['error'].shift(1)=='1'在列本身“尚不存在”时调用where条件。任何的想法？预先感谢您的帮助！

耶斯列尔

使用：

#conditions
mask = (df['Value1']==1) & (df['condition']=='B') & (df['Value2']==1)
#series for unique consecutive values
a = df['Value2'].ne(df['Value2'].shift()).cumsum()
#per each consecutive group cal cumulative sum, convert to boolean and then to integers
df['error'] = mask.groupby(a).cumsum().astype(bool).astype(int)
print (df)
                date condition  count  Value1  Value2  error
0   01,01,2018 08:00         A      1       0       0      0
1   01,01,2018 08:01         A      2       0       0      0
2   01,01,2018 08:02         A      3       0       0      0
3   01,01,2018 08:03         B      1       1       1      1
4   01,01,2018 08:04         B      2       0       1      1
5   01,01,2018 08:05         B      3       0       1      1
6   01,01,2018 08:06         B      4       0       0      0
7   01,01,2018 08:07         C      1       0       0      0
8   01,01,2018 08:08         C      2       0       0      0
9   01,01,2018 08:09         C      3       0       0      0
10  01,01,2018 08:10         C      4       0       0      0
11  01,01,2018 08:11         C      5       0       0      0
12  01,01,2018 08:12         A      1       0       0      0
13  01,01,2018 08:13         A      2       0       0      0
14  01,01,2018 08:14         B      1       0       0      0
15  01,01,2018 08:15         B      2       0       1      0
16  01,01,2018 08:16         B      3       0       1      0
17  01,01,2018 08:17         C      1       0       0      0

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2020-12-10

我来说两句

0 条评论

登录后参与评论

熊猫：如果列中的值出现在另一列中，则用第三列中的值替换

熊猫：删除其中一列的值出现在另一列中的任何行的行

熊猫：只要条件持续存在并且某个值出现在另一列中，则为该列分配值

熊猫：只要条件持续存在并且某个值出现在另一列中，则为该列分配值

构建类似于Jarvis的本地语言应用程序

在 Avalonia 中是否有带有柱子的 TreeView 或类似的东西？

Qt Creator Windows 10 - “使用 jom 而不是 nmake”不起作用

SQL Server中的非确定性数据类型

使用next.js时出现服务器错误，错误：找不到react-redux上下文值；请确保组件包装在<Provider>中

Swift 2.1-对单个单元格使用UITableView

Hashchange事件侦听器在将事件处理程序附加到事件之前进行侦听

HttpClient中的角度变化检测

如何了解DFT结果

错误：找不到存根。请确保已调用spring-cloud-contract：convert

Embers js中的更改侦听器上的组合框

在Wagtail管理员中，如何禁用图像和文档的摘要项？

如何避免每次重新编译所有文件？

Java中的循环开关案例

ng升级性能注意事项

Swift中的指针替代品？

如何使用geoChoroplethChart和dc.js在Mapchart的路径上添加标签或自定义值？

使用分隔符将成对相邻的数组元素相互连接

在同一Pushwoosh应用程序上Pushwoosh多个捆绑ID

ggplot：对齐多个分面图-所有大小不同的分面

完全禁用暂停（在内核级别？-必须与使用的DE和登录状态无关！）