add values of one group into another group in R

Ravindar G

I have a question on how to add the value from a group to rest of the elements in the group then delete that row. for ex:

df <- data.frame(Year=c(1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2),
                 Cluster=c("a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","a","c","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","b","d"),
                 Seed=c(1,1,1,1,1,2,2,2,2,2,3,3,3,3,3,99,99,99,99,99,99),
                 Day=c(1,2,3,4,5,1,2,3,4,5,1,2,3,4,5,1,2,3,4,5,1),
                 value=c(5,2,1,2,8,6,7,9,3,5,2,1,2,8,6,55,66,77,88,99,10))

in the above example, my data is grouped by Year, Cluster, Seed and Day where seed=99 values need to be added to above rows based on (Year, Cluster and Day) group then delete this row. for ex: Row # 16, is part of (Year=1, Cluster=a,Day=1 and Seed=99) group and the value of Row #16 which is 55 should be added to Row #1 (5+55), Row # 6 (6+55) and Row # 11 (2+55) and row # 16 should be deleted. But when it comes to Row #21, which is in cluster=C with seed=99, should remain in the database as is as it cannot find any matching in year+cluster+day combination.

My actual data is of 1 million records with 10 years, 80 clusters, 500 days and 10+1 (1 to 10 and 99) seeds, so looking for so looking for an efficient solution.

     Year Cluster Seed Day value
1     1       a    1   1    60
2     1       a    1   2    68
3     1       a    1   3    78
4     1       a    1   4    90
5     1       a    1   5   107
6     1       a    2   1    61
7     1       a    2   2    73
8     1       a    2   3    86
9     1       a    2   4    91
10    1       a    2   5   104
11    1       a    3   1    57
12    1       a    3   2    67
13    1       a    3   3    79
14    1       a    3   4    96
15    1       a    3   5   105
16    1       c   99   1    10
17    2       b    1   1    60
18    2       b    1   2    68
19    2       b    1   3    78
20    2       b    1   4    90
21    2       b    1   5   107
22    2       b    2   1    61
23    2       b    2   2    73
24    2       b    2   3    86
25    2       b    2   4    91
26    2       b    2   5   104
27    2       b    3   1    57
28    2       b    3   2    67
29    2       b    3   3    79
30    2       b    3   4    96
31    2       b    3   5   105
32    2       d   99   1    10

arg0naut91

A data.table approach:

library(data.table)

df <- setDT(df)[, `:=` (value = ifelse(Seed != 99, value + value[Seed == 99], value),
                  flag = Seed == 99 & .N == 1), by = .(Year, Cluster, Day)][!(Seed == 99 & flag == FALSE),][, "flag" := NULL]

Output:

df[]

    Year Cluster Seed Day value
 1:    1       a    1   1    60
 2:    1       a    1   2    68
 3:    1       a    1   3    78
 4:    1       a    1   4    90
 5:    1       a    1   5   107
 6:    1       a    2   1    61
 7:    1       a    2   2    73
 8:    1       a    2   3    86
 9:    1       a    2   4    91
10:    1       a    2   5   104
11:    1       a    3   1    57
12:    1       a    3   2    67
13:    1       a    3   3    79
14:    1       a    3   4    96
15:    1       a    3   5   105
16:    1       c   99   1    10
17:    2       b    1   1    60
18:    2       b    1   2    68
19:    2       b    1   3    78
20:    2       b    1   4    90
21:    2       b    1   5   107
22:    2       b    2   1    61
23:    2       b    2   2    73
24:    2       b    2   3    86
25:    2       b    2   4    91
26:    2       b    2   5   104
27:    2       b    3   1    57
28:    2       b    3   2    67
29:    2       b    3   3    79
30:    2       b    3   4    96
31:    2       b    3   5   105
32:    2       d   99   1    10

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2020-12-2

Comments

0 comments

TOP Ranking

Article

add values of one group into another group in R

add values of one group into another group in R

pump.io port in URL

Loopback Error: connect ECONNREFUSED 127.0.0.1:3306 (MAMP)

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

How to import an asset in swift using Bundle.main.path() in a react-native native module

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

ngClass error (Can't bind ngClass since it isn't a known property of div) in Angular 11.0.3

Using Response.Redirect with Friendly URLS in ASP.NET

Can a 32-bit antivirus program protect you from 64-bit threats

Double spacing in rmarkdown pdf

How to fix "pickle_module.load(f, **pickle_load_args) _pickle.UnpicklingError: invalid load key, '<'" using YOLOv3?

3D Touch Peek Swipe Like Mail

Bootstrap 5 Static Modal Still Closes when I Click Outside

Assembly definition can't resolve namespaces from external packages

Vector input in shiny R and then use it

Emulator wrong screen resolution in Android Studio 1.3

Svchost high CPU from Microsoft.BingWeather app errors

Graphics Context misaligned on first paint

Python connect to firebird docker database

Is this docker-for-mac password dialog legit?

How to save models trained locally in Amazon SageMaker?