Pandas Equivalent for SQL window function and rows range

Marc Published at Dev

Marc

Consider the minimal example

customer   day  purchase
Joe        1       5
Joe        1      10
Joe        2       5
Joe        2       5       
Joe        4      10
Joe        7       5

In BigQuery, one would do something similar to this to get how much the customer spent in the last 2 days for every day:

SELECT customer, day
, sum(purchase) OVER (PARTITION BY customer ORDER BY day ASC RANGE between 2 preceding and 1 preceding)
FROM table

What would be the equivalent in pandas? i.e., expected outcome

customer   day  purchase    amount_last_2d
Joe        1       5             null  -- spent days [-,-]
Joe        1      10             null  -- spent days [-,-]
Joe        2       5               15  -- spent days [-,1]
Joe        2       5               15  -- spent days [-,1]
Joe        4      10               10  -- spent days [2,3]
Joe        7       5                0  -- spent days [5,6]

BENY

Try groupby with shift then reindex back

df['new'] = df.groupby(['customer','day']).purchase.sum().shift().reindex(pd.MultiIndex.from_frame(df[['customer','day']])).values
df
Out[259]: 
  customer  day  purchase   new
0      Joe    1         5   NaN
1      Joe    1        10   NaN
2      Joe    2        10  15.0
3      Joe    2         5  15.0
4      Joe    4        10  15.0

Update

s = df.groupby(['customer','day']).apply(lambda x : df.loc[df.customer.isin(x['customer'].tolist()) & (df.day.isin(x['day']-1)|df.day.isin(x['day']-2)),'purchase'].sum())
df['new'] = s.reindex(pd.MultiIndex.from_frame(df[['customer','day']])).values
df
Out[271]: 
  customer  day  purchase  new
0      Joe    1         5    0
1      Joe    1        10    0
2      Joe    2         5   15
3      Joe    2         5   15
4      Joe    4        10   10
5      Joe    7         5    0

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-02-6

Comments

0 comments

What is the pandas equivalent to a sql count window function with a filter?

Pandas equivalent to SQL window functions

Pandas Equivalent for SQL window function and rows range

Pandas Equivalent for SQL window function and rows range

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

pump.io port in URL

How to import an asset in swift using Bundle.main.path() in a react-native native module

Loopback Error: connect ECONNREFUSED 127.0.0.1:3306 (MAMP)

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

Is this docker-for-mac password dialog legit?

Double spacing in rmarkdown pdf

ngClass error (Can't bind ngClass since it isn't a known property of div) in Angular 11.0.3

Vector input in shiny R and then use it

Assembly definition can't resolve namespaces from external packages

Bootstrap 5 Static Modal Still Closes when I Click Outside

Can a 32-bit antivirus program protect you from 64-bit threats

Using Response.Redirect with Friendly URLS in ASP.NET

BigQuery - concatenate ignoring NULL

How to how increase/decrease compared to adjacent cell

AirflowException: Celery command failed - The recorded hostname does not match this instance's hostname

@RefreshScope annotated Bean registered through BeanDefinitionRegistryPostProcessor not getting refreshed on Cloud Config changes

MTKView Displaying Wide Gamut P3 Colorspace

Displaying attached image with post how to i get it to display

Python connect to firebird docker database