How to set a column by slicing values of other columns

Mario Diez Martínez

I have a dataframe with the ruling party of the US, but the column is set on this format yyyy-yyyy: 'democrat' and I want my final dataframe to be like this yyyy : 'democrat'. Instead of the range of the ruling party I want a column with all years between 1945 and 2022 and another column that contains a string with 'dremocrat' or 'republican'.

enter image description here

This is what Ive been trying

us_gov = pd.read_csv('/Users/elgasko/Documents/NUMERO ARMAS NUCLEARES/presidents.csv')
us_gov = us_gov.iloc[31:,1:4]
us_gov=us_gov[['Years In Office','Party']]
us_gov.sort_values(by=['Years In Office'])
years=range(1945,2023)
us_gov_def=pd.DataFrame(years, columns=['Year'])
us_gov_def.set_index('Year', drop=True, append=False, inplace=True, verify_integrity=False)
us_gov_def.insert(0, column='Party', value=np.nan)

for i in range(len(us_gov)):
    string=us_gov.iloc[i]['Years In Office']
    inicio=string[0:4]
    inicio=int(float(inicio))
    final=string[5:9]
    final=int(float(final))
    for j in us_gov_def.index :
        if j in range(inicio,final):
            us_gov_def.loc['Party',us_gov.Party[i]]
            
#https://github.com/awhstin/Dataset-List/blob/master/presidents.csv

ouroboros1

One solution could be as follows:

import pandas as pd

data = {'Years In Office': ['1933-1945','1945-1953','1953-1961'],
      'Party': ['Democratic', 'Democratic', 'Republican']}

df = pd.DataFrame(data)

df['Years In Office'] = df['Years In Office'].str.split('-').explode()\
    .groupby(level=0).apply(lambda x: range(x.astype(int).min(), 
                                            x.astype(int).max()+1))
df = df.explode('Years In Office')

print(df)

   Years In Office       Party
0             1933  Democratic
1             1934  Democratic
2             1935  Democratic
3             1936  Democratic
4             1937  Democratic
5             1938  Democratic
6             1939  Democratic
7             1940  Democratic
8             1941  Democratic
9             1942  Democratic
10            1943  Democratic
11            1944  Democratic
12            1945  Democratic
13            1945  Democratic
14            1946  Democratic
15            1947  Democratic
16            1948  Democratic
17            1949  Democratic
18            1950  Democratic
19            1951  Democratic
20            1952  Democratic
21            1953  Democratic
22            1953  Republican
23            1954  Republican
24            1955  Republican
25            1956  Republican
26            1957  Republican
27            1958  Republican
28            1959  Republican
29            1960  Republican
30            1961  Republican

Notice that you will end up with duplicates:

print(df[df['Years In Office'].duplicated(keep=False)])

   Years In Office       Party
12            1945  Democratic
13            1945  Democratic
21            1953  Democratic
22            1953  Republican

This is because the periods overlap on end year & start year (e.g. '1933-1945','1945-1953'). If you don't want this, you could add:

df = df.groupby('Years In Office', as_index=False).agg({'Party':', '.join})
print(df.loc[df['Years In Office'].isin([1945, 1953])])

   Years In Office                   Party
12            1945  Democratic, Democratic
20            1953  Democratic, Republican

Or you could drop only the years where the ruling party does not change. E.g.:

df = df[~df.duplicated()].reset_index(drop=True)
print(df.loc[df['Years In Office'].isin([1945, 1953])])

   Years In Office       Party
12            1945  Democratic
20            1953  Democratic
21            1953  Republican

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2022-09-28

Comments

0 comments

How to set values of a column based on multiple conditions in other columns in python?

pandas slicing nested values along with other columns

VBA Set Columns equal to each other if other column values match

How to set a column by slicing values of other columns

How to set a column by slicing values of other columns

pump.io port in URL

Loopback Error: connect ECONNREFUSED 127.0.0.1:3306 (MAMP)

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

How to import an asset in swift using Bundle.main.path() in a react-native native module

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

ngClass error (Can't bind ngClass since it isn't a known property of div) in Angular 11.0.3

Using Response.Redirect with Friendly URLS in ASP.NET

Can a 32-bit antivirus program protect you from 64-bit threats

Double spacing in rmarkdown pdf

How to fix "pickle_module.load(f, **pickle_load_args) _pickle.UnpicklingError: invalid load key, '<'" using YOLOv3?

3D Touch Peek Swipe Like Mail

Bootstrap 5 Static Modal Still Closes when I Click Outside

Assembly definition can't resolve namespaces from external packages

Vector input in shiny R and then use it

Emulator wrong screen resolution in Android Studio 1.3

Svchost high CPU from Microsoft.BingWeather app errors

Graphics Context misaligned on first paint

Python connect to firebird docker database

Is this docker-for-mac password dialog legit?

How to save models trained locally in Amazon SageMaker?