How to properly apply a lambda function into a pandas data frame column

Amani :

I have a pandas data frame, sample, with one of the columns called PR to which am applying a lambda function as follows:

sample['PR'] = sample['PR'].apply(lambda x: NaN if x < 90)

I then get the following syntax error message:

sample['PR'] = sample['PR'].apply(lambda x: NaN if x < 90)
                                                         ^
SyntaxError: invalid syntax

What am I doing wrong?

jezrael :

You need mask:

sample['PR'] = sample['PR'].mask(sample['PR'] < 90, np.nan)

Another solution with loc and boolean indexing:

sample.loc[sample['PR'] < 90, 'PR'] = np.nan

Sample:

import pandas as pd
import numpy as np

sample = pd.DataFrame({'PR':[10,100,40] })
print (sample)
    PR
0   10
1  100
2   40

sample['PR'] = sample['PR'].mask(sample['PR'] < 90, np.nan)
print (sample)
      PR
0    NaN
1  100.0
2    NaN

sample.loc[sample['PR'] < 90, 'PR'] = np.nan
print (sample)
      PR
0    NaN
1  100.0
2    NaN

EDIT:

Solution with apply:

sample['PR'] = sample['PR'].apply(lambda x: np.nan if x < 90 else x)

Timings len(df)=300k:

sample = pd.concat([sample]*100000).reset_index(drop=True)

In [853]: %timeit sample['PR'].apply(lambda x: np.nan if x < 90 else x)
10 loops, best of 3: 102 ms per loop

In [854]: %timeit sample['PR'].mask(sample['PR'] < 90, np.nan)
The slowest run took 4.28 times longer than the fastest. This could mean that an intermediate result is being cached.
100 loops, best of 3: 3.71 ms per loop

이 기사는 인터넷에서 수집됩니다. 재 인쇄 할 때 출처를 알려주십시오.

침해가 발생한 경우 연락 주시기 바랍니다[email protected] 삭제

에서 수정2020-09-22

몇 마디 만하겠습니다

0리뷰

로그인참여 후 검토

이전 게시물：Kotlin에서 문자열이 비어 있는지 확인

TOP 리스트

기사

How to properly apply a lambda function into a pandas data frame column

How to properly apply a lambda function into a pandas data frame column

셀레늄의 모델 대화 상자에서 텍스트를 추출하는 방법은 무엇입니까?

Matlab의 반복 Sortino 비율

C # 16 진수 값 0x12는 잘못된 문자입니다.

EventEmitter <string>의 컨텍스트 'this'가 Observable <string> 유형의 'this'메서드에 할당되지 않았습니다.

Python의 csv 파일에서 첫 번째 열 삭제

개체 참조가 개체의 인스턴스로 설정되지 않았습니까? (예외 오류 ~ ASP.NET MVC)

atob은 인코딩 된 base64 문자열을 디코딩하지 않습니다.

팝업처럼 위젯을 표시하는 방법

Excel : 합계가 N보다 크거나 같은 상위 값 찾기

R을 사용하여 추정 된 ERGM 모델에서 nodemix () 항에 대한 출력 해석

병합 셀을 사용하여 워크 시트의 데이터 필터링

일반 메서드에서 클래스 속성에 액세스하는 방법-C #

Matterport Mask-R-CNN의 손실은 정확히 무엇입니까?

ssh를 사용하여 원격에서 로컬로 파일 복사

외부 파일이 포함 된 Runnable Jar 만들기

GStreamer-Java : RTSP- 소스에서 UDP- 싱크로

자바 스크립트에서 점 앞의 물음표 / 반응

main 메소드없이 Java 프로그램을 어떻게 실행할 수 있습니까?

앱바 중간과 상단에 툴바를 배치하는 방법 (vue, vuetify)

Spring Boot에서 HTTP 응답 캐싱을 활성화하는 방법

Samsung Galaxy Tab A 용 AOSP 빌드