Is there a default Pandas method for removing null or missing values when they are represented by a custom value like "?" or "Unknown"

Gerardo Zinno

In the dataset I'm working on, the Adult dataset, the missing values are indicated with the "?" string, and I want to discard the rows containing missing values.

In the documentation of the method df.dropna() there is no argument that offers the possibility of passing a custom value to interpret as the null/missing value,

I know I can simply solve the problem with something like:

df_str = df.select_dtypes(['object']) # get the columns containing the strings
for col in df_str.columns:
    df = df[df[col] != '?']

but I was wondering if there is a standard way of achieving this using Pandas apis which possibly offers more flexibility all while being faster.

DocZerø

If you're importing the data from CSV for example, you could use the parameter na_values to define additional strings to recognise as NA/NaN.

Example:

import pandas as pd
from io import StringIO

data = \
"""
A;B;C
1;2;?
4;?;6
?;8;9
"""

df = pd.read_csv(StringIO(data),
                 delimiter=';', 
                 na_values='?')

The resulting dataframe looks like this:

A	B	C
1	2	NaN
4	NaN	6
NaN	8	9

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2022-03-24

Comments

0 comments

Using a custom ContractResolver, how to set a default value instead of null when deserializing a null JSON property to a value-type member?

I want to set default value that is missing when click on the dropdown list.I would like to be unable to select "Please select" value

TOP Ranking

Article

Is there a default Pandas method for removing null or missing values when they are represented by a custom value like "?" or "Unknown"

Is there a default Pandas method for removing null or missing values when they are represented by a custom value like "?" or "Unknown"

pump.io port in URL

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

How to import an asset in swift using Bundle.main.path() in a react-native native module

Inner Loop design for webscrapping

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

ggplotly no applicable method for 'plotly_build' applied to an object of class "NULL" if statements

mysql.connector.errors.InterfaceError: 2003: Can't connect to MySQL server on '127.0.0.1:3306' (111 Connection refused)

Removed zsh, but forgot to change shell back to bash, and now Ubuntu crashes (wsl)

Ambiguous use of 'init' with CFStringTransform and Swift 3

Resetting Value of <input type="time"> in Firefox

Execute ./script.sh with a crontab

Converting a class method to a property with a backing field

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

How to update azerothcore-wotlk docker container

How to set tab order for array of cluster,where cluster elements have different data types in LabVIEW?

Grails with Oracle thick OCI driver authenticate to Oracle with wrong user

How to pass data to the ng2-bs3-modal?

Making Array From Page Elements in jQuery

Retrieve Element Tag Value XML Using Bash

Laravel's ORM sync with timestamps doesn't update timestamps

Do animations stop css changes after animation completion?