How to ffill nan values in a numpy array using the last non-nan values repeating N times

wuya Published at Dev

wuya

I would like to ffill nan values in a numpy array using the last non-nan values repeating N times. If the number of nan values > N, then fill the rest nan values with zero. How do I do it in pure numpy without iteration?

import numpy as np

n = 2
arr = np.array([np.nan, 0, 0, np.nan, 5, 4, 4, np.nan, np.nan, np.nan, 1, 5, 3, np.nan, 2, np.nan, np.nan])

def ffill(arr: np.array, n: int):
    pass
    return arr

result = np.array([0.0, 0.0, 0.0, 0.0, 5.0, 4.0, 4.0, 4.0, 4.0, 0.0, 1.0, 5.0, 3.0, 3.0, 2.0, 2.0, 2.0])

Ffill 4 n times (=2) [... 4, np.nan, np.nan, np.nan ...] -> [... 4, 4, 4, 0 ...]

[Solution]

Thanks for @Homer512 's answer. I improved it when n is very large.

def ffill(arr: np.array, n: int):
    if np.isnan(arr[0]):
        arr[0] = 0

    isnan = np.isnan(arr)
    notnan = ~isnan
    valid = arr[notnan]
    indices = np.cumsum(notnan) - 1
    arr = valid[indices]

    overlimit = np.lib.stride_tricks.sliding_window_view(isnan[:-1][::-1], isnan.size-n)[:, ::-1].all(axis=0)
    overlimit &= isnan[n:]

    indices = np.flatnonzero(overlimit) + n
    arr[indices] = 0
    return arr

Homer512

Here is a trick that works:

Fix the start value

if math.isnan(arr[0]):
    arr[0] = 0

Now we can keep track of the valid indices with np.cumsum

isnan = np.isnan(arr)
notnan = ~isnan
valid = arr[notnan]
indices = np.cumsum(notnan) - 1
arr = valid[indices]

To support your requirement that NaNs are replaced with zeros after N steps, you could use np.convolve(isnan, (1,) * (n + 1), mode='same') > n to find the indices. But because convolve is centered, it's a bit complicated to find the correct index from the convolution. Let's do it manually instead. Yes, this will use an iteration, but only a fixed number for N

overlimit = np.copy(isnan[n:])
for i in range(1, n + 1):
    overlimit &= isnan[n-i:-i]
indices = np.flatnonzero(overlimit) + n
arr[indices] = 0

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-11-25

Comments

0 comments

how to find the unique non nan values in a numpy array?

Extrapolate NaN values in a numpy array

How to ffill nan values in a numpy array using the last non-nan values repeating N times

How to ffill nan values in a numpy array using the last non-nan values repeating N times

pump.io port in URL

grouping by column variables and appending a new variable based on condition

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

Group boxplot data while keeping their individual X axis labels in ggplot2 in R

Vector input in shiny R and then use it

BigQuery - concatenate ignoring NULL

Can a 32-bit antivirus program protect you from 64-bit threats

How to remove the extra space from right in a webview?

How to how increase/decrease compared to adjacent cell

android.content.Context.getSharedPreferences(java.lang.String, int)' on a null object reference id DBhandler

Getting 502 Bad Gateway Error While Deploying WordPress On Dockerized Lemp?

Type 'number' is not assignable to type 'NgIterable<any>' when trying to async observe a datasource

Check if a number is a perfect square

FFmpeg resize without upscaling

How do I display Label text character-by-character?

How to show an image in a View with ASP.NET MVC 5? (Many suggestions failed so far)

Json Schema - Conditional Evaluation with RegEx

PlayOnLinux displays weird looking window on 18.04 for MS Office installation

JMeter: Why get error when try to save test plan

Emulator wrong screen resolution in Android Studio 1.3