How can I stop gradient when losses include nan #14217

garam-kim1 · 2023-01-31T07:55:03Z

garam-kim1
Jan 31, 2023

When I run below code, the result is [0.2 nan 0.2 0.2 0.2 0.2 0.2 0.2 nan nan]

key = random.PRNGKey(0)
def loss(x):
    nan_mask = random.uniform(key, x.shape) > 0.5
    x = x * 2.0
    x = x / (~nan_mask)
    x = jnp.nan_to_num(x)
    x = jnp.where(nan_mask, 0, x)
    
    x = (1 - nan_mask) * x
    x = jnp.where(nan_mask, lax.stop_gradient(x), x)
    
    return x.mean()

print(jax.grad(loss)(jnp.ones(10,)))

What I expected is [0.2 0.0 0.2 0.2 0.2 0.2 0.2 0.2 0.0 0.0] Because I masked nan value with x = jnp.where(nan_mask, lax.stop_gradient(x), x)

Is there any dynamic way to stop(or ignore) gradient when losses include nan?

Answered by jakevdp

Jan 31, 2023

This looks related to the situation covered in the following FAQ entry: https://jax.readthedocs.io/en/latest/faq.html#gradients-contain-nan-where-using-where

But if your goal is to simply have the specified entries in x not contribute to the gradient, you can do so by zeroing them out:

def loss(x):
    nan_mask = random.uniform(key, x.shape) > 0.5
    x = x * 2.0
    x = jnp.where(nan_mask, 0, x)
    return x.mean()

print(jax.grad(loss)(jnp.ones(10,)))
# [0.2 0.  0.2 0.2 0.2 0.2 0.2 0.2 0.  0. ]

Is that the output you're hoping to see?

View full answer

jakevdp · 2023-01-31T16:56:09Z

jakevdp
Jan 31, 2023
Maintainer

This looks related to the situation covered in the following FAQ entry: https://jax.readthedocs.io/en/latest/faq.html#gradients-contain-nan-where-using-where

But if your goal is to simply have the specified entries in x not contribute to the gradient, you can do so by zeroing them out:

def loss(x):
    nan_mask = random.uniform(key, x.shape) > 0.5
    x = x * 2.0
    x = jnp.where(nan_mask, 0, x)
    return x.mean()

print(jax.grad(loss)(jnp.ones(10,)))
# [0.2 0.  0.2 0.2 0.2 0.2 0.2 0.2 0.  0. ]

Is that the output you're hoping to see?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I stop gradient when losses include nan #14217

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

How can I stop gradient when losses include nan #14217

garam-kim1 Jan 31, 2023

Replies: 1 comment

jakevdp Jan 31, 2023 Maintainer

garam-kim1
Jan 31, 2023

jakevdp
Jan 31, 2023
Maintainer