How Can I Define Only the Gradient for a Tensorflow Subgraph?

Here’s a trick from Sergey Ioffe:

Suppose you want group of ops that behave as f(x) in forward mode, but as g(x) in the backward mode. You implement it as

t = g(x)
y = t + tf.stop_gradient(f(x) - t)

So in your case your g(x) could be an identity op, with a custom gradient using gradient_override_map

More Related Contents:

How to compile Tensorflow with SSE4.2 and AVX instructions?
On Windows, running “import tensorflow” generates No module named “_pywrap_tensorflow” error
Meaning of buffer_size in Dataset.map , Dataset.prefetch and Dataset.shuffle
What is the meaning of the word logits in TensorFlow? [duplicate]
How to set specific gpu in tensorflow?
What is the meaning of the “None” in model.summary of KERAS?
Module ‘tensorflow’ has no attribute ‘contrib’
Understanding TensorBoard (weight) histograms
How do I split Tensorflow datasets?
Multivariate LSTM with missing values
TensorFlow: numpy.repeat() alternative
Tensorflow: ImportError: libcusolver.so.8.0: cannot open shared object file: No such file or directory
Keras Loss Function with Additional Dynamic Parameter
Create keras callback to save model predictions and targets for each batch during training
Working with multiple graphs in TensorFlow
How to extract data/labels back from TensorFlow dataset
Obtaining output of an Intermediate layer in TensorFlow/Keras
Making predictions with a TensorFlow model
tf.data with multiple inputs / outputs in Keras
Update only part of the word embedding matrix in Tensorflow
Could not install packages due to an EnvironmentError: [WinError 5] Access is denied:
Keras split train test set when using ImageDataGenerator
CUDA_ERROR_OUT_OF_MEMORY in tensorflow
Implementing contrastive loss and triplet loss in Tensorflow
Confusion about keras Model: __call__ vs. call vs. predict methods
What does batch, repeat, and shuffle do with TensorFlow Dataset?
Checkpointing keras model: TypeError: can’t pickle _thread.lock objects
How to replace (or insert) intermediate layer in Keras model?
Choosing number of Steps per Epoch
Parallelism isn’t reducing the time in dataset map

More Related Contents:

Leave a Comment Cancel reply