What's the difference between "hidden" and "output" in PyTorch LSTM?

I made a diagram. The names follow the PyTorch docs, although I renamed num_layers to w.

output comprises all the hidden states in the last layer (“last” depth-wise, not time-wise). (h_n, c_n) comprises the hidden states after the last timestep, t = n, so you could potentially feed them into another LSTM.

The batch dimension is not included.

More Related Contents:

Why do we “pack” the sequences in PyTorch?
Best way to save a trained model in PyTorch? [closed]
Keras : How should I prepare input data for RNN?
How do I save a trained model in PyTorch?
Pytorch – RuntimeError: Trying to backward through the graph a second time, but the buffers have already been freed
LSTM module for Caffe
Understanding Keras LSTMs
Keras input explanation: input_shape, units, batch_size, dim, etc
Many to one and many to many LSTM examples in Keras
How to initialize weights in PyTorch?
How do I initialize weights in PyTorch?
Why do we need to call zero_grad() in PyTorch?
How does Pytorch’s “Fold” and “Unfold” work?
What does model.eval() do in pytorch?
TensorFlow: Remember LSTM state for next batch (stateful LSTM)
How to understand the term `tensor` in TensorFlow?
Calculate the output size in convolution layer [closed]
How to speed up Gensim Word2vec model load time?
Error when checking model input: expected lstm_1_input to have 3 dimensions, but got array with shape (339732, 29)
What’s the difference between torch.stack() and torch.cat() functions?
How do I create a variable-length input LSTM in Keras?
Keras LSTM input dimension setting
Error when checking model input: expected convolution2d_input_1 to have 4 dimensions, but got array with shape (32, 32, 3)
Is .data still useful in pytorch?
ValueError: Input 0 is incompatible with layer lstm_13: expected ndim=3, found ndim=4
ValueError: Tensor must be from the same graph as Tensor with Bidirectinal RNN in Tensorflow
RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same
How do I split a custom dataset into training and test datasets?
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training
PyTorch Binary Classification – same network structure, ‘simpler’ data, but worse performance?

What’s the difference between “hidden” and “output” in PyTorch LSTM?

Leave a Comment Cancel reply