Welcome to our Support Center

Bidirectional

Description

Define the bidirectional layer according to its parameters. To be used for the TimeDistributed layer. Type : polymorphic.

 

Input parameters

 

parameters : layer parameters.

Β layerΒ :Β rnn layer instance (RNN, GRU, LSTM, SimpleRNN).
Β merge_modeΒ :Β enum,Β mode by which outputs of the forward and backward RNNs will be combined.
Default value β€œconcat”.
Β backward_layer :Β rnn layer instance, (RNN, GRU, LSTM, SimpleRNN).
Β training?Β :Β boolean, whether the layer is in training mode (can store data for backward).
Default value β€œTrue”.
Β store?Β :Β boolean, whether the layer stores the last iteration gradient (accessible via the β€œget_gradients” function).
Default value β€œFalse”.
Β update?Β :Β boolean, whether the layer’s variables should be updated during backward. Equivalent to freeze the layer.
Default value β€œTrue”.
Β lda_coeff :Β float, defines the coefficient by which the loss derivative will be multiplied before being sent to the previous layer (since during the backward run we go backwards).
Default value β€œ1”.

 

Output parameters

 

Bidirectional out : layer bidirectional architecture.

Example

All these exemples are snippets PNG, you can drop these Snippet onto the block diagram and get the depicted code added to your VI (Do not forget to install HAIBAL library to run it).

Bidirectional layer inside TimeDistributed layer

1 – Generate a set of data

We generate an array of data of type single and shape [batch_size = 10, time = 6, timesteps = 7, features = 5].

2 – Define graph

First, we define the first layer of the graph which is an Input layer (explicit input layer method). This layer is setup as an input array shaped [time = 6, timesteps = 7, features = 5].
Then, we add to the graph the TimeDistributed layer which we setup with a Bidirectional layer using the define method.

3 – Run graph

We call the forward method and retrieve the result with the β€œPrediction 3D” method.
This method returns two variables, the first one is the layer information (cluster composed of the layer name, the graph index and the shape of the output layer) and the second one is the prediction with a shape of [batch_size, time, 2*units].
The output dimension depends on the parameters “return-sequences” and merge_mode, refer to the chapter “Dimension” of “add to graph” documentation.

 

Table of Contents