mlpack 3.4.2
Public Member Functions | List of all members
LSTM< InputDataType, OutputDataType > Class Template Reference

Implementation of the LSTM module class. More...

#include <lstm.hpp>

Public Member Functions

 LSTM ()
 Create the LSTM object. More...
 
 LSTM (const size_t inSize, const size_t outSize, const size_t rho=std::numeric_limits< size_t >::max())
 Create the LSTM layer object using the specified parameters. More...
 
template<typename InputType , typename ErrorType , typename GradientType >
void Backward (const InputType &input, const ErrorType &gy, GradientType &g)
 Ordinary feed backward pass of a neural network, calculating the function f(x) by propagating x backwards trough f. More...
 
OutputDataType & Delta ()
 Modify the delta. More...
 
OutputDataType const & Delta () const
 Get the delta. More...
 
template<typename InputType , typename OutputType >
void Forward (const InputType &input, OutputType &output)
 Ordinary feed-forward pass of a neural network, evaluating the function f(x) by propagating the activity forward through f. More...
 
template<typename InputType , typename OutputType >
void Forward (const InputType &input, OutputType &output, OutputType &cellState, bool useCellState=false)
 Ordinary feed-forward pass of a neural network, evaluating the function f(x) by propagating the activity forward through f. More...
 
OutputDataType & Gradient ()
 Modify the gradient. More...
 
OutputDataType const & Gradient () const
 Get the gradient. More...
 
template<typename InputType , typename ErrorType , typename GradientType >
void Gradient (const InputType &input, const ErrorType &error, GradientType &gradient)
 
size_t InSize () const
 Get the number of input units. More...
 
OutputDataType & OutputParameter ()
 Modify the output parameter. More...
 
OutputDataType const & OutputParameter () const
 Get the output parameter. More...
 
size_t OutSize () const
 Get the number of output units. More...
 
OutputDataType & Parameters ()
 Modify the parameters. More...
 
OutputDataType const & Parameters () const
 Get the parameters. More...
 
void Reset ()
 
void ResetCell (const size_t size)
 
size_t & Rho ()
 Modify the maximum number of steps to backpropagate through time (BPTT). More...
 
size_t Rho () const
 Get the maximum number of steps to backpropagate through time (BPTT). More...
 
template<typename Archive >
void serialize (Archive &ar, const unsigned int)
 Serialize the layer. More...
 

Detailed Description

template<typename InputDataType = arma::mat, typename OutputDataType = arma::mat>
class mlpack::ann::LSTM< InputDataType, OutputDataType >

Implementation of the LSTM module class.

The implementation corresponds to the following algorithm:

\begin{eqnarray} i &=& sigmoid(W \cdot x + W \cdot h + W \cdot c + b) \\ f &=& sigmoid(W \cdot x + W \cdot h + W \cdot c + b) \\ z &=& tanh(W \cdot x + W \cdot h + b) \\ c &=& f \odot c + i \odot z \\ o &=& sigmoid(W \cdot x + W \cdot h + W \cdot c + b) \\ h &=& o \odot tanh(c) \end{eqnarray}

Note that if an LSTM layer is desired as the first layer of a neural network, an IdentityLayer should be added to the network as the first layer, and then the LSTM layer should be added.

For more information, see the following.

@article{Graves2013,
author = {Alex Graves and Abdel{-}rahman Mohamed and Geoffrey E. Hinton},
title = {Speech Recognition with Deep Recurrent Neural Networks},
journal = CoRR},
year = {2013},
url = {http://arxiv.org/abs/1303.5778},
}
See also
FastLSTM for a faster LSTM version which combines the calculation of the input, forget, output gates and hidden state in a single step.
Template Parameters
InputDataTypeType of the input data (arma::colvec, arma::mat, arma::sp_mat or arma::cube).
OutputDataTypeType of the output data (arma::colvec, arma::mat, arma::sp_mat or arma::cube).

Definition at line 62 of file lstm.hpp.

Constructor & Destructor Documentation

◆ LSTM() [1/2]

LSTM ( )

Create the LSTM object.

◆ LSTM() [2/2]

LSTM ( const size_t  inSize,
const size_t  outSize,
const size_t  rho = std::numeric_limits< size_t >::max() 
)

Create the LSTM layer object using the specified parameters.

Parameters
inSizeThe number of input units.
outSizeThe number of output units.
rhoMaximum number of steps to backpropagate through time (BPTT).

Member Function Documentation

◆ Backward()

void Backward ( const InputType &  input,
const ErrorType &  gy,
GradientType &  g 
)

Ordinary feed backward pass of a neural network, calculating the function f(x) by propagating x backwards trough f.

Using the results from the feed forward pass.

Parameters
inputThe propagated input activation.
gyThe backpropagated error.
gThe calculated gradient.

◆ Delta() [1/2]

OutputDataType & Delta ( )
inline

Modify the delta.

Definition at line 161 of file lstm.hpp.

◆ Delta() [2/2]

OutputDataType const & Delta ( ) const
inline

Get the delta.

Definition at line 159 of file lstm.hpp.

◆ Forward() [1/2]

void Forward ( const InputType &  input,
OutputType &  output 
)

Ordinary feed-forward pass of a neural network, evaluating the function f(x) by propagating the activity forward through f.

Parameters
inputInput data used for evaluating the specified function.
outputResulting output activation.

◆ Forward() [2/2]

void Forward ( const InputType &  input,
OutputType &  output,
OutputType &  cellState,
bool  useCellState = false 
)

Ordinary feed-forward pass of a neural network, evaluating the function f(x) by propagating the activity forward through f.

Parameters
inputInput data used for evaluating the specified function.
outputResulting output activation.
cellStateCell state of the LSTM.
useCellStateUse the cellState passed in the LSTM cell.

◆ Gradient() [1/3]

OutputDataType & Gradient ( )
inline

Modify the gradient.

Definition at line 166 of file lstm.hpp.

◆ Gradient() [2/3]

OutputDataType const & Gradient ( ) const
inline

Get the gradient.

Definition at line 164 of file lstm.hpp.

◆ Gradient() [3/3]

void Gradient ( const InputType &  input,
const ErrorType &  error,
GradientType &  gradient 
)

◆ InSize()

size_t InSize ( ) const
inline

Get the number of input units.

Definition at line 169 of file lstm.hpp.

◆ OutputParameter() [1/2]

OutputDataType & OutputParameter ( )
inline

Modify the output parameter.

Definition at line 156 of file lstm.hpp.

◆ OutputParameter() [2/2]

OutputDataType const & OutputParameter ( ) const
inline

Get the output parameter.

Definition at line 154 of file lstm.hpp.

◆ OutSize()

size_t OutSize ( ) const
inline

Get the number of output units.

Definition at line 172 of file lstm.hpp.

◆ Parameters() [1/2]

OutputDataType & Parameters ( )
inline

Modify the parameters.

Definition at line 151 of file lstm.hpp.

◆ Parameters() [2/2]

OutputDataType const & Parameters ( ) const
inline

Get the parameters.

Definition at line 149 of file lstm.hpp.

◆ Reset()

void Reset ( )

◆ ResetCell()

void ResetCell ( const size_t  size)

◆ Rho() [1/2]

size_t & Rho ( )
inline

Modify the maximum number of steps to backpropagate through time (BPTT).

Definition at line 146 of file lstm.hpp.

◆ Rho() [2/2]

size_t Rho ( ) const
inline

Get the maximum number of steps to backpropagate through time (BPTT).

Definition at line 144 of file lstm.hpp.

◆ serialize()

void serialize ( Archive &  ar,
const unsigned int   
)

Serialize the layer.


The documentation for this class was generated from the following files: