How to use LSTM to solve seq2seq problem in MATLAB?

Question

YP 2024-9-5

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2150234-how-to-use-lstm-to-solve-seq2seq-problem-in-matlab

评论： Subhajyoti 2024-9-13

I'm struggling with a seq2seq problem. That is: input 200 past values and use LSTM network to predict next 10 values at one time(winthout using closed loop forecasting).And I tried writing the code as shown below. But it will report an error: Invalid training data. For regression tasks, responses must be a vector, a matrix, or a 4-D array of real numeric responses. Responses must not contain NaNs.

inputSize = 5;        % Number of input features
numTimeSteps = 200;   % Number of time steps
numSamples = 100;     % Number of samples
% Create random input data, a cell array of size [5, 200]
X = cell(numSamples, 1);
for i = 1:numSamples
    X{i} = rand(inputSize, numTimeSteps);
end
outputSize = 1;       % Number of output features for each time step
outputTimeSteps = 10; % Number of output time steps
numSamples = 100;     % Number of samples
% Create random output data, a cell array of size [1, 10]
Y = cell(numSamples, 1);
for i = 1:numSamples
    Y{i} = rand(outputSize, outputTimeSteps);
end
numHiddenUnits1 = 128; % Number of hidden units in the first LSTM layer
numHiddenUnits2 = 64;  % Number of hidden units in the second LSTM layer
outputSize = 1;        % Number of output features
layers = [
    sequenceInputLayer(inputSize)                 % Input layer, number of input features is 5
    lstmLayer(numHiddenUnits1, 'OutputMode', 'last') % First LSTM layer, outputs the last time step
    fullyConnectedLayer(numHiddenUnits2)          % Fully connected layer, transforms the input dimensions for the second LSTM layer
    functionLayer(@(X) repmat(X, [1, outputTimeSteps]), 'Name', 'replicate10') % Expand the output of the last time step to 10 time steps
    lstmLayer(numHiddenUnits2, 'OutputMode', 'sequence') % Second LSTM layer, outputs a sequence of 10 time steps
    fullyConnectedLayer(outputSize)               % Fully connected layer, number of output features is 1
    regressionLayer];                             % Regression layer, used for regression tasks
        
% Training options
options = trainingOptions('adam', ...
    'MaxEpochs', 50, ...
    'MiniBatchSize', 32, ...
    'Shuffle', 'every-epoch', ...
    'Verbose', false);
% Train the network
net = trainNetwork(X, Y, layers, options);

Now I have two questions:

(1) What data format should be used for the input and output of the model during training? Cell arrays or 3D arrays?

(2) How to control the time expansion steps of the input and output for each layer (input layer, LSTM layer, fully connected layer) in an LSTM network?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Subhajyoti 2024-9-9

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2150234-how-to-use-lstm-to-solve-seq2seq-problem-in-matlab#answer_1513314

在 MATLAB Online 中打开

Hi @YP,

In Deep Learning models, ‘cell-arrays’ are used to manage input and output data.

You can also use ‘dlarray’ object in MATLAB for handling data in DL tasks, since it is well-integrated with MATLAB's Deep Learning Toolbox, making it easier to work with complex data structures. It stores data with optional data format labels for custom training loops and enables functions to compute and use derivatives through automatic differentiation.

You can control the time-steps at each layer using the time dimension – denoted as “T” in the ‘fmt’ (Data Format) input argument for ‘dlarray()’.

Refer to the following MathWorks documentation links to learn more about ‘dlarray’ in MATLAB:

https://www.mathworks.com/help/deeplearning/ref/dlarray.html

To address the error message indicating the presence of ‘NaN’s in the dataset, check for these ‘NaN’s before training the network. You can use the following snippet in your code to check for ‘NaN’s in the dataset:

inputHasNaNs = any(isnan(cell2mat(X)));
if inputHasNaNs
    error('Input data contains NaNs.');
end

The above snippet throws an error if ‘NaN’s are detected, ensuring that the network training process only begins after the data is cleaned.

Additionally, you can refer to following resource to know more about ‘Sequence-to-Sequence Regression Using Deep Learning’ in MATLAB:

https://www.mathworks.com/help/deeplearning/ug/sequence-to-sequence-regression-using-deep-learning.html

2 个评论
显示无隐藏无

YP 2024-9-13

Thanks ,Subhajyoti.Your advice helps me a lot.

Subhajyoti 2024-9-13

I'm glad to hear that the solution helped you!

You can accept the answer if you feel it will be helpful for others also.

请先登录，再进行评论。

How to use LSTM to solve seq2seq problem in MATLAB?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

2 个评论
显示无隐藏无

另请参阅

类别

标签

Community Treasure Hunt

How to use LSTM to solve seq2seq problem in MATLAB?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

2 个评论 显示 无隐藏 无

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无