To integrate "CNN" and "LSTM" blocks together, you should apply the "CNN" layer to each input in the sequence. The extracted feature maps can then be passed onto "BLSTM" or "LSTM" layers for sequential processing. Here's an existing example in R2024b that demonstrates this:
openExample("deeplearning_shared/SequenceClassificationUsingCNNLSTMNetworkExample")
For more information, check out the "Define 2-D CNN LSTM Architecture" section at the following link: