gpucoder.batchedMatrixMultiplyAdd
Optimized GPU implementation of batched matrix multiply with add operation
Since R2020a
Syntax
Description
[
performs matrix-matrix multiplication and add of a batch of matrices
D
1,D
2] = gpucoder.batchedMatrixMultiplyAdd(A
1,B
1,C
1,A
2,B
2,C
2)A1,B1,C1
and A2,B2,C2
. The
gpucoder.batchedMatrixMultiplyAdd
function performs matrix-matrix
multiplication of the form:
where and are scalar multiplication factors and A
,
B
, C
, and D
are matrices with
dimensions m
-by-k
,
k
-by-n
,
m
-by-n
, and
m
-by-n
respectively.A
and
B
can optionally be transposed or hermitian-conjugated. By default, and are set to one and the matrices are not transposed. To specify a different
scalar multiplication factor and perform transpose operations on the input matrices, use the
Name,Value
pair arguments.
All the batches passed to the
gpucoder.batchedMatrixMultiplyAdd
function must be uniform. That is,
all instances must have the same dimensions
m,n,k
.
___ = gpucoder.batchedMatrixMultiplyAdd(___,
performs batched matrix multiply and add operation by using the options specified by one or
more Name,Value
)Name,Value
pair arguments.
Examples
Input Arguments
Output Arguments
Version History
Introduced in R2020a
See Also
Apps
Functions
codegen
|coder.gpu.kernel
|coder.gpu.kernelfun
|gpucoder.stridedMatrixMultiply
|gpucoder.stridedMatrixMultiplyAdd
|gpucoder.batchedMatrixMultiply