Main Content

Implement Hardware-Efficient Real Partial-Systolic Q-less QR Decomposition

This example shows how to implement a hardware-efficient Q-less QR decomposition using the Real Partial-Systolic Q-less QR Decomposition block.

Economy Size Q-less QR Decomposition

The Real Partial-Systolic Q-less QR Decomposition block performs the first step of solving the matrix equation A'AX = B which transforms A in-place to upper-triangular R, then solves the transformed system R'RX = B, where R'R = A'A.

Define Matrix Dimensions

Specify the number of rows and columns in matrix A.

m = 5;  % Number of rows in matrix A
n = 3;  % Number of columns in matrix A

Generate Matrix A

Use the helper function realUniformRandomArray to generate a random matrix A such that the elements of A are between -1 and +1, and A is full rank.

rng('default')
A = fixed.example.realUniformRandomArray(-1,1,m,n);

Select Fixed-Point Data Types

Use the helper function qlessqrFixedpointTypes to select fixed-point data types for matrix A that guarantee no overflow will occur in the transformation of A in-place to R.

max_abs_A = 1;        % Upper bound on max(abs(A(:))
precisionBits = 24;   % Number of bits of precision
T = fixed.qlessqrFixedpointTypes(m,max_abs_A,precisionBits);
A = cast(A,'like',T.A);

Open the Model

model = 'RealPartialSystolicQlessQRModel';
open_system(model);

AMBA AXI Handshaking Process

The Data Handler subsystem in this model takes real matrix A as input. It sends rows of A to the QR Decomposition block using the AMBA AXI handshake protocol. The validIn signal indicates when data is available. The ready signal indicates that the block can accept the data. Transfer of data occurs only when both the validIn and ready signals are high. You can set delay for the feeding in rows of A in the Data Handler to emulate the processing time of the upstream block. validOut signal of the Data Handler remain high when rowDelay is set to 0 because this indicates the Data Handler always has data available.

Set Variables in the Model Workspace

Use the helper function setModelWorkspace to add the variables defined above to the model workspace. These variables correspond to the block parameters for the Real Partial-Systolic Q-less QR Decomposition block.

numSamples = 1; % Number of sample matrices
rowDelay = 1; % Delay of clock cycles between feeding in rows of A
fixed.example.setModelWorkspace(model,'A',A,'m',m,'n',n,...
    'numSamples',numSamples,'rowDelay',rowDelay);

Simulate the Model

out = sim(model);

Construct the Solution from the Output Data

The Real Partial-Systolic QR Decomposition block outputs matrix R at each time step. When a valid result matrix is output, the block sets validOut to true.

R = out.R;

R is an upper-triangular matrix.

R
R = 

    1.5379    0.0432   -0.1395
         0    1.5978    0.4742
         0         0    1.5192

          DataTypeMode: Fixed-point: binary point scaling
            Signedness: Signed
            WordLength: 28
        FractionLength: 24
isequal(R,triu(R))
ans =

  logical

   1

Verify the Accuracy of the Output

To evaluate the accuracy of the Real Partial-Systolic Q-less QR Decomposition block, compute the relative error.

relative_error = norm(double(R'*R - A'*A))/norm(double(A'*A))
relative_error =

   9.4841e-07

Suppress mlint warnings.

%#ok<*NOPTS>