?gesvda_batch

Developer Reference for Intel® oneAPI Math Kernel Library for Fortran

Download PDF

ID 766686

Date 6/24/2024

Version

Public

A newer version of this document is available. Customers should click here to go to the newest version.

Visible to Intel only — GUID: GUID-985AE4B9-55A8-406A-AF76-E644BECA466A

View Details

?gesvda_batch_strided

Computes the truncated SVD of a group of general m-by-n matrices that are stored at a constant stride from each other in a contiguous block of memory.

Syntax

call sgesvda_batch_strided (iparm, irank, m, n, A, lda, stride_a, S, stride_s, U, ldu, stride_u, Vt, ldvt, stride_vt, tolerance, residual, work, lwork, batch_size, info )

call dgesvda_batch_strided (iparm, irank, m, n, A, lda, stride_a, S, stride_s, U, ldu, stride_u, Vt, ldvt, stride_vt, tolerance, residual, work, lwork, batch_size, info )

call cgesvda_batch_strided (iparm, irank, m, n, A, lda, stride_a, S, stride_s, U, ldu, stride_u, Vt, ldvt, stride_vt, tolerance, residual, work, lwork, batch_size, info )

call zgesvda_batch_strided (iparm, irank, m, n, A, lda, stride_a, S, stride_s, U, ldu, stride_u, Vt, ldvt, stride_vt, tolerance, residual, work, lwork, batch_size, info )

Include Files

mkl.fi

Description

The ?gesvda_batch_strided routines compute the truncated SVD for a group of general m-by-n matrices.

All matrices have the same parameters (matrix size, leading dimension) and are stored at constant stride_a from each other in a contiguous block of memory. The operation is defined as

for i = 0 … batch_size-1
    A_i is a matrix at offset i * stride_a from A
    A_i := U_i * S_i*V_i^T
    A_i := U_i * S_i *
end for

where Ui and Vi are orthogonal matrices, and Si is a diagonal matrix with singular values on the diagonal. Singular values are nonnegative and listed in decreasing order. A truncated SVD of a given mxn matrix produces matrices with the specified number of columns, where the number of columns is defined by the user or determined at runtime with the help of the user-defined tolerance threshold.

An approximation of each matrix can be also obtained as a product of two low-rank matrices (low-rank product):

A_i=P_i×Q_i

where P_i=U_i×S_i , Q_i=V_i^T if m≥n, and P_i=U_i , Q_i=S_i × V_i^T otherwise.

The routines provide three possible ways to compute truncated SVD:

Compute truncated SVD with the help of the input array rank where rank(i) specifies the number of singular values and vectors to be computed in parameters U_i ,V_i and S_i for each matrix A_i.
Compute truncated SVD using a tolerance threshold. While computing SVD, singular values that are less than the user-defined tolerance are treated as zero, and they are not computed but set to zero.
Compute truncated SVD using the effective rank. The effective rank of A is determined by treating as zero those singular values that are less than the user-defined tolerance threshold times the largest singular value.

The routines can be also used for computing singular values only.

Input Parameters

iparm

INTEGER. Array of dimension 16 specifying options to compute truncated SVD. Also specifies the type of returned SVD decomposition form. The individual components of the iparm parameter appear below. Default values are denoted with an asterisk (*).

iparm(1)

Specifies a criterion for treating singular values as zeros.

`-1`	Use default `iparm` values (`iparm(1:3)=0`, `iparm(4)=1`) . All other `iparm` settings are ignored.
`= 0`*	Computes the truncated SVD with the help of the input array `irank`.
`= 1`	Computes the truncated SVD using the parameter `tolerance`.
`= 2`	Computes the truncated SVD using the effective rank. The effective rank of A is determined by treating as zero those singular values that are less than the user-defined tolerance multiplied by the largest singular value.

iparm(2)

Specifies the option for computing singular vectors.

`0`*	Both singular values and singular vectors are computed.
`1`	Only singular values are computed.

iparm(3)

Specifies the type of the returned SVD decomposition.

0*

Computes the truncated SVD as a product of three matrices:

A_i=U_i×S_i×V_i^T

1

Computes the truncated SVD as a low-rank product:

A_i=P_i×Q_i

iparm(4)

Specifies the option for computing the residual vector.

`0`	The residual vector is not computed.
`1`*	Computes the residual vector.

NOTE:

iparm(5)–iparm(16) are reserved for future use.

irank

INTEGER. Array of size at least batch_size. If iparm(1)=0 or iparm(1)=-1, element irank(i) specifies the number of singular values and/or singular vectors to be computed in U_i , V_i^T, and S_i for each matrix A_i.

m

INTEGER. The number of rows in the matrices A_i (m ≥ 0).

n

INTEGER. The number of columns in the matrices A_i (n ≥ 0).

A

REAL for sgesvda_batch_strided.

DOUBLE PRECISION for dgesvda_batch_strided.

COMPLEX for cgesvda_batch_strided.

DOUBLE COMPLEX for zgesvda_batch_strided.

Array of size at least stride_a * batch_size holding input matrices A_i.

lda

INTEGER. Specifies the leading dimension of the A_i matrices: lda ≥ max(1, m).

strde_a

INTEGER. Stride between two consecutive A_i matrices: stride_a ≥max(1, lda * n).

stride_s

INTEGER. The stride between two consecutive S_i matrices: stride_s ≥ max(1, min(m,n)).

ldu

INTEGER. Specifies the leading dimension of the U_i matrices: ldu ≥ max(1, m).

stride_u

INTEGER. The stride between two consecutive U_i matrices: stride_u ≥ max(1, ldu * m).

ldvt

INTEGER. Specifies the leading dimension of the V_i^T matrices: ldvt ≥ max(1, n).

stride_vt

INTEGER. The stride between two consecutive V_i^T matrices: stride_vt ≥ max(1, ldvt * n).

tolerance

REAL for single precision flavors.

DOUBLE PRECISION for double precision flavors.

Specifies the tolerance threshold for computing truncated SVD in the cases of iparm(1)=1 and iparm(1)=2. Not used otherwise.

batch_size

INTEGER. The number of problems in a batch. Must be at least 0.

work

REAL for sgesvda_batch_strided.

DOUBLE PRECISION for dgesvda_batch_strided.

COMPLEX for cgesvda_batch_strided.

DOUBLE COMPLEX for zgesvda_batch_strided.

Workspace array with dimension max(1, lwork).

lwork

INTEGER. The dimension of the array work.

If lwork = -1, a workspace query is assumed: the routine only calculates the optimal size of the work array and returns this value as the first entry of the work array, and no error message related to lwork is issued by xerbla. If lwork is less than the required minimum size but is positive, the routine internally allocates the needed memory.

Output Parameters

irank	On exit, if `iparm(1)=1` or `iparm(1)=2`, element `irank(i)` is the number of computed singular values and/or singular vectors for matrix A_i.
A	Unchanged on exit if the residual vector is not required. Otherwise, contains the residual matrix A_i:=A_i- U_i×S_i×V_i^T if `iparm(3)=0`, and A_i:=A_i-P_i×Q_i otherwise.
S	REAL for single precision flavors. DOUBLE PRECISION for double precision flavors. Array of size at least `min(m,n)*batch_size` to store a batch of singular values S_i.
U	REAL for sgesvda_batch_strided. DOUBLE PRECISION for dgesvda_batch_strided. COMPLEX for cgesvda_batch_strided. DOUBLE COMPLEX for zgesvda_batch_strided. Array of size at least `stride_u*batch_size` to store a batch of U_i if `iparm(3)=0`, or to store a batch of P_i if `iparm(3)=1`.
Vt	REAL for sgesvda_batch_strided. DOUBLE PRECISION for dgesvda_batch_strided. COMPLEX for cgesvda_batch_strided. DOUBLE COMPLEX for zgesvda_batch_strided. Array of size at least `stride_vt*batch_size` to store a batch of V_i^T if `iparm(3)=0`, or to store a batch of Q_i if `iparm(3)=1`.
residual	REAL for single precision flavors. DOUBLE PRECISION for double precision flavors. Array of dimension `batch_size`. If `iparm(4)=1`, `residual(i)` is the Frobenius norm of the matrix `\|\|A_i - U_i×S_i×V_i^T\|\|` if `iparm(3)=0`, and `\|\|A_i - P_i×Q_i\|\|` if `iparm(3)=1`.
info	INTEGER. Array of size at least `batch_size`, which reports the status for each matrix. If `info(i) = 0`, the execution is successful for A_i. If `info(1) = -j`, the j-th parameter had an illegal value. If `info(1) = 1,` an internal memory allocation failed. If `info(i) = 2`, an input parameter contains an invalid value. If `info(i) = 3`, an error in algorithm while computing singular values of A_i occurred. If `info(1) = 4`, the routine encountered an empty structure or matrix array.

Parent topic: Singular Value Decomposition: LAPACK Driver Routines

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in