Developer Reference for Intel® oneAPI Math Kernel Library for Fortran

ID 766686
Date 6/24/2024
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

?larrv2

Computes the eigenvectors of the tridiagonal matrix T = L*D*LT given L, D and the eigenvalues of L*D*LT.

Syntax

call slarrv2( n, vl, vu, d, l, pivmin, isplit, m, dol, dou, needil, neediu, minrgp, rtol1, rtol2, w, werr, wgap, iblock, indexw, gers, sdiam, z, ldz, isuppz, work, iwork, vstart, finish, maxcls, ndepth, parity, zoffset, info )

call dlarrv2( n, vl, vu, d, l, pivmin, isplit, m, dol, dou, needil, neediu, minrgp, rtol1, rtol2, w, werr, wgap, iblock, indexw, gers, sdiam, z, ldz, isuppz, work, iwork, vstart, finish, maxcls, ndepth, parity, zoffset, info )

Description

?larrv2 computes the eigenvectors of the tridiagonal matrix T = LDLT given L, D and approximations to the eigenvalues of LDLT. The input eigenvalues should have been computed by larre2a or by previous calls to ?larrv2.

The major difference between the parallel and the sequential construction of the representation tree is that in the parallel case, not all eigenvalues of a given cluster might be computed locally. Other processors might "own" and refine part of an eigenvalue cluster. This is crucial for scalability. Thus there might be communication necessary before the current level of the representation tree can be parsed.

Please note:

  • The calling sequence has two additional integer parameters, dol and dou, that should satisfy mdoudol1. These parameters are only relevant when both eigenvalues and eigenvectors are computed (stegr2b parameter jobz = 'V'). ?larrv2 only computes the eigenvectors corresponding to eigenvalues dol through dou in w. (That is, instead of computing the eigenvectors belonging to w(1) through w(m), only the eigenvectors belonging to eigenvalues w(dol) through w(dou) are computed. In this case, only the eigenvalues dol:dou are guaranteed to be accurately refined to all figures by Rayleigh-Quotient iteration.

  • The additional arguments vstart, finish, ndepth, parity, zoffset are included as a thread-safe implementation equivalent to save variables. These variables store details about the local representation tree which is computed layerwise. For scalability reasons, eigenvalues belonging to the locally relevant representation tree might be computed on other processors. These need to be communicated before the inspection of the RRRs can proceed on any given layer. Note that only when the variable finish is true, the computation has ended. All eigenpairs between dol and dou have been computed. m is set to dou - dol + 1.

  • ?larrv2 needs more workspace in z than the sequential slarrv. It is used to store the conformal embedding of the local representation tree.

Product and Performance Information

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.

Notice revision #20201201

Input Parameters

n

INTEGER

The order of the matrix. n 0.

vl, vu

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Lower and upper bounds of the interval that contains the desired eigenvalues. vl < vu. Needed to compute gaps on the left or right end of the extremal eigenvalues in the desired range. vu is currently not used but kept as parameter in case needed.

d

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The n diagonal elements of the diagonal matrix d. On exit, d is overwritten.

l

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The (n-1) subdiagonal elements of the unit bidiagonal matrix L are in elements 1 to n-1 of l (if the matrix is not split.) At the end of each block is stored the corresponding shift as given by ?larre. On exit, l is overwritten.

pivmin

DOUBLE PRECISION

The minimum pivot allowed in the sturm sequence.

isplit

INTEGER

Array of size n

The splitting points, at which T breaks up into blocks. The first block consists of rows/columns 1 to isplit( 1 ), the second of rows/columns isplit( 1 ) + 1 through isplit( 2 ), etc.

m

INTEGER

The total number of input eigenvalues. 0 mn.

dol, dou

INTEGER

If you want to compute only selected eigenvectors from all the eigenvalues supplied, you can specify an index range dol:dou. Or else the setting dol=1, dou=m should be applied. Note that dol and dou refer to the order in which the eigenvalues are stored in w. If you want to compute only selected eigenpairs, the columns dol-1 to dou+1 of the eigenvector space z contain the computed eigenvectors. All other columns of z are set to zero.

If dol > 1, then z(:,dol-1-zoffset) is used.

If dou < m, then z(:,dou+1-zoffset) is used.

needil, neediu

INTEGER

Describe which are the left and right outermost eigenvalues that still need to be included in the computation. These indices indicate whether eigenvalues from other processors are needed to correctly compute the conformally embedded representation tree.

When dolneedilneediudou, all required eigenvalues are local to the processor and no communication is required to compute its part of the representation tree.

minrgp

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

The minimum relative gap threshold to decide whether an eigenvalue or a cluster boundary is reached.

rtol1, rtol2

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Parameters for bisection. An interval [left,right] has converged if right-left < max( rtol1*gap, rtol2*max(|left|,|right|) )

w

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The first m elements of w contain the approximate eigenvalues for which eigenvectors are to be computed. The eigenvalues should be grouped by split-off block and ordered from smallest to largest within the block. (The output array w from ?stegr2a is expected here.) Furthermore, they are with respect to the shift of the corresponding root representation for their block.

werr

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The first m elements contain the semiwidth of the uncertainty interval of the corresponding eigenvalue in w.

wgap

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The separation from the right neighbor eigenvalue in w.

iblock

INTEGERArray of size n

The indices of the blocks (submatrices) associated with the corresponding eigenvalues in w; iblock(i)=1 if eigenvalue w(i) belongs to the first block from the top, iblock(i)=2 if w(i) belongs to the second block, and so on.

indexw

INTEGERArray of size n

The indices of the eigenvalues within each block (submatrix). For example: indexw(i)= 10 and iblock(i)=2 imply that the i-th eigenvalue w(i) is the 10th eigenvalue in block 2.

gers

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size 2*n

The n Gerschgorin intervals (the i-th Gerschgorin interval is (gers(2*i-1), gers(2*i))). The Gerschgorin intervals should be computed from the original unshifted matrix.

Not used but kept as parameter for possible future use.

sdiam

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array of size n

The spectral diameters for all unreduced blocks.

ldz

INTEGER

The leading dimension of the array z. ldz 1, and if stegr2b parameter jobz = 'V', ldz max(1,n).

work

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

(workspace) array of size 12*n

iwork

(workspace) INTEGERArray of size 7*n

vstart

LOGICAL

.TRUE. on initialization, set to .FALSE. afterwards.

finish

LOGICAL

A flag that indicates whether all eigenpairs have been computed.

maxcls

INTEGER

The largest cluster worked on by this processor in the representation tree.

ndepth

INTEGER

The current depth of the representation tree. Set to zero on initial pass, changed when the deeper levels of the representation tree are generated.

parity

INTEGER

An internal parameter needed for the storage of the clusters on the current level of the representation tree.

zoffset

INTEGER

Offset for storing the eigenpairs when z is distributed in 1D-cyclic fashion.

OUTPUT Parameters

needil, neediu

w

Unshifted eigenvalues for which eigenvectors have already been computed.

werr

Contains refined values of its input approximations.

wgap

Contains refined values of its input approximations. Very small gaps are changed.

z

REAL for slarrv2

DOUBLE PRECISION for dlarrv2

Array, dimension (ldz, max(1,m) )

If info = 0, the first m columns of z contain the orthonormal eigenvectors of the matrix T corresponding to the input eigenvalues, with the i-th column of z holding the eigenvector associated with w(i).

In the distributed version, only a subset of columns is accessed, see dol, dou, and zoffset.

isuppz

INTEGER

Array of size 2*max(1,m)

The support of the eigenvectors in z, i.e., the indices indicating the non-zero elements in z. The i-th eigenvector is non-zero only in elements isuppz( 2*i-1 ) through isuppz( 2*i ).

vstart

.TRUE. on initialization, set to .FALSE. afterwards.

finish

A flag that indicates whether all eigenpairs have been computed.

maxcls

The largest cluster worked on by this processor in the representation tree.

ndepth

The current depth of the representation tree. Set to zero on initial pass, changed when the deeper levels of the representation tree are generated.

parity

An internal parameter needed for the storage of the clusters on the current level of the representation tree.

info

INTEGER

= 0: successful exit

> 0: A problem occured in ?larrv2.

< 0: One of the called subroutines signaled an internal problem.

Needs inspection of the corresponding parameter info for further information.

=-1: Problem in ?larrb2 when refining a child's eigenvalues.

=-2: Problem in ?larrf2 when computing the RRR of a child. When a child is inside a tight cluster, it can be difficult to find an RRR. A partial remedy from the user's point of view is to make the parameter minrgp smaller and recompile. However, as the orthogonality of the computed vectors is proportional to 1/minrgp, be aware that decreasing minrgp might be reduce precision.

=-3: Problem in ?larrb2 when refining a single eigenvalue after the Rayleigh correction was rejected.

= 5: The Rayleigh Quotient Iteration failed to converge to full accuracy.

See Also