Visible to Intel only — GUID: GUID-3A086451-4C82-4BB1-B742-FF93EBF60DA3
Visible to Intel only — GUID: GUID-3A086451-4C82-4BB1-B742-FF93EBF60DA3
prefetch/noprefetch
Invites the compiler to issue or disable requests to prefetch data from memory. This pragma applies only to Intel® Advanced Vector Extensions 512 (Intel® AVX-512).
Syntax
#pragma prefetch #pragma prefetch *:hint[:distance] #pragma prefetch [var1 [: hint1 [: distance1]] [, var2 [: hint2 [: distance2]]]...] #pragma noprefetch [var1 [, var2]...] |
Arguments
var |
An optional memory reference (data to be prefetched) |
hint |
An optional hint to the compiler to specify the type of prefetch. Possible values:
To use this argument, you must also specify var. |
distance |
An optional integer argument with a value greater than 0. It indicates the number of loop iterations ahead of which a prefetch is issued, before the corresponding load or store instruction. To use this argument, you must also specify var and hint. |
Description
The prefetch pragma hints to the compiler to generate data prefetches for some memory references. These hints affect the heuristics used in the compiler. Prefetching data can minimize the effects of memory latency.
If you specify the prefetch pragma with no arguments, all arrays accessed in the immediately following loop are prefetched.
If the loop includes the expression A(j), placing #pragma prefetch A in front of the loop instructs the compiler to insert prefetches for A(j + d) within the loop. Here, d is the number of iterations ahead of which to prefetch the data, and is determined by the compiler.
If you specify #pragma prefetch *, then hint and distance prefetches all array accesses in the loop.
To use these pragmas, compiler option [Q]opt-prefetch must be set (it is turned on by default if the compiler general optimization level is set at option O2 or higher).
The noprefetch pragma hints to the compiler not to generate data prefetches for some memory references. This affects the heuristics used in the compiler.
Examples
Use the prefetch pragma:
#pragma prefetch htab_p:1:30
#pragma prefetch htab_p:0:6
// Issue vprefetch1 for htab_p with a distance of 30 vectorized iterations ahead
// Issue vprefetch0 for htab_p with a distance of 6 vectorized iterations ahead
// If pragmas are not present, compiler chooses both distance values
for (j=0; j<2*N; j++) { htab_p[i*m1 + j] = -1; }
Use noprefetch and prefetch pragmas together:
#pragma noprefetch b
#pragma prefetch a
for(i=0; i<m; i++) { a[i]=b[i]+1; }
Use noprefetch and prefetch pragmas together:
for (i=i0; i!=i1; i+=is) {
float sum = b[i];
int ip = srow[i];
int c = col[ip];
#pragma noprefetch col
#pragma prefetch value:1:80
#pragma prefetch x:1:40
for(; ip<srow[i+1]; c=col[++ip])
sum -= value[ip] * x[c];
y[i] = sum;
}