The CUDA driver API starting version 6.5 offers a set of runtime functions to calculate several occupancy-related measures, as a replacement for the occupancy calculator spreadsheet. This patch adds a heuristic for default runtime launch geometry, based on the new runtime function cuOccupancyMaxPotentialBlockSize. Build on x86_64 with nvptx accelerator and ran libgomp testsuite. 2018-08-13 Cesar Philippidis <cesar@codesourcery.com> Tom de Vries <tdevries@suse.de> PR target/85590 * plugin/cuda/cuda.h (CUoccupancyB2DSize): New typedef. (cuOccupancyMaxPotentialBlockSize): Declare. * plugin/cuda-lib.def (cuOccupancyMaxPotentialBlockSize): New CUDA_ONE_CALL_MAYBE_NULL. * plugin/plugin-nvptx.c (CUDA_VERSION < 6050): Define CUoccupancyB2DSize and declare cuOccupancyMaxPotentialBlockSize. (nvptx_exec): Use cuOccupancyMaxPotentialBlockSize to set the default num_gangs and num_workers when the driver supports it. Co-Authored-By: Tom de Vries <tdevries@suse.de> From-SVN: r263505
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
config | Loading commit data... | |
plugin | Loading commit data... | |
testsuite | Loading commit data... | |
ChangeLog | Loading commit data... | |
ChangeLog.graphite | Loading commit data... | |
Makefile.am | Loading commit data... | |
Makefile.in | Loading commit data... | |
acinclude.m4 | Loading commit data... | |
aclocal.m4 | Loading commit data... | |
affinity.c | Loading commit data... | |
alloc.c | Loading commit data... | |
atomic.c | Loading commit data... | |
barrier.c | Loading commit data... | |
config.h.in | Loading commit data... | |
configure | Loading commit data... | |
configure.ac | Loading commit data... | |
configure.tgt | Loading commit data... | |
critical.c | Loading commit data... | |
env.c | Loading commit data... | |
error.c | Loading commit data... | |
fortran.c | Loading commit data... | |
hashtab.h | Loading commit data... | |
icv-device.c | Loading commit data... | |
icv.c | Loading commit data... | |
iter.c | Loading commit data... | |
iter_ull.c | Loading commit data... | |
libgomp-plugin.c | Loading commit data... | |
libgomp-plugin.h | Loading commit data... | |
libgomp.h | Loading commit data... | |
libgomp.map | Loading commit data... | |
libgomp.spec.in | Loading commit data... | |
libgomp.texi | Loading commit data... | |
libgomp_f.h.in | Loading commit data... | |
libgomp_g.h | Loading commit data... | |
lock.c | Loading commit data... | |
loop.c | Loading commit data... | |
loop_ull.c | Loading commit data... | |
oacc-async.c | Loading commit data... | |
oacc-cuda.c | Loading commit data... | |
oacc-host.c | Loading commit data... | |
oacc-init.c | Loading commit data... | |
oacc-int.h | Loading commit data... | |
oacc-mem.c | Loading commit data... | |
oacc-parallel.c | Loading commit data... | |
oacc-plugin.c | Loading commit data... | |
oacc-plugin.h | Loading commit data... | |
omp.h.in | Loading commit data... | |
omp_lib.f90.in | Loading commit data... | |
omp_lib.h.in | Loading commit data... | |
openacc.f90 | Loading commit data... | |
openacc.h | Loading commit data... | |
openacc_lib.h | Loading commit data... | |
ordered.c | Loading commit data... | |
parallel.c | Loading commit data... | |
priority_queue.c | Loading commit data... | |
priority_queue.h | Loading commit data... | |
sections.c | Loading commit data... | |
secure_getenv.h | Loading commit data... | |
single.c | Loading commit data... | |
splay-tree.c | Loading commit data... | |
splay-tree.h | Loading commit data... | |
target.c | Loading commit data... | |
task.c | Loading commit data... | |
taskloop.c | Loading commit data... | |
team.c | Loading commit data... | |
work.c | Loading commit data... |