1. 13 Nov, 2019 27 commits
    • GCN Libgomp Plugin · 237957cc
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	libgomp/
      	* plugin/Makefrag.am: Add amdgcn plugin support.
      	* plugin/configfrag.ac: Likewise.
      	* plugin/plugin-gcn.c: New file.
      	* configure: Regenerate.
      	* Makefile.in: Regenerate.
      	* testsuite/Makefile.in: Regenerate.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      Co-Authored-By: Kwok Cheung Yeung <kcy@codesourcery.com>
      Co-Authored-By: Tom de Vries <tom@codesourcery.com>
      
      From-SVN: r278138
      Andrew Stubbs committed
    • Use a single worker for OpenACC on AMD GCN · fe22e0d4
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      
      	gcc/
      	* config/gcn/gcn.c (gcn_goacc_validate_dims): Ensure
      	flag_worker_partitioning is not set.
      	(TARGET_GOACC_WORKER_PARTITIONING): Remove target hook definition.
      	* config/gcn/gcn.opt (macc-experimental-workers): Default to off.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      
      From-SVN: r278137
      Andrew Stubbs committed
    • Optimize GCN OpenMP malloc performance · cee16451
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	libgomp/
      	* config/gcn/team.c (gomp_gcn_enter_kernel): Set up the team arena
      	and use team_malloc variants.
      	(gomp_gcn_exit_kernel): Use team_free.
      	* libgomp.h (TEAM_ARENA_SIZE): Define.
      	(TEAM_ARENA_START): Define.
      	(TEAM_ARENA_FREE): Define.
      	(TEAM_ARENA_END): Define.
      	(team_malloc): New function.
      	(team_malloc_cleared): New function.
      	(team_free): New function.
      	* team.c (gomp_new_team): Initialize and use team_malloc.
      	(free_team): Use team_free.
      	(gomp_free_thread): Use team_free.
      	(gomp_pause_host): Use team_free.
      	* work.c (gomp_init_work_share): Use team_malloc.
      	(gomp_fini_work_share): Use team_free.
      
      From-SVN: r278136
      Andrew Stubbs committed
    • GCN libgomp port · fa499995
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	include/
      	* gomp-constants.h (GOMP_DEVICE_GCN): Define.
      	(GOMP_VERSION_GCN): Define.
      
      	libgomp/
      	* Makefile.am (libgomp_la_SOURCES): Add oacc-target.c.
      	* Makefile.in: Regenerate.
      	* config.h.in (PLUGIN_GCN): Add new undef.
      	* config/accel/openacc.f90 (acc_device_gcn): New parameter.
      	* config/gcn/affinity-fmt.c: New file.
      	* config/gcn/bar.c: New file.
      	* config/gcn/bar.h: New file.
      	* config/gcn/doacross.h: New file.
      	* config/gcn/icv-device.c: New file.
      	* config/gcn/oacc-target.c: New file.
      	* config/gcn/simple-bar.h: New file.
      	* config/gcn/target.c: New file.
      	* config/gcn/task.c: New file.
      	* config/gcn/team.c: New file.
      	* config/gcn/time.c: New file.
      	* configure.ac: Add amdgcn*-*-*.
      	* configure: Regenerate.
      	* configure.tgt: Add amdgcn*-*-*.
      	* libgomp-plugin.h (offload_target_type): Add OFFLOAD_TARGET_TYPE_GCN.
      	* libgomp.h (gcn_thrs): Add amdgcn variant.
      	(set_gcn_thrs): Likewise.
      	(gomp_thread): Likewise.
      	* oacc-int.h (goacc_thread): Likewise.
      	* oacc-target.c: New file.
      	* openacc.f90 (acc_device_gcn): New parameter.
      	* openacc.h (acc_device_t): Add acc_device_gcn.
      	* team.c (gomp_free_pool_helper): Add amdgcn support.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      Co-Authored-By: Kwok Cheung Yeung <kcy@codesourcery.com>
      Co-Authored-By: Tom de Vries <tom@codesourcery.com>
      
      From-SVN: r278135
      Andrew Stubbs committed
    • Add device number to GOMP_OFFLOAD_openacc_async_construct · d2903ce0
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      
      	libgomp/
      	* libgomp-plugin.h (GOMP_OFFLOAD_openacc_async_construct): Add int
      	parameter.
      	* oacc-async.c (lookup_goacc_asyncqueue): Pass device number to the
      	queue constructor.
      	* oacc-host.c (host_openacc_async_construct): Add device parameter.
      	* plugin/plugin-nvptx.c (GOMP_OFFLOAD_openacc_async_construct): Add
      	device parameter.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      
      From-SVN: r278134
      Andrew Stubbs committed
    • GCN mkoffload · 15e9e679
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	gcc/
      	* config/gcn/mkoffload.c: New file.
      	* config/gcn/offload.h: New file.
      
      From-SVN: r278133
      Andrew Stubbs committed
    • Move generic libgomp files from nvptx to accel · b3d14b3a
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	libgomp/
      	* configure.tgt (nvptx*-*-*): Add "accel" directory.
      	* config/nvptx/libgomp-plugin.c: Move ...
      	* config/accel/libgomp-plugin.c: ... to here.
      	* config/nvptx/lock.c: Move ...
      	* config/accel/lock.c: ... to here.
      	* config/nvptx/mutex.c: Move ...
      	* config/accel/mutex.c: ... to here.
      	* config/nvptx/mutex.h: Move ...
      	* config/accel/mutex.h: ... to here.
      	* config/nvptx/oacc-async.c: Move ...
      	* config/accel/oacc-async.c: ... to here.
      	* config/nvptx/oacc-cuda.c: Move ...
      	* config/accel/oacc-cuda.c: ... to here.
      	* config/nvptx/oacc-host.c: Move ...
      	* config/accel/oacc-host.c: ... to here.
      	* config/nvptx/oacc-init.c: Move ...
      	* config/accel/oacc-init.c: ... to here.
      	* config/nvptx/oacc-mem.c: Move ...
      	* config/accel/oacc-mem.c: ... to here.
      	* config/nvptx/oacc-plugin.c: Move ...
      	* config/accel/oacc-plugin.c: ... to here.
      	* config/nvptx/omp-lock.h: Move ...
      	* config/accel/omp-lock.h: ... to here.
      	* config/nvptx/openacc.f90: Move ...
      	* config/accel/openacc.f90: ... to here.
      	* config/nvptx/pool.h: Move ...
      	* config/accel/pool.h: ... to here.
      	* config/nvptx/proc.c: Move ...
      	* config/accel/proc.c: ... to here.
      	* config/nvptx/ptrlock.c: Move ...
      	* config/accel/ptrlock.c: ... to here.
      	* config/nvptx/ptrlock.h: Move ...
      	* config/accel/ptrlock.h: ... to here.
      	* config/nvptx/sem.c: Move ...
      	* config/accel/sem.c: ... to here.
      	* config/nvptx/sem.h: Move ...
      	* config/accel/sem.h: ... to here.
      	* config/nvptx/thread-stacksize.h: Move ...
      	* config/accel/thread-stacksize.h: ... to here.
      
      From-SVN: r278132
      Andrew Stubbs committed
    • Move gcn-run heap into GPU memory. · e3d0ee4a
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	gcc/
      	* config/gcn/gcn-run.c (heap_region): New global variable.
      	(struct hsa_runtime_fn_info): Add hsa_memory_assign_agent_fn.
      	(init_hsa_runtime_functions): Initialize hsa_memory_assign_agent.
      	(get_kernarg_region): Move contents to ....
      	(get_memory_region): .... here.
      	(get_heap_region): New function.
      	(init_device): Initialize the heap_region.
      	(device_malloc): Add region parameter.
      	(struct kernargs): Move heap ....
      	(heap): ... to global scope.
      	(main): Allocate heap separate to kernargs.
      
      From-SVN: r278131
      Andrew Stubbs committed
    • c-ada-spec.c (get_underlying_decl): Do not look through typedefs. · 2dbad62d
      	* c-ada-spec.c (get_underlying_decl): Do not look through typedefs.
      	(dump_forward_type): Do not generate a declaration for function types.
      	(dump_nested_type) <ARRAY_TYPE>: Do not generate a nested declaration
      	of the component type if it is declared in another file.
      
      From-SVN: r278129
      Eric Botcazou committed
    • PR fortran/92470 Fixes for CFI_address · 46927ade
              libgfortran/
              PR fortran/92470
              * runtime/ISO_Fortran_binding.c (CFI_establish): Set lower_bound to 0
              also for CFI_attribute_other.
      
              gcc/testsuite/
              PR fortran/92470
              * gfortran.dg/ISO_Fortran_binding_1.c (establish_c): Add assert for
              lower_bound == 0.
      
      From-SVN: r278128
      Tobias Burnus committed
    • ipa-prop.c (ipa_print_node_jump_functions, [...]): Print info about missing summaries. · 0302955a
      	* ipa-prop.c (ipa_print_node_jump_functions,
      	ipa_print_node_params): Print info about missing summaries.
      
      From-SVN: r278127
      Jan Hubicka committed
    • Enable libsanitizer build on riscv64 · 4a96e1c4
      	* configure.tgt (riscv64-*-linux*): Enable build.
      
      From-SVN: r278126
      Andreas Schwab committed
    • Account for the cost of generating loop masks · 61e5f2df
      We didn't take the cost of generating loop masks into account, and so
      tended to underestimate the cost of loops that need multiple masks.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vect-loop.c (vect_estimate_min_profitable_iters): Include
      	the cost of generating loop masks.
      
      gcc/testsuite/
      	* gcc.target/aarch64/sve/mask_struct_store_3.c: Add
      	-fno-vect-cost-model.
      	* gcc.target/aarch64/sve/mask_struct_store_3_run.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_2.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_2_run.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_3.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_3_run.c: Likewise.
      
      From-SVN: r278125
      Richard Sandiford committed
    • Avoid accounting for non-existent vector loop versioning · 6eed64b9
      vect_analyze_loop_costing uses two profitability thresholds: a runtime
      one and a static compile-time one.  The runtime one is simply the point
      at which the vector loop is cheaper than the scalar loop, while the
      static one also takes into account the cost of choosing between the
      scalar and vector loops at runtime.  We compare this static cost against
      the expected execution frequency to decide whether it's worth generating
      any vector code at all.
      
      However, we never reclaimed the cost of applying the runtime threshold
      if it turned out that the vector code can always be used.  And we only
      know whether that's true once we've calculated what the runtime
      threshold would be.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vectorizer.h (vect_apply_runtime_profitability_check_p):
      	New function.
      	* tree-vect-loop-manip.c (vect_loop_versioning): Use it.
      	* tree-vect-loop.c (vect_analyze_loop_2): Likewise.
      	(vect_transform_loop): Likewise.
      	(vect_analyze_loop_costing): Don't take the cost of versioning
      	into account for the static profitability threshold if it turns
      	out that no versioning is needed.
      
      From-SVN: r278124
      Richard Sandiford committed
    • ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node and… · d4b44b83
      ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node and target_option_default_node to get...
      
      	* ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node
      	and target_option_default_node to get -fprofile-generate ctors working
      	right with LTO.
      
      From-SVN: r278123
      Jan Hubicka committed
    • Don't assign a cost to vectorizable_assignment · e4020b28
      vectorizable_assignment handles true SSA-to-SSA copies (which hopefully
      we don't see in practice) and no-op conversions that are required
      to maintain correct gimple, such as changes between signed and
      unsigned types.  These cases shouldn't generate any code and so
      shouldn't count against either the scalar or vector costs.
      
      Later patches test this, but it seemed worth splitting out.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vectorizer.h (vect_nop_conversion_p): Declare.
      	* tree-vect-stmts.c (vect_nop_conversion_p): New function.
      	(vectorizable_assignment): Don't add a cost for nop conversions.
      	* tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
      	Likewise.
      	* tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
      
      From-SVN: r278122
      Richard Sandiford committed
    • Fix vectorizable_conversion costs · 3f446c27
      This patch makes two tweaks to vectorizable_conversion.  The first
      is to use "modifier" to distinguish between promotion, demotion,
      and neither promotion nor demotion, rather than using a code for
      some cases and "modifier" for others.  The second is to take ncopies
      into account for the promotion and demotion costs; previously we gave
      multiple copies the same cost as a single copy.
      
      Later patches test this, but it seemed worth splitting out.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vect-stmts.c (vect_model_promotion_demotion_cost): Take the
      	number of ncopies as an additional argument.
      	(vectorizable_conversion): Update call accordingly.  Use "modifier"
      	to check whether a conversion is between vectors with the same
      	numbers of units.
      
      From-SVN: r278121
      Richard Sandiford committed
    • [AArch64] Use aarch64_sve_int_mode in SVE ACLE code · 86194087
      This is a like-for-like change at the moment, but is a prerequisite
      for removing mode_for_int_vector.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* config/aarch64/aarch64-sve-builtins-functions.h
      	(unary_count::expand): Use aarch64_sve_int_mode instead of
      	mode_for_int_vector.
      
      From-SVN: r278120
      Richard Sandiford committed
    • [C++] Fix interaction between aka changes and DR1558 (PR92206) · ae83b9de
      One of the changes in r277281 was to make the typedef variant
      handling in strip_typedefs pass the raw DECL_ORIGINAL_TYPE to the
      recursive call, instead of applying TYPE_MAIN_VARIANT first.
      This PR shows that that interacts badly with the implementation
      of DR1558, because we then refuse to strip aliases with dependent
      template parameters and trip:
      
        gcc_assert (!typedef_variant_p (result)
      	      || ((flags & STF_USER_VISIBLE)
      		  && !user_facing_original_type_p (result)));
      
      Keeping the current behaviour but suppressing the ICE leads to a
      duplicate error (the dg-bogus in the first test), so that didn't
      seem like a good fix.
      
      I assume keeping the alias should never actually be necessary for
      DECL_ORIGINAL_TYPEs, because it will already have been checked
      somewhere, even for implicit TYPE_DECLs.  This patch therefore
      passes a flag to say that we can safely strip aliases with
      dependent template parameters.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/cp/
      	PR c++/92206
      	* cp-tree.h (STF_STRIP_DEPENDENT): New constant.
      	* tree.c (strip_typedefs): Add STF_STRIP_DEPENDENT to the flags
      	when calling strip_typedefs recursively on a DECL_ORIGINAL_TYPE.
      	Don't apply the fix for DR1558 in that case; allow aliases with
      	dependent template parameters to be stripped instead.
      
      gcc/testsuite/
      	PR c++/92206
      	* g++.dg/cpp0x/alias-decl-pr92206-1.C: New test.
      	* g++.dg/cpp0x/alias-decl-pr92206-2.C: Likewise.
      	* g++.dg/cpp0x/alias-decl-pr92206-3.C: Likewise.
      
      From-SVN: r278119
      Richard Sandiford committed
    • Update comment in opts.c. · 9b6e95d3
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* opts.c: Update comment about OPT_LEVELS_2_PLUS_SPEED_ONLY.
      
      From-SVN: r278118
      Martin Liska committed
    • Fix params.exp by parsing output of --help=params -Q. · 32b0081d
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* gcc.dg/params/params.exp: Restore test by parsing output
      	of --help=params -Q.
      
      From-SVN: r278117
      Martin Liska committed
    • Remove leftover call to finalize_options_struct. · 9c40fd46
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* tree-streamer-in.c (lto_input_ts_function_decl_tree_pointers):
      	Remove call to finalize_options_struct.
      
      From-SVN: r278116
      Martin Liska committed
    • re PR target/92055 ([avr] Support 64-bit double) · af73373e
      	PR target/92055
      	* config/avr/t-avr (avr-mcus): Do not depend on
      	$(srcdir)/config/avr/t-multilib.
      
      From-SVN: r278115
      Georg-Johann Lay committed
    • Fortran] Use proper type for hidden is-present argument · b0896310
              gcc/fortran/
              * trans-expr.c (gfc_conv_procedure_call): Fold hidden
              is-present argument to the right type.
      
      From-SVN: r278114
      Tobias Burnus committed
    • re PR target/92473 (test pr92324-2.c fails on arm and aarch64) · efc40c1f
      2019-11-13  Richard Biener  <rguenther@suse.de>
      
      	PR tree-optimization/92473
      	* tree-vect-loop.c (vect_create_epilog_for_reduction): Perform
      	direct optab reduction in the correct type.
      
      From-SVN: r278113
      Richard Biener committed
    • Add option -fweb for pr47763.c · e08f64f0
      This case is testing 'web' on ignore naked clobber.
      -funroll-loops no longer implies -fweb for powerpc.
      So, add -fweb to enable 'web' for this case.
      
      gcc.testsuite/
      2019-11-13  Jiufu Guo  <guojiufu@linux.ibm.com>
      
      	PR target/92465
      	* gcc.dg/pr47763.c: Add option -fweb.
      
      From-SVN: r278112
      Jiufu Guo committed
    • Daily bump. · eb67187b
      From-SVN: r278111
      GCC Administrator committed
  2. 12 Nov, 2019 13 commits