1. 14 Nov, 2019 5 commits
    • Support extended aggregate jump function in ipa-cp · eb270950
      2019-11-14  Feng Xue  <fxue@os.amperecomputing.com>
      
      	PR ipa/91682
      	* ipa-prop.h (jump_func_type): New value IPA_JF_LOAD_AGG.
      	(ipa_load_agg_data, ipa_agg_value, ipa_agg_value_set): New structs.
      	(ipa_agg_jf_item): Add new field jftype and type, redefine field value.
      	(ipa_agg_jump_function): Remove member function equal_to.
      	(ipa_agg_jump_function_p): Remove typedef.
      	(ipa_copy_agg_values, ipa_release_agg_values): New functions.
      	* ipa-prop.c (ipa_print_node_jump_functions_for_edge): Dump
      	information for aggregate jump function.
      	(get_ssa_def_if_simple_copy): Add new parameter rhs_stmt to
      	record last definition statement.
      	(load_from_unmodified_param_or_agg): New function.
      	(ipa_known_agg_contents_list): Add new field type and value, remove
      	field constant.
      	(build_agg_jump_func_from_list): Rename parameter const_count to
      	value_count, build aggregate jump function from ipa_load_agg_data.
      	(analyze_agg_content_value): New function.
      	(extract_mem_content): Analyze memory store assignment to prepare
      	information for aggregate jump function generation.
      	(determine_known_aggregate_parts): Add new parameter fbi, remove
      	parameter aa_walk_budeget_p.
      	(update_jump_functions_after_inlining): Update aggregate jump function.
      	(ipa_find_agg_cst_for_param): Change type of parameter agg.
      	(try_make_edge_direct_simple_call): Add new parameter new_root.
      	(try_make_edge_direct_virtual_call): Add new parameter new_root and
      	new_root_info.
      	(update_indirect_edges_after_inlining): Pass new argument to
      	try_make_edge_direct_simple_call and try_make_edge_direct_virtual_call.
      	(ipa_write_jump_function): Write aggregate jump function to file.
      	(ipa_read_jump_function): Read aggregate jump function from file.
      	(ipa_agg_value::equal_to): Migrate from ipa_agg_jf_item::equal_to.
      	* ipa-cp.c (ipa_get_jf_arith_result): New function.
      	(ipa_agg_value_from_node): Likewise.
      	(ipa_agg_value_set_from_jfunc): Likewise.
      	(propagate_vals_across_arith_jfunc): Likewise.
      	(propagate_aggregate_lattice): Likewise.
      	(ipa_get_jf_pass_through_result): Call ipa_get_jf_arith_result.
      	(propagate_vals_across_pass_through): Call
      	propagate_vals_across_arith_jfunc.
      	(get_clone_agg_value): Move forward.
      	(propagate_aggs_across_jump_function): Handle value propagation for
      	aggregate jump function.
      	(agg_jmp_p_vec_for_t_vec): Remove.
      	(context_independent_aggregate_values): Replace vec<ipa_agg_jf_item>
      	with vec<ipa_agg_value>.
      	(copy_plats_to_inter, intersect_with_plats): Likewise.
      	(agg_replacements_to_vector, intersect_with_agg_replacements): Likewise.
      	(intersect_aggregate_with_edge): Likewise.
      	(find_aggregate_values_for_callers_subset): Likewise.
      	(cgraph_edge_brings_all_agg_vals_for_node): Likewise.
      	(estimate_local_effects): Replace vec<ipa_agg_jump_function> and
      	vec<ipa_agg_jump_function_p> with vec<ipa_agg_value_set>.
      	(gather_context_independent_values): Likewise.
      	(perform_estimation_of_a_value, decide_whether_version_node): Likewise.
      	* ipa-fnsummary.c (evaluate_conditions_for_known_args): Replace
      	vec<ipa_agg_jump_function_p> with vec<ipa_agg_value_set>.
      	(evaluate_properties_for_edge): Likewise.
      	(estimate_edge_devirt_benefit): Likewise.
      	(estimate_edge_size_and_time):  Likewise.
      	(estimate_calls_size_and_time): Likewise.
      	(ipa_call_context::ipa_call_context): Likewise.
      	(estimate_ipcp_clone_size_and_time):  Likewise.
      	* ipa-fnsummary.h (ipa_call_context): Replace
      	vec<ipa_agg_jump_function_p> with vec<ipa_agg_value_set>.
      	* ipa-inline-analysis.c (do_estimate_edge_time): Replace
      	vec<ipa_agg_jump_function_p> with vec<ipa_agg_value_set>.
      	(do_estimate_edge_size): Likewise.
      	(do_estimate_edge_hints): Likewise.
      
      2019-11-14  Feng Xue  <fxue@os.amperecomputing.com>
      
              PR ipa/91682
              * gcc.dg/ipa/ipcp-agg-10.c: Change dg-scan string.
              * gcc.dg/ipa/ipcp-agg-11.c: New test.
      
      From-SVN: r278193
      Feng Xue committed
    • re PR ipa/92421 (ICE in inline_small_functions, at ipa-inline.c:2001 since r277759) · 3e7cf2e6
      	PR ipa/92421
      	* g++.dg/torture/pr92421.C: Add -Wno-return-type to
      	dg-additional-options.  Avoid -Wwrite-string warnings, most of
      	-Wreturn-type warnings, define bf ctor.  Use struct instead of class
      	with public: at the start.
      
      From-SVN: r278192
      Jakub Jelinek committed
    • alias-decl-pr92206-3.C: Require effective target c++14 rather than c++11. · 49084d39
      	* g++.dg/cpp0x/alias-decl-pr92206-3.C: Require effective target c++14
      	rather than c++11.
      
      From-SVN: r278191
      Jakub Jelinek committed
    • ISO_Fortran_binding_17.c: Include ../../../libgfortran/ISO_Fortran_binding.h rather than... · dec231be
      	* gfortran.dg/ISO_Fortran_binding_17.c: Include
      	../../../libgfortran/ISO_Fortran_binding.h rather than
      	ISO_Fortran_binding.h.
      
      From-SVN: r278190
      Jakub Jelinek committed
    • Daily bump. · d82f3812
      From-SVN: r278189
      GCC Administrator committed
  2. 13 Nov, 2019 35 commits
    • ipa-cp.c (propagate_vr_across_jump_function): Propagate also across binary operations. · 2b89b748
      	* ipa-cp.c (propagate_vr_across_jump_function): Propagate also across
      	binary operations.
      
      From-SVN: r278185
      Jan Hubicka committed
    • ipa-profile.c (check_argument_count): Check properly that e_info is non-NULL; do… · d772e360
      ipa-profile.c (check_argument_count): Check properly that e_info is non-NULL; do not check descriptors.
      
      	* ipa-profile.c (check_argument_count): Check properly that e_info
      	is non-NULL; do not check descriptors.
      
      From-SVN: r278184
      Jan Hubicka committed
    • ipa-inline-analysis.c (do_estimate_edge_time): Relax check for ipa profiles. · 367c959f
      	* ipa-inline-analysis.c (do_estimate_edge_time): Relax
      	check for ipa profiles.
      
      From-SVN: r278179
      Jan Hubicka committed
    • re PR ipa/92421 (ICE in inline_small_functions, at ipa-inline.c:2001 since r277759) · b914768c
      
      	PR c++/92421
      	* ipa-prop.c (update_indirect_edges_after_inlining):
      	Mark parameter as used.
      	* ipa-inline.c (recursive_inlining): Reset node cache
      	after inlining.
      	(inline_small_functions): Remove checking ifdef.
      	* ipa-inline-analysis.c (do_estimate_edge_time): Verify
      	cache consistency.
      	* g++.dg/torture/pr92421.C: New testcase.
      
      From-SVN: r278159
      Jan Hubicka committed
    • re PR ipa/92498 (gcc.dg/tree-prof/crossmodule-indircall-1.c fails starting with r278100) · 7b34a284
      	PR ipa/92498
      	* ipa-profile.c (check_argument_count): Do not ICE when descriptors
      	is NULL.
      	(ipa_profile): Fix reversed test.
      
      From-SVN: r278157
      Jan Hubicka committed
    • ipa-cp.c (ignore_edge_p): Do not check caller flags. · 223f4b10
      	* ipa-cp.c (ignore_edge_p): Do not check caller flags.
      	(propagate_constants_topo): Fix typo.
      
      From-SVN: r278156
      Jan Hubicka committed
    • Move plain value_range things to value-range.[hc]*. · cca78449
      From-SVN: r278153
      Aldy Hernandez committed
    • Sanitize the constant argument for rotr<mode>3 · 425ea30f
      This was dormant for quite some time, but it started happening for me
      on gcc.c-torture/compile/pr65153.c sometime after r276645 for -mabi=32 linux runs.
      
      The pattern accepts any SMALL_OPERAND constant value while it asserts during the final
      that the value is in the mode size range. I this case it happens that combine_and_move_insns
      during ira makes a pattern with negative "shift count" which fails at final stage.
      
      This simple fix just truncates the constant operand to mode size the same as shift patterns.
      
      gcc/ChangeLog:
      
      2019-11-13  Dragan Mladjenovic  <dmladjenovic@wavecomp.com>
      
      	* config/mips/mips.md (rotr<mode>3): Sanitize the constant argument
      	instead of asserting its value.
      
      From-SVN: r278152
      Dragan Mladjenovic committed
    • download_prerequisites: Use http instead of ftp for downloading. · d51dd316
      2019-11-13  Janne Blomqvist  <jb@gcc.gnu.org>
      
              * download_prerequisites: Use http instead of ftp for downloading.
      
      From-SVN: r278151
      Janne Blomqvist committed
    • pr92090.c: Limit -mbig to powerpc64le-*-*. · 5ea9ae01
              * gcc.target/powerpc/pr92090.c: Limit -mbig to powerpc64le-*-*.
              * gcc.target/powerpc/pr92090-2.c: Likewise.
      
      From-SVN: r278150
      David Edelsohn committed
    • libsupc++: Implement comparison algorithms for C++20 · 0ff15d21
      This is incomplete because std::strong_order doesn't support
      floating-point types.
      
      The partial_order and weak_order tests use VERIFY instead of
      static_assert because of PR 92431.
      
      	* libsupc++/compare (strong_order, weak_order, partial_order)
      	(compare_strong_order_fallback, compare_weak_order_fallback)
      	(compare_partial_order_fallback): Define customization point objects
      	for C++20.
      	* testsuite/18_support/comparisons/algorithms/partial_order.cc: New
      	test.
      	* testsuite/18_support/comparisons/algorithms/strong_order.cc: New
      	test.
      	* testsuite/18_support/comparisons/algorithms/weak_order.cc: New test.
      
      From-SVN: r278149
      Jonathan Wakely committed
    • Rewrite value_range constructors to the value_range_kind is at the end, and defaults to VR_RANGE. · 5d462877
      Rewrite value_range constructors to the value_range_kind is at the
      end, and defaults to VR_RANGE.  Similarly for set() methods.
      
      From-SVN: r278148
      Aldy Hernandez committed
    • PR c++/89070 - bogus [[nodiscard]] warning in SFINAE. · 7b521fbd
      This is a complaint that we issue a [[nodiscard]] warning even in SFINAE
      contexts.  Here 'complain' is tf_decltype, but not tf_warning so I guess
      we can fix it as below.
      
      	* cvt.c (convert_to_void): Guard maybe_warn_nodiscard calls with
      	tf_warning.
      
      	* g++.dg/cpp1z/nodiscard7.C: New test.
      
      From-SVN: r278147
      Marek Polacek committed
    • tree-dump.c (dequeue_and_dump): Print first tree operand for VIEW_CONVERT_EXPR. · 2aaf32ab
      2019-11-13  Ulrich Drepper  <drepper@redhat.com>
      
      	* tree-dump.c (dequeue_and_dump): Print first tree operand
      	for VIEW_CONVERT_EXPR.
      
      From-SVN: r278146
      Ulrich Drepper committed
    • Add C2x *_NORM_MAX constants to <float.h>. · 00be2a5f
      C2x adds <float.h> constants FLT_NORM_MAX, DBL_NORM_MAX and
      LDBL_NORM_MAX.  These are for the maximum "normalized" finite
      floating-point number, where the given definition of normalized is
      that all possible values with MANT_DIG significand digits (leading one
      not zero) can be represented with that exponent.  The effect of that
      definition is that these macros are the same as the corresponding MAX
      macros for all formats except IBM long double, where the NORM_MAX
      value has exponent 1 smaller than the MAX one so that all 106 digits
      can be 1.
      
      This patch adds those macros to GCC.  They are only defined for float,
      double and long double; C2x does not include such macros for DFP
      types, and while the integration of TS 18661-3 into C2x has not yet
      occurred, the draft proposed text does not add them for the _FloatN /
      _FloatNx types (where they would always be the same as the MAX
      macros).
      
      Bootstrapped with no regressions on x86_64-pc-linux-gnu.  Also tested
      compilation of the new test for powerpc-linux-gnu to confirm the check
      of LDBL_NORM_MAX in the IBM long double case does get properly
      optimized out.
      
      gcc:
      	* ginclude/float.c [__STDC_VERSION__ > 201710L] (FLT_NORM_MAX,
      	DBL_NORM_MAX, LDBL_NORM_MAX): Define.
      	* real.c (get_max_float): Add norm_max argument.
      	* real.h (get_max_float): Update prototype.
      	* builtins.c (fold_builtin_interclass_mathfn): Update calls to
      	get_max_float.
      
      gcc/c-family:
      	* c-cppbuiltin.c (builtin_define_float_constants): Also define
      	NORM_MAX constants.  Update call to get_max_float.
      	(LAZY_HEX_FP_VALUES_CNT): Update value to include NORM_MAX
      	constants.
      
      gcc/d:
      	* d-target.cc (define_float_constants): Update call to
      	get_max_float.
      
      gcc/testsuite:
      	* gcc.dg/c11-float-3.c, gcc.dg/c2x-float-1.c: New tests.
      
      From-SVN: r278145
      Joseph Myers committed
    • Come up with selftests for dbgcnt. · 5024c8bb
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* dbgcnt.c (test_sorted_dbg_counters): New.
      	(dbgcnt_c_tests): Likewise.
      	* selftest-run-tests.c (selftest::run_tests): Likewise.
      	* selftest.h (dbgcnt_c_tests): Likewise.
      
      From-SVN: r278144
      Martin Liska committed
    • Add a few missing checks that IPA_NODE_REF is not NULL (PR 92454) · 3c4fa8a8
      2019-11-13  Jan Hubicka  <hubicka@ucw.cz>
      	    Martin Jambor  <mjambor@suse.cz>
      
      	PR ipa/92454
      	* ipa-cp.c (spread_undeadness): Check that IPA_NODE_REF exists.
      	(identify_dead_nodes): Likewise.
      
      	testsuite/
      	* g++.dg/ipa/pr92454.C: New test.
      
      From-SVN: r278142
      Martin Jambor committed
    • Do not overuse push/pop_cfun in IPA ICF. · a5e2beb9
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* ipa-icf.c (sem_function::equals_private): Do not overuse
      	push/pop_cfun functions.
      
      From-SVN: r278141
      Martin Liska committed
    • Enhance syntax of -fdbg-cnt. · 83a49336
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* common.opt: Document change of -fdbg-cnt option.
      	* dbgcnt.c (DEBUG_COUNTER): Remove.
      	(dbg_cnt_is_enabled): Remove.
      	(dbg_cnt): Work with new intervals.
      	(dbg_cnt_set_limit_by_index): Set to new
      	list of intervals.
      	(dbg_cnt_set_limit_by_name): Likewise.
      	(dbg_cnt_process_single_pair): Process new format.
      	(dbg_cnt_process_opt): Likewise.
      	(dbg_cnt_list_all_counters): Likewise.
      	* doc/invoke.texi: Document change of -fdbg-cnt option.
      	(cmp_tuples): New.
      2019-11-13  Martin Liska  <mliska@suse.cz>
      
      	* gcc.dg/ipa/ipa-icf-39.c: Update -fdbg-cnt to the new format.
      	* gcc.dg/pr68766.c: Likewise.
      
      From-SVN: r278140
      Martin Liska committed
    • ipa-inline.c (ipa_inline): Check that function is defined before flattening. · 2895b172
      
      	* ipa-inline.c (ipa_inline): Check that function is defined before
      	flattening.
      	* gcc.c-torture/compile/flatten.c: New testcase.
      
      From-SVN: r278139
      Jan Hubicka committed
    • GCN Libgomp Plugin · 237957cc
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	libgomp/
      	* plugin/Makefrag.am: Add amdgcn plugin support.
      	* plugin/configfrag.ac: Likewise.
      	* plugin/plugin-gcn.c: New file.
      	* configure: Regenerate.
      	* Makefile.in: Regenerate.
      	* testsuite/Makefile.in: Regenerate.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      Co-Authored-By: Kwok Cheung Yeung <kcy@codesourcery.com>
      Co-Authored-By: Tom de Vries <tom@codesourcery.com>
      
      From-SVN: r278138
      Andrew Stubbs committed
    • Use a single worker for OpenACC on AMD GCN · fe22e0d4
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      
      	gcc/
      	* config/gcn/gcn.c (gcn_goacc_validate_dims): Ensure
      	flag_worker_partitioning is not set.
      	(TARGET_GOACC_WORKER_PARTITIONING): Remove target hook definition.
      	* config/gcn/gcn.opt (macc-experimental-workers): Default to off.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      
      From-SVN: r278137
      Andrew Stubbs committed
    • Optimize GCN OpenMP malloc performance · cee16451
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	libgomp/
      	* config/gcn/team.c (gomp_gcn_enter_kernel): Set up the team arena
      	and use team_malloc variants.
      	(gomp_gcn_exit_kernel): Use team_free.
      	* libgomp.h (TEAM_ARENA_SIZE): Define.
      	(TEAM_ARENA_START): Define.
      	(TEAM_ARENA_FREE): Define.
      	(TEAM_ARENA_END): Define.
      	(team_malloc): New function.
      	(team_malloc_cleared): New function.
      	(team_free): New function.
      	* team.c (gomp_new_team): Initialize and use team_malloc.
      	(free_team): Use team_free.
      	(gomp_free_thread): Use team_free.
      	(gomp_pause_host): Use team_free.
      	* work.c (gomp_init_work_share): Use team_malloc.
      	(gomp_fini_work_share): Use team_free.
      
      From-SVN: r278136
      Andrew Stubbs committed
    • GCN libgomp port · fa499995
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	include/
      	* gomp-constants.h (GOMP_DEVICE_GCN): Define.
      	(GOMP_VERSION_GCN): Define.
      
      	libgomp/
      	* Makefile.am (libgomp_la_SOURCES): Add oacc-target.c.
      	* Makefile.in: Regenerate.
      	* config.h.in (PLUGIN_GCN): Add new undef.
      	* config/accel/openacc.f90 (acc_device_gcn): New parameter.
      	* config/gcn/affinity-fmt.c: New file.
      	* config/gcn/bar.c: New file.
      	* config/gcn/bar.h: New file.
      	* config/gcn/doacross.h: New file.
      	* config/gcn/icv-device.c: New file.
      	* config/gcn/oacc-target.c: New file.
      	* config/gcn/simple-bar.h: New file.
      	* config/gcn/target.c: New file.
      	* config/gcn/task.c: New file.
      	* config/gcn/team.c: New file.
      	* config/gcn/time.c: New file.
      	* configure.ac: Add amdgcn*-*-*.
      	* configure: Regenerate.
      	* configure.tgt: Add amdgcn*-*-*.
      	* libgomp-plugin.h (offload_target_type): Add OFFLOAD_TARGET_TYPE_GCN.
      	* libgomp.h (gcn_thrs): Add amdgcn variant.
      	(set_gcn_thrs): Likewise.
      	(gomp_thread): Likewise.
      	* oacc-int.h (goacc_thread): Likewise.
      	* oacc-target.c: New file.
      	* openacc.f90 (acc_device_gcn): New parameter.
      	* openacc.h (acc_device_t): Add acc_device_gcn.
      	* team.c (gomp_free_pool_helper): Add amdgcn support.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      Co-Authored-By: Kwok Cheung Yeung <kcy@codesourcery.com>
      Co-Authored-By: Tom de Vries <tom@codesourcery.com>
      
      From-SVN: r278135
      Andrew Stubbs committed
    • Add device number to GOMP_OFFLOAD_openacc_async_construct · d2903ce0
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      
      	libgomp/
      	* libgomp-plugin.h (GOMP_OFFLOAD_openacc_async_construct): Add int
      	parameter.
      	* oacc-async.c (lookup_goacc_asyncqueue): Pass device number to the
      	queue constructor.
      	* oacc-host.c (host_openacc_async_construct): Add device parameter.
      	* plugin/plugin-nvptx.c (GOMP_OFFLOAD_openacc_async_construct): Add
      	device parameter.
      
      Co-Authored-By: Julian Brown <julian@codesourcery.com>
      
      From-SVN: r278134
      Andrew Stubbs committed
    • GCN mkoffload · 15e9e679
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      	    Kwok Cheung Yeung  <kcy@codesourcery.com>
      	    Julian Brown  <julian@codesourcery.com>
      	    Tom de Vries  <tom@codesourcery.com>
      
      	gcc/
      	* config/gcn/mkoffload.c: New file.
      	* config/gcn/offload.h: New file.
      
      From-SVN: r278133
      Andrew Stubbs committed
    • Move generic libgomp files from nvptx to accel · b3d14b3a
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	libgomp/
      	* configure.tgt (nvptx*-*-*): Add "accel" directory.
      	* config/nvptx/libgomp-plugin.c: Move ...
      	* config/accel/libgomp-plugin.c: ... to here.
      	* config/nvptx/lock.c: Move ...
      	* config/accel/lock.c: ... to here.
      	* config/nvptx/mutex.c: Move ...
      	* config/accel/mutex.c: ... to here.
      	* config/nvptx/mutex.h: Move ...
      	* config/accel/mutex.h: ... to here.
      	* config/nvptx/oacc-async.c: Move ...
      	* config/accel/oacc-async.c: ... to here.
      	* config/nvptx/oacc-cuda.c: Move ...
      	* config/accel/oacc-cuda.c: ... to here.
      	* config/nvptx/oacc-host.c: Move ...
      	* config/accel/oacc-host.c: ... to here.
      	* config/nvptx/oacc-init.c: Move ...
      	* config/accel/oacc-init.c: ... to here.
      	* config/nvptx/oacc-mem.c: Move ...
      	* config/accel/oacc-mem.c: ... to here.
      	* config/nvptx/oacc-plugin.c: Move ...
      	* config/accel/oacc-plugin.c: ... to here.
      	* config/nvptx/omp-lock.h: Move ...
      	* config/accel/omp-lock.h: ... to here.
      	* config/nvptx/openacc.f90: Move ...
      	* config/accel/openacc.f90: ... to here.
      	* config/nvptx/pool.h: Move ...
      	* config/accel/pool.h: ... to here.
      	* config/nvptx/proc.c: Move ...
      	* config/accel/proc.c: ... to here.
      	* config/nvptx/ptrlock.c: Move ...
      	* config/accel/ptrlock.c: ... to here.
      	* config/nvptx/ptrlock.h: Move ...
      	* config/accel/ptrlock.h: ... to here.
      	* config/nvptx/sem.c: Move ...
      	* config/accel/sem.c: ... to here.
      	* config/nvptx/sem.h: Move ...
      	* config/accel/sem.h: ... to here.
      	* config/nvptx/thread-stacksize.h: Move ...
      	* config/accel/thread-stacksize.h: ... to here.
      
      From-SVN: r278132
      Andrew Stubbs committed
    • Move gcn-run heap into GPU memory. · e3d0ee4a
      2019-11-13  Andrew Stubbs  <ams@codesourcery.com>
      
      	gcc/
      	* config/gcn/gcn-run.c (heap_region): New global variable.
      	(struct hsa_runtime_fn_info): Add hsa_memory_assign_agent_fn.
      	(init_hsa_runtime_functions): Initialize hsa_memory_assign_agent.
      	(get_kernarg_region): Move contents to ....
      	(get_memory_region): .... here.
      	(get_heap_region): New function.
      	(init_device): Initialize the heap_region.
      	(device_malloc): Add region parameter.
      	(struct kernargs): Move heap ....
      	(heap): ... to global scope.
      	(main): Allocate heap separate to kernargs.
      
      From-SVN: r278131
      Andrew Stubbs committed
    • c-ada-spec.c (get_underlying_decl): Do not look through typedefs. · 2dbad62d
      	* c-ada-spec.c (get_underlying_decl): Do not look through typedefs.
      	(dump_forward_type): Do not generate a declaration for function types.
      	(dump_nested_type) <ARRAY_TYPE>: Do not generate a nested declaration
      	of the component type if it is declared in another file.
      
      From-SVN: r278129
      Eric Botcazou committed
    • PR fortran/92470 Fixes for CFI_address · 46927ade
              libgfortran/
              PR fortran/92470
              * runtime/ISO_Fortran_binding.c (CFI_establish): Set lower_bound to 0
              also for CFI_attribute_other.
      
              gcc/testsuite/
              PR fortran/92470
              * gfortran.dg/ISO_Fortran_binding_1.c (establish_c): Add assert for
              lower_bound == 0.
      
      From-SVN: r278128
      Tobias Burnus committed
    • ipa-prop.c (ipa_print_node_jump_functions, [...]): Print info about missing summaries. · 0302955a
      	* ipa-prop.c (ipa_print_node_jump_functions,
      	ipa_print_node_params): Print info about missing summaries.
      
      From-SVN: r278127
      Jan Hubicka committed
    • Enable libsanitizer build on riscv64 · 4a96e1c4
      	* configure.tgt (riscv64-*-linux*): Enable build.
      
      From-SVN: r278126
      Andreas Schwab committed
    • Account for the cost of generating loop masks · 61e5f2df
      We didn't take the cost of generating loop masks into account, and so
      tended to underestimate the cost of loops that need multiple masks.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vect-loop.c (vect_estimate_min_profitable_iters): Include
      	the cost of generating loop masks.
      
      gcc/testsuite/
      	* gcc.target/aarch64/sve/mask_struct_store_3.c: Add
      	-fno-vect-cost-model.
      	* gcc.target/aarch64/sve/mask_struct_store_3_run.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_2.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_2_run.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_3.c: Likewise.
      	* gcc.target/aarch64/sve/peel_ind_3_run.c: Likewise.
      
      From-SVN: r278125
      Richard Sandiford committed
    • Avoid accounting for non-existent vector loop versioning · 6eed64b9
      vect_analyze_loop_costing uses two profitability thresholds: a runtime
      one and a static compile-time one.  The runtime one is simply the point
      at which the vector loop is cheaper than the scalar loop, while the
      static one also takes into account the cost of choosing between the
      scalar and vector loops at runtime.  We compare this static cost against
      the expected execution frequency to decide whether it's worth generating
      any vector code at all.
      
      However, we never reclaimed the cost of applying the runtime threshold
      if it turned out that the vector code can always be used.  And we only
      know whether that's true once we've calculated what the runtime
      threshold would be.
      
      2019-11-13  Richard Sandiford  <richard.sandiford@arm.com>
      
      gcc/
      	* tree-vectorizer.h (vect_apply_runtime_profitability_check_p):
      	New function.
      	* tree-vect-loop-manip.c (vect_loop_versioning): Use it.
      	* tree-vect-loop.c (vect_analyze_loop_2): Likewise.
      	(vect_transform_loop): Likewise.
      	(vect_analyze_loop_costing): Don't take the cost of versioning
      	into account for the static profitability threshold if it turns
      	out that no versioning is needed.
      
      From-SVN: r278124
      Richard Sandiford committed
    • ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node and… · d4b44b83
      ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node and target_option_default_node to get...
      
      	* ipa.c (cgraph_build_static_cdtor): Pass optimization_default_node
      	and target_option_default_node to get -fprofile-generate ctors working
      	right with LTO.
      
      From-SVN: r278123
      Jan Hubicka committed