1. 23 Nov, 2016 40 commits
    • re PR sanitizer/69278 (Confusion option handling for -sanitize-recovery=alll) · c944c6a2
      	PR sanitizer/69278
      	* opts.c (parse_sanitizer_options): For -fsanitize=undefined,
      	restore enabling also SANITIZE_UNREACHABLE and SANITIZE_RETURN.
      
      	* g++.dg/ubsan/return-7.C: New test.
      	* c-c++-common/ubsan/unreachable-4.c: New test.
      
      From-SVN: r242795
      Jakub Jelinek committed
    • re PR tree-optimization/78482 (wrong code at -O3 in both 32-bit and 64-bit modes… · 871a2c6c
      re PR tree-optimization/78482 (wrong code at -O3 in both 32-bit and 64-bit modes on x86_64-linux-gnu)
      
      	PR tree-optimization/78482
      	* gcc.dg/torture/pr78482.c (c, d): Use signed char instead of char.
      	(bar): New function.
      	(main): Call bar instead of printf.
      
      From-SVN: r242794
      Jakub Jelinek committed
    • re PR middle-end/69183 (ICE when using OpenMP PRIVATE keyword in OMP DO loop not… · c39dad64
      re PR middle-end/69183 (ICE when using OpenMP PRIVATE keyword in OMP DO loop not explicitly encapsulated in OMP PARALLEL region)
      
      	PR middle-end/69183
      	* omp-low.c (build_outer_var_ref): Change lastprivate argument
      	to code, pass it recursively, adjust uses.  For OMP_CLAUSE_PRIVATE
      	on worksharing constructs, treat it like clauses on simd construct.
      	Formatting fix.
      	(lower_rec_input_clauses): For OMP_CLAUSE_PRIVATE_OUTER_REF pass
      	OMP_CLAUSE_PRIVATE as last argument to build_outer_var_ref.
      	(lower_lastprivate_clauses): Pass OMP_CLAUSE_LASTPRIVATE instead
      	of true as last argument to build_outer_var_ref.
      
      	* gfortran.dg/gomp/pr69183.f90: New test.
      
      From-SVN: r242793
      Jakub Jelinek committed
    • re PR target/78230 (Compile pr66178.c fail for mips64el-elf with N64 abi) · 62567da0
      	PR target/78230
      	* gcc.dg/torture/pr66178.c (test): Use uintptr_t instead of int.
      	(test2) Ditto.
      
      From-SVN: r242792
      Kito Cheng committed
    • i386.md (*movqi_internal): Calculate mode attribute of alternatives 7,8,9… · 46e89251
      i386.md (*movqi_internal): Calculate mode attribute of alternatives 7,8,9 depending on TARGET_AVX512DQ.
      
      	* gcc.target/config/i386.md (*movqi_internal): Calculate mode
      	attribute of alternatives 7,8,9 depending on TARGET_AVX512DQ.
      	<TYPE_MSKMOV>: Emit kmovw for MODE_HI insn mode attribute.
      	(*k<logic><mode>): Calculate mode attribute depending on
      	TARGET_AVX512DQ.  Emit k<logic>w for MODE_HI insn mode attribute.
      	(*andqi_1): Calculate mode attribute of alternative 3 depending
      	on TARGET_AVX512DQ.  Emit kandw for MODE_HI insn mode attribute.
      	(kandn<mode>): Calculate mode attribute of alternative 2 depending
      	on TARGET_AVX512DQ.  Emit kandnw for MODE_HI insn mode attribute.
      	(kxnor<mode>): Merge insn patterns using SWI1248_AVX512BW mode
      	iterator.  Calculate mode attribute of alternative 1 depending
      	on TARGET_AVX512DQ.  Emit kxnorw for MODE_HI insn mode attribute.
      	(*one_cmplqi2_1): Calculate mode attribute of alternative 2 depending
      	on TARGET_AVX512DQ.  Emit knotw for MODE_HI insn mode attribute.
      
      From-SVN: r242791
      Uros Bizjak committed
    • re PR c++/77907 (Add "const" to argument of constexpr constructor causes the… · 25cb6b33
      re PR c++/77907 (Add "const" to argument of constexpr constructor causes the object to be left in unconstructed state)
      
      	PR c++/77907
      	* cp-gimplify.c (cp_fold) <case CALL_EXPR>: When calling constructor
      	and maybe_constant_value returns non-CALL_EXPR, create INIT_EXPR
      	with the object on lhs and maybe_constant_value returned expr on rhs.
      
      	* g++.dg/cpp0x/pr77907.C: New test.
      
      From-SVN: r242790
      Jakub Jelinek committed
    • OpenMP offloading to NVPTX: libgomp changes · 6103184e
      	* Makefile.am (libgomp_la_SOURCES): Add atomic.c, icv.c, icv-device.c.
      	* Makefile.in. Regenerate.
      	* configure.ac [nvptx*-*-*] (libgomp_use_pthreads): Set and use it...
      	(LIBGOMP_USE_PTHREADS): ...here; new define.
      	* configure: Regenerate.
      	* config.h.in: Likewise.
      	* config/posix/affinity.c: Move to...
      	* affinity.c: ...here (new file).  Guard use of Pthreads-specific
      	interface by LIBGOMP_USE_PTHREADS. 
      	* critical.c: Split out GOMP_atomic_{start,end} into...
      	* atomic.c: ...here (new file).
      	* env.c: Split out ICV definitions into...
      	* icv.c: ...here (new file) and...
      	* icv-device.c: ...here. New file.
      	* config/linux/lock.c (gomp_init_lock_30): Move to generic lock.c.
      	(gomp_destroy_lock_30): Ditto.
      	(gomp_set_lock_30): Ditto.
      	(gomp_unset_lock_30): Ditto.
      	(gomp_test_lock_30): Ditto.
      	(gomp_init_nest_lock_30): Ditto.
      	(gomp_destroy_nest_lock_30): Ditto.
      	(gomp_set_nest_lock_30): Ditto.
      	(gomp_unset_nest_lock_30): Ditto.
      	(gomp_test_nest_lock_30): Ditto.
      	* lock.c: New.
      	* config/nvptx/lock.c: New.
      	* config/nvptx/bar.c: New.
      	* config/nvptx/bar.h: New.
      	* config/nvptx/doacross.h: New.
      	* config/nvptx/error.c: New.
      	* config/nvptx/icv-device.c: New.
      	* config/nvptx/mutex.h: New.
      	* config/nvptx/pool.h: New.
      	* config/nvptx/proc.c: New.
      	* config/nvptx/ptrlock.h: New.
      	* config/nvptx/sem.h: New.
      	* config/nvptx/simple-bar.h: New.
      	* config/nvptx/target.c: New.
      	* config/nvptx/task.c: New.
      	* config/nvptx/team.c: New.
      	* config/nvptx/time.c: New.
      	* config/posix/simple-bar.h: New.
      	* libgomp.h: Guard pthread.h inclusion.  Include simple-bar.h.
      	(gomp_num_teams_var): Declare.
      	(struct gomp_thread_pool): Change threads_dock member to
      	gomp_simple_barrier_t.
      	[__nvptx__] (gomp_thread): New implementation.
      	(gomp_thread_attr): Guard by LIBGOMP_USE_PTHREADS.
      	(gomp_thread_destructor): Ditto.
      	(gomp_init_thread_affinity): Ditto.
      	* team.c: Guard uses of Pthreads-specific interfaces by
      	LIBGOMP_USE_PTHREADS.  Adjust all uses of threads_dock.
      	(gomp_free_thread) [__nvptx__]: Do not call 'free'.
      
      	* config/nvptx/alloc.c: Delete.
      	* config/nvptx/barrier.c: Ditto.
      	* config/nvptx/fortran.c: Ditto.
      	* config/nvptx/iter.c: Ditto.
      	* config/nvptx/iter_ull.c: Ditto.
      	* config/nvptx/loop.c: Ditto.
      	* config/nvptx/loop_ull.c: Ditto.
      	* config/nvptx/ordered.c: Ditto.
      	* config/nvptx/parallel.c: Ditto.
      	* config/nvptx/priority_queue.c: Ditto.
      	* config/nvptx/sections.c: Ditto.
      	* config/nvptx/single.c: Ditto.
      	* config/nvptx/splay-tree.c: Ditto.
      	* config/nvptx/work.c: Ditto.
      
      	* testsuite/libgomp.fortran/fortran.exp (lang_link_flags): Pass
      	-foffload=-lgfortran in addition to -lgfortran.
      	* testsuite/libgomp.oacc-fortran/fortran.exp (lang_link_flags): Ditto.
      
      	* plugin/plugin-nvptx.c: Include <limits.h>.
      	(struct targ_fn_descriptor): Add new fields.
      	(struct ptx_device): Ditto.  Set them...
      	(nvptx_open_device): ...here.
      	(nvptx_adjust_launch_bounds): New.
      	(nvptx_host2dev): Allow NULL 'nvthd'.
      	(nvptx_dev2host): Ditto.
      	(GOMP_OFFLOAD_get_caps): Add GOMP_OFFLOAD_CAP_OPENMP_400.
      	(link_ptx): Adjust log sizes.
      	(nvptx_host2dev): Allow NULL 'nvthd'.
      	(nvptx_dev2host): Ditto.
      	(nvptx_set_clocktick): New.  Use it...
      	(GOMP_OFFLOAD_load_image): ...here.  Set new targ_fn_descriptor
      	fields.
      	(GOMP_OFFLOAD_dev2dev): New.
      	(nvptx_adjust_launch_bounds): New.
      	(nvptx_stacks_size): New.
      	(nvptx_stacks_alloc): New.
      	(nvptx_stacks_free): New.
      	(GOMP_OFFLOAD_run): New.
      	(GOMP_OFFLOAD_async_run): New (stub).
      
      Co-Authored-By: Dmitry Melnik <dm@ispras.ru>
      Co-Authored-By: Jakub Jelinek <jakub@redhat.com>
      
      From-SVN: r242789
      Alexander Monakov committed
    • * config/mcore/mcore.c (emit_new_cond_insn): Fix prototype. · 6251fe93
      From-SVN: r242788
      Jeff Law committed
    • iq2000.c (iq2000_rtx_costs): Avoid multiplication in boolean context warning. · 7d1f0a00
      	* config/iq2000/iq2000.c (iq2000_rtx_costs): Avoid multiplication
      	in boolean context warning.
      
      From-SVN: r242787
      Jeff Law committed
    • re PR middle-end/78153 (strlen return value can be assumed to be less than PTRDIFF_MAX) · cfe3d653
      2016-11-23  Prathamesh Kulkarni  <prathamesh.kulkarni@linaro.org>
      
      	PR middle-end/78153
      	* gimple-fold.c (fold_stmt_1): Handle case for GIMPLE_RETURN.
      	* tree-vrp.c (extract_range_basic): Handle case for
      	CFN_BUILT_IN_STRLEN.
      
      testsuite/
      	* gcc.dg/tree-ssa/pr78153-1.c: New test.
      	* gcc.dg/tree-ssa/pr78153-2.c: Likewise.
      
      From-SVN: r242786
      Prathamesh Kulkarni committed
    • * config/ia64/ia64.c (ia64_emit_insn_before): Fix prototype. · 017fdefe
      From-SVN: r242785
      Jeff Law committed
    • [Patch ARM 17/17] Enable _Float16 for ARM and fix PR target/63250 · 5774b1fa
      gcc/
      
      	PR target/63250
      	* config/arm/arm-builtins.c (arm_simd_floatHF_type_node): Rename to...
      	(arm_fp16_type_node): ...This, make visibile.
      	(arm_simd_builtin_std_type): Rename arm_simd_floatHF_type_node to
      	arm_fp16_type_node.
      	(arm_init_simd_builtin_types): Likewise.
      	(arm_init_fp16_builtins): Likewise.
      	* config/arm/arm.c (arm_excess_precision): New.
      	(arm_floatn_mode): Likewise.
      	(TARGET_C_EXCESS_PRECISION): Likewise.
      	(TARGET_FLOATN_MODE): Likewise.
      	(arm_promoted_type): Only promote arm_fp16_type_node.
      	* config/arm/arm.h (arm_fp16_type_node): Declare.
      
      gcc/testsuite/
      
      	PR target/63250
      	* lib/target-supports.exp (add_options_for_float16): Add
      	-mfp16-format=ieee when testign arm*-*-*.
      
      From-SVN: r242784
      James Greenhalgh committed
    • [Patch 16/17 libgcc ARM] Half to double precision conversions · 5e0f10a0
      gcc/
      
      	* config/arm/arm.c (arm_convert_to_type): Delete.
      	(TARGET_CONVERT_TO_TYPE): Delete.
      	(arm_init_libfuncs): Enable trunc_optab from DFmode to HFmode.
      	(arm_libcall_uses_aapcs_base): Add trunc_optab from DF- to HFmode.
      	* config/arm/arm.h (TARGET_FP16_TO_DOUBLE): New.
      	* config/arm/arm.md (truncdfhf2): Only convert through SFmode if we
      	are in fast math mode, and have no single step hardware instruction.
      	(extendhfdf2): Only expand through SFmode if we don't have a
      	single-step hardware instruction.
      	* config/arm/vfp.md (*truncdfhf2): New.
      	(extendhfdf2): Likewise.
      
      gcc/testsuite/
      
      	* gcc.target/arm/fp16-rounding-alt-1.c (ROUNDED): Change expected
      	result.
      	* gcc.target/arm/fp16-rounding-ieee-1.c (ROUNDED): Change expected
      	result.
      
      From-SVN: r242783
      James Greenhalgh committed
    • [Patch 15/17 libgcc ARM] Add double to half conversions. · bea64ca3
      libgcc/
      
      	* config/arm/fp16.c (binary64): New.
      	(__gnu_d2h_internal): New.
      	(__gnu_d2h_ieee): New.
      	(__gnu_d2h_alternative): New.
      
      
      
      Co-Authored-By: Matthew Wahab <matthew.wahab@arm.com>
      
      From-SVN: r242782
      James Greenhalgh committed
    • [Patch 14/17] [libgcc, ARM] Generalise float-to-half conversion function. · 8630cadb
      libgcc/
      
      	* config/arm/fp16.c (struct format): New.
      	(binary32): New.
      	(__gnu_float2h_internal): New.  Body moved from
      	__gnu_f2h_internal and generalize.
      	(_gnu_f2h_internal): Move body to function __gnu_float2h_internal.
      	Call it with binary32.
      
      
      
      Co-Authored-By: Matthew Wahab <matthew.wahab@arm.com>
      
      From-SVN: r242781
      James Greenhalgh committed
    • [Patch testsuite patch 10/17] Add options for floatN when checking effective target for support · ba75a174
      gcc/testsuite/
      
      	* lib/target-supports.exp (check_effective_target_float16): Add
      	options for _Float16.
      	(check_effective_target_float32): Add options for _Float32.
      	(check_effective_target_float64): Add options for _Float64.
      	(check_effective_target_float128): Add options for _Float128.
      	(check_effective_target_float32x): Add options for _Float32x.
      	(check_effective_target_float64x): Add options for _Float64x.
      	(check_effective_target_float128x): Add options for _Float128x.
      
      From-SVN: r242780
      James Greenhalgh committed
    • [Patch libgcc 9/17] Update soft-fp from glibc · 5823ef23
      libgcc/
      
      	* soft-fp/extendhftf2.c: New.
      	* soft-fp/fixhfti.c: Likewise.
      	* soft-fp/fixunshfti.c: Likewise.
      	* soft-fp/floattihf.c: Likewise.
      	* soft-fp/floatuntihf.c: Likewise.
      	* soft-fp/half.h: Likewise.
      	* soft-fp/trunctfhf2.c: Likewise.
      
      From-SVN: r242779
      James Greenhalgh committed
    • [Patch 8/17] Make _Float16 available if HFmode is available · 3479b7af
      gcc/
      
      	* targhooks.c (default_floatn_mode): Enable _Float16 if a target
      	provides HFmode.
      
      From-SVN: r242778
      James Greenhalgh committed
    • [Patch 7/17] Delete TARGET_FLT_EVAL_METHOD and poison it. · 8f23c89a
      gcc/
      
      	* config/s390/s390.h (TARGET_FLT_EVAL_METHOD): Delete.
      	* config/m68k/m68k.h (TARGET_FLT_EVAL_METHOD): Delete.
      	* config/i386/i386.h (TARGET_FLT_EVAL_METHOD): Delete.
      	* defaults.h (TARGET_FLT_EVAL_METHOD): Delete.
      	* doc/tm.texi.in (TARGET_FLT_EVAL_METHOD): Delete.
      	* doc/tm.texi: Regenerate.
      	* system.h (TARGET_FLT_EVAL_METHOD): Poison.
      
      From-SVN: r242777
      James Greenhalgh committed
    • [Patch 6/17] Migrate excess precision logic to use TARGET_EXCESS_PRECISION · 56d8ffc1
      gcc/
      
      	* toplev.c (init_excess_precision): Delete most logic.
      	* tree.c (excess_precision_type): Rewrite to use
      	TARGET_EXCESS_PRECISION.
      	* doc/invoke.texi (-fexcess-precision): Document behaviour in a
      	more generic fashion.
      	* ginclude/float.h: Wrap definition of FLT_EVAL_METHOD in
      	__STDC_WANT_IEC_60559_TYPES_EXT__.
      
      gcc/c-family/
      
      	* c-common.c (excess_precision_mode_join): New.
      	(c_ts18661_flt_eval_method): New.
      	(c_c11_flt_eval_method): Likewise.
      	(c_flt_eval_method): Likewise.
      	* c-common.h (excess_precision_mode_join): New.
      	(c_flt_eval_method): Likewise.
      	* c-cppbuiltin.c (c_cpp_flt_eval_method_iec_559): New.
      	(cpp_iec_559_value): Call it.
      	(c_cpp_builtins): Modify logic for __LIBGCC_*_EXCESS_PRECISION__,
      	call c_flt_eval_method to set __FLT_EVAL_METHOD__ and
      	__FLT_EVAL_METHOD_TS_18661_3__.
      
      gcc/testsuite/
      
      	* gcc.dg/fpermitted-flt-eval-methods_3.c: New.
      	* gcc.dg/fpermitted-flt-eval-methods_4.c: Likewise.
      
      From-SVN: r242776
      James Greenhalgh committed
    • [Patch 5/17] Add -fpermitted-flt-eval-methods=[c11|ts-18661-3] · 04f0fcf7
      gcc/c-family/
      
      	* c-opts.c (c_common_post_options): Add logic to handle the default
      	case for -fpermitted-flt-eval-methods.
      
      gcc/
      
      	* common.opt (fpermitted-flt-eval-methods): New.
      	* doc/invoke.texi (-fpermitted-flt-eval-methods): Document it.
      	* flag_types.h (permitted_flt_eval_methods): New.
      
      gcc/testsuite/
      
      	* gcc.dg/fpermitted-flt-eval-methods_1.c: New.
      	* gcc.dg/fpermitted-flt-eval-methods_2.c: New.
      
      From-SVN: r242775
      James Greenhalgh committed
    • [Patch 4/17] Implement TARGET_C_EXCESS_PRECISION for m68k · b89de1b8
      gcc/
      
      	* config/m68k/m68k.c (m68k_excess_precision): New.
      	(TARGET_C_EXCESS_PRECISION): Define.
      
      From-SVN: r242774
      James Greenhalgh committed
    • [Patch 3/17] Implement TARGET_C_EXCESS_PRECISION for s390 · 638108bd
      	* config/s390/s390.c (s390_excess_precision): New.
      	(TARGET_C_EXCESS_PRECISION): Define.
      
      From-SVN: r242773
      James Greenhalgh committed
    • [Patch 2/17] Implement TARGET_C_EXCESS_PRECISION for i386 · e97dfd54
      gcc/
      	* config/i386/i386.c (ix86_excess_precision): New.
      	(TARGET_C_EXCESS_PRECISION): Define.
      
      From-SVN: r242772
      James Greenhalgh committed
    • [Patch 1/17] Add a new target hook for describing excess precision intentions · 42e02b20
      gcc/
      
      	* target.def (excess_precision): New hook.
      	* target.h (flt_eval_method): New.
      	(excess_precision_type): Likewise.
      	* targhooks.c (default_excess_precision): New.
      	* targhooks.h (default_excess_precision): New.
      	* doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): New.
      	* doc/tm.texi: Regenerate.
      
      From-SVN: r242771
      James Greenhalgh committed
    • PR middle-end/78461 - [7 Regression] ICE: in operator+= · aecc521b
      gcc/testsuite/ChangeLog:
      
      	PR middle-end/78461
      	* gcc.dg/tree-ssa/builtin-sprintf-4.c: New test.
      	* gcc.dg/tree-ssa/builtin-sprintf-warn-2.c: Adjust warning text.
      
      gcc/ChangeLog:
      
      	PR middle-end/78461
      	* gimple-ssa-sprintf.c (format_string): Correct the maxima and
      	set the minimum number of bytes for an unknown string to zero.
      
      From-SVN: r242769
      Martin Sebor committed
    • re PR c++/71450 (ICE on invalid C++11 code on x86_64-linux-gnu: in tree check:… · 1b3314dd
      re PR c++/71450 (ICE on invalid C++11 code on x86_64-linux-gnu: in tree check: expected record_type or union_type or qual_union_type, have template_type_parm in lookup_base, at cp/search.c:203)
      
      	PR c++/71450
      	* pt.c (tsubst_copy): Return error_mark_node when mark_used
      	fails, even when complain & tf_error.
      
      	* g++.dg/cpp0x/pr71450-1.C: New test.
      	* g++.dg/cpp0x/pr71450-2.C: New test.
      
      From-SVN: r242767
      Jakub Jelinek committed
    • re PR c++/77739 (internal compiler error: in create_tmp_var, at gimple-expr.c:524) · e2df2328
      	PR c++/77739
      	* cp-gimplify.c (cp_gimplify_tree) <case VEC_INIT_EXPR>: Pass
      	false as handle_invisiref_parm_p to cp_genericize_tree.
      	(struct cp_genericize_data): Add handle_invisiref_parm_p field.
      	(cp_genericize_r): Don't wrap is_invisiref_parm into references
      	if !wtd->handle_invisiref_parm_p.
      	(cp_genericize_tree): Add handle_invisiref_parm_p argument,
      	set wtd.handle_invisiref_parm_p to it.
      	(cp_genericize): Pass true as handle_invisiref_parm_p to
      	cp_genericize_tree.  Formatting fix.
      
      	* g++.dg/cpp1y/pr77739.C: New test.
      
      From-SVN: r242766
      Jakub Jelinek committed
    • Fix PR number in ChangeLog · 75631692
      From-SVN: r242765
      Jonathan Wakely committed
    • backport: hsa-builtins.def: New file. · 56b1c60e
      Merge from HSA branch to trunk
      
      2016-11-23  Martin Jambor  <mjambor@suse.cz>
      	    Martin Liska  <mliska@suse.cz>
      
      gcc/
      	* hsa-builtins.def: New file.
      	* Makefile.in (BUILTINS_DEF): Add hsa-builtins.def dependency.
      	* builtins.def: Include hsa-builtins.def.
      	(DEF_HSA_BUILTIN): New macro.
      	* dumpfile.h (OPTGROUP_OPENMP): Define.
      	* dumpfile.c (optgroup_options): Added OPTGROUP_OPENMP.
      	* gimple.h (gf_mask): Added elements GF_OMP_FOR_GRID_INTRA_GROUP and
      	GF_OMP_FOR_GRID_GROUP_ITER.
      	(gimple_omp_for_grid_phony): Added checking assert.
      	(gimple_omp_for_set_grid_phony): Likewise.
      	(gimple_omp_for_grid_intra_group): New function.
      	(gimple_omp_for_set_grid_intra_group): Likewise.
      	(gimple_omp_for_grid_group_iter): Likewise.
      	(gimple_omp_for_set_grid_group_iter): Likewise.
      	* omp-low.c (check_omp_nesting_restrictions): Allow GRID loop where
      	previosuly only distribute loop was permitted.
      	(lower_lastprivate_clauses): Allow non tcc_comparison predicates.
      	(grid_get_kernel_launch_attributes): Support multiple HSA grid
      	dimensions.
      	(grid_expand_omp_for_loop): Likewise and also support standalone
      	distribute constructs.  New parameter INTRA_GROUP, updated both users.
      	(grid_expand_target_grid_body): Support standalone distribute
      	constructs.
      	(pass_data_expand_omp): Changed optinfo_flags to OPTGROUP_OPENMP.
      	(pass_data_expand_omp_ssa): Likewise.
      	(pass_data_omp_device_lower): Likewsie.
      	(pass_data_lower_omp): Likewise.
      	(pass_data_diagnose_omp_blocks): Likewise.
      	(pass_data_oacc_device_lower): Likewise.
      	(pass_data_omp_target_link): Likewise.
      	(grid_lastprivate_predicate): New function.
      	(lower_omp_for_lastprivate): Call grid_lastprivate_predicate for
      	gridified loops.
      	(lower_omp_for): Support standalone distribute constructs.
      	(grid_prop): New type.
      	(grid_safe_assignment_p): Check for assignments to group_sizes, new
      	parameter GRID.
      	(grid_seq_only_contains_local_assignments): New parameter GRID, pass
      	it to callee.
      	(grid_find_single_omp_among_assignments_1): Likewise, improve missed
      	optimization info messages.
      	(grid_find_single_omp_among_assignments): Likewise.
      	(grid_find_ungridifiable_statement): Do not bail out for SIMDs.
      	(grid_parallel_clauses_gridifiable): New function.
      	(grid_inner_loop_gridifiable_p): Likewise.
      	(grid_dist_follows_simple_pattern): Likewise.
      	(grid_gfor_follows_tiling_pattern): Likewise.
      	(grid_call_permissible_in_distribute_p): Likewise.
      	(grid_handle_call_in_distribute): Likewise.
      	(grid_dist_follows_tiling_pattern): Likewise.
      	(grid_target_follows_gridifiable_pattern): Support standalone distribute
      	constructs.
      	(grid_var_segment): New enum.
      	(grid_mark_variable_segment): New function.
      	(grid_copy_leading_local_assignments): Call grid_mark_variable_segment
      	if a new argument says so.
      	(grid_process_grid_body): New function.
      	(grid_eliminate_combined_simd_part): Likewise.
      	(grid_mark_tiling_loops): Likewise.
      	(grid_mark_tiling_parallels_and_loops): Likewise.
      	(grid_process_kernel_body_copy): Support standalone distribute
      	constructs.
      	(grid_attempt_target_gridification): New grid variable holding overall
      	gridification state.  Support standalone distribute constructs and
      	collapse clauses.
      	* doc/optinfo.texi (Optimization groups): Document OPTGROUP_OPENMP.
      	* hsa.h (hsa_bb): Add method method append_phi.
      	(hsa_insn_br): Renamed to hsa_insn_cbr, renamed all
      	occurences in all files too.
      	(hsa_insn_br): New class, now the ancestor of hsa_incn_cbr.
      	(is_a_helper <hsa_insn_br *>::test): New function.
      	(is_a_helper <hsa_insn_cbr *>::test): Adjust to only cover conditional
      	branch instructions.
      	(hsa_insn_signal): Make a direct descendant of
      	hsa_insn_basic.  Add memorder constructor parameter and
      	m_memory_order and m_signalop member variables.
      	(hsa_insn_queue): Changed constructor parameters to common form.
      	Added m_segment and m_memory_order member variables.
      	(hsa_summary_t): Add private member function
      	process_gpu_implementation_attributes.
      	(hsa_function_summary): Rename m_binded_function to
      	m_bound_function.
      	(hsa_insn_basic_p): Remove typedef.
      	(hsa_op_with_type): Change hsa_insn_basic_p into plain pointers.
      	(hsa_op_reg_p): Remove typedef.
      	(hsa_function_representation): Change hsa_op_reg_p into plain
      	pointers.
      	(hsa_insn_phi): Removed new and delete operators.
      	(hsa_insn_br): Likewise.
      	(hsa_insn_cbr): Likewise.
      	(hsa_insn_sbr): Likewise.
      	(hsa_insn_cmp): Likewise.
      	(hsa_insn_mem): Likewise.
      	(hsa_insn_atomic): Likewise.
      	(hsa_insn_signal): Likewise.
      	(hsa_insn_seg): Likewise.
      	(hsa_insn_call): Likewise.
      	(hsa_insn_arg_block): Likewise.
      	(hsa_insn_comment): Likewise.
      	(hsa_insn_srctype): Likewise.
      	(hsa_insn_packed): Likewise.
      	(hsa_insn_cvt): Likewise.
      	(hsa_insn_alloca): Likewise.
      	* hsa.c (hsa_destroy_insn): Also handle instances of hsa_insn_br.
      	(process_gpu_implementation_attributes): New function.
      	(link_functions): Move some functionality into it.  Adjust after
      	renaming m_binded_functions to m_bound_functions.
      	(hsa_insn_basic::op_output_p): Add BRIG_OPCODE_DEBUGTRAP
      	to the list of instructions with no output registers.
      	(get_in_type): Return this if it is a register of
      	matching size.
      	(hsa_get_declaration_name): Moved to...
              * hsa-gen.c (hsa_get_declaration_name): ...here.  Allocate
      	temporary string on an obstack instead from ggc.
      	(query_hsa_grid): Renamed to query_hsa_grid_dim, reimplemented, cut
      	down to two overloads.
      	(hsa_allocp_operand_address): Removed.
      	(hsa_allocp_operand_immed): Likewise.
      	(hsa_allocp_operand_reg): Likewise.
      	(hsa_allocp_operand_code_list): Likewise.
      	(hsa_allocp_operand_operand_list): Likewise.
      	(hsa_allocp_inst_basic): Likewise.
      	(hsa_allocp_inst_phi): Likewise.
      	(hsa_allocp_inst_mem): Likewise.
      	(hsa_allocp_inst_atomic): Likewise.
      	(hsa_allocp_inst_signal): Likewise.
      	(hsa_allocp_inst_seg): Likewise.
      	(hsa_allocp_inst_cmp): Likewise.
      	(hsa_allocp_inst_br): Likewise.
      	(hsa_allocp_inst_sbr): Likewise.
      	(hsa_allocp_inst_call): Likewise.
      	(hsa_allocp_inst_arg_block): Likewise.
      	(hsa_allocp_inst_comment): Likewise.
      	(hsa_allocp_inst_queue): Likewise.
      	(hsa_allocp_inst_srctype): Likewise.
      	(hsa_allocp_inst_packed): Likewise.
      	(hsa_allocp_inst_cvt): Likewise.
      	(hsa_allocp_inst_alloca): Likewise.
      	(hsa_allocp_bb): Likewise.
      	(hsa_obstack): New.
      	(hsa_init_data_for_cfun): Initialize obstack.
      	(hsa_deinit_data_for_cfun): Release memory of the obstack.
      	(hsa_op_immed::operator new): Use obstack instead of object_allocator.
      	(hsa_op_reg::operator new): Likewise.
      	(hsa_op_address::operator new): Likewise.
      	(hsa_op_code_list::operator new): Likewise.
      	(hsa_op_operand_list::operator new): Likewise.
      	(hsa_insn_basic::operator new): Likewise.
      	(hsa_insn_phi::operator new): Likewise.
      	(hsa_insn_br::operator new): Likewise.
      	(hsa_insn_sbr::operator new): Likewise.
      	(hsa_insn_cmp::operator new): Likewise.
      	(hsa_insn_mem::operator new): Likewise.
      	(hsa_insn_atomic::operator new): Likewise.
      	(hsa_insn_signal::operator new): Likewise.
      	(hsa_insn_seg::operator new): Likewise.
      	(hsa_insn_call::operator new): Likewise.
      	(hsa_insn_arg_block::operator new): Likewise.
      	(hsa_insn_comment::operator new): Likewise.
      	(hsa_insn_srctype::operator new): Likewise.
      	(hsa_insn_packed::operator new): Likewise.
      	(hsa_insn_cvt::operator new): Likewise.
      	(hsa_insn_alloca::operator new): Likewise.
      	(hsa_init_new_bb): Likewise.
      	(hsa_bb::append_phi): New function.
      	(gen_hsa_phi_from_gimple_phi): Use it.
      	(get_symbol_for_decl): Fix dinstinguishing between
      	global and local functions.  Put local variables into a segment
      	according to their attribute or static flag, if there is one.
      	(hsa_insn_br::hsa_insn_br): New.
      	(hsa_insn_br::operator new): Likewise.
      	(hsa_insn_cbr::hsa_insn_cbr): Set width via ancestor constructor.
      	(query_hsa_grid_nodim): New function.
      	(multiply_grid_dim_characteristics): Likewise.
      	(gen_get_num_threads): Likewise.
      	(gen_get_num_teams): Reimplemented.
      	(gen_get_team_num): Likewise.
      	(gen_hsa_insns_for_known_library_call): Updated calls to the above
      	helper functions.
      	(get_memory_order_name): Removed.
      	(get_memory_order): Likewise.
      	(hsa_memorder_from_tree): New function.
      	(gen_hsa_ternary_atomic_for_builtin): Renamed to
      	gen_hsa_atomic_for_builtin, can also create signals.
      	(gen_hsa_insns_for_call): Handle many new builtins.  Adjust to use
      	hsa_memory_order_from_tree and gen_hsa_atomic_for_builtin.
      	(hsa_insn_atomic): Fix function comment.
      	(hsa_insn_signal::hsa_insn_signal): Fix comment.  Update call to
      	ancestor constructor and initialization of new member variables.
      	(hsa_insn_queue::hsa_insn_queue): Added initialization of new
      	member variables.
      	(hsa_get_host_function): Handle functions with no bound CPU
      	implementation.  Fix binded to bound.
      	(get_brig_function_name): Likewise.
      	(HSA_SORRY_ATV): Remove semicolon after macro.
      	(HSA_SORRY_AT): Likewise.
      	(omp_simple_builtin::generate): Add missing semicolons.
      	(hsa_insn_phi::operator new): Removed.
      	(hsa_insn_br::operator new): Likewise.
      	(hsa_insn_cbr::operator new): Likewise.
      	(hsa_insn_sbr::operator new): Likewise.
      	(hsa_insn_cmp::operator new): Likewise.
      	(hsa_insn_mem::operator new): Likewise.
      	(hsa_insn_atomic::operator new): Likewise.
      	(hsa_insn_signal::operator new): Likewise.
      	(hsa_insn_seg::operator new): Likewise.
      	(hsa_insn_call::operator new): Likewise.
      	(hsa_insn_arg_block::operator new): Likewise.
      	(hsa_insn_comment::operator new): Likewise.
      	(hsa_insn_srctype::operator new): Likewise.
      	(hsa_insn_packed::operator new): Likewise.
      	(hsa_insn_cvt::operator new): Likewise.
      	(hsa_insn_alloca::operator new): Likewise.
      	(get_symbol_for_decl): Accept CONST_DECLs, put them to
      	readonly segment.
      	(gen_hsa_addr): Also process CONST_DECLs.
      	(gen_hsa_addr_insns): Process CONST_DECLs by creating private
      	copies.
      	(gen_hsa_unary_operation): Make sure the function does
      	not use bittype source type for firstbit and lastbit operations.
      	(gen_hsa_popcount_to_dest): Make sure the function uses a bittype
      	source type.
      	* hsa-brig.c (emit_insn_operands): Cope with zero operands in an
      	instruction.
      	(emit_branch_insn): Renamed to emit_cond_branch_insn.
      	Emit the width stored in the class.
      	(emit_generic_branch_insn): New function.
      	(emit_insn): Call emit_generic_branch_insn.
      	(emit_signal_insn): Remove obsolete comment.  Update
      	member variable name, pick a type according to profile.
      	(emit_alloca_insn): Remove obsolete comment.
      	(emit_atomic_insn): Likewise.
      	(emit_queue_insn): Get segment and memory order from the IR object.
      	(hsa_brig_section): Make allocate_new_chunk, chunks
      	and cur_chunk provate, add a default NULL parameter to add method.
      	(hsa_brig_section::add): Added a new parameter, store pointer to
      	output data there if it is non-NULL.
      	(emit_function_directives): Use this new parameter instead of
      	calculating the pointer itself, fix function comment.
      	(hsa_brig_emit_function): Add forgotten endian conversion.
      	(hsa_output_kernels): Remove unnecessary building of
      	kernel_dependencies_vector_type.
      	(emit_immediate_operand): Declare.
      	(emit_directive_variable): Also emit initializers of CONST_DECLs.
      	(gen_hsa_insn_for_internal_fn_call): Also handle IFN_RSQRT.
      	(verify_function_arguments): Properly detect variadic
      	arguments.
      	* hsa-dump.c (hsa_width_specifier_name): New function.
      	(dump_hsa_insn_1): Dump generic branch instructions, update signal
      	member variable name.  Special dumping for queue objects.
      	* ipa-hsa.c (process_hsa_functions): Adjust after renaming
      	m_binded_functions to m_bound_functions.  Copy externally visible flag
      	to the node.
      	(ipa_hsa_write_summary): Likewise.
      	(ipa_hsa_read_section): Likewise.
      
      
      	gcc/fortran/
              * f95-lang.c (DEF_HSA_BUILTIN): New macro.
      
      gcc/testsuite/
      	* c-c++-common/gomp/gridify-1.c: Update scan string.
      	* gfortran.dg/gomp/gridify-1.f90: Likewise.
      	* c-c++-common/gomp/gridify-2.c: New test.
      	* c-c++-common/gomp/gridify-3.c: Likewise.
      
      libgomp/
      	* testsuite/libgomp.hsa.c/bits-insns.c: New test.
      	* testsuite/libgomp.hsa.c/tiling-1.c: Likewise.
      	* testsuite/libgomp.hsa.c/tiling-2.c: Likewise.
      
      
      Co-Authored-By: Martin Liska <mliska@suse.cz>
      
      From-SVN: r242761
      Martin Jambor committed
    • PR78494 add missing returns to propagate_const · f6cdfe82
      2016-11-23  Felix Morgner  <felix.morgner@gmail.com>
      	    Jonathan Wakely  <jwakely@redhat.com>
      
      	PR libstdc++/78494
      	* include/experimental/propagate_const (propagate_const::operator=):
      	Add missing return statements.
      	* testsuite/experimental/propagate_const/assignment/move_neg.cc:
      	Adjust dg-error line numbers.
      	* testsuite/experimental/propagate_const/requirements2.cc: Likewise.
      
      Co-Authored-By: Jonathan Wakely <jwakely@redhat.com>
      
      From-SVN: r242760
      Felix Morgner committed
    • re PR tree-optimization/78396 (gcc.dg/vect/bb-slp-cond-1.c FAILs after fix for PR77848) · e600f219
      2016-11-23  Richard Biener  <rguenther@suse.de>
      
      	PR tree-optimization/78396
      	* tree-vectorizer.c (vectorize_loops): If an innermost loop didn't
      	vectorize try vectorizing an if-converted body using BB vectorization.
      
      	* gcc.dg/vect/bb-slp-cond-1.c: Adjust.
      
      From-SVN: r242759
      Richard Biener committed
    • Rework subreg_get_info · 07ce1466
      This isn't intended to change the behaviour, just rewrite the
      existing logic in a different (and hopefully clearer) way.
      The new form -- particularly the part based on the "block"
      concept -- is easier to convert to polynomial sizes.
      
      gcc/
      2016-11-15  Richard Sandiford  <richard.sandiford@arm.com>
      	    Alan Hayward  <alan.hayward@arm.com>
      	    David Sherwood  <david.sherwood@arm.com>
      
      	* rtlanal.c (subreg_get_info): Use more local variables.
      	Remark that for HARD_REGNO_NREGS_HAS_PADDING, each scalar unit
      	occupies at least one register.  Assume that full hard registers
      	have consistent endianness.  Share previously-duplicated if block.
      	Rework the main handling so that it operates on independently-
      	addressable YMODE-sized blocks.  Use subreg_size_lowpart_offset
      	to check lowpart offsets, without trying to find an equivalent
      	integer mode first.  Handle WORDS_BIG_ENDIAN != REG_WORDS_BIG_ENDIAN
      	as a final register-endianness correction.
      
      Co-Authored-By: Alan Hayward <alan.hayward@arm.com>
      Co-Authored-By: David Sherwood <david.sherwood@arm.com>
      
      From-SVN: r242758
      Richard Sandiford committed
    • combine: Convert subreg-of-lshiftrt to zero_extract properly (PR78390) · 712871dd
      r242414, for PR77881, introduces some bugs (PR78390, PR78438, PR78477).
      It all has the same root cause: that patch makes combine convert every
      lowpart subreg of a logical shift right to a zero_extract.  This cannot
      work at all if it is not a constant shift, and it has to be a bit more
      careful exactly which bits it extracts.
      
      
      	PR target/77881
      	PR bootstrap/78390
      	PR target/78438
      	PR bootstrap/78477
      	* combine.c (make_compound_operation_int): Do not convert a subreg of
      	a non-constant logical shift right to a zero_extract.  Handle the case
      	where some zero bits have been shifted into the range covered by that
      	subreg.
      
      From-SVN: r242757
      Segher Boessenkool committed
    • Add more subreg offset helpers · 33951763
      Provide versions of subreg_lowpart_offset and subreg_highpart_offset
      that work on mode sizes rather than modes.  Also provide a routine
      that converts an lsb position to a subreg offset.
      
      The intent (in combination with later patches) is to move the
      handling of the BYTES_BIG_ENDIAN != WORDS_BIG_ENDIAN case into
      just two places, so that for other combinations we don't have
      to split offsets into words and subwords.
      
      gcc/
      2016-11-15  Richard Sandiford  <richard.sandiford@arm.com>
      	    Alan Hayward  <alan.hayward@arm.com>
      	    David Sherwood  <david.sherwood@arm.com>
      
      	* rtl.h (subreg_size_offset_from_lsb): Declare.
      	(subreg_offset_from_lsb): New function.
      	(subreg_size_lowpart_offset): Declare.
      	(subreg_lowpart_offset): Turn into an inline function.
      	(subreg_size_highpart_offset): Declare.
      	(subreg_highpart_offset): Turn into an inline function.
      	* emit-rtl.c (subreg_size_lowpart_offset): New function.
      	(subreg_size_highpart_offset): Likewise
      	* rtlanal.c (subreg_size_offset_from_lsb): Likewise.
      
      Co-Authored-By: Alan Hayward <alan.hayward@arm.com>
      Co-Authored-By: David Sherwood <david.sherwood@arm.com>
      
      From-SVN: r242755
      Richard Sandiford committed
    • re PR tree-optimization/78482 (wrong code at -O3 in both 32-bit and 64-bit modes… · cbb88345
      re PR tree-optimization/78482 (wrong code at -O3 in both 32-bit and 64-bit modes on x86_64-linux-gnu)
      
      2016-11-23  Richard Biener  <rguenther@suse.de>
      
      	PR tree-optimization/78482
      	* tree-cfgcleanup.c: Include tree-ssa-loop-niter.h.
      	(remove_forwarder_block_with_phi): When merging with a loop
      	header creates a new latch reset number of iteration information
      	of the loop.
      
      	* gcc.dg/torture/pr78482.c: New testcase.
      
      From-SVN: r242754
      Richard Biener committed
    • sparc.md (*ashrsi3_extend): Rename to... · abe40574
      	* config/sparc/sparc.md (*ashrsi3_extend): Rename to...
      	(*ashrsi3_extend0): ...this.  Accept constant integers.
      	(*ashrsi3_extend2): Rename to...
      	(*ashrsi3_extend1): ...this.
      	(*ashrsi3_extend2): New pattern.
      	(*lshrsi3_extend1): Accept constant integers.
      	(*lshrsi3_extend2): Fix condition on operand 2.
      
      From-SVN: r242753
      Eric Botcazou committed
    • i386.c: Initialize function pointer to NULL to prevent · 4987b1e0
      	* config/i386/i386.c: Initialize function pointer to NULL.
      
      From-SVN: r242752
      Martin Liska committed
    • fold-const.c (fold_cond_expr_with_comparison): Move simplification for A == C1 ? A : C2 to below. · e2535011
      	* fold-const.c (fold_cond_expr_with_comparison): Move simplification
      	for A == C1 ? A : C2 to below.
      	* match.pd: Move from above to here:
      	(cond (eq (convert1? x) c1) (convert2? x) c2)
      	  -> (cond (eq x c1) c1 c2).
      
      From-SVN: r242751
      Bin Cheng committed
    • fold-const.c (fold_cond_expr_with_comparison): Move simplification for A cmp C1… · b5481987
      fold-const.c (fold_cond_expr_with_comparison): Move simplification for A cmp C1 ? A : C2 to below, also simplify remaining code.
      
      	* fold-const.c (fold_cond_expr_with_comparison): Move simplification
      	for A cmp C1 ? A : C2 to below, also simplify remaining code.
      	* match.pd: Move and extend simplification from above to here:
      	(cond (cmp (convert1? x) c1) (convert2? x) c2) -> (minmax (x c)).
      	* tree-if-conv.c (ifcvt_follow_ssa_use_edges): New func.
      	(predicate_scalar_phi): Call fold_stmt using the new valueize func.
      
      	gcc/testsuite
      	* gcc.dg/fold-cond_expr-1.c: New test.
      	* gcc.dg/fold-condcmpconv-1.c: New test.
      	* gcc.dg/fold-condcmpconv-2.c: New test.
      
      From-SVN: r242750
      Bin Cheng committed