Commits · 2de001eed013bb666f832694bc75b8f055ffdc76 · lvzhengyang / riscv-gcc-1

06 Jul, 2016 9 commits

[6/7] Explicitly classify vector loads and stores · 2de001ee

This is the main patch in the series.  It adds a new enum and routines
for classifying a vector load or store implementation.

Originally there were three motivations:

      (1) Reduce cut-&-paste

      (2) Make the chosen vectorisation strategy more obvious.  At the
          moment this is derived implicitly from various other bits of
          state (GROUPED, STRIDED, SLP, etc.)

      (3) Decouple the vectorisation strategy from those other bits of state,
          so that there can be a choice of implementation for a given scalar
          statement.  The specific problem here is that we class:

              for (...)
                {
                  ... = a[i * x];
                  ... = a[i * x + 1];
                }

          as "strided and grouped" but:

              for (...)
                {
                  ... = a[i * 7];
                  ... = a[i * 7 + 1];
                }

          as "non-strided and grouped".  Before the patch, "strided and
          grouped" loads would always try to use separate scalar loads
          while "non-strided and grouped" loads would always try to use
          load-and-permute.  But load-and-permute is never supported for
          a group size of 7, so the effect was that the first loop was
          vectorisable and the second wasn't.  It seemed odd that not
          knowing x (but accepting it could be 7) would allow more
          optimisation opportunities than knowing x is 7.

Unfortunately, it looks like we underestimate the cost of separate
scalar accesses on at least aarch64, so I've disabled (3) for now;
see the "if" statement at the end of get_load_store_type.  I think
the patch still does (1) and (2), so that's the justification for
it in its current form.  It also means that (3) is now simply a
case of removing the FIXME code, once the cost model problems have
been sorted out.  (I did wonder about adding a --param, but that
seems overkill.  I hope to get back to this during GCC 7 stage 1.)

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vectorizer.h (vect_memory_access_type): New enum.
	(_stmt_vec_info): Add a memory_access_type field.
	(STMT_VINFO_MEMORY_ACCESS_TYPE): New macro.
	(vect_model_store_cost): Take an access type instead of a boolean.
	(vect_model_load_cost): Likewise.
	* tree-vect-slp.c (vect_analyze_slp_cost_1): Update calls to
	vect_model_store_cost and vect_model_load_cost.
	* tree-vect-stmts.c (vec_load_store_type): New enum.
	(vect_model_store_cost): Take an access type instead of a
	store_lanes_p boolean.  Simplify tests.
	(vect_model_load_cost): Likewise, but for load_lanes_p.
	(get_group_load_store_type, get_load_store_type): New functions.
	(vectorizable_store): Use get_load_store_type.  Record the access
	type in STMT_VINFO_MEMORY_ACCESS_TYPE.
	(vectorizable_load): Likewise.
	(vectorizable_mask_load_store): Likewise.  Replace is_store
	variable with vls_type.

From-SVN: r238038

committed Jul 06, 2016

2de001ee Browse Files

[5/7] Move the fix for PR65518 · 4fb8ba9d

This patch moves the fix for PR65518 to the code that checks whether
load-and-permute operations are supported.   If the group size is
greater than the vectorisation factor, it would still be possible
to fall back to elementwise loads (as for strided groups) rather
than fail vectorisation entirely.

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vectorizer.h (vect_grouped_load_supported): Add a
	single_element_p parameter.
	* tree-vect-data-refs.c (vect_grouped_load_supported): Likewise.
	Check the PR65518 case here rather than in vectorizable_load.
	* tree-vect-loop.c (vect_analyze_loop_2): Update call accordignly.
	* tree-vect-stmts.c (vectorizable_load): Likewise.

From-SVN: r238037

committed Jul 06, 2016

4fb8ba9d Browse Files

[4/7] Add a gather_scatter_info structure · 134c85ca

This patch just refactors the gather/scatter support so that all
information is in a single structure, rather than separate variables.
This reduces the number of arguments to a function added in patch 6.

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vectorizer.h (gather_scatter_info): New structure.
	(vect_check_gather_scatter): Return a bool rather than a decl.
	Replace return-by-pointer arguments with a single
	gather_scatter_info *.
	* tree-vect-data-refs.c (vect_check_gather_scatter): Likewise.
	(vect_analyze_data_refs): Update call accordingly.
	* tree-vect-stmts.c (vect_mark_stmts_to_be_vectorized): Likewise.
	(vectorizable_mask_load_store): Likewise.  Also record the
	offset dt and vectype in the gather_scatter_info.
	(vectorizable_store): Likewise.
	(vectorizable_load): Likewise.

From-SVN: r238036

committed Jul 06, 2016

134c85ca Browse Files

[3/7] Fix load/store costs for strided groups · 071e8018

vect_model_store_cost had:

      /* Costs of the stores.  */
      if (STMT_VINFO_STRIDED_P (stmt_info)
          && !STMT_VINFO_GROUPED_ACCESS (stmt_info))
        {
          /* N scalar stores plus extracting the elements.  */
          inside_cost += record_stmt_cost (body_cost_vec,
				       ncopies * TYPE_VECTOR_SUBPARTS (vectype),
				       scalar_store, stmt_info, 0, vect_body);

But non-SLP strided groups also use individual scalar stores rather than
vector stores, so I think we should skip this only for SLP groups.

The same applies to vect_model_load_cost.

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vect-stmts.c (vect_model_store_cost): For non-SLP
	strided groups, use the cost of N scalar accesses instead
	of ncopies vector accesses.
	(vect_model_load_cost): Likewise.

From-SVN: r238035

committed Jul 06, 2016

071e8018 Browse Files

[2/7] Clean up vectorizer load/store costs · 892a981f

Add a bit more commentary and try to make the structure more obvious.
The horrendous:

      if (grouped_access_p
          && represents_group_p
          && !store_lanes_p
          && !STMT_VINFO_STRIDED_P (stmt_info)
          && !slp_node)

checks go away in patch 6.

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vect-stmts.c (vect_cost_group_size): Delete.
	(vect_model_store_cost): Avoid calling it.  Use first_stmt_p
	variable to indicate when once-per-group costs are being used.
	(vect_model_load_cost): Likewise.  Fix comment and misindented code.

From-SVN: r238034

committed Jul 06, 2016

892a981f Browse Files

[1/7] Remove unnecessary peeling for gaps check · c01e092f

I recently relaxed the peeling-for-gaps conditions for LD3 but
kept them as-is for load-and-permute.  I don't think the conditions
are needed for load-and-permute either though.  No current load-and-
permute should load outside the group, so if there is no gap at the end,
the final vector element loaded will correspond to an element loaded
by the original scalar loop.

The patch for PR68559 (a missed optimisation PR) increased the peeled
cases from "exact_log2 (groupsize) == -1" to "vf % group_size == 0", so
before that fix, we didn't peel for gaps if there was no gap at the end
of the group and if the group size was a power of 2.

The only current non-power-of-2 load-and-permute size is 3, which
doesn't require loading more than 3 vectors.

The testcase is based on gcc.dg/vect/pr49038.c.

Tested on aarch64-linux-gnu and x86_64-linux-gnu.

gcc/
	* tree-vect-stmts.c (vectorizable_load): Remove unnecessary
	peeling-for-gaps condition.

gcc/testsuite/
	* gcc.dg/vect/group-no-gaps-1.c: New test.

From-SVN: r238033

committed Jul 06, 2016

c01e092f Browse Files

S/390: Fix vecinit expansion. · a07189f4

The fallback routine in the S/390 vecinit expander did not check
whether each of the initializer elements is a proper general_operand.
Since revision r236582 the expander is invoked also with e.g. symbol
refs with an odd addend resulting in invalid insns.

Fixed by forcing the element into a register in such cases.

gcc/ChangeLog:

2016-07-06  Andreas Krebbel  <krebbel@linux.vnet.ibm.com>

	* config/s390/s390.c (s390_expand_vec_init): Force initializer
	element to register if it doesn't match general_operand.

From-SVN: r238032

committed Jul 06, 2016

a07189f4 Browse Files

Fix MPX tests on systems with MPX disabled · 8070763a

I have a Skylake system with MPX in the CPU, but MPX is disabled
in the kernel configuration.

This makes all the MPX tests fail because they assume if MPX
is in CPUID it works

Check the output of XGETBV too to detect non MPX kernels.

gcc/testsuite/:

2016-07-05  Andi Kleen  <ak@linux.intel.com>

	* gcc.target/i386/mpx/mpx-check.h: Check XGETBV output
	if kernel supports MPX.

From-SVN: r238031

committed Jul 06, 2016

8070763a Browse Files

Daily bump. · 8217ad20
```
From-SVN: r238029
```
GCC Administrator committed Jul 06, 2016
8217ad20 Browse Files

05 Jul, 2016 18 commits

pr69102.c: Require fpic support. · 7f6e88a8

2016-07-05  Kito Cheng <kito.cheng@gmail.com>

	* gcc.c-torture/compile/pr69102.c: Require fpic support.

From-SVN: r238023

committed Jul 05, 2016

7f6e88a8 Browse Files

Implement LWG 2509, · 7d4f48b5

	any_cast doesn't work with rvalue reference targets and cannot
	move with a value target.
	* include/experimental/any (any(_ValueType&&)): Constrain and
	add an overload that doesn't forward.
	(any_cast(any&&)): Constrain and add an overload that moves.
	* testsuite/experimental/any/misc/any_cast.cc: Add tests for
	the functionality added by LWG 2509.

From-SVN: r238022

committed Jul 05, 2016

7d4f48b5 Browse Files

re PR c++/71214 (Typo in feature test macro for rvalue references) · 98d44e93
```
Fix PR c++/71214

       PR c++/71214
       * c-cppbuiltin.c (c_cpp_builtins): Define __cpp_rvalue_references.

From-SVN: r238017
```
Markus Trippelsdorf committed Jul 05, 2016
98d44e93 Browse Files

rs6000-protos.h (rs6000_split_signbit): New prototype. · 36a265b1

[gcc]

2016-07-05  Michael Meissner  <meissner@linux.vnet.ibm.com>
	    Bill Schmidt  <wschmidt@linux.vnet.ibm.com>

	* config/rs6000/rs6000-protos.h (rs6000_split_signbit): New
	prototype.
	* config/rs6000/rs6000.c (rs6000_split_signbit): New function.
	* config/rs6000/rs6000.md (UNSPEC_SIGNBIT): New constant.
	(SIGNBIT): New mode iterator.
	(Fsignbit): New mode attribute.
	(signbit<mode>2): Change operand1 to match FLOAT128 instead of
	IBM128; dispatch to gen_signbit{kf,tf}2_dm for __float128
	when direct moves are available.
	(signbit<mode>2_dm): New define_insn_and_split).
	(signbit<mode>2_dm2): New define_insn.

[gcc/testsuite]

2016-07-05  Michael Meissner  <meissner@linux.vnet.ibm.com>
	    Bill Schmidt  <wschmidt@linux.vnet.ibm.com>

	* gcc.target/powerpc/signbit-1.c: New test.
	* gcc.target/powerpc/signbit-2.c: New test.
	* gcc.target/powerpc/signbit-3.c: New test.


Co-Authored-By: Bill Schmidt <wschmidt@linux.vnet.ibm.com>

From-SVN: r238016

committed Jul 05, 2016

36a265b1 Browse Files

[RTL ifcvt] PR rtl-optimization/71594: ICE in noce_emit_cmove due to mismatched source modes · 7a98fb6e

	PR rtl-optimization/71594
	* ifcvt.c (noce_convert_multiple_sets): Wrap new_val or old_val
	into subregs of appropriate mode before trying to emit a conditional
	move.

	* gcc.dg/torture/pr71594.c: New test.

From-SVN: r238013

committed Jul 05, 2016

7a98fb6e Browse Files

tree-scalar-evolution.c (iv_can_overflow_p): New function. · 1e3d54b4

	* tree-scalar-evolution.c (iv_can_overflow_p): New function.
	(simple_iv): Use it.

	* gcc.dg/tree-ssa/scev-14.c: new testcase.

From-SVN: r238012

committed Jul 05, 2016

1e3d54b4 Browse Files

* tree-ssa-loop-niter.c (nowrap_type_p): Use ANY_INTEGRAL_TYPE_P. · 341c5337
```
From-SVN: r238011
```
Jan Hubicka committed Jul 05, 2016
341c5337 Browse Files
[LRA] Don't count spilling cost for it offmemok · 10406801
```
	* lra-constraints.c (process_alt_operands): Don't add spilling cost for
	"offmemok".

From-SVN: r238010
```
Jiong Wang committed Jul 05, 2016
10406801 Browse Files
tree-scalar-evoluiton.c (simple_iv): Use nowrap_type to check if IV can overflow. · 1210573b
```
	* tree-scalar-evoluiton.c (simple_iv): Use nowrap_type to check if
	IV can overflow.

From-SVN: r238009
```
Jan Hubicka committed Jul 05, 2016
1210573b Browse Files

PR c++/62314: add fixit hint for "expected ';' after class definition" · 84ca3893

gcc/cp/ChangeLog:
	PR c++/62314
	* parser.c (cp_parser_class_specifier_1): When reporting
	missing semicolons, use a fixit-hint to suggest insertion
	of a semicolon immediately after the closing brace,
	offsetting the reported column accordingly.

gcc/testsuite/ChangeLog:
	PR c++/62314
	* gcc/testsuite/g++.dg/parse/error5.C: Update column
	number of missing semicolon error.
	* g++.dg/pr62314-2.C: New test case.

From-SVN: r238008

committed Jul 05, 2016

84ca3893 Browse Files

Second review of STAT= patch + tests · 20d0bfce
```
From-SVN: r238007
```
Alessandro Fanfarillo committed Jul 05, 2016
20d0bfce Browse Files

gimple-ssa-split-paths.c (find_block_to_duplicate_for_splitting_pa): Handle empty else block. · 1174b21b

2016-07-05  Richard Biener  <rguenther@suse.de>

	* gimple-ssa-split-paths.c (find_block_to_duplicate_for_splitting_pa):
	Handle empty else block.
	(is_feasible_trace): Likewise.
	(split_paths): Likewise.

From-SVN: r238005

committed Jul 05, 2016

1174b21b Browse Files

tree-loop-distribution.c (distribute_loop): Fix issue with the cost model loop. · 16eba420

2016-07-05  Richard Biener  <rguenther@suse.de>

	* tree-loop-distribution.c (distribute_loop): Fix issue with
	the cost model loop.

From-SVN: r238004

committed Jul 05, 2016

16eba420 Browse Files

Update documentation. · b758f6e2
```
From-SVN: r238003
```
Arnaud Charlet committed Jul 05, 2016
b758f6e2 Browse Files

re PR fortran/71623 (Segfault when allocating deferred-length characters to size of a pointer) · 69aaea06

gcc/fortran/ChangeLog:

2016-07-05  Andre Vehreschild  <vehre@gcc.gnu.org>

	PR fortran/71623
	* trans-stmt.c (gfc_trans_allocate): Add code of pre block of typespec
	in allocate to parent block.

gcc/testsuite/ChangeLog:

2016-07-05  Andre Vehreschild  <vehre@gcc.gnu.org>

	PR fortran/71623
	* gfortran.dg/deferred_character_17.f90: New test.

From-SVN: r238002

committed Jul 05, 2016

69aaea06 Browse Files

decl.c (gnat_to_gnu_entity): Invoke global_bindings_p last when possible. · b0ad2d78

	* gcc-interface/decl.c (gnat_to_gnu_entity): Invoke global_bindings_p
	last when possible.  Do not call elaborate_expression_2 on offsets in
	local record types and avoid useless processing for constant offsets.

From-SVN: r238001

committed Jul 05, 2016

b0ad2d78 Browse Files

[ARM][testsuite] neon-testgen.ml removal · f723a43c

2016-07-05  Christophe Lyon  <christophe.lyon@linaro.org>

	gcc/
	* config/arm/neon-testgen.ml: Delete.
	* config/arm/neon.ml: Delete.

	gcc/testsuite/
	* gcc.target/arm/neon/polytypes.c: Move to ...
	* gcc.target/arm/polytypes.c: ... here.
	* gcc.target/arm/neon/pr51534.c: Move to ...
	* gcc.target/arm/pr51534.c: ... here.
	* gcc.target/arm/neon/vect-vcvt.c: Move to ...
	* gcc.target/arm/vect-vcvt.c: ... here.
	* gcc.target/arm/neon/vect-vcvtq.c: Move to ...
	* gcc.target/arm/vect-vcvtq.c: ... here.
	* gcc.target/arm/neon/vfp-shift-a2t2.c: Move to ...
	* gcc.target/arm/vfp-shift-a2t2.c: ... here.
	* gcc.target/arm/neon/vst1Q_laneu64-1.c: Move to ...
	* gcc.target/arm/vst1Q_laneu64-1.c: ... here. Fix foo() prototype.
	* gcc.target/arm/neon/neon.exp: Delete.
	* gcc.target/arm/neon/*.c: Delete.

From-SVN: r238000

committed Jul 05, 2016

f723a43c Browse Files

Daily bump. · 7915f06a
```
From-SVN: r237998
```
GCC Administrator committed Jul 05, 2016
7915f06a Browse Files

04 Jul, 2016 13 commits

re PR fortran/66575 (Endless compilation on missing end interface) · d73e0ccf

2016-07-04  Jerry DeLisle  <jvdelisle@gcc.gnu.org>

	PR fortran/66575
	* decl.c (match_procedure_interface): Exit loop if procedure
	interface refers to itself.

	* gfortran.dg: pr65575.f90: New test.

From-SVN: r237994

committed Jul 04, 2016

d73e0ccf Browse Files

re PR fortran/35849 ("wrong" line shown in error message for parameter) · c20f6223

2016-07-04  Jerry DeLisle  <jvdelisle@gcc.gnu.org>
	    Steven G. Kargl  <kargl@gcc.gnu.org>

	PR fortran/35849
	* simplify.c (gfc_simplify_ishftc): Check that absolute value of
	SHIFT is less than or equal to SIZE.

	* gfortran.dg: pr35849.f90: New test.

Co-Authored-By: Steven G. Kargl <kargl@gcc.gnu.org>

From-SVN: r237993

committed Jul 04, 2016

c20f6223 Browse Files

re PR c++/71739 (ICE on valid C++11 code: tree check: expected identifier_node,… · 2a5537c3

re PR c++/71739 (ICE on valid C++11 code: tree check: expected identifier_node, have tree_list in private_is_attribute_p, at tree.c:6080)

	PR c++/71739
	* tree.c (attribute_value_equal): Use get_attribute_name instead of
	directly using TREE_PURPOSE.

	* g++.dg/cpp0x/pr71739.C: New test.

From-SVN: r237991

committed Jul 04, 2016

2a5537c3 Browse Files

[AArch64] Renaming ARMv8.1 to ARMv8.1-A in comments and documentations · 74bb9de4

	* config/aarch64/aarch64.h: Rename "ARMv8.1" to "ARMv8.1-A".
	* config/aarch64/aarch64_neon.h: Likewise.
	* config/aarch64/arm_neon.h: Likewise.
	* config/aarch64/atomics.md: Likewise.
	* config/aarch64/aarch64-simd-builtins.def: Likewise.
	* doc/invoke.texi: Likewise.

From-SVN: r237988

committed Jul 04, 2016

74bb9de4 Browse Files

[testsuite] asan/clone-test-1.c: Handle clone() failure · 740f9751

2016-07-04  Christophe Lyon  <christophe.lyon@linaro.org>

	* c-c++-common/asan/clone-test-1.c (main): Handle clone() failure.

From-SVN: r237987

committed Jul 04, 2016

740f9751 Browse Files

Add tests for inserting aliased objects into std::vector · 097e8994

2016-07-04  François Dumont  <fdumont@gcc.gnu.org>

	* testsuite/23_containers/vector/modifiers/emplace/self_emplace.cc:
	New test.
	* testsuite/23_containers/vector/modifiers/insert/self_insert.cc: New
	test.

From-SVN: r237986

committed Jul 04, 2016

097e8994 Browse Files

Fix std::vector's use of temporary objects · 9958c7eb

	* include/bits/stl_vector.h (emplace(const_iterator, _Args&&...)):
	Define inline. Forward to _M_emplace_aux.
	(insert(const_iterator, value_type&&)): Forward to _M_insert_rval.
	(_M_insert_rval, _M_emplace_aux): Declare new functions.
	(_Temporary_value): New RAII type using allocator to construct/destroy.
	(_S_insert_aux_assign): Remove.
	(_M_insert_aux): Make non-variadic.
	* include/bits/vector.tcc (insert(const_iterator, const value_type&)):
	Use _Temporary_value.
	(emplace(const_iterator, _Args&&...)): Remove definition.
	(_M_insert_rval, _M_emplace_aux): Define.
	(_M_insert_aux): Make non-variadic, stop using _S_insert_aux_assign.
	(_M_fill_insert): Use _Temporary_value.
	* testsuite/23_containers/vector/allocator/construction.cc: New test.
	* testsuite/23_containers/vector/modifiers/insert_vs_emplace.cc:
	Adjust expected results for emplacing an lvalue with reallocation.
	* testsuite/23_containers/vector/check_construct_destroy.cc: Adjust
	expected results to account for construction/destruction of temporary
	using allocator.

From-SVN: r237985

committed Jul 04, 2016

9958c7eb Browse Files

S/390: Add support for z13 instructions lochi and locghi. · bf749919

The attached patch adds patterns to make use of the z13 LOCHI and
LOCGHI instructions.

gcc/ChangeLog:

2016-07-04  Dominik Vogt  <vogt@linux.vnet.ibm.com>

	* config/s390/s390.md: Add "z13" cpu_facility.
	("*mov<mode>cc"): Add support for z13 instructions lochi and locghi.
	* config/s390/predicates.md ("loc_operand"): New predicate for "load on
	condition" type instructions.

gcc/testsuite/ChangeLog:

2016-07-04  Dominik Vogt  <vogt@linux.vnet.ibm.com>

	* gcc.target/s390/vector/vec-scalar-cmp-1.c: Expect lochi instead
	of locr.
	* gcc.target/s390/loc-1.c: New test.

From-SVN: r237984

committed Jul 04, 2016

bf749919 Browse Files

Minor cleanup to allocate_dynamic_stack_space · 4fc0c9c8

gcc/ChangeLog:

2016-07-04  Dominik Vogt  <vogt@linux.vnet.ibm.com>
	    Jeff Law  <law@redhat.com>

	* explow.c (allocate_dynamic_stack_space): Simplify knowing that
	MUST_ALIGN was always true and extra_align ist always BITS_PER_UNIT.


Co-Authored-By: Jeff Law <law@redhat.com>

From-SVN: r237983

committed Jul 04, 2016

4fc0c9c8 Browse Files

i386.c (ix86_expand_vec_perm): Add handle one-operand permutation for TARGET_AVX512F. · 430bb38e

gcc/
	* config/i386/i386.c (ix86_expand_vec_perm): Add handle one-operand
	permutation for TARGET_AVX512F.
	(ix86_expand_vec_one_operand_perm_avx512): New function.
	(expand_vec_perm_1): Invoke introduced function.
	* tree-vect-loop.c (vect_transform_loop): Clear-up safelen value since
	it may be not valid after vectorization.

gcc/testsuite/
	* gcc/testsuite/gcc.target/i386/avx512f-vect-perm-1.c: New test.
	* gcc/testsuite/gcc.target/i386/avx512f-vect-perm-2.c: New test.

From-SVN: r237982

committed Jul 04, 2016

430bb38e Browse Files

Update documentation. · 5f5f7b7d
```
From-SVN: r237979
```
Arnaud Charlet committed Jul 04, 2016
5f5f7b7d Browse Files

re PR libstdc++/71313 ([Filesystem TS] remove_all fails to remove directory contents recursively) · e12880f9

	PR libstdc++/71313
	* src/filesystem/ops.cc (remove_all(const path&, error_code&)):
	Call remove_all for children of a directory.
	* testsuite/experimental/filesystem/operations/create_directories.cc:
	Adjust.

From-SVN: r237978

committed Jul 04, 2016

e12880f9 Browse Files

Cleanups. · cbe91164
```
From-SVN: r237977
```
Arnaud Charlet committed Jul 04, 2016
cbe91164 Browse Files