1. 09 Dec, 2014 26 commits
    • Add jit-tempdir.{c|h} · d1e5f2c7
      gcc/jit/ChangeLog:
              PR jit/64206
      	* Make-lang.in (jit_OBJS): Add jit/jit-tempdir.o.
      	* jit-common.h (gcc::jit::tempdir): New forward decl.
      	* jit-playback.c: Include jit-tempdir.h.
      	(gcc::jit::playback::context::context): Initialize m_tempdir.
      	(gcc::jit::playback::context::~context): Move tempdir
      	cleanup to new file jit-tempdir.c
      	(make_tempdir_path_template): Move to new file jit-tempdir.c.
      	(gcc::jit::playback::context::compile): Move tempdir creation
      	to new tempdir object in new file jit-tempdir.c.
      	(gcc::jit::playback::context::make_fake_args): Get path from
      	tempdir object rather than from member data.
      	(gcc::jit::playback::context::convert_to_dso): Likewise.
      	(gcc::jit::playback::context::dlopen_built_dso): Likewise.
      	(gcc::jit::playback::context::dump_generated_code): Likewise.
      	(gcc::jit::playback::context::get_path_c_file): New function.
      	(gcc::jit::playback::context::get_path_s_file): New function.
      	(gcc::jit::playback::context::get_path_so_file): New function.
      	* jit-playback.h (gcc::jit::playback::context::get_path_c_file):
      	New function.
      	(gcc::jit::playback::context::get_path_s_file): New function.
      	(gcc::jit::playback::context::get_path_so_file): New function.
      	(gcc::jit::playback::context): Move fields "m_path_template",
      	"m_path_tempdir", "m_path_c_file", "m_path_s_file",
      	"m_path_so_file" to new jit::tempdir class; add field "m_tempdir".
      	* jit-tempdir.c: New file.
      	* jit-tempdir.h: New file.
      
      From-SVN: r218533
      David Malcolm committed
    • [AArch64] Fix ICE on non-constant indices to __builtin_aarch64_im_lane_boundsi · 661fce82
      gcc/:
      
      	* config/aarch64/aarch64-builtins.c (aarch64_types_binopv_qualifiers,
      	TYPES_BINOPV): Delete.
      	(enum aarch64_builtins): Add AARCH64_BUILTIN_SIMD_LANE_CHECK and
      	AARCH64_SIMD_PATTERN_START.
      	(aarch64_init_simd_builtins): Register
      	__builtin_aarch64_im_lane_boundsi; use  AARCH64_SIMD_PATTERN_START.
      	(aarch64_simd_expand_builtin): Handle AARCH64_BUILTIN_LANE_CHECK; use
      	AARCH64_SIMD_PATTERN_START.
      
      	* config/aarch64/aarch64-simd.md (aarch64_im_lane_boundsi): Delete.
      	* config/aarch64/aarch64-simd-builtins.def (im_lane_bound): Delete.
      
      	* config/aarch64/arm_neon.h (__AARCH64_LANE_CHECK): New.
      	(__aarch64_vget_lane_f64, __aarch64_vget_lane_s64,
      	__aarch64_vget_lane_u64, __aarch64_vset_lane_any, vdupd_lane_f64,
      	vdupd_lane_s64, vdupd_lane_u64, vext_f32, vext_f64, vext_p8, vext_p16,
      	vext_s8, vext_s16, vext_s32, vext_s64, vext_u8, vext_u16, vext_u32,
      	vext_u64, vextq_f32, vextq_f64, vextq_p8, vextq_p16, vextq_s8,
      	vextq_s16, vextq_s32, vextq_s64, vextq_u8, vextq_u16, vextq_u32,
      	vextq_u64, vmulq_lane_f64): Use __AARCH64_LANE_CHECK.
      
      gcc/testsuite/:
      
      	* gcc.target/aarch64/simd/vset_lane_s16_const_1.c: New test.
      
      From-SVN: r218532
      Alan Lawrence committed
    • [AArch64]Fix ICE at -O0 on vld1_lane intrinsics · 2310e29f
      gcc/:
      
      	* config/aarch64/arm_neon.h (__AARCH64_NUM_LANES, __aarch64_lane *2):
      	New.
      	(aarch64_vset_lane_any): Redefine using previous, same for BE + LE.
      	(vset_lane_f32, vset_lane_f64, vset_lane_p8, vset_lane_p16,
      	vset_lane_s8, vset_lane_s16, vset_lane_s32, vset_lane_s64,
      	vset_lane_u8, vset_lane_u16, vset_lane_u32, vset_lane_u64): Remove
      	number of lanes.
      	(vld1_lane_f32, vld1_lane_f64, vld1_lane_p8, vld1_lane_p16,
      	vld1_lane_s8, vld1_lane_s16, vld1_lane_s32, vld1_lane_s64,
      	vld1_lane_u8, vld1_lane_u16, vld1_lane_u32, vld1_lane_u64): Call
      	__aarch64_vset_lane_any rather than vset_lane_xxx.
      
      gcc/testsuite/:
      
      	* gcc.target/aarch64/vld1_lane-o0.c: New test.
      
      From-SVN: r218531
      Alan Lawrence committed
    • vabs_intrinsic_2.c: New test. · 8f905d69
      gcc/testsuite/:
      
              * gcc.target/aarch64/vabs_intrinsic_2.c: New test.
      
      From-SVN: r218530
      Alan Lawrence committed
    • Fix bogus ChangeLog entry from r218521 · 82e0c914
      In r218521 I erroneously added the entries meant for gcc/jit/ChangeLog
      to gcc/ChangeLog instead.
      
      Move them to the correct ChangeLog file.
      
      Sorry for the noise.
      
      From-SVN: r218529
      David Malcolm committed
    • Guard less code with the JIT mutex · 38771e4e
      gcc/jit/ChangeLog:
      	* jit-playback.c (gcc::jit::playback::context::compile): Acquire the
      	mutex here, immediately before using toplev, and release it here, on
      	each exit path after acquisition.
      	(jit_mutex): Move this variable here, from jit-recording.c.
      	(gcc::jit::playback::context::acquire_mutex): New function, based on
      	code in jit-recording.c.
      	(gcc::jit::playback::context::release_mutex): Likewise.
      	* jit-playback.h (gcc::jit::playback::context::acquire_mutex): New
      	function.
      	(gcc::jit::playback::context::release_mutex): New function.
      	* jit-recording.c (jit_mutex): Move this variable to jit-playback.c.
      	(gcc::jit::recording::context::compile): Move mutex-handling from
      	here into jit-playback.c's gcc::jit::playback::context::compile.
      	* notes.txt: Update to show the new locations of ACQUIRE_MUTEX
      	and RELEASE_MUTEX.
      
      From-SVN: r218528
      David Malcolm committed
    • jit-playback.c: Move dlopen code into a new function · 38f4f641
      gcc/jit/ChangeLog:
      	* jit-playback.c (gcc::jit::playback::context::compile): Move the
      	dlopen code into...
      	(gcc::jit::playback::context::dlopen_built_dso): ...this new
      	function.
      	* jit-playback.h (gcc::jit::playback::context::dlopen_built_dso):
      	New function.
      
      From-SVN: r218527
      David Malcolm committed
    • [AArch64] Add TARGET_SCHED_REASSOCIATION_WIDTH · cee66c68
      2014-12-09  Wilco Dijkstra  <wilco.dijkstra@arm.com>
      
      	* gcc/config/aarch64/aarch64-protos.h (tune-params): Add reasociation
      	tuning parameters.
      	* gcc/config/aarch64/aarch64.c (TARGET_SCHED_REASSOCIATION_WIDTH):
      	Define.
      	(aarch64_reassociation_width): New function.
      	(generic_tunings): Add reassociation tuning parameters.
      	(cortexa53_tunings): Likewise.
      	(cortexa57_tunings): Likewise.
      	(thunderx_tunings): Likewise.
      
      From-SVN: r218526
      Wilco Dijkstra committed
    • [AARCH64][5/5] Add macro fusion support for cmp/b.X for ThunderX · 3759108f
      	* config/aarch64/aarch64.c (AARCH64_FUSE_CMP_BRANCH): New define.
      	(thunderx_tunings): Add AARCH64_FUSE_CMP_BRANCH to fuseable_ops.
      	(aarch_macro_fusion_pair_p): Handle AARCH64_FUSE_CMP_BRANCH.
      
      From-SVN: r218525
      Andrew Pinski committed
    • jit: Add new testcase missing from r218521 · 3b21bfb1
      I intended for r218521 to contain this file:
      
      gcc/testsuite/ChangeLog:
      	* jit.dg/test-error-unrecognized-dump.c: New file.
      
      Add it.
      
      From-SVN: r218522
      David Malcolm committed
    • PR jit/64166: Add API entrypoint gcc_jit_context_enable_dump · 463366a0
      gcc/jit/ChangeLog:
      	PR jit/64166
      	* docs/topics/contexts.rst (Debugging): Add description of
      	gcc_jit_context_enable_dump.
      	* docs/_build/texinfo/libgccjit.texi: Regenerate.
      	* jit-playback.c: Include context.h.
      	(class auto_argvec): New class.
      	(auto_argvec::~auto_argvec): New function.
      	(gcc::jit::playback::context::compile): Convert fake_args to be
      	an auto_argvec, so that it can contain dynamically-allocated
      	strings.   Construct a vec of all requested dumps, and pass it to
      	make_fake_args.  Extract requested dumps between the calls to
      	toplev::main and toplev::finalize.
      	(gcc::jit::playback::context::make_fake_args): Convert param
      	"argvec" to be a vec <char *>, and gain a "requested_dumps"
      	param.  Convert to dynamically-allocated arg strings by converting
      	ADD_ARG to take a copy of the arg, and add ADD_ARG_TAKE_OWNERSHIP
      	for args that are already a copy.  Add args for all requested dumps.
      	(gcc::jit::playback::context::extract_any_requested_dumps): New
      	function.
      	(gcc::jit::playback::context::read_dump_file): New function.
      	* jit-playback.h (gcc::jit::playback::context::make_fake_args):
      	Convert param "argvec" to be a vec <char *>, and gain a
      	"requested_dumps" param.
      	(gcc::jit::playback::context::extract_any_requested_dumps): New
      	function.
      	(gcc::jit::playback::context::read_dump_file): New function.
      	* jit-recording.c (gcc::jit::recording::context::enable_dump): New
      	function.
      	(gcc::jit::recording::context::get_all_requested_dumps): New
      	function.
      	* jit-recording.h (gcc::jit::recording::requested_dump): New
      	struct.
      	(gcc::jit::recording::context::enable_dump): New function.
      	(gcc::jit::recording::context::get_all_requested_dumps): New
      	function.
      	(gcc::jit::recording::context::m_requested_dumps): New field.
      	* libgccjit.c (gcc_jit_context_enable_dump): New API entrypoint.
      	* libgccjit.h (gcc_jit_context_enable_dump): New API entrypoint.
      	* libgccjit.map (gcc_jit_context_enable_dump): New API entrypoint.
      
      gcc/testsuite/ChangeLog:
      	PR jit/64166
      	PR jit/64020
      	* jit.dg/harness.h (CHECK_STRING_CONTAINS): New macro.
      	(check_string_contains): New function.
      	* jit.dg/test-error-unrecognized-dump.c: New file.
      	* jit.dg/test-functions.c (trig_sincos_dump): New variable.
      	(trig_statistics_dump): New variable.
      	(create_test_of_builtin_trig): Enable dumping of "sincos" and
      	"statistics" into "trig_sincos_dump" and "trig_statistics_dump".
      	(verify_test_of_builtin_trig): Verify the sincos and statistics
      	dumps.
      	* jit.dg/test-sum-of-squares.c (dump_vrp1): New variable.
      	(create_code): Enable dumping of "tree-vrp1" into dump_vrp1.
      	(verify_code): Verify the tree-vrp1 dump.
      
      From-SVN: r218521
      David Malcolm committed
    • PR jit/64166: Add methods to gcc::dump_manager needed by JIT testing · 799505ae
      gcc/ChangeLog:
      	PR jit/64166
      	* dumpfile.c (gcc::dump_manager::get_dump_file_info_by_switch):
      	New function.
      	(gcc::dump_manager::get_dump_file_name): Split out bulk of
      	implementation into a new overloaded variant taking a
      	dump_file_info *.
      	* dumpfile.h (gcc::dump_manager::get_dump_file_info_by_switch):
      	New function.
      	(gcc::dump_manager::get_dump_file_name): New overloaded variant of
      	this function, taking a dump_file_info *.
      
      From-SVN: r218520
      David Malcolm committed
    • re PR bootstrap/64213 (gimple-match.c:1523:6: error: ‘GIMPLE’ was not declared in this scope) · 0cc67c94
      	PR bootstrap/64213
      	Revert:
      	2014-11-28  H.J. Lu  <hongjiu.lu@intel.com>
      
      	PR rtl-optimization/64037
      	* combine.c (setup_incoming_promotions): Pass the argument
      	before any promotions happen to promote_function_mode.
      
      testsuite/ChangeLog:
      
      	PR bootstrap/64213
      	Revert:
      	2014-11-28  H.J. Lu  <hongjiu.lu@intel.com>
      
      	PR rtl-optimization/64037
      	* g++.dg/pr64037.C: New test.
      
      From-SVN: r218516
      Uros Bizjak committed
    • re PR tree-optimization/64193 (Decreased performance after r173250) · 92a5094e
      2014-12-09  Richard Biener  <rguenther@suse.de>
      
      	PR tree-optimization/64193
      	* tree-ssa-alias.c (walk_non_aliased_vuses): Add valueize parameter
      	and valueize the VUSE before looking up the def stmt.
      	* tree-ssa-alias.h (walk_non_aliased_vuses): Adjust prototype.
      	* tree-ssa-sccvn.c (vn_reference_lookup_pieces): Pass vn_valueize
      	to walk_non_aliased_vuses.
      	(vn_reference_lookup): Likewise.
      	* tree-ssa-dom.c (lookup_avail_expr): Pass NULL as valueize
      	callback to walk_non_aliased_vuses.
      
      	* gcc.dg/tree-ssa/ssa-fre-43.c: New testcase.
      
      From-SVN: r218515
      Richard Biener committed
    • Local Vim config with GNU formatting. · c3e1e693
          
      2014-12-09  Laurynas Biveinis  <laurynas.biveinis@gmail.com>
      	    Yury Gribov  <y.gribov@samsung.com>
          
      /
      	* .gitignore: Added .local.vimrc and .lvimrc.
      	* Makefile.tpl (vimrc, .lvimrc, .local.vimrc): New targets.
      	* Makefile.in: Regenerate.
      
      contrib/
      	* vimrc: New file.
      
      From-SVN: r218514
      Yury Gribov committed
    • re PR tree-optimization/64199 (ICE: tree check: expected class 'constant', have… · 85fd4c28
      re PR tree-optimization/64199 (ICE: tree check: expected class 'constant', have 'binary' (plus_expr) in fold_binary_loc, at fold-const.c:10404 with -ffast-math -frounding-math)
      
      2014-12-09  Richard Biener  <rguenther@suse.de>
      
      	PR middle-end/64199
      	* fold-const.c (fold_binary_loc): Use TREE_OVERFLOW_P.
      
      	* gcc.dg/torture/pr64199.c: New testcase.
      
      From-SVN: r218513
      Richard Biener committed
    • re PR libstdc++/64203 (shared_mutex compile errors on bare-metal targets) · 60b6534a
      	PR libstdc++/64203
      	* include/std/shared_mutex: Fix preprocessor conditions.
      	* testsuite/experimental/feat-cxx14.cc: Check conditions.
      
      From-SVN: r218511
      Jonathan Wakely committed
    • re PR tree-optimization/64191 (indirect clobbers messes up dead code elimination… · ac6aeab4
      re PR tree-optimization/64191 (indirect clobbers messes up dead code elimination in loop calling dtor)
      
      2014-12-09  Richard Biener  <rguenther@suse.de>
      
      	PR tree-optimization/64191
      	* tree-vect-stmts.c (vect_stmt_relevant_p): Clobbers are
      	not relevant (nor are their uses).
      
      From-SVN: r218509
      Richard Biener committed
    • lto-partition.c (privatize_symbol_name): Correctly privatize instrumentation clones. · 6c779727
      gcc/
      
      	* lto/lto-partition.c (privatize_symbol_name): Correctly
      	privatize instrumentation clones.
      
      gcc/testsuite/
      
      	* gcc.dg/lto/lto.exp: Load mpx-dg.exp.
      	* gcc.dg/lto/chkp-privatize_0.c: New.
      	* gcc.dg/lto/chkp-privatize_1.c: New.
      
      From-SVN: r218508
      Ilya Enkovich committed
    • lto-cgraph.c (input_cgraph_1): Don't break existing instrumentation clone references. · 6a73516d
      	* lto-cgraph.c (input_cgraph_1): Don't break existing
      	instrumentation clone references.
      	* lto/lto-symtab.c (lto_cgraph_replace_node): Redirect
      	instrumented_version references appropriately.
      
      From-SVN: r218507
      Ilya Enkovich committed
    • re PR bootstrap/63995 (Bootstrap error with -mmpx -fcheck-pointer-bounds) · 227eabce
      gcc/
      
      	PR bootstrap/63995
      	* tree-chkp.c (chkp_make_static_bounds): Share bounds var
      	between nodes sharing assembler name.
      
      gcc/testsuite/
      
      	PR bootstrap/63995
      	* g++.dg/dg.exp: Add mpx-dg.exp.
      	* g++.dg/pr63995-1.C: New.
      
      From-SVN: r218506
      Ilya Enkovich committed
    • re PR target/64204 (gcc.dg/c11-atomic-2.c fails on powerpc 64-bit little endian… · f9ea9950
      re PR target/64204 (gcc.dg/c11-atomic-2.c fails on powerpc 64-bit little endian after -mupper-regs patches went in)
      
      2014-12-08  Michael Meissner  <meissner@linux.vnet.ibm.com>
      
      	PR target/64204
      	* config/rs6000/rs6000.c (rs6000_emit_move): Do not split TFmode
      	constant moves if -mupper-regs-df.
      
      	* config/rs6000/rs6000.md (mov<mode>_64bit_dm): Optimize moving
      	0.0L to TFmode.
      	(movtd_64bit_nodm): Likewise.
      	(mov<mode>_32bit, FMOVE128 case): Likewise.
      
      From-SVN: r218505
      Michael Meissner committed
    • simplify-rtx.c (simplify_relational_operation_1): Handle simplification… · 10828a01
      simplify-rtx.c (simplify_relational_operation_1): Handle simplification identities for BICS patterns.
      
      2014-12-08  Sandra Loosemore  <sandra@codesourcery.com>
      
      	gcc/
      	* simplify-rtx.c (simplify_relational_operation_1): Handle
      	simplification identities for BICS patterns.
      
      	gcc/testsuite/
      	* gcc.target/aarch64/bics_4.c: New.
      
      From-SVN: r218503
      Sandra Loosemore committed
    • move gimple_canonical_types htab out of gc memory · 58dddbd2
      lto/
      
      	* lto.c (read_cgraph_and_symbols): allocate gimple_canonical_types
      	htab with malloc instead of ggc.
      
      From-SVN: r218502
      Trevor Saunders committed
    • remove param_is from nvptx · f3dba894
      gcc/
      
      	* config/nvptx/nvptx.c: Convert htabs to hash_table.
      
      From-SVN: r218501
      Trevor Saunders committed
    • Daily bump. · 13ef9090
      From-SVN: r218500
      GCC Administrator committed
  2. 08 Dec, 2014 14 commits
    • re PR target/64226 (Secondary reload incorrect TOC address) · ca9f165c
              PR target/64226
              * config/rs6000/rs6000.c (rs6000_secondary_reload_inner)
              [SYMBOL_REF]: Do not explicitly call create_TOC_reference for
              TARGET_TOC. Always use rs6000_emit_move.
      
      From-SVN: r218497
      David Edelsohn committed
    • DWARFv5 Emit DW_TAG_atomic_type for C11 _Atomic. · bc3518da
      This implements the DW_TAG_atomic_type for C11 _Atomic proposal as adopted
      in the latest DWARF5 draft. http://dwarfstd.org/ShowIssue.php?issue=131112.1
      
      gcc/ChangeLog
      
      	PR debug/60782
      	* dwarf2out.c (modified_type_die): Handle TYPE_QUAL_ATOMIC.
      
      gcc/testsuite/ChangeLog
      
      	PR debug/60782
      	* gcc.dg/debug/dwarf2/atomic.c: New test.
      	* gcc.dg/debug/dwarf2/stacked-qualified-types-3.c: Likewise.
      
      include/ChangeLog
      
      	PR debug/60782
      	* dwarf2.def: Add DWARFv5 DW_TAG_atomic_type.
      
      From-SVN: r218496
      Mark Wielaard committed
    • Fix typo in ChangeLog · 009a3480
      From-SVN: r218495
      Jeff Law committed
    • re PR inline-asm/61692 (ICE in extract_insn in recog.c for asm with many parameters) · c475c36c
              PR target/61692
      	* cfgexpand.c (expand_asm_operands): Count all inline asm params.
      
      	PR target/61692
              * gcc.dg/pr61692.c: New test.
      
      From-SVN: r218494
      David Wohlferd committed
    • Added myself to Write After Approval list · eecea65c
      From-SVN: r218491
      Lynn Boger committed
    • PR jit/63854: Introduce xstrdup_for_dump · 2a72a953
      gcc/ChangeLog:
      	PR jit/63854
      	* cgraph.h (xstrdup_for_dump): New function.
      	* cgraph.c (cgraph_node::get_create): Replace use of xstrdup
      	within fprintf with xstrdup_for_dump.
      	(cgraph_edge::make_speculative): Likewise.
      	(cgraph_edge::resolve_speculation): Likewise.
      	(cgraph_edge::redirect_call_stmt_to_callee): Likewise.
      	(cgraph_node::dump): Likewise.
      	* cgraphclones.c (symbol_table::materialize_all_clones): Likewise.
      	* ipa-cp.c (perhaps_add_new_callers): Likewise.
      	* ipa-inline.c (report_inline_failed_reason): Likewise.
      	(want_early_inline_function_p): Likewise.
      	(edge_badness): Likewise.
      	(update_edge_key): Likewise.
      	(flatten_function): Likewise.
      	(inline_always_inline_functions): Likewise.
      	* ipa-profile.c (ipa_profile): Likewise.
      	* ipa-prop.c (ipa_print_node_jump_functions): Likewise.
      	(ipa_make_edge_direct_to_target): Likewise.
      	(remove_described_reference): Likewise.
      	(propagate_controlled_uses): Likewise.
      	* ipa-utils.c (ipa_merge_profiles): Likewise.
      
      From-SVN: r218490
      David Malcolm committed
    • libgccjit++.h: use indentation to show inheritance · 53b730ff
      gcc/jit/ChangeLog:
      	* libgccjit++.h: Indent the forward declarations of the classes to
      	show the inheritance hierarchy.
      
      From-SVN: r218489
      David Malcolm committed
    • Make jit/notes.txt better reflect current status quo · 0f379445
      gcc/jit/ChangeLog:
      	* notes.txt: Show the beginning and ending of
      	recording::context::compile vs playback::context::compile.  Show
      	the creation and unlinking of the tempdir.  Show toplev::finalize.
      	Move "RELEASE MUTEX" to the correct location.  Show
      	gcc_jit_result_release, and indicate where the
      	dlopen/dlsym/dlclose occur.
      
      From-SVN: r218488
      David Malcolm committed
    • re PR ipa/64049 (r215898 caused wrong code at -O3) · b09e592e
      2014-12-08  Bernd Edlinger  <bernd.edlinger@hotmail.de>
      
              PR ipa/64049
              * ipa-polymorphic-call.c
              (pa_polymorphic_call_context::ipa_polymorphic_call): Allow RESULT_DECL.
      
      testsuite/ChangeLog:
      2014-12-08  Bernd Edlinger  <bernd.edlinger@hotmail.de>
      
              PR ipa/64049
              * g++.dg/ipa/pr64049.h: New.
              * g++.dg/ipa/pr64049-1.C: New.
              * g++.dg/ipa/pr64049-2.C: New.
      
      From-SVN: r218487
      Bernd Edlinger committed
    • Bics instruction generation for aarch64 · cd8fc5ee
      gcc/
      
      	* config/aarch64/aarch64.md (and_one_cmpl<mode>3_compare0_no_reuse):
      	New define_insn.
      	* (and_one_cmpl_<SHIFT:optab><mode>3_compare0_no_reuse):
      	Likewise.
      
      gcc/testsuite/
      
      	* gcc.target/aarch64/bics_3.c : New testcase.
      
      From-SVN: r218486
      Alex Velenko committed
    • re PR go/64198 (ICE in gofrontend) · 942f7f5a
      	PR go/64198
      compiler: Don't crash on invalid ++.
      
      From-SVN: r218485
      Ian Lance Taylor committed
    • arm_neon.h (vrecpe_u32, [...]): Rewrite using builtin functions. · 58a3bd25
              * config/aarch64/arm_neon.h (vrecpe_u32, vrecpeq_u32): Rewrite using
              builtin functions.
              (vfma_f32, vfmaq_f32, vfmaq_f64, vfma_n_f32, vfmaq_n_f32, vfmaq_n_f64,
              vfms_f32, vfmsq_f32, vfmsq_f64): Likewise.
              (vhsub_s8, vhsub_u8, vhsub_s16, vhsub_u16, vhsub_s32, vhsub_u32,
              vhsubq_s8, vhsubq_u8, vhsubq_s16, vhsubq_u16, vhsubq_s32, vhsubq_u32,
              vsubhn_s16, vsubhn_u16, vsubhn_s32, vsubhn_u32, vsubhn_s64, vsubhn_u66,
              vrsubhn_s16, vrsubhn_u16, vrsubhn_s32, vrsubhn_u32, vrsubhn_s64,
              vrsubhn_u64, vsubhn_high_s16, vsubhn_high_u16, vsubhn_high_s32,
              vsubhn_high_u32, vsubhn_high_s64, vsubhn_high_u64, vrsubhn_high_s16,
              vrsubhn_high_u16, vrsubhn_high_s32, vrsubhn_high_u32, vrsubhn_high_s64,
              vrsubhn_high_u64): Likewise.
              * config/aarch64/iterators.md (VDQ_SI): New mode iterator.
              * config/aarch64/aarch64.md (define_c_enum "unspec"): Add UNSPEC_URECPE.
              * config/aarch64/aarch64-simd.md (aarch64_urecpe<mode>): New pattern.
              * config/aarch64/aarch64-simd-builtins.def (shsub, uhsub, subhn, rsubhn,
              subhn2, rsubhn2, urecpe): New builtins.
      
      Co-Authored-By: Haijian Zhang <z.zhanghaijian@huawei.com>
      Co-Authored-By: Jiji Jiang <jiangjiji@huawei.com>
      Co-Authored-By: Pengfei Sui <suipengfei@huawei.com>
      
      From-SVN: r218484
      Felix Yang committed
    • Enable non-const v64qi permutations. · 28adf6e7
      gcc/
      	* config/i386/i386.c (ix86_expand_vec_perm_vpermi2): Handle v64qi.
      	* config/i386/sse.md (VEC_PERM_AVX2): Add v64qi.
      
      From-SVN: r218483
      Ilya Tocar committed
    • Enable const permutations for V64QImode. · 9f9f6115
      gcc/
      	* config/i386/i386.c (expand_vec_perm_broadcast_1): Handle v64qi.
      	(expand_vec_perm_vpermi2_vpshub2): New.
      	(ix86_expand_vec_perm_const_1): Use it.
      	(ix86_vectorize_vec_perm_const_ok): Handle v64qi.
      	* config/i386/sse.md (VEC_PERM_CONST): Add v64qi.
      
      From-SVN: r218482
      Ilya Tocar committed