Commit 5c38262d by Jakub Jelinek Committed by Jakub Jelinek

re PR tree-optimization/82374 (#pragma GCC optimize is not applied to openmp-generated functions)

	PR tree-optimization/82374
	* omp-low.c (create_omp_child_function): Copy DECL_ATTRIBUTES,
	DECL_FUNCTION_SPECIFIC_OPTIMIZATION,
	DECL_FUNCTION_SPECIFIC_TARGET and DECL_FUNCTION_VERSIONED from
	current_function_decl to the new decl.

	* gcc.dg/gomp/pr82374.c: New test.

From-SVN: r253395
parent 21375f0d
2017-10-04 Jakub Jelinek <jakub@redhat.com>
PR tree-optimization/82374
* omp-low.c (create_omp_child_function): Copy DECL_ATTRIBUTES,
DECL_FUNCTION_SPECIFIC_OPTIMIZATION,
DECL_FUNCTION_SPECIFIC_TARGET and DECL_FUNCTION_VERSIONED from
current_function_decl to the new decl.
2017-10-03 Michael Meissner <meissner@linux.vnet.ibm.com>
* config/rs6000/rs6000-builtin.def (BU_FLOAT128_2_HW): Define new
......@@ -1626,6 +1626,14 @@ create_omp_child_function (omp_context *ctx, bool task_copy)
DECL_CONTEXT (decl) = NULL_TREE;
DECL_INITIAL (decl) = make_node (BLOCK);
BLOCK_SUPERCONTEXT (DECL_INITIAL (decl)) = decl;
DECL_ATTRIBUTES (decl) = DECL_ATTRIBUTES (current_function_decl);
DECL_FUNCTION_SPECIFIC_OPTIMIZATION (decl)
= DECL_FUNCTION_SPECIFIC_OPTIMIZATION (current_function_decl);
DECL_FUNCTION_SPECIFIC_TARGET (decl)
= DECL_FUNCTION_SPECIFIC_TARGET (current_function_decl);
DECL_FUNCTION_VERSIONED (decl)
= DECL_FUNCTION_VERSIONED (current_function_decl);
if (omp_maybe_offloaded_ctx (ctx))
{
cgraph_node::get_create (decl)->offloadable = 1;
......
2017-10-04 Jakub Jelinek <jakub@redhat.com>
PR tree-optimization/82374
* gcc.dg/gomp/pr82374.c: New test.
2017-10-04 Richard Biener <rguenther@suse.de>
* gfortran.dg/graphite/id-17.f: For ilp32 allow graphite codegen
......
/* PR tree-optimization/82374 */
/* { dg-do compile } */
/* { dg-options "-O2 -fno-tree-vectorize -fdump-tree-vect-details" } */
/* { dg-additional-options "-mavx -mno-avx2" { target i?86-*-* x86_64-*-* } } */
/* { dg-additional-options "-mvsx" { target powerpc_vsx_ok } } */
#define SIZE (1024 * 1024 * 1024)
float a[SIZE];
float b[SIZE];
float c[SIZE];
float d[SIZE];
__attribute__((optimize ("O2", "tree-vectorize"))) void
foo (void)
{
int i;
#pragma omp parallel for
for (i = 0; i < SIZE; i++)
c[i] = a[i] + b[i];
}
__attribute__((optimize ("O2", "tree-vectorize"))) void
bar (void)
{
int i;
for (i = 0; i < SIZE; i++)
d[i] = a[i] + b[i];
}
/* { dg-final { scan-tree-dump-times "vectorized 1 loops" 2 "vect" { target { { i?86-*-* x86_64-*-* } || { powerpc_vsx_ok } } } } } */
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment