date:20130620

Re: mips SNaN/QNaN is swapped

2013-06-20 Thread Thomas Schwinge

Hi!

On Thu, 20 Jun 2013 15:08:16 -0700, Ian Lance Taylor  wrote:
> Thomas Schwinge  writes:
> > On Sun, 05 May 2013 18:55:09 -0700, Ian Lance Taylor  wrote:
> >> Thomas Schwinge  writes:
> >> 
> >> >> libgcc/
> >> >> * fp-bit.c (unpack_d, pack_d): Properly preserve and restore a
> >> >> NaN's payload.
> >> >> * fp-bit.h [FLOAT] (QUIET_NAN): Correct value.
> >> 
> >> This is OK.
> >
> > Joseph suggested these two bug-fix commits (trunk r198621 and r198622)
> > should be applied to earlier release branches, too.  OK?

Joseph, which release branches would you like me to commit these two
patches to?

> The RM's, including Joseph, are in charge of the release branches.  You
> don't need my separate OK.

OK, thanks.


Grüße,
 Thomas


pgppyEOJLrXYZ.pgp
Description: PGP signature

[RS6000] powerpc64le vec splat

2013-06-20 Thread Alan Modra

A number of places in the rs6000 backend assume the value for a vec
splat can be found at element nunits-1 of a vector constant, which is
wrong for little-endian.  This patch fixes them and the ICE found
when running altivec-consts.c on powerpc64le.

I've also updated the testcase so that it passes for little-endian.
vspltish() and vspltisw() don't do the right thing little-endian, nor
is it easy to make the function work.  Consider
  v16qi v = { 0, 15, 0, 15, 0, 15, 0, 15, 0, 15, 0, 15, 0, 15, 0, 15 };
  vspltish (w, 15);
vs
  v8hi v = { 15, 15, 15, 15, 15, 15, 15, 15 };
  vspltish (w, 15);
So rather than write two or three variants to build constants the
right way, I just simply use static arrays.  I've also added a
scan-assembler-not to make sure none of the vector constants are
loaded from memory, and a new test that has constants appropriate for
little-endian optimisation.

Bootstrapped and regression tested powerpc64-linux.  OK for mainline
and 4.8?

gcc/
* config/rs6000/rs6000.c (vspltis_constant): Correct for little-endian.
(gen_easy_altivec_constant): Likewise.
* config/rs6000/predicates.md (easy_vector_constant_add_self,
easy_vector_constant_msb): Likewise.
gcc/testsuite/
* gcc.target/powerpc/altivec-consts.c: Correct for little-endian.
Add scan-assembler-not "lvx".
* gcc.target/powerpc/le-altivec-consts.c: New.

Index: gcc/config/rs6000/rs6000.c
===
--- gcc/config/rs6000/rs6000.c  (revision 200274)
+++ gcc/config/rs6000/rs6000.c  (working copy)
@@ -4657,7 +4662,7 @@ vspltis_constant (rtx op, unsigned step, unsigned
   bitsize = GET_MODE_BITSIZE (inner);
   mask = GET_MODE_MASK (inner);
 
-  val = const_vector_elt_as_int (op, nunits - 1);
+  val = const_vector_elt_as_int (op, BYTES_BIG_ENDIAN ? nunits - 1 : 0);
   splat_val = val;
   msb_val = val > 0 ? 0 : -1;
 
@@ -4697,7 +4702,7 @@ vspltis_constant (rtx op, unsigned step, unsigned
   for (i = 0; i < nunits - 1; ++i)
 {
   HOST_WIDE_INT desired_val;
-  if (((i + 1) & (step - 1)) == 0)
+  if (((BYTES_BIG_ENDIAN ? i + 1 : i) & (step - 1)) == 0)
desired_val = val;
   else
desired_val = msb_val;
@@ -4782,13 +4787,13 @@ gen_easy_altivec_constant (rtx op)
 {
   enum machine_mode mode = GET_MODE (op);
   int nunits = GET_MODE_NUNITS (mode);
-  rtx last = CONST_VECTOR_ELT (op, nunits - 1);
+  rtx val = CONST_VECTOR_ELT (op, BYTES_BIG_ENDIAN ? nunits - 1 : 0);
   unsigned step = nunits / 4;
   unsigned copies = 1;
 
   /* Start with a vspltisw.  */
   if (vspltis_constant (op, step, copies))
-return gen_rtx_VEC_DUPLICATE (V4SImode, gen_lowpart (SImode, last));
+return gen_rtx_VEC_DUPLICATE (V4SImode, gen_lowpart (SImode, val));
 
   /* Then try with a vspltish.  */
   if (step == 1)
@@ -4797,7 +4802,7 @@ gen_easy_altivec_constant (rtx op)
 step >>= 1;
 
   if (vspltis_constant (op, step, copies))
-return gen_rtx_VEC_DUPLICATE (V8HImode, gen_lowpart (HImode, last));
+return gen_rtx_VEC_DUPLICATE (V8HImode, gen_lowpart (HImode, val));
 
   /* And finally a vspltisb.  */
   if (step == 1)
@@ -4806,7 +4811,7 @@ gen_easy_altivec_constant (rtx op)
 step >>= 1;
 
   if (vspltis_constant (op, step, copies))
-return gen_rtx_VEC_DUPLICATE (V16QImode, gen_lowpart (QImode, last));
+return gen_rtx_VEC_DUPLICATE (V16QImode, gen_lowpart (QImode, val));
 
   gcc_unreachable ();
 }
Index: gcc/config/rs6000/predicates.md
===
--- gcc/config/rs6000/predicates.md (revision 200274)
+++ gcc/config/rs6000/predicates.md (working copy)
@@ -527,9 +527,11 @@
(match_test "easy_altivec_constant (op, mode)")))
 {
   HOST_WIDE_INT val;
+  int elt;
   if (mode == V2DImode || mode == V2DFmode)
 return 0;
-  val = const_vector_elt_as_int (op, GET_MODE_NUNITS (mode) - 1);
+  elt = BYTES_BIG_ENDIAN ? GET_MODE_NUNITS (mode) - 1 : 0;
+  val = const_vector_elt_as_int (op, elt);
   val = ((val & 0xff) ^ 0x80) - 0x80;
   return EASY_VECTOR_15_ADD_SELF (val);
 })
@@ -541,9 +543,11 @@
(match_test "easy_altivec_constant (op, mode)")))
 {
   HOST_WIDE_INT val;
+  int elt;
   if (mode == V2DImode || mode == V2DFmode)
 return 0;
-  val = const_vector_elt_as_int (op, GET_MODE_NUNITS (mode) - 1);
+  elt = BYTES_BIG_ENDIAN ? GET_MODE_NUNITS (mode) - 1 : 0;
+  val = const_vector_elt_as_int (op, elt);
   return EASY_VECTOR_MSB (val, GET_MODE_INNER (mode));
 })
 
Index: gcc/testsuite/gcc.target/powerpc/le-altivec-consts.c
===
--- gcc/testsuite/gcc.target/powerpc/le-altivec-consts.c(revision 0)
+++ gcc/testsuite/gcc.target/powerpc/le-altivec-consts.c(revision 0)
@@ -0,0 +1,253 @@
+/* { dg-do run { target { powerpc*-*-* && vmx_hw } } } */
+/* { dg-do compile { target { powerpc*-*-* && { ! vmx_hw } } } } */
+/* { dg-require-effective-target pow

[c++-concepts] merge from trunk

2013-06-20 Thread Gabriel Dos Reis


At revision 200282.

Andrew -- 
there was a slight conflict with the new usage of is_binary_trait.
I think I resolved it properly, but double check.

-- Gaby

Go patch committed: Really only make descriptors when needed

2013-06-20 Thread Ian Lance Taylor

And this patch fixes the last patch to *really* only make function
descriptors when they are needed.  Sigh.  Bootstrapped and ran Go
testsuite on x86_64-unknown-linux-gnu.  Committed to mainline and 4.8
branch.

Ian

diff -r 3081c3c4d620 go/expressions.cc
--- a/go/expressions.cc	Thu Jun 20 17:08:43 2013 -0700
+++ b/go/expressions.cc	Thu Jun 20 17:16:00 2013 -0700
@@ -1242,24 +1242,6 @@
 	  : Expression::traverse(&this->closure_, traverse));
 }
 
-// Lower a function reference.  If this reference is not called
-// directly, make sure there is a function descriptor.
-
-Expression*
-Func_expression::do_lower(Gogo* gogo, Named_object*, Statement_inserter*, int)
-{
-  // Make sure that the descriptor exists.  FIXME: If the function is
-  // only ever called, and is never referenced otherwise, then we
-  // don't need the descriptor.  We could do that with another pass
-  // over the tree.
-  if (this->closure_ == NULL
-  && this->function_->is_function()
-  && !this->function_->func_value()->is_method())
-this->function_->func_value()->descriptor(gogo, this->function_);
-
-  return this;
-}
-
 // Return the type of a function expression.
 
 Type*
diff -r 3081c3c4d620 go/expressions.h
--- a/go/expressions.h	Thu Jun 20 17:08:43 2013 -0700
+++ b/go/expressions.h	Thu Jun 20 17:16:00 2013 -0700
@@ -1524,9 +1524,6 @@
   int
   do_traverse(Traverse*);
 
-  Expression*
-  do_lower(Gogo*, Named_object*, Statement_inserter*, int);
-
   Type*
   do_type();

Re: C++ PATCH for c++/55149 (lambda VLA capture)

2013-06-20 Thread Jason Merrill


On 06/20/2013 08:26 PM, Jason Merrill wrote:

Instantiation of a VLA capture in a template wasn't working properly;
this fixes it.


Along with this, which fixes the type of an instantiation of the MINUS_EXPR.


commit c37ab236406b9acf4423e80554055bf72d7be5fa
Author: Jason Merrill 
Date:   Thu Jun 20 17:42:16 2013 -0400

	* decl.c (compute_array_index_type): Use size_one_node.

diff --git a/gcc/cp/decl.c b/gcc/cp/decl.c
index 9eb1d12..dad1e10 100644
--- a/gcc/cp/decl.c
+++ b/gcc/cp/decl.c
@@ -8241,7 +8241,7 @@ compute_array_index_type (tree name, tree size, tsubst_flags_t complain)
 	 constant. Just build the index type and mark that it requires
 	 structural equality checks.  */
   itype = build_index_type (build_min (MINUS_EXPR, sizetype,
-	   size, integer_one_node));
+	   size, size_one_node));
   TYPE_DEPENDENT_P (itype) = 1;
   TYPE_DEPENDENT_P_VALID (itype) = 1;
   SET_TYPE_STRUCTURAL_EQUALITY (itype);

C++ PATCH for c++/55149 (lambda VLA capture)

2013-06-20 Thread Jason Merrill

Instantiation of a VLA capture in a template wasn't working properly; 
this fixes it.


Tested x86_64-pc-linux-gnu, applying to trunk.
commit 9e88ab6e258122ce7d4a709919df56f7c1514c06
Author: Jason Merrill 
Date:   Thu Jun 13 09:29:11 2013 -0400

	PR c++/55149
	* decl.c (compute_array_index_type): Don't reject VLAs in SFINAE
	context if we're in C++14 mode.
	* tree.c (array_of_runtime_bound_p): Return true for a dependent
	bound that is not potentually constant.
	* cp-tree.h (DECL_VLA_CAPTURE_P, REFERENCE_VLA_OK): New.
	* pt.c (tsubst) [REFERENCE_TYPE]: Check REFERENCE_VLA_OK.
	* semantics.c (build_lambda_object): Don't rvalue a VLA capture.
	(build_capture_proxy): Set REFERENCE_VLA_OK.
	(vla_capture_type): Make it a proper C++ class.
	(add_capture): Set DECL_VLA_CAPTURE_P.  Don't pre-digest the
	initializer.

diff --git a/gcc/cp/cp-tree.h b/gcc/cp/cp-tree.h
index 9fc4aeb..cf54acf 100644
--- a/gcc/cp/cp-tree.h
+++ b/gcc/cp/cp-tree.h
@@ -126,6 +126,7 @@ c-common.h, not after.
5: CLASS_TYPE_P (in RECORD_TYPE and UNION_TYPE)
   ENUM_FIXED_UNDERLYING_TYPE_P (in ENUMERAL_TYPE)
   AUTO_IS_DECLTYPE (in TEMPLATE_TYPE_PARM)
+  REFERENCE_VLA_OK (in REFERENCE_TYPE)
6: TYPE_DEPENDENT_P_VALID
 
Usage of DECL_LANG_FLAG_?:
@@ -139,6 +140,7 @@ c-common.h, not after.
   DECL_MEMBER_TEMPLATE_P (in TEMPLATE_DECL)
   FUNCTION_PARAMETER_PACK_P (in PARM_DECL)
   USING_DECL_TYPENAME_P (in USING_DECL)
+  DECL_VLA_CAPTURE_P (in FIELD_DECL)
2: DECL_THIS_EXTERN (in VAR_DECL or FUNCTION_DECL).
   DECL_IMPLICIT_TYPEDEF_P (in a TYPE_DECL)
3: DECL_IN_AGGR_P.
@@ -2979,6 +2981,11 @@ extern void decl_shadowed_for_var_insert (tree, tree);
&& (TREE_CODE (TREE_TYPE (TREE_OPERAND ((NODE), 0)))	\
== REFERENCE_TYPE))
 
+/* True if NODE is a REFERENCE_TYPE which is OK to instantiate to be a
+   reference to VLA type, because it's used for VLA capture.  */
+#define REFERENCE_VLA_OK(NODE) \
+  (TYPE_LANG_FLAG_5 (REFERENCE_TYPE_CHECK (NODE)))
+
 #define NEW_EXPR_USE_GLOBAL(NODE) \
   TREE_LANG_FLAG_0 (NEW_EXPR_CHECK (NODE))
 #define DELETE_EXPR_USE_GLOBAL(NODE) \
@@ -3616,6 +3623,11 @@ more_aggr_init_expr_args_p (const aggr_init_expr_arg_iterator *iter)
 #define DECL_THIS_STATIC(NODE) \
   DECL_LANG_FLAG_6 (VAR_FUNCTION_OR_PARM_DECL_CHECK (NODE))
 
+/* Nonzero for FIELD_DECL node means that this field is a lambda capture
+   field for an array of runtime bound.  */
+#define DECL_VLA_CAPTURE_P(NODE) \
+  DECL_LANG_FLAG_1 (FIELD_DECL_CHECK (NODE))
+
 /* Nonzero for FIELD_DECL node means that this field is a base class
of the parent object, as opposed to a member field.  */
 #define DECL_FIELD_IS_BASE(NODE) \
diff --git a/gcc/cp/decl.c b/gcc/cp/decl.c
index dad1e10..225f131 100644
--- a/gcc/cp/decl.c
+++ b/gcc/cp/decl.c
@@ -8296,7 +8296,8 @@ compute_array_index_type (tree name, tree size, tsubst_flags_t complain)
   else if (TREE_CONSTANT (size)
 	   /* We don't allow VLAs at non-function scopes, or during
 	  tentative template substitution.  */
-	   || !at_function_scope_p () || !(complain & tf_error))
+	   || !at_function_scope_p ()
+	   || (cxx_dialect < cxx1y && !(complain & tf_error)))
 {
   if (!(complain & tf_error))
 	return error_mark_node;
diff --git a/gcc/cp/pt.c b/gcc/cp/pt.c
index 256f9ab..4a0f411 100644
--- a/gcc/cp/pt.c
+++ b/gcc/cp/pt.c
@@ -11608,7 +11608,9 @@ tsubst (tree t, tree args, tsubst_flags_t complain, tree in_decl)
 	  r = cp_build_reference_type (type, TYPE_REF_IS_RVALUE (t));
 	r = cp_build_qualified_type_real (r, cp_type_quals (t), complain);
 
-	if (cxx_dialect >= cxx1y && array_of_runtime_bound_p (type))
+	if (cxx_dialect >= cxx1y
+	&& !(TREE_CODE (t) == REFERENCE_TYPE && REFERENCE_VLA_OK (t))
+	&& array_of_runtime_bound_p (type))
 	  {
 	if (complain & tf_warning_or_error)
 	  pedwarn
diff --git a/gcc/cp/semantics.c b/gcc/cp/semantics.c
index 0a700b7..135ef74 100644
--- a/gcc/cp/semantics.c
+++ b/gcc/cp/semantics.c
@@ -9033,6 +9033,7 @@ build_lambda_object (tree lambda_expr)
   if (TREE_CODE (TREE_TYPE (field)) == ARRAY_TYPE)
 	val = build_array_copy (val);
   else if (DECL_NORMAL_CAPTURE_P (field)
+	   && !DECL_VLA_CAPTURE_P (field)
 	   && TREE_CODE (TREE_TYPE (field)) != REFERENCE_TYPE)
 	{
 	  /* "the entities that are captured by copy are used to
@@ -9404,8 +9405,7 @@ build_capture_proxy (tree member)
 
   type = lambda_proxy_type (object);
 
-  if (TREE_CODE (type) == RECORD_TYPE
-  && TYPE_NAME (type) == NULL_TREE)
+  if (DECL_VLA_CAPTURE_P (member))
 {
   /* Rebuild the VLA type from the pointer and maxindex.  */
   tree field = next_initializable_field (TYPE_FIELDS (type));
@@ -9414,8 +9414,9 @@ build_capture_proxy (tree member)
   tree max = build_simple_component_ref (object, field);
   type = build_array_type (TREE_TYPE (TREE_TYPE (ptr)),
 			   build_index_type (max));
-  object = convert (build_reference_type (type), p

Go patch committed: Only make function descriptors if needed

2013-06-20 Thread Ian Lance Taylor

This is a follow up to the last big patch, adding function descriptors
to the Go frontend.  This patch adds another compilation pass to only
create the function descriptors if they are needed: if the function is
exported, or if it is referenced in some way other than calling it.
Bootstrapped and ran Go testsuite on x86_64-unknown-linux-gnu.
Committed to mainline.

Ian

diff -r a4bae46290c3 go/expressions.cc
--- a/go/expressions.cc	Thu Jun 20 13:21:55 2013 -0700
+++ b/go/expressions.cc	Thu Jun 20 17:04:56 2013 -0700
@@ -1385,62 +1385,24 @@
   return new Func_expression(function, closure, location);
 }
 
-// A function descriptor.  A function descriptor is a struct with a
-// single field pointing to the function code.  This is used for
-// functions without closures.
-
-class Func_descriptor_expression : public Expression
-{
- public:
-  Func_descriptor_expression(Named_object* fn, Named_object* dfn)
-: Expression(EXPRESSION_FUNC_DESCRIPTOR, fn->location()),
-  fn_(fn), dfn_(dfn), dvar_(NULL)
-  {
-go_assert(!fn->is_function() || !fn->func_value()->needs_closure());
-  }
-
-  // Make the function descriptor type, so that it can be converted.
-  static void
-  make_func_descriptor_type();
-
- protected:
-  int
-  do_traverse(Traverse*)
-  { return TRAVERSE_CONTINUE; }
-
-  Type*
-  do_type();
-
-  void
-  do_determine_type(const Type_context*)
-  { }
-
-  Expression*
-  do_copy()
-  { return Expression::make_func_descriptor(this->fn_, this->dfn_); }
-
-  bool
-  do_is_addressable() const
-  { return true; }
-
-  tree
-  do_get_tree(Translate_context*);
-
-  void
-  do_dump_expression(Ast_dump_context* context) const
-  { context->ostream() << "[descriptor " << this->fn_->name() << "]"; }
-
- private:
-  // The type of all function descriptors.
-  static Type* descriptor_type;
-
-  // The function for which this is the descriptor.
-  Named_object* fn_;
-  // The descriptor function.
-  Named_object* dfn_;
-  // The descriptor variable.
-  Bvariable* dvar_;
-};
+// Class Func_descriptor_expression.
+
+// Constructor.
+
+Func_descriptor_expression::Func_descriptor_expression(Named_object* fn)
+  : Expression(EXPRESSION_FUNC_DESCRIPTOR, fn->location()),
+fn_(fn), dfn_(NULL), dvar_(NULL)
+{
+  go_assert(!fn->is_function() || !fn->func_value()->needs_closure());
+}
+
+// Traversal.
+
+int
+Func_descriptor_expression::do_traverse(Traverse*)
+{
+  return TRAVERSE_CONTINUE;
+}
 
 // All function descriptors have the same type.
 
@@ -1464,6 +1426,18 @@
   return Func_descriptor_expression::descriptor_type;
 }
 
+// Copy a Func_descriptor_expression;
+
+Expression*
+Func_descriptor_expression::do_copy()
+{
+  Func_descriptor_expression* fde =
+Expression::make_func_descriptor(this->fn_);
+  if (this->dfn_ != NULL)
+fde->set_descriptor_wrapper(this->dfn_);
+  return fde;
+}
+
 // The tree for a function descriptor.
 
 tree
@@ -1519,12 +1493,20 @@
   return var_to_tree(bvar);
 }
 
+// Print a function descriptor expression.
+
+void
+Func_descriptor_expression::do_dump_expression(Ast_dump_context* context) const
+{
+  context->ostream() << "[descriptor " << this->fn_->name() << "]";
+}
+
 // Make a function descriptor expression.
 
-Expression*
-Expression::make_func_descriptor(Named_object* fn, Named_object* dfn)
-{
-  return new Func_descriptor_expression(fn, dfn);
+Func_descriptor_expression*
+Expression::make_func_descriptor(Named_object* fn)
+{
+  return new Func_descriptor_expression(fn);
 }
 
 // Make the function descriptor type, so that it can be converted.
diff -r a4bae46290c3 go/expressions.h
--- a/go/expressions.h	Thu Jun 20 13:21:55 2013 -0700
+++ b/go/expressions.h	Thu Jun 20 17:04:56 2013 -0700
@@ -32,6 +32,7 @@
 class Binary_expression;
 class Call_expression;
 class Func_expression;
+class Func_descriptor_expression;
 class Unknown_expression;
 class Index_expression;
 class Map_index_expression;
@@ -161,10 +162,9 @@
   // Make a function descriptor, an immutable struct with a single
   // field that points to the function code.  This may only be used
   // with functions that do not have closures.  FN is the function for
-  // which we are making the descriptor.  DFN is the descriptor
-  // function wrapper.
-  static Expression*
-  make_func_descriptor(Named_object* fn, Named_object* dfn);
+  // which we are making the descriptor.
+  static Func_descriptor_expression*
+  make_func_descriptor(Named_object* fn);
 
   // Make a reference to the code of a function.  This is used to set
   // descriptor and closure fields.
@@ -1562,6 +1562,63 @@
   Expression* closure_;
 };
 
+// A function descriptor.  A function descriptor is a struct with a
+// single field pointing to the function code.  This is used for
+// functions without closures.
+
+class Func_descriptor_expression : public Expression
+{
+ public:
+  Func_descriptor_expression(Named_object* fn);
+
+  // Set the descriptor wrapper.
+  void
+  set_descriptor_wrapper(Named_object* dfn)
+  {
+go_assert(this->dfn_ == NULL);
+

Re: mips SNaN/QNaN is swapped

2013-06-20 Thread Ian Lance Taylor

Thomas Schwinge  writes:

> Hi!
>
> On Sun, 05 May 2013 18:55:09 -0700, Ian Lance Taylor  wrote:
>> Thomas Schwinge  writes:
>> 
>> >> libgcc/
>> >>   * fp-bit.c (unpack_d, pack_d): Properly preserve and restore a
>> >>   NaN's payload.
>> >>   * fp-bit.h [FLOAT] (QUIET_NAN): Correct value.
>> 
>> This is OK.
>
> Joseph suggested these two bug-fix commits (trunk r198621 and r198622)
> should be applied to earlier release branches, too.  OK?

The RM's, including Joseph, are in charge of the release branches.  You
don't need my separate OK.

Ian

Re: mips SNaN/QNaN is swapped

2013-06-20 Thread Thomas Schwinge

Hi!

On Sun, 05 May 2013 18:55:09 -0700, Ian Lance Taylor  wrote:
> Thomas Schwinge  writes:
> 
> >> libgcc/
> >>* fp-bit.c (unpack_d, pack_d): Properly preserve and restore a
> >>NaN's payload.
> >>* fp-bit.h [FLOAT] (QUIET_NAN): Correct value.
> 
> This is OK.

Joseph suggested these two bug-fix commits (trunk r198621 and r198622)
should be applied to earlier release branches, too.  OK?


Grüße,
 Thomas


pgpFd3zW4t43_.pgp
Description: PGP signature

[linaro/gcc-4_8-branch] Backport from trunk

2013-06-20 Thread Christophe Lyon

Hi,
I have just backported r198683 from trunk to linaro/gcc-4_8-branch (as
rev 200267).
(It's the support of Address Sanitizer for ARM)

Thanks,

Christophe.

Re: [patch] set MULTIARCH_DIRNAME for multilib architectures

2013-06-20 Thread Aurelien Jarno

On Thu, Jun 20, 2013 at 02:26:12PM +0200, Matthias Klose wrote:
> Am 13.06.2013 11:42, schrieb Richard Sandiford:
> > "Bernhard Reutner-Fischer"  writes:
> >> On 12 June 2013 20:20:50 Richard Sandiford  
> >> wrote:
> >>> Matthias Klose  writes:
>  Index: config/mips/t-linux64
>  ===
>  --- config/mips/t-linux64(revision 200012)
>  +++ config/mips/t-linux64(working copy)
>  @@ -24,3 +24,13 @@
>   ../lib32$(call 
> >>> if_multiarch,:mips64$(MIPS_EL)-linux-gnuabin32$(MIPS_SOFT)) \
>   ../lib$(call 
>  if_multiarch,:mips$(MIPS_EL)-linux-gnu$(MIPS_SOFT)) \
>   ../lib64$(call
>  if_multiarch,:mips64$(MIPS_EL)-linux-gnuabi64$(MIPS_SOFT))
>  +
>  +ifneq (,$(findstring abin32,$(target)))
>  +MULTIARCH_DIRNAME = $(call 
> >>> if_multiarch,mips64$(MIPS_EL)-linux-gnuabin32$(MIPS_SOFT))
>  +else
>  +ifneq (,$(findstring abi64,$(target)))
>  +MULTIARCH_DIRNAME = $(call 
> >>> if_multiarch,mips64$(MIPS_EL)-linux-gnuabi64$(MIPS_SOFT))
>  +else
>  +MULTIARCH_DIRNAME = $(call 
> >>> if_multiarch,mips$(MIPS_EL)-linux-gnu$(MIPS_SOFT))
>  +endif
>  +endif
> >>>
> >>> findstring seems a bit fragile for a full triple.  I think it would
> >>> be better to have something similar to the current MIPS_SOFT definition:
> >>>
> >>> MIPS_SOFT = $(if $(strip $(filter MASK_SOFT_FLOAT_ABI, 
> >>> $(target_cpu_default)) $(filter soft, $(with_float))),soft)
> >>>
> >>> but for ABIs.  It could then also take with_abi into account.
> >>> Maybe something like:
> >>>
> >>> MIPS_ABI = $(or $(with_abi), \
> >>> $(if $(filter MIPS_ABI_DEFAULT=ABI_N32, \
> >>> $(target_cpu_default)), n32), \
> >>> o32)
> >>>
> >>> (completely untested).
> >>
> >> Bikeshedding:
> >> Doko would know, but ISTR that $(or) did not exist in make-3.80 which is 
> >> currently the minimum prerequisite, fwiw.
> > 
> > Gah, that's a pity.  Thanks for the catch though.  Maybe firstword
> > would be OK instead.
> > 
> > I see I also fell into the usual ABI trap.  It wouldn't have affected
> > the use in this patch, but the default ought to be "32" rather than "o32".
> 
> the define is in tm_defines, not target_cpu_default.
> 
> MIPS_ABI = $(or $(with_abi), \
> $(if $(filter MIPS_ABI_DEFAULT=ABI_N32, \
> $(tm_defines)), n32), \
>  32)
> 
> However I can't see yet how this should be used. Maybe Aurelian could come up
> with a tested version of this patch?
> 

I don't have a lot of time right now, will look at that in details next
week.


-- 
Aurelien Jarno  GPG: 1024D/F1BCDB73
aurel...@aurel32.net http://www.aurel32.net

Re: [PATCH] Fix typo in `aot-compile' option list

2013-06-20 Thread Andrew Haley

On 06/20/2013 09:09 PM, Roland Lutz wrote:
> Signed-off-by: Roland Lutz 
> ---
>   libjava/contrib/aot-compile.in |2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/libjava/contrib/aot-compile.in b/libjava/contrib/aot-compile.in
> index 91cfc67..2ee6739 100644
> --- a/libjava/contrib/aot-compile.in
> +++ b/libjava/contrib/aot-compile.in
> @@ -52,7 +52,7 @@ try:
>   sys.argv[1:],
>   "M:C:D:m:c:l:e:",
>   ["make=", "gcj=", "dbtool=",
> - "makeflags=" "gcjflags=", "ldflags=",
> + "makeflags=", "gcjflags=", "ldflags=",
>"exclude="])
>   srcdir, dstdir = args
>   except:
> 

OK with a ChangeLog entry.

Andrew.

[PATCH] Fix typo in `aot-compile' option list

2013-06-20 Thread Roland Lutz


Signed-off-by: Roland Lutz 
---
 libjava/contrib/aot-compile.in |2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/libjava/contrib/aot-compile.in b/libjava/contrib/aot-compile.in
index 91cfc67..2ee6739 100644
--- a/libjava/contrib/aot-compile.in
+++ b/libjava/contrib/aot-compile.in
@@ -52,7 +52,7 @@ try:
 sys.argv[1:],
 "M:C:D:m:c:l:e:",
 ["make=", "gcj=", "dbtool=",
- "makeflags=" "gcjflags=", "ldflags=",
+ "makeflags=", "gcjflags=", "ldflags=",
  "exclude="])
 srcdir, dstdir = args
 except:
--
1.7.10.4

PATCH to add include from system.h

2013-06-20 Thread Jason Merrill

Since we poison "malloc" and friends in system.h, any C++ code that 
includes a standard library header such as , which in turn 
includes , will get poisoning errors due to lines like



#undef malloc
  using ::malloc;


The solution is to include  before poisoning those identifiers.

Oleg posted a variant of this patch last year, which included  
instead of ; my patch includes both to avoid issues with other 
library implementations in which  might not declare functions 
in the global namespace.


Tested x86_64-pc-linux-gnu, applying to trunk.

commit 33be921c304a8bf38bbf5ae30b3489a01be4763c
Author: Jason Merrill 
Date:   Mon Jun 10 21:17:00 2013 -0400

	* system.h: Include  as well as .

diff --git a/gcc/system.h b/gcc/system.h
index 41cd565..f10ba4a 100644
--- a/gcc/system.h
+++ b/gcc/system.h
@@ -226,6 +226,14 @@ extern int errno;
 # include 
 #endif
 
+/* When compiling C++ we need to include  as well as  so
+   that it is processed before we poison "malloc"; otherwise, if a source
+   file uses a standard library header that includes , we will get
+   an error about 'using std::malloc'.  */
+#ifdef __cplusplus
+#include 
+#endif
+
 /* Undef vec_free from AIX stdlib.h header which conflicts with vec.h.  */
 #undef vec_free

Re: patch to fix PR57604

2013-06-20 Thread Richard Sandiford

Thanks Vlad.

Vladimir Makarov  writes:
> Index: lra.c
> ===
> --- lra.c (revision 200174)
> +++ lra.c (working copy)
> @@ -242,6 +242,42 @@ lra_delete_dead_insn (rtx insn)
>lra_set_insn_deleted (insn);
>  }
>  
> +/* Emit insn x = y + z.  Return NULL if we failed to do it.
> +   Otherwise, return the insn.  We don't use gen_add3_insn as it might
> +   clobber CC.  */
> +static rtx
> +emit_add3_insn (rtx x, rtx y, rtx z)
> +{
> +  rtx insn, last;
> +
> +  last = get_last_insn ();
> +  insn = emit_insn (gen_rtx_SET (VOIDmode, x,
> +  gen_rtx_PLUS (GET_MODE (y), y, z)));
> +  if (recog_memoized (insn) < 0)
> +{
> +  delete_insns_since (last);
> +  insn = NULL_RTX;
> +}
> +  return insn;
> +}
> +
> +/* Emit insn x = x + y.  Return the insn.  We use gen_add2_insn as the
> +   last resort.  */
> +static rtx
> +emit_add2_insn (rtx x, rtx y)
> +{
> +  rtx insn;
> +
> +  insn = emit_add3_insn (x, x, y);
> +  if (insn == NULL_RTX)
> +{
> +  insn = gen_add2_insn (x, y);
> +  if (insn != NULL_RTX)
> + emit_insn (insn);
> +}
> +  return insn;
> +}

Why is gen_add2_insn allowed and gen_add3_insn not?  They both use
the same add3 optab, so one seems just as likely as the other
to clobber the flags.

Maybe it would help to have an addptr3 optab that can only be used
for values that are known to be pointers and that is specifically not
allowed to clobber the flags.  Or an "lea2" optab that is required
to accept all "m" addresses.

> @@ -306,12 +342,14 @@ lra_emit_add (rtx x, rtx y, rtx z)
> || (disp != NULL_RTX && ! CONSTANT_P (disp))
> || (scale != NULL_RTX && ! CONSTANT_P (scale)))
>   {
> -   /* It is not an address generation.   Probably we have no 3 op
> -  add.  Last chance is to use 2-op add insn.  */
> +   /* Probably we have no 3 op add.  Last chance is to use 2-op
> +  add insn.  To succeed, don't move Z to X as an address
> +  segment always comes in Y.  Otherwise, we might fail when
> +  adding the address segment to register.  */

Do you mean "segment" in the x86 sense, or something else?  I still don't
really understand.

Thanks,
Richard

C++ PATCH to build a TEMPLATE_DECL for a partial specialization

2013-06-20 Thread Jason Merrill

It has always seemed odd that we didn't build a TEMPLATE_DECL for a 
partial specialization, and with the ongoing concepts work it has become 
more important to have one so that we can attach any requirements to it. 
 Note that there is still no way to get from the type of a partial 
specialization to its template other than by way of the primary 
template's list of partial specializations.


The second patch adds some missing _CHECK uses.

Tested x86_64-pc-linux-gnu, applying to trunk.
commit 5961bf601b6e2cf859d0cedf9b6a730e2160d8f4
Author: Jason Merrill 
Date:   Wed Jun 19 14:39:17 2013 -0400

	* pt.c (process_partial_specialization): Build a TEMPLATE_DECL for
	a partial specialization.
	(tsubst_decl): Don't clobber CLASSTYPE_TI_TEMPLATE of a partial
	specialization.
	(most_specialized_class): Adjust.

diff --git a/gcc/cp/cp-tree.h b/gcc/cp/cp-tree.h
index 7896386..cf54acf 100644
--- a/gcc/cp/cp-tree.h
+++ b/gcc/cp/cp-tree.h
@@ -3739,11 +3739,10 @@ more_aggr_init_expr_args_p (const aggr_init_expr_arg_iterator *iter)
specializations of this template.  (Full specializations are not
recorded on this list.)  The TREE_PURPOSE holds the arguments used
in the partial specialization (e.g., for `template  struct
-   S' this will be `T*'.)  The arguments will also include
-   any outer template arguments.  The TREE_VALUE holds the innermost
-   template parameters for the specialization (e.g., `T' in the
-   example above.)  The TREE_TYPE is the _TYPE node for the partial
-   specialization.
+   S' this will be `T*, int'.)  The arguments will also include
+   any outer template arguments.  The TREE_VALUE holds the TEMPLATE_DECL
+   for the partial specialization.  The TREE_TYPE is the _TYPE node for
+   the partial specialization.
 
This list is not used for other templates.  */
 #define DECL_TEMPLATE_SPECIALIZATIONS(NODE) \
@@ -3813,9 +3812,10 @@ more_aggr_init_expr_args_p (const aggr_init_expr_arg_iterator *iter)
 #define SET_DECL_SELF_REFERENCE_P(NODE) \
   (DECL_LANG_FLAG_4 (NODE) = 1)
 
-/* A `primary' template is one that has its own template header.  A
-   member function of a class template is a template, but not primary.
-   A member template is primary.  Friend templates are primary, too.  */
+/* A `primary' template is one that has its own template header and is not
+   a partial specialization.  A member function of a class template is a
+   template, but not primary.  A member template is primary.  Friend
+   templates are primary, too.  */
 
 /* Returns the primary template corresponding to these parameters.  */
 #define DECL_PRIMARY_TEMPLATE(NODE) \
diff --git a/gcc/cp/pt.c b/gcc/cp/pt.c
index 5e3cf07..4a0f411 100644
--- a/gcc/cp/pt.c
+++ b/gcc/cp/pt.c
@@ -4246,8 +4246,16 @@ process_partial_specialization (tree decl)
   /* We should only get here once.  */
   gcc_assert (!COMPLETE_TYPE_P (type));
 
+  tree tmpl = build_template_decl (decl, current_template_parms,
+   DECL_MEMBER_TEMPLATE_P (maintmpl));
+  TREE_TYPE (tmpl) = type;
+  DECL_TEMPLATE_RESULT (tmpl) = decl;
+  SET_DECL_TEMPLATE_SPECIALIZATION (tmpl);
+  DECL_TEMPLATE_INFO (tmpl) = build_template_info (maintmpl, specargs);
+  DECL_PRIMARY_TEMPLATE (tmpl) = maintmpl;
+
   DECL_TEMPLATE_SPECIALIZATIONS (maintmpl)
-= tree_cons (specargs, inner_parms,
+= tree_cons (specargs, tmpl,
  DECL_TEMPLATE_SPECIALIZATIONS (maintmpl));
   TREE_TYPE (DECL_TEMPLATE_SPECIALIZATIONS (maintmpl)) = type;
 
@@ -10091,7 +10099,10 @@ tsubst_decl (tree t, tree args, tsubst_flags_t complain)
 	  RETURN (error_mark_node);
 
 	TREE_TYPE (r) = new_type;
-	CLASSTYPE_TI_TEMPLATE (new_type) = r;
+	/* For a partial specialization, we need to keep pointing to
+	   the primary template.  */
+	if (!DECL_TEMPLATE_SPECIALIZATION (t))
+	  CLASSTYPE_TI_TEMPLATE (new_type) = r;
 	DECL_TEMPLATE_RESULT (r) = TYPE_MAIN_DECL (new_type);
 	DECL_TI_ARGS (r) = CLASSTYPE_TI_ARGS (new_type);
 	DECL_CONTEXT (r) = TYPE_CONTEXT (new_type);
@@ -18115,7 +18126,8 @@ most_specialized_class (tree type, tree tmpl, tsubst_flags_t complain)
 {
   tree partial_spec_args;
   tree spec_args;
-  tree parms = TREE_VALUE (t);
+  tree spec_tmpl = TREE_VALUE (t);
+  tree orig_parms = DECL_INNERMOST_TEMPLATE_PARMS (spec_tmpl);
 
   partial_spec_args = CLASSTYPE_TI_ARGS (TREE_TYPE (t));
 
@@ -18123,24 +18135,14 @@ most_specialized_class (tree type, tree tmpl, tsubst_flags_t complain)
 
   if (outer_args)
 	{
-	  int i;
-
 	  /* Discard the outer levels of args, and then substitute in the
 	 template args from the enclosing class.  */
 	  partial_spec_args = INNERMOST_TEMPLATE_ARGS (partial_spec_args);
 	  partial_spec_args = tsubst_template_args
 	(partial_spec_args, outer_args, tf_none, NULL_TREE);
 
-	  /* PARMS already refers to just the innermost parms, but the
-	 template parms in partial_spec_args had their levels lowered
-	 by tsubst, so we need to

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 01:23 PM, Jason Merrill wrote:

Since Gaby prefers SVN, let's keep using the SVN branch; it really isn't
much less convenient than a git-only branch.  The main difference is
'git svn rebase'/'git svn dcommit' instead of 'git pull'/'git push'.


The one caveat is that git-svn historically hasn't handled merges very 
well.  I think git-svn v1.8.3 or newer can do the right thing, but I 
haven't tested it, so it's probably best to keep doing merges with the 
svn client for the time being.


Jason

Re: [PATCH 1/2] Improve __atomic_clear/test_and_set documentation

2013-06-20 Thread Richard Henderson

On 06/20/2013 11:14 AM, Andi Kleen wrote:
>>> +It should be only used for operands of type bool or atomic_flag. For 
>>> +other types only part of the value may be set.
>>
>> @code{bool}.  We have no definition for atomic_flag.  Perhaps just 
>> @code{char}
>> for now?
>>
>>> +It should be only used for operands of type bool or atomic_flag and 
>>
>> Same.
>>
>>> +in conjunction with __atomic_test_and_set.
>>
>> @code{__atomic_test_and_set}.
> 
> Ok with these changes?

Yeah.


r~

[PATCH] Set $ac_aux_dir before use in libdecnumber/configure

2013-06-20 Thread Simon Baldwin

Set $ac_aux_dir before use in libdecnumber/configure.

libdecnumber/configure uses $ac_aux_dir before it is set, causing incorrect
MISSING value.  Fix with explicit AC_CONFIG_AUX_DIR.

Bootstrapped for c/c++.  Okay for trunk?


libdecnumber/ChangeLog
2013-06-20  Simon Baldwin  

* configure.ac: Add AC_CONFIG_AUX_DIR.
* configure: Regenerated.


Index: libdecnumber/configure.ac
===
--- libdecnumber/configure.ac   (revision 200246)
+++ libdecnumber/configure.ac   (working copy)
@@ -23,6 +23,7 @@ AC_PREREQ(2.64)
 AC_INIT(libdecnumber, [ ], gcc-b...@gcc.gnu.org, libdecnumber)
 AC_CONFIG_SRCDIR(decNumber.h)
 AC_CONFIG_MACRO_DIR(../config)
+AC_CONFIG_AUX_DIR(..)
 
 # Checks for programs.
 AC_PROG_MAKE_SET
Index: libdecnumber/configure
===
--- libdecnumber/configure  (revision 200246)
+++ libdecnumber/configure  (working copy)
@@ -2220,6 +2220,29 @@ ac_compiler_gnu=$ac_cv_c_compiler_gnu
 
 
 
+ac_aux_dir=
+for ac_dir in .. "$srcdir"/..; do
+  for ac_t in install-sh install.sh shtool; do
+if test -f "$ac_dir/$ac_t"; then
+  ac_aux_dir=$ac_dir
+  ac_install_sh="$ac_aux_dir/$ac_t -c"
+  break 2
+fi
+  done
+done
+if test -z "$ac_aux_dir"; then
+  as_fn_error "cannot find install-sh, install.sh, or shtool in .. 
\"$srcdir\"/.." "$LINENO" 5
+fi
+
+# These three variables are undocumented and unsupported,
+# and are intended to be withdrawn in a future Autoconf release.
+# They can cause serious problems if a builder's source tree is in a directory
+# whose full name contains unusual characters.
+ac_config_guess="$SHELL $ac_aux_dir/config.guess"  # Please don't use this var.
+ac_config_sub="$SHELL $ac_aux_dir/config.sub"  # Please don't use this var.
+ac_configure="$SHELL $ac_aux_dir/configure"  # Please don't use this var.
+
+
 
 # Checks for programs.
 { $as_echo "$as_me:${as_lineno-$LINENO}: checking whether ${MAKE-make} sets 
\$(MAKE)" >&5
@@ -4458,29 +4481,6 @@ else
 fi
 
 
-ac_aux_dir=
-for ac_dir in "$srcdir" "$srcdir/.." "$srcdir/../.."; do
-  for ac_t in install-sh install.sh shtool; do
-if test -f "$ac_dir/$ac_t"; then
-  ac_aux_dir=$ac_dir
-  ac_install_sh="$ac_aux_dir/$ac_t -c"
-  break 2
-fi
-  done
-done
-if test -z "$ac_aux_dir"; then
-  as_fn_error "cannot find install-sh, install.sh, or shtool in \"$srcdir\" 
\"$srcdir/..\" \"$srcdir/../..\"" "$LINENO" 5
-fi
-
-# These three variables are undocumented and unsupported,
-# and are intended to be withdrawn in a future Autoconf release.
-# They can cause serious problems if a builder's source tree is in a directory
-# whose full name contains unusual characters.
-ac_config_guess="$SHELL $ac_aux_dir/config.guess"  # Please don't use this var.
-ac_config_sub="$SHELL $ac_aux_dir/config.sub"  # Please don't use this var.
-ac_configure="$SHELL $ac_aux_dir/configure"  # Please don't use this var.
-
-
 # Make sure we can run config.sub.
 $SHELL "$ac_aux_dir/config.sub" sun4 >/dev/null 2>&1 ||
   as_fn_error "cannot run $SHELL $ac_aux_dir/config.sub" "$LINENO" 5

Re: [PATCH 1/2] Improve __atomic_clear/test_and_set documentation

2013-06-20 Thread Andi Kleen

> > +It should be only used for operands of type bool or atomic_flag. For 
> > +other types only part of the value may be set.
> 
> @code{bool}.  We have no definition for atomic_flag.  Perhaps just @code{char}
> for now?
> 
> > +It should be only used for operands of type bool or atomic_flag and 
> 
> Same.
> 
> > +in conjunction with __atomic_test_and_set.
> 
> @code{__atomic_test_and_set}.

Ok with these changes?

-Andi


-- 
a...@linux.intel.com -- Speaking for myself only.

[PATCH, i386]: Fix PR57655, ICE: in create_pre_exit, at mode-switching.c:418

2013-06-20 Thread Uros Bizjak

Hello!

x86_64 ABI mandates long double returns in x87 register, so it is
invalid to use --mno-fp-ret-in-387. It just happened that vzeroupper
insertion mode-switching pass tripped on invalid register usage.

2013-06-20  Uros Bizjak  

PR target/57655
* config/i386/i386.c (construct_container): Report error if
long double is used with disabled x87 float returns.

testsuite/ChangeLog:

2013-06-20  Uros Bizjak  

PR target/57655
* gcc.target/i386/pr57655.c: New test.

Patch was tested on x86_64-pc-linux-gnu {,-m32} and was committed to
mainline SVN. The patch will be backported to all release branches.

Uros.
Index: config/i386/i386.c
===
--- config/i386/i386.c  (revision 200247)
+++ config/i386/i386.c  (working copy)
@@ -6498,7 +6498,7 @@ construct_container (enum machine_mode mode, enum
 
   /* Likewise, error if the ABI requires us to return values in the
  x87 registers and the user specified -mno-80387.  */
-  if (!TARGET_80387 && in_return)
+  if (!TARGET_FLOAT_RETURNS_IN_80387 && in_return)
 for (i = 0; i < n; i++)
   if (regclass[i] == X86_64_X87_CLASS
  || regclass[i] == X86_64_X87UP_CLASS
Index: testsuite/gcc.target/i386/pr57655.c
===
--- testsuite/gcc.target/i386/pr57655.c (revision 0)
+++ testsuite/gcc.target/i386/pr57655.c (working copy)
@@ -0,0 +1,10 @@
+/* { dg-do compile } */
+/* { dg-options "-mavx -mvzeroupper -mno-fp-ret-in-387" }
+
+/* { dg-error "x87 register return with x87 disabled" "" { target { ! ia32 } } 
8 } */
+
+long double
+foo (long double x)
+{
+  return __builtin_ilogbl (x);
+}

Re: [PATCH 2/2] Fix HLE example in manual

2013-06-20 Thread Richard Henderson

On 06/20/2013 06:20 AM, Andi Kleen wrote:
> gcc/:
> 2013-06-13  Andi Kleen  
> 
>   * doc/extend.texi: Dont use __atomic_clear in HLE
>   example.  Fix typo.

Ok.


r~

Re: [PATCH 1/2] Improve __atomic_clear/test_and_set documentation

2013-06-20 Thread Richard Henderson

On 06/20/2013 06:20 AM, Andi Kleen wrote:
> From: Andi Kleen 
> 
> Document that __atomic_clear and __atomic_test_and_set should
> only be used with bool.
> 
> gcc/:
> 2013-06-13  Andi Kleen  
> 
>   * doc/extend.texi: Document that __atomic_clear and
> __atomic_test_and_set should only be used with bool.
> ---
>  gcc/doc/extend.texi | 6 ++
>  1 file changed, 6 insertions(+)
> 
> diff --git a/gcc/doc/extend.texi b/gcc/doc/extend.texi
> index 1e1f8b3..aa3abef 100644
> --- a/gcc/doc/extend.texi
> +++ b/gcc/doc/extend.texi
> @@ -7438,6 +7438,8 @@ This built-in function performs an atomic test-and-set 
> operation on
>  the byte at @code{*@var{ptr}}.  The byte is set to some implementation
>  defined nonzero ``set'' value and the return value is @code{true} if and only
>  if the previous contents were ``set''.
> +It should be only used for operands of type bool or atomic_flag. For 
> +other types only part of the value may be set.

@code{bool}.  We have no definition for atomic_flag.  Perhaps just @code{char}
for now?

> +It should be only used for operands of type bool or atomic_flag and 

Same.

> +in conjunction with __atomic_test_and_set.

@code{__atomic_test_and_set}.



r~

Re: [PATCH] Cilk Plus Array Notation for C++

2013-06-20 Thread Richard Henderson

> +/* Returns true if there is a length mismatch among exprssions that are at 
> the
> +   same dimension and one the same side of the equal sign.  The Array 
> notation
> +   lengths (LIST) is passed in as a 2D vector of trees.  */
> +
> +static bool
> +cp_length_mismatch_in_expr_p (location_t loc, vec >list)

Other than working on a vec<>, how does this differ from the

  length_mismatch_in_expr_p (location_t loc, tree **list, size_t x, size_t y)

in c-family?

> +static inline void
> +clear_all_an_vectors (vec  > *value, vec > *start,
> +   vec  > *length, vec > *stride,
> +   vec > *is_vector, vec > *count_down,
> +   vec *incr, vec *cmp, vec *ind_init,
> +   vec *var)
> +{
> +  value->release ();
> +  start->release ();
> +  length->release ();
> +  stride->release ();
> +  is_vector->release ();
> +  count_down->release ();
> +  incr->release ();
> +  cmp->release ();
> +  ind_init->release ();
> +  var->release ();
> +}

10 arrays kept in sync?  It's at this point that we should realize that there's
a mistake in how we're arranging the data.  This should be one array, with the
element being a structure.

That should significantly improve quite a lot of the functions in this file.

> +  init = ARITHMETIC_TYPE_P (new_var_type) ? build_zero_cst (new_var_type)
> + : integer_zero_node;

Why the test for ARITHMETIC_TYPE_P?  Surely we always want something of
new_var_type.

> +  if (an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_ADD
> +  || an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_MUL)
> +new_expr = build_x_modify_expr (location, *new_var, code, func_parm,
> + tf_warning_or_error);
> +  if (an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_ALL_ZERO
> +  || an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_ALL_NONZERO
> +  || an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_ANY_ZERO
> +  || an_type == BUILT_IN_CILKPLUS_SEC_REDUCE_ANY_NONZERO)

Why not another switch statement here?


r~

Re: [google gcc-4_8] Change size accounting during inlining in lipo mode

2013-06-20 Thread Xinliang David Li

Looks good.

thanks,

David

On Thu, Jun 20, 2013 at 10:38 AM, Easwaran Raman  wrote:
> In lipo mode, this patch updates the overall unit size only when the
> eventual function to which the callee is inlined is in primary module.
> This is to avoid the situation where the module growth budget is used
> up by inlines into auxiliary module functions that never get inlined
> into some primary module function. Ok for google/gcc-4_8 branch?
>
> Thanks,
> Easwaran

[google gcc-4_8] Change size accounting during inlining in lipo mode

2013-06-20 Thread Easwaran Raman

In lipo mode, this patch updates the overall unit size only when the
eventual function to which the callee is inlined is in primary module.
This is to avoid the situation where the module growth budget is used
up by inlines into auxiliary module functions that never get inlined
into some primary module function. Ok for google/gcc-4_8 branch?

Thanks,
Easwaran


lipo_inline.patch
Description: Binary data

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 11:50 AM, Andrew Sutton wrote:

The c++-concepts SVN branch won't come in with a clone; you need to add it
to the fetch list with

git config --add remote.origin.fetch
refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts

Then you can check out a local branch based on it.


Already did that. I probably need to do this, right?

   git checkout -b c++-concepts origin/c++-concepts


Right.


I just setup using a git-only branch, so the latter. I set up to push
changes to asutton/c++-concepts, so submitted should be visible there.


Since Gaby prefers SVN, let's keep using the SVN branch; it really isn't 
much less convenient than a git-only branch.  The main difference is 
'git svn rebase'/'git svn dcommit' instead of 'git pull'/'git push'.



Out of curiosity, how will reviews work? Just walk through the most
recently pushed changes on my branch?


That's what I was thinking, yes.


I should be able to produce diffs to send to Gaby also. I should also
be able to use those to patch an SVN checkout and commit from a
different repo.


git-svn is much more convenient :)

Jason

[PATCH] Fix PR tree-optimization/57660

2013-06-20 Thread Jeff Law

This test is failing on targets with low branch costs.  For those 
targets we expand the code radically differently and the test really 
doesn't make much sense -- much like vrp87.c.


This patch copies the list of targets where the test is meaningless from 
vrp87.c.


Installed onto the trunk.

diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
index 5adbc01..f2d9895 100644
--- a/gcc/testsuite/ChangeLog
+++ b/gcc/testsuite/ChangeLog
@@ -1,5 +1,9 @@
 2013-06-20  Jeff Law  
 
+   PR tree-optimization/57660
+   * gcc.dg/tree-ssa/forwprop-28.c: Don't run test on various targets
+   based on their branch cost.
+
* gcc.dg/tree-ssa/forwprop-28.c: Add missing dg-final.
 
 2013-06-20  Tobias Burnus  
diff --git a/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c 
b/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
index 8e870b9..a64987b 100644
--- a/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
+++ b/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
@@ -1,4 +1,4 @@
-/* { dg-do compile } */
+/* { dg-do compile { target { ! "m68k*-*-* mmix*-*-* mep*-*-* bfin*-*-* 
v850*-*-* picochip*-*-* moxie*-*-* cris*-*-* m32c*-*-* fr30*-*-* mcore*-*-* 
powerpc*-*-* xtensa*-*-*"} } } */
 /* { dg-options "-O2 -fdump-tree-forwprop1" } */
 
 extern char *frob (void);

Re: [patch] libitm: Fix handling of reentrancy in the HTM fastpath

2013-06-20 Thread Andreas Krebbel

On 20/06/13 05:13, Peter Bergner wrote:
> On Thu, 2013-06-20 at 00:51 +0200, Torvald Riegel wrote:
>> On Wed, 2013-06-19 at 14:43 -0500, Peter Bergner wrote: 
> I'd still like to hear from Andreas, whether the reentrant.c test case
> with both patches, now works on S390.

The patches fix the testcase also on s390. Thanks!

Bye,

-Andreas-

Re: [PATCH, x86] Use vector moves in memmove expanding

2013-06-20 Thread Michael Zolotukhin

It seems that one of the tests needed a small fix.
Attached is a corrected version.

On 20 June 2013 17:16, Michael Zolotukhin
 wrote:
> Hi,
> I added two tests to verify we generate vector instructions when
> vector_loop is used.
> Is the patch ok with that change?
>
> Thanks, Michael
>
> On 5 June 2013 18:10, Michael Zolotukhin  
> wrote:
>> I'll prepare some tests shortly,
>> What about the rest questions?
>>
>> Thanks, Michael
>>
>> On 15 May 2013 19:45, H.J. Lu  wrote:
>>> On Wed, May 15, 2013 at 5:47 AM, Michael Zolotukhin
>>>  wrote:
 Hi HJ,
> You use Pmode as the largest integer mode.  Is word_mode better
> than Pmode since word_mode is DI and Pmode may be SI for x32.
 I updated the patch to use word_mode instead of Pmode.  Bootstrap,
 make check and Specs2000 are ok.

 Thanks, Michael

 On 14 May 2013 19:55, H.J. Lu  wrote:
> On Tue, May 14, 2013 at 7:34 AM, Michael Zolotukhin
>  wrote:
>> Hi,
>> I attached an updated version of the patch.  Looks like the 64-bit issue 
>> is
>> resolved in it (now a vector mode is explicitly chosen instead of TI- or
>> another integer mode).  Also, some of remarks are fixed in it - some
>> others are still not changed, because I don't know what to do with them 
>> right
>> now (see my questions in the previous letter).
>>
>> Could you take a look at the patch?
>>
>> I checked it on i386/x86_64 bootstrap and make check, and on 
>> Specs2000/2006
>> (I checked only stability).
>>
>
> You use Pmode as the largest integer mode.  Is word_mode better
> than Pmode since word_mode is DI and Pmode may be SI for x32.
>
>
>>>
>>> You should include tests to verify that it works as
>>> expected.
>>>
>>>
>>> --
>>> H.J.
>>
>>
>>
>> --
>> ---
>> Best regards,
>> Michael V. Zolotukhin,
>> Software Engineer
>> Intel Corporation.
>
>
>
> --
> ---
> Best regards,
> Michael V. Zolotukhin,
> Software Engineer
> Intel Corporation.



-- 
---
Best regards,
Michael V. Zolotukhin,
Software Engineer
Intel Corporation.


memmov-5.patch
Description: Binary data

Re: [PATCH] libgomp testsuite fixes

2013-06-20 Thread Mike Stump

On May 30, 2013, at 12:59 PM, Cesar Philippidis  wrote:
> Here is a patch from our backlog at Mentor Graphics that addresses a 
> libgomp issue where setting ENABLE_LTO=1 in site.exp causes the following 
> error with dejagnu

> Is it OK for trunk?

Ok.

Committed revision 200253.

> 2013-05-30  Iain Sandoe  
>   Cesar Philippidis  
> 
>   libgomp/
>   * testsuite/lib/libgomp.exp: Reorder lib loads into dependency order.
>   Do not load_gcc_lib gcc-dg.exp and add a comment as to why.
>   * testsuite/libgomp.c/c.exp: load_gcc_lib gcc-dg.exp.
>   * testsuite/libgomp.fortran/fortran.exp: Likewise.
>   * testsuite/libgomp.graphite/graphite.exp: Likewise.
>   * testsuite/libgomp.c++/c++.exp: load_gcc_lib gcc-dg.exp.
>   Use dg-runtest rather than gfortran-dg-runtest.

Re: [PATCH] libitm testsuite fixes

2013-06-20 Thread Mike Stump

On May 30, 2013, at 1:02 PM, Cesar Philippidis  wrote:
> Here is a patch from our backlog that addresses a libitm issue where 
> setting ENABLE_LTO=1

> Is this OK for trunk?

Ok.

Committed revision 200252.

> Cesar Philippidis
> 
> 
> 2013-05-30  Iain Sandoe  
>   Cesar Philippidis  
> 
>   libitm/
>   * testsuite/lib/libitm.exp: Reorder lib loads into dependency order.
>   Do not load_gcc_lib gcc-dg.exp and add a comment as to why.
>   * testsuite/libitm.c/c.exp: load_gcc_lib gcc-dg.exp.
>   * testsuite/libitm.c++/c++.exp: load_gcc_lib gcc-dg.exp.

Re: [patch] libitm: Fix handling of reentrancy in the HTM fastpath

2013-06-20 Thread Richard Henderson

On 06/20/2013 02:49 AM, Torvald Riegel wrote:
> commit c8352d4d9fa5cfa3453a61c581956835de9753e5
> Author: Torvald Riegel 
> Date:   Thu Jun 20 00:46:59 2013 +0200
> 
> libitm: Handle HTM fastpath in status query functions.

Ok.


r~

Re: [patch] libitm: Fix handling of reentrancy in the HTM fastpath

2013-06-20 Thread Richard Henderson

On 06/19/2013 07:57 AM, Torvald Riegel wrote:
> commit 185af84e365e1bae31aea5afd6e67e81f3c32c72
> Author: Torvald Riegel 
> Date:   Wed Jun 19 16:42:24 2013 +0200
> 
> libitm: Fix handling of reentrancy in the HTM fastpath.
> 
> PR libitm/57643

Ok.


r~

[PATCH] Add missing dg-final to new test

2013-06-20 Thread Jeff Law

The newly added forwprop-28.c test was missing a dg-final clause to 
remove the debugging dump is scans.  This trivial patch adds the missing 
dg-final.


Installed as obvious.

diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
index 6dca973..5adbc01 100644
--- a/gcc/testsuite/ChangeLog
+++ b/gcc/testsuite/ChangeLog
@@ -1,3 +1,7 @@
+2013-06-20  Jeff Law  
+
+   * gcc.dg/tree-ssa/forwprop-28.c: Add missing dg-final.
+
 2013-06-20  Tobias Burnus  
 
PR fortran/57633
diff --git a/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c 
b/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
index 09d96ad..8e870b9 100644
--- a/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
+++ b/gcc/testsuite/gcc.dg/tree-ssa/forwprop-28.c
@@ -74,3 +74,5 @@ test_8 (int code)
 }
 
 /* { dg-final { scan-tree-dump-times "Replaced" 8 "forwprop1"} } */
+/* { dg-final { cleanup-tree-dump "forwprop1" } } */
+

Re: [c++-concepts] code review

2013-06-20 Thread Andrew Sutton

> The c++-concepts SVN branch won't come in with a clone; you need to add it
> to the fetch list with
>
> git config --add remote.origin.fetch
> refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts
>
> Then you can check out a local branch based on it.

Already did that. I probably need to do this, right?

  git checkout -b c++-concepts origin/c++-concepts



>> How will this change the workflow? What's the process for submitting
>> changes using git?
>
>
> That depends whether you want to keep it as an SVN branch or switch to a
> git-only branch.  Both processes are documented in the wiki page:
>
> http://gcc.gnu.org/wiki/GitMirror#Commit_upstream_.28git-svn.29
> http://gcc.gnu.org/wiki/GitMirror#Git-only_branches
>
> If it's unclear, please let me know so I can update the page.

I just setup using a git-only branch, so the latter. I set up to push
changes to asutton/c++-concepts, so submitted should be visible there.

Out of curiosity, how will reviews work? Just walk through the most
recently pushed changes on my branch?

I should be able to produce diffs to send to Gaby also. I should also
be able to use those to patch an SVN checkout and commit from a
different repo.

Andrew

Re: [PATCH] Redirect calls to non-functions to builtin_unreachable

2013-06-20 Thread Martin Jambor

sHi,

On Thu, Jun 20, 2013 at 03:47:11PM +0100, Marcus Shawcroft wrote:
> Hi,  I've been looking at an issue in mysql compilation which appears
> to be due to this patch.
> 
> On 10 May 2013 18:27, Martin Jambor  wrote:
> > Hi,
> >
> > as we discover targets of previously indirect calls in ipa-inline and
> > ipa-cp, we sometimes figure out that the targets are not a function.
> > One typical example is when NULL is passed as a function pointer
> 
> OK, this part makes sense to me.
> 
> > parameter, another is when C++ member-pointer points to a virtual
> > function and the overloaded field of the structure which can also hold
> > pointers to non-virtual methods contains odd integer constants.
> 
> I'm struggling to understand why such a member-pointer call would be
> illegal in a well formed program.
> 
> Attached is a fragment of code that demonstrates the issue I've been
> looking at.  When compiled at -O3 the 047i.inline dump tells me that:
> 
> ipa-prop: Discovered direct call to non-function in unsigned int
> A::foo(unsigned int (H::*)() const) const/11, making it unreachable.
>   not inlinable: unsigned int A::foo(unsigned int (H::*)() const)
> const/11 -> void __builtin_unreachable()/12, function body not
> available
> 
> This behavior appears to be the explicit intent of the original patch,
> the call to the member function pointer has been replaced with
> __builtin_unreachable, but that looks like a legitimate  call to me.
> What am I missing?

Hm, the reason why I did this was that I misremembered that each
branch, the one for virtual calls and the one for direct calls.  But
when I checked dumps, I realized it was not so, there is only one call
when the branches join.  I'll fix this promptly.

Thanks for the report,

Martin

Re: [c++-concepts] code review

2013-06-20 Thread Gabriel Dos Reis

Jason Merrill  writes:

| On 06/20/2013 11:22 AM, Gabriel Dos Reis wrote:
| > Jason Merrill  writes:
| >
| > | On 06/20/2013 10:17 AM, Andrew Sutton wrote:
| > | > It looks like I should be able to switch directly to the c++-concepts
| > | > branch. Or is there some configuration that has to happen on the
| > | > remote site?
| > |
| > | The c++-concepts SVN branch won't come in with a clone; you need to
| > | add it to the fetch list with
| > |
| > | git config --add remote.origin.fetch
| > | refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts
| > |
| > | Then you can check out a local branch based on it.
| >
| > Does that mean I also need to switch to git in order to continue
| > maintaining the branch?  I would rather stay with SVN for now.
| 
| No, the above is just how you check out the SVN branch under git.

Great!  Thanks for the clarification.

-- Gaby

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 11:22 AM, Gabriel Dos Reis wrote:

Jason Merrill  writes:

| On 06/20/2013 10:17 AM, Andrew Sutton wrote:
| > It looks like I should be able to switch directly to the c++-concepts
| > branch. Or is there some configuration that has to happen on the
| > remote site?
|
| The c++-concepts SVN branch won't come in with a clone; you need to
| add it to the fetch list with
|
| git config --add remote.origin.fetch
| refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts
|
| Then you can check out a local branch based on it.

Does that mean I also need to switch to git in order to continue
maintaining the branch?  I would rather stay with SVN for now.


No, the above is just how you check out the SVN branch under git.


I prefer an SVN branch for now.


OK.

Jason

Re: [c++-concepts] code review

2013-06-20 Thread Gabriel Dos Reis

Jason Merrill  writes:

| On 06/20/2013 10:17 AM, Andrew Sutton wrote:
| > It looks like I should be able to switch directly to the c++-concepts
| > branch. Or is there some configuration that has to happen on the
| > remote site?
| 
| The c++-concepts SVN branch won't come in with a clone; you need to
| add it to the fetch list with
| 
| git config --add remote.origin.fetch
| refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts
| 
| Then you can check out a local branch based on it.

Does that mean I also need to switch to git in order to continue
maintaining the branch?  I would rather stay with SVN for now.

| > How will this change the workflow? What's the process for submitting
| > changes using git?
| 
| That depends whether you want to keep it as an SVN branch or switch to
| a git-only branch.  Both processes are documented in the wiki page:

I prefer an SVN branch for now.
| 
| http://gcc.gnu.org/wiki/GitMirror#Commit_upstream_.28git-svn.29
| http://gcc.gnu.org/wiki/GitMirror#Git-only_branches
| 
| If it's unclear, please let me know so I can update the page.
| 
| Jason

-- Gaby

Re: [c++-concepts] code review

2013-06-20 Thread Gabriel Dos Reis

Andrew Sutton  writes:

| That works. I think the current patch addresses all of Jason's comments.

OK, that sounds good.

| I'll also create a github version of this branch, so can avoid email patches.

Well, actually I prefer the email patches because I can read them them
right away when I have only email connections (yes, I am one those
dinausors who still read emails with emacs remotely from a terminal.)
I wish I had the luxury of organizing everything around browsers, but I don't.

-- Gaby

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 10:17 AM, Andrew Sutton wrote:

It looks like I should be able to switch directly to the c++-concepts
branch. Or is there some configuration that has to happen on the
remote site?


The c++-concepts SVN branch won't come in with a clone; you need to add 
it to the fetch list with


git config --add remote.origin.fetch 
refs/remotes/c++-concepts:refs/remotes/origin/c++-concepts


Then you can check out a local branch based on it.


How will this change the workflow? What's the process for submitting
changes using git?


That depends whether you want to keep it as an SVN branch or switch to a 
git-only branch.  Both processes are documented in the wiki page:


http://gcc.gnu.org/wiki/GitMirror#Commit_upstream_.28git-svn.29
http://gcc.gnu.org/wiki/GitMirror#Git-only_branches

If it's unclear, please let me know so I can update the page.

Jason

[PATCH] Changed type of array notation's struct field

2013-06-20 Thread Iyer, Balaji V

Hello Everyone,
One of the struct field used by array notations (struct inv_list) was 
accidentaly set to the type enum rid, but the correct value should be enum 
tree_code. This was changed in the following patch. This was causing a warning. 
The field is only used by C++ (which patch is still under review). It is 
committed as obvious. I am willing to revert this patch or modify it if anyone 
has objections.

2013-06-20  Balaji V. Iyer  

* array-notation-common.c (find_inv_trees): Removed an unwanted
typecasting.
* c-common.h (struct inv_list::additional_tcodes): Changed type from
enum rid to enum tree_code.


Thanks,

Balaji V. Iyer.


diff --git a/gcc/c-family/ChangeLog b/gcc/c-family/ChangeLog
index 3d8f68f..a0f195b 100644
Binary files a/gcc/c-family/ChangeLog and b/gcc/c-family/ChangeLog differ
diff --git a/gcc/c-family/array-notation-common.c b/gcc/c-family/array-notation-
common.c
index 489b67c..c82d7dc 100644
--- a/gcc/c-family/array-notation-common.c
+++ b/gcc/c-family/array-notation-common.c
@@ -484,7 +484,7 @@ find_inv_trees (tree *tp, int *walk_subtrees, void *data)
 tree codes such as TARGET_EXPR must be eliminated.  These codes are
 passed into additional_tcodes and are walked through and checked.  */
   for (ii = 0; ii < vec_safe_length (i_list->additional_tcodes); ii++)
-   if (TREE_CODE (*tp) == (enum rid)(*(i_list->additional_tcodes))[ii])
+   if (TREE_CODE (*tp) == (*(i_list->additional_tcodes))[ii])
  *walk_subtrees = 0;
 }
   return NULL_TREE;
diff --git a/gcc/c-family/c-common.h b/gcc/c-family/c-common.h
index 8eaf54f..82625d7 100644
--- a/gcc/c-family/c-common.h
+++ b/gcc/c-family/c-common.h
@@ -1157,7 +1157,7 @@ struct inv_list
 {
   vec *list_values;
   vec *replacement;
-  vec *additional_tcodes;
+  vec *additional_tcodes;
 };

 /* In array-notation-common.c.  */

Re: [PATCH] Redirect calls to non-functions to builtin_unreachable

2013-06-20 Thread Marcus Shawcroft

Hi,  I've been looking at an issue in mysql compilation which appears
to be due to this patch.

On 10 May 2013 18:27, Martin Jambor  wrote:
> Hi,
>
> as we discover targets of previously indirect calls in ipa-inline and
> ipa-cp, we sometimes figure out that the targets are not a function.
> One typical example is when NULL is passed as a function pointer

OK, this part makes sense to me.

> parameter, another is when C++ member-pointer points to a virtual
> function and the overloaded field of the structure which can also hold
> pointers to non-virtual methods contains odd integer constants.

I'm struggling to understand why such a member-pointer call would be
illegal in a well formed program.

Attached is a fragment of code that demonstrates the issue I've been
looking at.  When compiled at -O3 the 047i.inline dump tells me that:

ipa-prop: Discovered direct call to non-function in unsigned int
A::foo(unsigned int (H::*)() const) const/11, making it unreachable.
  not inlinable: unsigned int A::foo(unsigned int (H::*)() const)
const/11 -> void __builtin_unreachable()/12, function body not
available

This behavior appears to be the explicit intent of the original patch,
the call to the member function pointer has been replaced with
__builtin_unreachable, but that looks like a legitimate  call to me.
What am I missing?

Thanks
/Marcus

foo.cc
Description: Binary data

Re: [PATCH] ARMv6-M MI thunk fix

2013-06-20 Thread Cesar Philippidis

Ping.

Cesar


On 6/7/13 9:50 AM, Cesar Philippidis wrote:
> On 6/6/13 9:00 AM, Richard Earnshaw wrote:
>> The pipeline offset is 4 for Thumb2 as well.  So at the very least you
>> need to explain why your change doesn't apply then as well.
> 
> Yes some context is lost in that comment.  Thunks are usually emitted in
> ARM mode, except for Thumb-only targets.  Is the new comment OK?  If so,
> please check it in since I do not have SVN write access.
> 
> Thanks,
> Cesar
> 
> 
> 2013-06-07  Julian Brown  
>   Cesar Philippidis  
> 
>   gcc/
>   * config/arm/arm.c (arm_output_mi_thunk): Fix offset for
>   TARGET_THUMB1_ONLY.
>

Fix misoptimization of LTO firefox

2013-06-20 Thread Jan Hubicka

Index: ChangeLog
===
--- ChangeLog   (revision 200246)
+++ ChangeLog   (working copy)
@@ -1,3 +1,7 @@
+2013-06-20  Jan Hubicka  
+
+   * lto-cgraph.c (input_symtab): Do not set cgraph state.
+
 2013-06-20  Joern Rennecke 
 
PR rtl-optimization/57425
Index: lto/ChangeLog
===
--- lto/ChangeLog   (revision 200246)
+++ lto/ChangeLog   (working copy)
@@ -1,3 +1,7 @@
+2013-06-20  Jan Hubicka  
+
+   * lto.c (read_cgraph_and_symbols): Set cgraph state.
+
 2013-06-19  Jan Hubicka  
 
* lto-partition.c (add_references_to_partition): Use
Index: lto/lto.c
===
--- lto/lto.c   (revision 200246)
+++ lto/lto.c   (working copy)
@@ -3362,7 +3362,6 @@ read_cgraph_and_symbols (unsigned nfiles
 {
   unsigned int i, last_file_ix;
   FILE *resolution;
-  struct cgraph_node *node;
   int count = 0;
   struct lto_file_decl_data **decl_data;
   void **res;
@@ -3544,14 +3543,7 @@ read_cgraph_and_symbols (unsigned nfiles
 }
   lto_symtab_merge_symbols ();
   ggc_collect ();
-
-  /* FIXME: ipa_transforms_to_apply holds list of passes that have optimization
- summaries computed and needs to apply changes.  At the moment WHOPR only
- supports inlining, so we can push it here by hand.  In future we need to 
stream
- this field into ltrans compilation.  */
-  if (flag_ltrans)
-FOR_EACH_DEFINED_FUNCTION (node)
-  node->ipa_transforms_to_apply.safe_push ((ipa_opt_pass)&pass_ipa_inline);
+  cgraph_state = CGRAPH_STATE_IPA_SSA;
 
   timevar_pop (TV_IPA_LTO_CGRAPH_MERGE);
 
Index: lto-cgraph.c
===
--- lto-cgraph.c(revision 200246)
+++ lto-cgraph.c(working copy)
@@ -1451,8 +1451,6 @@ input_symtab (void)
   unsigned int j = 0;
   struct cgraph_node *node;
 
-  cgraph_state = CGRAPH_STATE_IPA_SSA;
-
   while ((file_data = file_data_vec[j++]))
 {
   const char *data;

Re: [c++-concepts] code review

2013-06-20 Thread Andrew Sutton

I didn't know it existed! Even better. Except that I'm not a git
expert. I'm reading through the docs on that site while I clone the
repo.

It looks like I should be able to switch directly to the c++-concepts
branch. Or is there some configuration that has to happen on the
remote site?

How will this change the workflow? What's the process for submitting
changes using git?

Andrew

On Thu, Jun 20, 2013 at 8:57 AM, Jason Merrill  wrote:
> On 06/20/2013 09:18 AM, Andrew Sutton wrote:
>>
>> I'll also create a github version of this branch, so can avoid email
>> patches.
>
>
> Why there rather than in the gcc.gnu.org git repository?
>
> http://gcc.gnu.org/wiki/GitMirror
>
> Jason
>

-- 
Andrew Sutton
andrew.n.sut...@gmail.com

Re: [PATCH] libitm testsuite fixes

2013-06-20 Thread Cesar Philippidis

Ping.

Cesar


On 5/30/13 1:02 PM, Cesar Philippidis wrote:
> Here is a patch from our backlog that addresses a libitm issue where 
> setting ENABLE_LTO=1 in site.exp causes the following error with dejagnu:
> 
> ERROR: (DejaGnu) proc "libitm_target_compile linker_plugin19344.c 
> linker_plugin19344.exe executable {{additional_flags=-flto 
> -fuse-linker-plugin}}" does not exist.
> 
> This problem usually does not occur since the default site.exp does not
> contain ENABLE_LTO=1. This patch has been tested both with and without
> our custom site.exp.
> 
> Is this OK for trunk? If so, please check it in since I do not have commit
> rights.
> 
> Cesar Philippidis
> 
> 
> 2013-05-30  Iain Sandoe  
>   Cesar Philippidis  
> 
>   libitm/
>   * testsuite/lib/libitm.exp: Reorder lib loads into dependency order.
>   Do not load_gcc_lib gcc-dg.exp and add a comment as to why.
>   * testsuite/libitm.c/c.exp: load_gcc_lib gcc-dg.exp.
>   * testsuite/libitm.c++/c++.exp: load_gcc_lib gcc-dg.exp.
>

Re: [PATCH] libgomp testsuite fixes

2013-06-20 Thread Cesar Philippidis

Ping.

Cesar


On 5/30/13 12:59 PM, Cesar Philippidis wrote:
> Here is a patch from our backlog at Mentor Graphics that addresses a 
> libgomp issue where setting ENABLE_LTO=1 in site.exp causes the following 
> error with dejagnu:
> 
> ERROR: (DejaGnu) proc "libgomp_target_compile linker_plugin9263.c 
> linker_plugin9263.exe executable {{additional_flags=-flto 
> -fuse-linker-plugin}}" does not exist.
> 
> This problem usually does not occur since the default site.exp does not
> contain ENABLE_LTO=1. I tested both with and without our custom site.exp.
> 
> Is it OK for trunk? If so, please check it in since I do not have commit
> rights.
> 
> Cesar Philippidis
> 
> 
> 2013-05-30  Iain Sandoe  
>   Cesar Philippidis  
> 
>   libgomp/
>   * testsuite/lib/libgomp.exp: Reorder lib loads into dependency order.
>   Do not load_gcc_lib gcc-dg.exp and add a comment as to why.
>   * testsuite/libgomp.c/c.exp: load_gcc_lib gcc-dg.exp.
>   * testsuite/libgomp.fortran/fortran.exp: Likewise.
>   * testsuite/libgomp.graphite/graphite.exp: Likewise.
>   * testsuite/libgomp.c++/c++.exp: load_gcc_lib gcc-dg.exp.
>   Use dg-runtest rather than gfortran-dg-runtest.
>

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 09:18 AM, Andrew Sutton wrote:

I'll also create a github version of this branch, so can avoid email patches.


Why there rather than in the gcc.gnu.org git repository?

http://gcc.gnu.org/wiki/GitMirror

Jason

[PATCH][ARM][9/n] Partial IT block deprecation in ARMv8 AArch32 - bitwise ops

2013-06-20 Thread Kyrylo Tkachov

Hi all,

This patch adjusts the andsi3, iorsi3, xorsi3 patterns for -mrestrict-it. It
is done by adding 16-bit alternatives that are fit be cond_exec'd when
arm_restrict_it is on.

Tested arm-none-eabi on model and qemu with ARMv7 and ARMv8. Also tested as
part of the series with bootstrap on a Cortex-A15.

Ok for trunk?

Thanks,
Kyrill

2013-06-20  Kyrylo Tkachov  

* config/arm/arm.md (arm_andsi3_insn): Add alternatives for 16-bit
encoding.
(iorsi3_insn): Likewise.
(arm_xorsi3): Likewise.

10-bitwise-ops.patch
Description: Binary data

Re: [patch] libitm: Fix handling of reentrancy in the HTM fastpath

2013-06-20 Thread Peter Bergner

On Thu, 2013-06-20 at 11:49 +0200, Torvald Riegel wrote:
> You're right, that was missing for x86 as well.  Please see the updated
> second patch that is attached.  It additionally checks htm_fastpath to
> see whether we are actually using the HTM.  This variable is initialized
> to the value that htm_init() returns.  This should do the right thing I
> suppose.

Of course you're right too.  I totally missed we were already caching
that value.  Ok, I removed the unneeded call to htm_available() in my
htm_transaction_active() and all is well using your second patch.
Thanks again!

Peter

[PATCH][ARM][8/n] Partial IT block deprecation in ARMv8 AArch32 - mov* patterns and splitters

2013-06-20 Thread Kyrylo Tkachov

Hi all,

This patch adjusts the mov* patterns in the arm backend to generate code
appropriate for -mrestrict-it. The rules are:
moves between any two registers are allowed to be in IT blocks. mov immediate
are allowed if the immediate is 8-wide and the register is a low register.


The splitters that generate explicit cond_exec moves are adjusted to conform
to these rules, therefore making this patch self-contained.
The "Ts" constraint is introduced which evaluates to the low regs when
arm_restrict_it is on, and to the general regs otherwise.

Bootstrapped on Cortex-A15 and tested on model and qemu on arm-none-eabi for
ARMv7 and ARMv8.

Ok for trunk?

Thanks,
Kyrill

2013-06-20  Kyrylo Tkachov  

* config/arm/constraints.md (Ts): New constraint.
* config/arm/arm.md (arm_movqi_insn): Add alternatives for
16-bit encodings.
(compare_scc): Use "Ts" constraint for operand 0.
(ior_scc_scc): Likewise.
(and_scc_scc): Likewise.
(and_scc_scc_nodom): Likewise.
(ior_scc_scc_cmp): Likewise for operand 7.
(and_scc_scc_cmp): Likewise.
* config/arm/thumb2.md (thumb2_movsi_insn):
Add alternatives for 16-bit encodings.
(thumb2_movhi_insn): Likewise.
(thumb2_movsicc_insn): Likewise.
(thumb2_and_scc): Take 'and' outside cond_exec.  Use "Ts" constraint.
(thumb2_negscc): Use "Ts" constraint.
Move mvn instruction outside cond_exec block.
* config/arm/vfp.md (thumb2_movsi_vfp): Add alternatives
for 16-bit encodings.

09-mov-patterns.patch
Description: Binary data

[PATCH 2/2] Fix HLE example in manual

2013-06-20 Thread Andi Kleen

From: Andi Kleen 

The HLE example in the manual only commits when using bool
for the flag, because __atomic_clear only writes bool, and
HLE requires the acquire and release to match.

So when the example is copied with e.g. an int variable it
does not commit and causes slower than expected performance.

Some people are running into problems because of this.

Switch it over to use __atomic_store.

Also fix a minor typo nearby.

gcc/:
2013-06-13  Andi Kleen  

* doc/extend.texi: Dont use __atomic_clear in HLE
example.  Fix typo.
---
 gcc/doc/extend.texi | 6 --
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/gcc/doc/extend.texi b/gcc/doc/extend.texi
index aa3abef..b6f786d 100644
--- a/gcc/doc/extend.texi
+++ b/gcc/doc/extend.texi
@@ -7524,18 +7524,20 @@ End lock elision on a lock variable.
 Memory model must be @code{__ATOMIC_RELEASE} or stronger.
 @end table
 
-When a lock acquire fails it's required for good performance to abort
+When a lock acquire fails it is required for good performance to abort
 the transaction quickly. This can be done with a @code{_mm_pause}
 
 @smallexample
 #include  // For _mm_pause
 
+int lockvar;
+
 /* Acquire lock with lock elision */
 while (__atomic_exchange_n(&lockvar, 1, __ATOMIC_ACQUIRE|__ATOMIC_HLE_ACQUIRE))
 _mm_pause(); /* Abort failed transaction */
 ...
 /* Free lock with lock elision */
-__atomic_clear(&lockvar, __ATOMIC_RELEASE|__ATOMIC_HLE_RELEASE);
+__atomic_store(&lockvar, 0, __ATOMIC_RELEASE|__ATOMIC_HLE_RELEASE);
 @end smallexample
 
 @node Object Size Checking
-- 
1.8.3

[PATCH 1/2] Improve __atomic_clear/test_and_set documentation

2013-06-20 Thread Andi Kleen

From: Andi Kleen 

Document that __atomic_clear and __atomic_test_and_set should
only be used with bool.

gcc/:
2013-06-13  Andi Kleen  

* doc/extend.texi: Document that __atomic_clear and
  __atomic_test_and_set should only be used with bool.
---
 gcc/doc/extend.texi | 6 ++
 1 file changed, 6 insertions(+)

diff --git a/gcc/doc/extend.texi b/gcc/doc/extend.texi
index 1e1f8b3..aa3abef 100644
--- a/gcc/doc/extend.texi
+++ b/gcc/doc/extend.texi
@@ -7438,6 +7438,8 @@ This built-in function performs an atomic test-and-set 
operation on
 the byte at @code{*@var{ptr}}.  The byte is set to some implementation
 defined nonzero ``set'' value and the return value is @code{true} if and only
 if the previous contents were ``set''.
+It should be only used for operands of type bool or atomic_flag. For 
+other types only part of the value may be set.
 
 All memory models are valid.
 
@@ -7447,6 +7449,10 @@ All memory models are valid.
 
 This built-in function performs an atomic clear operation on
 @code{*@var{ptr}}.  After the operation, @code{*@var{ptr}} contains 0.
+It should be only used for operands of type bool or atomic_flag and 
+in conjunction with __atomic_test_and_set.
+For other types it may only clear partially. If the type is not bool
+prefer using @code{__atomic_store}.
 
 The valid memory model variants are
 @code{__ATOMIC_RELAXED}, @code{__ATOMIC_SEQ_CST}, and
-- 
1.8.3

Re: [c++-concepts] code review

2013-06-20 Thread Andrew Sutton

That works. I think the current patch addresses all of Jason's comments.

I'll also create a github version of this branch, so can avoid email patches.

On Thu, Jun 20, 2013 at 8:09 AM, Gabriel Dos Reis  wrote:
> Jason Merrill  writes:
>
> | On 06/20/2013 01:30 AM, Gabriel Dos Reis wrote:
> | > As I discussed
> | > with Andrew a couple of weeks ago, I have been holding back the
> | > merge from trunk because he has these patch series in the queue.
> |
> | Incidentally, since the code is going onto a branch, we don't really
> | need to delay checkins based on code review; I'd actually rather
> | review the code from within git than from an email patch.
>
> Makes sense.  Andrew, you can check in what you have now, and refine
> based on all other comments.  I won't have network connection until
> tonight; at that point I will do the merge from trunk.  We don't want to
> have the branch too old compared to the trunk.
>
> -- Gaby



-- 
Andrew Sutton
andrew.n.sut...@gmail.com

Re: [PATCH, x86] Use vector moves in memmove expanding

2013-06-20 Thread Michael Zolotukhin

Hi,
I added two tests to verify we generate vector instructions when
vector_loop is used.
Is the patch ok with that change?

Thanks, Michael

On 5 June 2013 18:10, Michael Zolotukhin  wrote:
> I'll prepare some tests shortly,
> What about the rest questions?
>
> Thanks, Michael
>
> On 15 May 2013 19:45, H.J. Lu  wrote:
>> On Wed, May 15, 2013 at 5:47 AM, Michael Zolotukhin
>>  wrote:
>>> Hi HJ,
 You use Pmode as the largest integer mode.  Is word_mode better
 than Pmode since word_mode is DI and Pmode may be SI for x32.
>>> I updated the patch to use word_mode instead of Pmode.  Bootstrap,
>>> make check and Specs2000 are ok.
>>>
>>> Thanks, Michael
>>>
>>> On 14 May 2013 19:55, H.J. Lu  wrote:
 On Tue, May 14, 2013 at 7:34 AM, Michael Zolotukhin
  wrote:
> Hi,
> I attached an updated version of the patch.  Looks like the 64-bit issue 
> is
> resolved in it (now a vector mode is explicitly chosen instead of TI- or
> another integer mode).  Also, some of remarks are fixed in it - some
> others are still not changed, because I don't know what to do with them 
> right
> now (see my questions in the previous letter).
>
> Could you take a look at the patch?
>
> I checked it on i386/x86_64 bootstrap and make check, and on 
> Specs2000/2006
> (I checked only stability).
>

 You use Pmode as the largest integer mode.  Is word_mode better
 than Pmode since word_mode is DI and Pmode may be SI for x32.


>>
>> You should include tests to verify that it works as
>> expected.
>>
>>
>> --
>> H.J.
>
>
>
> --
> ---
> Best regards,
> Michael V. Zolotukhin,
> Software Engineer
> Intel Corporation.



-- 
---
Best regards,
Michael V. Zolotukhin,
Software Engineer
Intel Corporation.


memmov-4.patch
Description: Binary data

Re: [c++-concepts] code review

2013-06-20 Thread Gabriel Dos Reis

Jason Merrill  writes:

| On 06/20/2013 01:30 AM, Gabriel Dos Reis wrote:
| > As I discussed
| > with Andrew a couple of weeks ago, I have been holding back the
| > merge from trunk because he has these patch series in the queue.
| 
| Incidentally, since the code is going onto a branch, we don't really
| need to delay checkins based on code review; I'd actually rather
| review the code from within git than from an email patch.

Makes sense.  Andrew, you can check in what you have now, and refine
based on all other comments.  I won't have network connection until
tonight; at that point I will do the merge from trunk.  We don't want to
have the branch too old compared to the trunk.

-- Gaby

Re: [c++-concepts] code review

2013-06-20 Thread Jason Merrill


On 06/20/2013 01:30 AM, Gabriel Dos Reis wrote:

As I discussed
with Andrew a couple of weeks ago, I have been holding back the
merge from trunk because he has these patch series in the queue.


Incidentally, since the code is going onto a branch, we don't really 
need to delay checkins based on code review; I'd actually rather review 
the code from within git than from an email patch.


Jason

Re: [PATCH] Improve folding of bitwise ops feeding conditionals for single bit types

2013-06-20 Thread Jeff Law


On 06/20/2013 04:49 AM, Andreas Schwab wrote:

Jeff Law  writes:


+/* { dg-final { scan-tree-dump-times "Replaced" 8 "forwprop1"} } */


$ grep -c Replaced forwprop-28.c.022t.forwprop1
16

;; Function test (test, funcdef_no=0, decl_uid=1388, symbol_order=0)

   Replaced 'rotate_7 == 0' with '_6 == 0'
   Replaced '_6 == 0' with 'code_5(D) != 22'

Target?

Give me enough information and I'll gladly take a look.  Give me junk 
and I'll ignore.


jeff

Re: [PATCH, trunk, PR57358] Avoid IPA-CP analysis if attribute optimize precludes it

2013-06-20 Thread Martin Jambor

Hi,

On Thu, Jun 20, 2013 at 01:32:38PM +0200, Jan Hubicka wrote:
> > 
> > 2013-06-11  Martin Jambor  
> > 
> > PR tree-optimization/57358
> > * ipa-prop.c (ipa_func_spec_opts_forbid_analysis_p): New function.
> > (ipa_compute_jump_functions_for_edge): Bail out if it returns true.
> > (ipa_analyze_params_uses): Generate pessimistic info when true.
> > 
> > testsuite
> > * gcc.dg/ipa/pr57358.c: New test.
> > 
> > Index: src/gcc/ipa-prop.c
> > ===
> > --- src.orig/gcc/ipa-prop.c
> > +++ src/gcc/ipa-prop.c
> > @@ -78,6 +78,21 @@ struct ipa_cst_ref_desc
> >  
> >  static alloc_pool ipa_refdesc_pool;
> >  
> > +/* Return true if DECL_FUNCTION_SPECIFIC_OPTIMIZATION of the decl 
> > associated
> > +   with NODE should prevent us from analyzing it for the purposes of 
> > IPA-CP.  */
> > +
> > +static bool
> > +ipa_func_spec_opts_forbid_analysis_p (struct cgraph_node *node)
> > +{
> > +  tree fs_opts = DECL_FUNCTION_SPECIFIC_OPTIMIZATION (node->symbol.decl);
> > +  struct cl_optimization *os;
> > +
> > +  if (!fs_opts)
> > +return false;
> > +  os = TREE_OPTIMIZATION (fs_opts);
> > +  return !os->x_optimize || !os->x_flag_ipa_cp;
> > +}
> 
> Hmm, somewhat ugly, but I suppose it is as ugly as it needs to be.
> I wonder about the other IPA passes. Do we want to always ignore them?

Well, I suppose that if we wanted to truly support the optimize
attribute in its full strength and generality (or at least make an
effort to do so), all IPA passes would need to check for their flag(s)
their.  In fact, this should be even more ugly if done really
properly.  -fno-indirect-inlinig should be checked in
ipa_analyze_call_uses, for example, -fno-devirtualize at various other
places...  And of course, we should probably never just bail out but
mark the flags in the IPA structures instead because they are used by
both IPA-CP and inlining and perhaps disabling the features in one
should not disable it another...

...so I just decided this has never worked and I was not going to
spend too much time fixing it.  I hesitated about the
os->x_flag_ipa_cp test, but eventually thought it might be useful for
us when debugging so I kept it.  Still, publicly I'd continue saying
that IPA passes cannot be adjusted by the optimize attribute.

> How the ICE happens? Is it because we don't do the analysis?

No, we do, and we ask alias oracle about stuff even though there are
no VDEFs and VUSEs which segfaults.  IPA passes just have to be
careful not to ICE on -O0 functions.

Thanks,

Martin

Re: [patch] set MULTIARCH_DIRNAME for multilib architectures

2013-06-20 Thread Matthias Klose

Am 13.06.2013 11:42, schrieb Richard Sandiford:
> "Bernhard Reutner-Fischer"  writes:
>> On 12 June 2013 20:20:50 Richard Sandiford  
>> wrote:
>>> Matthias Klose  writes:
 Index: config/mips/t-linux64
 ===
 --- config/mips/t-linux64  (revision 200012)
 +++ config/mips/t-linux64  (working copy)
 @@ -24,3 +24,13 @@
../lib32$(call 
>>> if_multiarch,:mips64$(MIPS_EL)-linux-gnuabin32$(MIPS_SOFT)) \
../lib$(call if_multiarch,:mips$(MIPS_EL)-linux-gnu$(MIPS_SOFT)) \
../lib64$(call
 if_multiarch,:mips64$(MIPS_EL)-linux-gnuabi64$(MIPS_SOFT))
 +
 +ifneq (,$(findstring abin32,$(target)))
 +MULTIARCH_DIRNAME = $(call 
>>> if_multiarch,mips64$(MIPS_EL)-linux-gnuabin32$(MIPS_SOFT))
 +else
 +ifneq (,$(findstring abi64,$(target)))
 +MULTIARCH_DIRNAME = $(call 
>>> if_multiarch,mips64$(MIPS_EL)-linux-gnuabi64$(MIPS_SOFT))
 +else
 +MULTIARCH_DIRNAME = $(call 
>>> if_multiarch,mips$(MIPS_EL)-linux-gnu$(MIPS_SOFT))
 +endif
 +endif
>>>
>>> findstring seems a bit fragile for a full triple.  I think it would
>>> be better to have something similar to the current MIPS_SOFT definition:
>>>
>>> MIPS_SOFT = $(if $(strip $(filter MASK_SOFT_FLOAT_ABI, 
>>> $(target_cpu_default)) $(filter soft, $(with_float))),soft)
>>>
>>> but for ABIs.  It could then also take with_abi into account.
>>> Maybe something like:
>>>
>>> MIPS_ABI = $(or $(with_abi), \
>>> $(if $(filter MIPS_ABI_DEFAULT=ABI_N32, \
>>>   $(target_cpu_default)), n32), \
>>> o32)
>>>
>>> (completely untested).
>>
>> Bikeshedding:
>> Doko would know, but ISTR that $(or) did not exist in make-3.80 which is 
>> currently the minimum prerequisite, fwiw.
> 
> Gah, that's a pity.  Thanks for the catch though.  Maybe firstword
> would be OK instead.
> 
> I see I also fell into the usual ABI trap.  It wouldn't have affected
> the use in this patch, but the default ought to be "32" rather than "o32".

the define is in tm_defines, not target_cpu_default.

MIPS_ABI = $(or $(with_abi), \
$(if $(filter MIPS_ABI_DEFAULT=ABI_N32, \
  $(tm_defines)), n32), \
 32)

However I can't see yet how this should be used. Maybe Aurelian could come up
with a tested version of this patch?

  Matthias

Re: [PATCH, trunk, PR57358] Avoid IPA-CP analysis if attribute optimize precludes it

2013-06-20 Thread Jan Hubicka

> 
> 2013-06-11  Martin Jambor  
> 
>   PR tree-optimization/57358
>   * ipa-prop.c (ipa_func_spec_opts_forbid_analysis_p): New function.
>   (ipa_compute_jump_functions_for_edge): Bail out if it returns true.
>   (ipa_analyze_params_uses): Generate pessimistic info when true.
> 
> testsuite
>   * gcc.dg/ipa/pr57358.c: New test.
> 
> Index: src/gcc/ipa-prop.c
> ===
> --- src.orig/gcc/ipa-prop.c
> +++ src/gcc/ipa-prop.c
> @@ -78,6 +78,21 @@ struct ipa_cst_ref_desc
>  
>  static alloc_pool ipa_refdesc_pool;
>  
> +/* Return true if DECL_FUNCTION_SPECIFIC_OPTIMIZATION of the decl associated
> +   with NODE should prevent us from analyzing it for the purposes of IPA-CP. 
>  */
> +
> +static bool
> +ipa_func_spec_opts_forbid_analysis_p (struct cgraph_node *node)
> +{
> +  tree fs_opts = DECL_FUNCTION_SPECIFIC_OPTIMIZATION (node->symbol.decl);
> +  struct cl_optimization *os;
> +
> +  if (!fs_opts)
> +return false;
> +  os = TREE_OPTIMIZATION (fs_opts);
> +  return !os->x_optimize || !os->x_flag_ipa_cp;
> +}

Hmm, somewhat ugly, but I suppose it is as ugly as it needs to be.
I wonder about the other IPA passes. Do we want to always ignore them?
How the ICE happens? Is it because we don't do the analysis?

Honza

[testsuite, android] Disabling thread_local4.C and thread_local4g.C for Android.

2013-06-20 Thread Alexander Ivchenko

Hi,

for Android:

FAIL: g++.dg/tls/thread_local4.C -std=gnu++11 (test for excess errors)
Excess errors:
<>/gcc/testsuite/g++.dg/tls/thread_local4.C:31:26: error:
'pthread_testcancel' was not declared in this scope
<>/gcc/testsuite/g++.dg/tls/thread_local4.C:40:24: error:
'pthread_cancel' was not declared in this scope

the same for thread_local4g.C.

from http://www.kandroid.org/ndk/docs/system/libc/OVERVIEW.html
"pthread_cancel() will not be supported in Bionic, because doing this
would involve making the C library significantly bigger for very
little benefit.
<...>"

The following patch switches those tests off for Android.

diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog
index b8073d1..d217e4d 100644
--- a/gcc/testsuite/ChangeLog
+++ b/gcc/testsuite/ChangeLog
@@ -1,3 +1,8 @@
+2013-06-19  Alexander Ivchenko  
+
+   * g++.dg/tls/thread_local4.C: Disable test for Android.
+   * g++.dg/tls/thread_local4g.C: Ditto.
+
 2013-06-11  Tobias Burnus  

PR fortran/57508
diff --git a/gcc/testsuite/g++.dg/tls/thread_local4.C
b/gcc/testsuite/g++.dg/tls/thread_local4.C
index acf1cae..cc1a35a 100644
--- a/gcc/testsuite/g++.dg/tls/thread_local4.C
+++ b/gcc/testsuite/g++.dg/tls/thread_local4.C
@@ -1,6 +1,7 @@
 // Test for cleanups with pthread_cancel.

-// { dg-do run }
+// There is no support of pthread_cancel in Android.
+// { dg-do run { target { ! *-*-android* } } }
 // { dg-require-effective-target c++11 }
 // { dg-require-effective-target tls_runtime }
 // { dg-require-effective-target pthread }
diff --git a/gcc/testsuite/g++.dg/tls/thread_local4g.C
b/gcc/testsuite/g++.dg/tls/thread_local4g.C
index f5bc3ff..756c6a2 100644
--- a/gcc/testsuite/g++.dg/tls/thread_local4g.C
+++ b/gcc/testsuite/g++.dg/tls/thread_local4g.C
@@ -1,6 +1,7 @@
 // Test for cleanups with pthread_cancel.

-// { dg-do run }
+// There is no support of pthread_cancel in Android.
+// { dg-do run { target { ! *-*-android* } } }
 // { dg-require-effective-target c++11 }
 // { dg-require-effective-target tls_runtime }
 // { dg-require-effective-target pthread }


is it ok for trunk?


--Alexander

[PATCH] Speedup streamer_read_uhwi

2013-06-20 Thread Richard Biener


This speeds up streamer_read_uhwi (top in mozilla LTO profile) by
delaying the section overrun check and inlining streamer_read_uchar
manually, performing CSE and optimizing the 1-byte case.

LTO bootstrapped on x86_64-unknown-linux-gnu, applied.

Richard.

2013-06-20  Richard Biener  

* data-streamer-in.c (streamer_read_uhwi): Optimize single
byte case, inline streamer_read_uchar and defer section
overrun check.

Index: gcc/data-streamer-in.c
===
*** gcc/data-streamer-in.c  (revision 200189)
--- gcc/data-streamer-in.c  (working copy)
*** bp_unpack_string (struct data_in *data_i
*** 120,137 
  unsigned HOST_WIDE_INT
  streamer_read_uhwi (struct lto_input_block *ib)
  {
!   unsigned HOST_WIDE_INT result = 0;
!   int shift = 0;
unsigned HOST_WIDE_INT byte;
  
!   while (true)
  {
!   byte = streamer_read_uchar (ib);
!   result |= (byte & 0x7f) << shift;
!   shift += 7;
!   if ((byte & 0x80) == 0)
!   return result;
  }
  }
  
  
--- 120,152 
  unsigned HOST_WIDE_INT
  streamer_read_uhwi (struct lto_input_block *ib)
  {
!   unsigned HOST_WIDE_INT result;
!   int shift;
unsigned HOST_WIDE_INT byte;
+   unsigned int p = ib->p;
+   unsigned int len = ib->len;
  
!   const char *data = ib->data;
!   result = data[p++];
!   if ((result & 0x80) != 0)
  {
!   result &= 0x7f;
!   shift = 7;
!   do
!   {
! byte = data[p++];
! result |= (byte & 0x7f) << shift;
! shift += 7;
!   }
!   while ((byte & 0x80) != 0);
  }
+ 
+   /* We check for section overrun after the fact for performance reason.  */
+   if (p > len)
+ lto_section_overrun (ib);
+ 
+   ib->p = p;
+   return result;
  }

Re: [PATCH, 4.8, PR57358] Check if optimizing in parm_ref_data_preserved_p

2013-06-20 Thread Jan Hubicka

> Hi,
> 
> this is the simplest fix for the PR which happens because there is no
> VDEF on a stmt if a particular function is not optimized.  I'd like to
> fix the bug with it on the branch.  Bootstrapped and tested on
> x86_64-linux.  OK?
> 
> Thanks,
> 
> Martin
> 
> 
> 2013-06-11  Martin Jambor  
> 
>   PR tree-optimization/57358
>   * ipa-prop.c (parm_ref_data_preserved_p): Always return true when
>   not optimizing.

Seems to make sense,
OK,
Honza

Re: [PATCH, PR 57539] Fix refdesc remapping during inlining

2013-06-20 Thread Jan Hubicka

> Hi,
> 
> PR 57539 revealed two problems with remapping reference descriptors
> during cloning of trees of inlined call graph nodes.  First, when
> indirect inlining is involved, we happily remove the reference
> descriptor itself by calling ipa_free_edge_args_substructures in
> ipa_propagate_indirect_call_infos.  Second, the current remapping code
> does not work because the global.inlined_to field of the destination
> caller is not yet set.
> 
> The patch below fixes the first problem by not calling the freeing
> function and the second one by making cgraph_clone_node set the
> required field prior to calling any duplication hooks (which is the
> only place where we have the pair of corresponding source and
> destination edge at our disposal so the duplication/remapping has to
> happen there).  I have also shortened the lists of corresponding
> references and cleared up the search loop a little.
> 
> I'll post a testcase in a separate patch.  This one bootstrapped and
> passed testsuite on x86_64-linux without any issues.  OK for trunk?
> 
> Thanks,
> 
> Martin
> 
> 
> 2013-06-10  Martin Jambor  
> 
>   PR tree-optimization/57539
>   * cgraphclones.c (cgraph_clone_node): Add parameter new_inlined_to, set
>   global.inlined_to of the new node to it.  All callers changed.
>   * ipa-inline-transform.c (clone_inlined_nodes): New variable
>   inlining_into, pass it to cgraph_clone_node.
>   * ipa-prop.c (ipa_propagate_indirect_call_infos): Do not call
>   ipa_free_edge_args_substructures.
>   (ipa_edge_duplication_hook): Only add edges from inlined nodes to
>   rdesc linked list.  Do not assert rdesc edges have inlined caller.
>   Assert we have found an rdesc in the rdesc list.

OK,
thanks
Honza
> 
> Index: src/gcc/cgraph.h
> ===
> --- src.orig/gcc/cgraph.h
> +++ src/gcc/cgraph.h
> @@ -707,7 +707,7 @@ struct cgraph_edge * cgraph_clone_edge (
>   unsigned, gcov_type, int, bool);
>  struct cgraph_node * cgraph_clone_node (struct cgraph_node *, tree, 
> gcov_type,
>   int, bool, vec,
> - bool);
> + bool, struct cgraph_node *);
>  tree clone_function_name (tree decl, const char *);
>  struct cgraph_node * cgraph_create_virtual_clone (struct cgraph_node 
> *old_node,
> vec,
> Index: src/gcc/cgraphclones.c
> ===
> --- src.orig/gcc/cgraphclones.c
> +++ src/gcc/cgraphclones.c
> @@ -167,13 +167,19 @@ cgraph_clone_edge (struct cgraph_edge *e
> function's profile to reflect the fact that part of execution is handled
> by node.  
> When CALL_DUPLICATOIN_HOOK is true, the ipa passes are acknowledged about
> -   the new clone. Otherwise the caller is responsible for doing so later.  */
> +   the new clone. Otherwise the caller is responsible for doing so later.
> +
> +   If the new node is being inlined into another one, NEW_INLINED_TO should 
> be
> +   the outline function the new one is (even indirectly) inlined to.  All 
> hooks
> +   will see this in node's global.inlined_to, when invoked.  Can be NULL if 
> the
> +   node is not inlined.  */
>  
>  struct cgraph_node *
>  cgraph_clone_node (struct cgraph_node *n, tree decl, gcov_type count, int 
> freq,
>  bool update_original,
>  vec redirect_callers,
> -bool call_duplication_hook)
> +bool call_duplication_hook,
> +struct cgraph_node *new_inlined_to)
>  {
>struct cgraph_node *new_node = cgraph_create_empty_node ();
>struct cgraph_edge *e;
> @@ -195,6 +201,7 @@ cgraph_clone_node (struct cgraph_node *n
>new_node->symbol.externally_visible = false;
>new_node->local.local = true;
>new_node->global = n->global;
> +  new_node->global.inlined_to = new_inlined_to;
>new_node->rtl = n->rtl;
>new_node->count = count;
>new_node->frequency = n->frequency;
> @@ -307,7 +314,7 @@ cgraph_create_virtual_clone (struct cgra
>  
>new_node = cgraph_clone_node (old_node, new_decl, old_node->count,
>   CGRAPH_FREQ_BASE, false,
> - redirect_callers, false);
> + redirect_callers, false, NULL);
>/* Update the properties.
>   Make clone visible only within this translation unit.  Make sure
>   that is not weak also.
> Index: src/gcc/ipa-inline-transform.c
> ===
> --- src.orig/gcc/ipa-inline-transform.c
> +++ src/gcc/ipa-inline-transform.c
> @@ -132,6 +132,13 @@ void
>  clone_inlined_nodes (struct cgraph_edge *e, bool duplicate,
>bool update_original, int *overall_size)
>  {
> +  struct cgraph_node *inlining_into;
> +
> +  if (e->caller-

Re: [PATCH] Improve folding of bitwise ops feeding conditionals for single bit types

2013-06-20 Thread Andreas Schwab

Jeff Law  writes:

> +/* { dg-final { scan-tree-dump-times "Replaced" 8 "forwprop1"} } */

$ grep -c Replaced forwprop-28.c.022t.forwprop1
16

;; Function test (test, funcdef_no=0, decl_uid=1388, symbol_order=0)

  Replaced 'rotate_7 == 0' with '_6 == 0'
  Replaced '_6 == 0' with 'code_5(D) != 22'

Andreas.

-- 
Andreas Schwab, SUSE Labs, sch...@suse.de
GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE  1748 E4D4 88E3 0EEA B9D7
"And now for something completely different."

Re: [PATCH, ARM] Reintroduce minipool ranges for zero-extension insn patterns

2013-06-20 Thread Julian Brown

On Tue, 18 Jun 2013 17:10:37 +0100
Richard Earnshaw  wrote:

> On 18/06/13 16:42, Julian Brown wrote:
> > Hi,
> >
> > The following patch removed pool_range/neg_pool_range attributes
> > from several instructions as a cleanup, which I believe to have been
> > incorrect:
> >
> > http://gcc.gnu.org/ml/gcc-patches/2010-07/msg01036.html
> >
> > On a Mentor-local branch, this caused problems with instructions
> > like:
> >
> > (insn 77 53 87 (set (reg:SI 8 r8 [orig:197 s.4 ] [197])
> >  (zero_extend:SI (mem/u/c:HI (symbol_ref/u:SI ("*.LC0")
> > [flags 0x2]) [7 S2 A16]))) [...] 161 {*arm_zero_extendhisi2_v6}
> > (nil))
> >
> > The reasoning behind the cleanup was that the instructions in
> > question have no immediate constraints -- but the minipool code is
> > used for more than just immediates, e.g. in the above case where a
> > symbol reference ("m") is loaded.
> >
> > I don't have a test case for the problem on mainline at present,
> > but I believe it is still a latent bug. Tested with the default
> > multilibs (ARM & Thumb mode) on arm-none-eabi, with no regressions.
> > (The patch has also been tested with more multilibs on our local
> > branches for a while, and I did ensure previously that it did not
> > adversely affect Bernd's patch linked above.)
> >
> > OK to apply?
> >
> 
> Pushing extending loads (particularly sign-extending loads) into the 
> minipools adversely affects freedom of pool placement (since the
> offset ranges are limited).
> 
> And they shouldn't be happening anyway.  I really don't think this is 
> the right solution to the problem you have.

How about this as a fix instead? IIUC, all the instances of this bug
that have been reported relate to insns following the pattern:

(set (reg) (sign/zero_extend:SI (symbol_ref:QI/HI)))

where the symbol_ref is a match_operand with an "m" constraint. These
kinds of addresses are explicitly permitted by
arm_legitimate_address_outer_p and thumb2_legitimate_address_p by the
last clause in the functions, "if (GET_MODE_CLASS (mode) != MODE_FLOAT
&& ...", etc.

So, I think the obvious thing to do is to forbid sub-SImode-sized
quantities in that clause. Perhaps tellingly, the thumb1 version of the
function already forbids sizes other than 4 for the mode size in the
equivalent clause. For non-Thumb1, we probably want to retain DImode
(etc.) minipool entries.

The change to generated code for a re-reduced version of the testcase
from:

https://bugzilla.redhat.com/show_bug.cgi?id=927565

(attached), for an insn which breaks before the attached patch is
applied, 210r.reload:

before:

(insn 7 30 33 2 (set (reg:SI 12 ip [orig:110 D.4194 ] [110])
(zero_extend:SI (mem/u/c:QI (symbol_ref/u:SI ("*.LC0") [flags 0x2]) [0 
S1 A8]))) minipool.c:11 182 {*arm_zero_extendqisi2_v6}
 (nil))

after (with a reload):

(insn 221 30 7 2 (set (reg:SI 0 r0)
(symbol_ref/u:SI ("*.LC0") [flags 0x2])) minipool.c:11 197 
{*arm_movsi_insn}
 (nil))
(insn 7 221 33 2 (set (reg:SI 12 ip [orig:110 D.4194 ] [110])
(zero_extend:SI (mem/u/c:QI (reg:SI 0 r0) [0 S1 A8]))) minipool.c:11 
182 {*arm_zero_extendqisi2_v6}
 (nil))

(Compiled with -march=armv7-a -O2.)

Tested cross to arm-none-eabi, default & mthumb multilibs. I also
removed some additional pool_range/neg_pool_ranges from similar
extension instructions, for consistency.

OK to apply?

Thanks,

Julian

ChangeLog

gcc/
* config/arm/arm.c (arm_legitimate_address_outer_p)
(thumb2_legitimate_address_p): Don't allow symbol refs with mode
size smaller than a word.
* config/arm/arm.md (thumb1_zero_extendqisi2, *arm_extendhisi2)
(*arm_extendhisi2_v6, *arm_extendqihi_insn, *arm_extendqisi)
(*arm_extendqisi_v6): Remove pool_range/neg_pool_range attributes.
Index: gcc/config/arm/arm.c
===
--- gcc/config/arm/arm.c	(revision 200204)
+++ gcc/config/arm/arm.c	(working copy)
@@ -5947,6 +5947,7 @@ arm_legitimate_address_outer_p (enum mac
 #endif
 
   else if (GET_MODE_CLASS (mode) != MODE_FLOAT
+	   && GET_MODE_SIZE (mode) >= UNITS_PER_WORD
 	   && code == SYMBOL_REF
 	   && CONSTANT_POOL_ADDRESS_P (x)
 	   && ! (flag_pic
@@ -6022,6 +6023,7 @@ thumb2_legitimate_address_p (enum machin
 }
 
   else if (GET_MODE_CLASS (mode) != MODE_FLOAT
+	   && GET_MODE_SIZE (mode) >= UNITS_PER_WORD
 	   && code == SYMBOL_REF
 	   && CONSTANT_POOL_ADDRESS_P (x)
 	   && ! (flag_pic
Index: gcc/config/arm/arm.md
===
--- gcc/config/arm/arm.md	(revision 200204)
+++ gcc/config/arm/arm.md	(working copy)
@@ -5432,8 +5432,7 @@
#
ldrb\\t%0, %1"
   [(set_attr "length" "4,2")
-   (set_attr "type" "alu_shift,load_byte")
-   (set_attr "pool_range" "*,32")]
+   (set_attr "type" "alu_shift,load_byte")]
 )
 
 (define_insn "*thumb1_zero_extendqisi2_v6"
@@ -5700,9 +5699,7 @@
ldr%(sh%)\\t%0, %1"
   [(set_attr "length" "8,4")
(set_attr "type" "alu_shift,load_byte")
-   (s

Re: [Patch, Fortran] PR57633 - Fix EOL handling with \r in list-directed I/O

2013-06-20 Thread Janne Blomqvist

On Thu, Jun 20, 2013 at 12:11 PM, Tobias Burnus  wrote:
> gfortran failed to correctly read the file
>line1,1,
>line2
> with DOS (\r\n) line endings. As the code already set EOL for "\r",
> finish_list_read didn't call eat_line. Result: The attempt to read "line2"
> actually accessed the last byte of line one, namely "\n", which it regarded
> as zero-sized string.
>
> That's fixed by the patch to next_char.
>
> eat_separator is a separate issue and unrelated to the PR. I also failed to
> create a test case for it. In any case, I regard the following as wrong:
>
>  case '\r':
>   dtp->u.p.at_eol = 1;
> ...
>if (n != '\n')
> {
>   unget_char (dtp, n);
>   break;
>
> As the code explicitly does not regard "\r" as EOL in this case, I believe
> EOL shouldn't be set here.
>
> (Recall, Unix (MacOS X, Linux, ...) have '\n' while DOS/Windows has "\r\n".
> While '\r' as line break exists (old Macs, pre MacOS X), gfortran does not
> support formatted I/O with "\r" record markers.)
>
>
> Build and regtested on x86-64-gnu-linux.
> OK for the trunk?
>
> Tobias

Ok, thanks.


--
Janne Blomqvist

[PATCH] Fix PR57584

2013-06-20 Thread Richard Biener


We have to be careful to not end up inserting expressions with
abnormal SSA names.  Thus the following makes niter analysis
more careful what it puts into number-of-iteration expressions.

Bootstrapped and tested on x86_64-unknown-linux-gnu, applied to trunk.

Richard.

2013-06-20  Richard Biener  

PR tree-optimization/57584
* tree-ssa-loop-niter.c (expand_simple_operations): Avoid including
SSA names into the expanded expression that take part in
abnormal coalescing.

* gcc.dg/torture/pr57584.c: New testcase.

Index: gcc/tree-ssa-loop-niter.c
===
*** gcc/tree-ssa-loop-niter.c   (revision 200189)
--- gcc/tree-ssa-loop-niter.c   (working copy)
*** expand_simple_operations (tree expr)
*** 1514,1519 
--- 1514,1526 
if (gimple_code (stmt) != GIMPLE_ASSIGN)
  return expr;
  
+   /* Avoid expanding to expressions that contain SSA names that need
+  to take part in abnormal coalescing.  */
+   ssa_op_iter iter;
+   FOR_EACH_SSA_TREE_OPERAND (e, stmt, iter, SSA_OP_USE)
+ if (SSA_NAME_OCCURS_IN_ABNORMAL_PHI (e))
+   return expr;
+ 
e = gimple_assign_rhs1 (stmt);
code = gimple_assign_rhs_code (stmt);
if (get_gimple_rhs_class (code) == GIMPLE_SINGLE_RHS)
Index: gcc/testsuite/gcc.dg/torture/pr57584.c
===
*** gcc/testsuite/gcc.dg/torture/pr57584.c  (revision 0)
--- gcc/testsuite/gcc.dg/torture/pr57584.c  (working copy)
***
*** 0 
--- 1,73 
+ /* { dg-do compile } */
+ 
+ typedef int int32_t __attribute__ ((__mode__ (__SI__)));
+ typedef unsigned char uint8_t;
+ typedef unsigned long int uintptr_t;
+ typedef uint8_t scm_t_uint8;
+ typedef int32_t scm_t_int32;
+ typedef uintptr_t scm_t_uintptr;
+ typedef scm_t_uintptr scm_t_bits;
+ typedef struct scm_unused_struct {
+ } *SCM;
+ enum scm_tc8_tags {
+ scm_tc8_flag = 4 + 0x00,   scm_tc8_char = 4 + 0x08,   scm_tc8_unused_0 = 
4 + 0x10,   scm_tc8_unused_1 = 4 + 0x18 };
+ struct __jmp_buf_tag   {
+ };
+ typedef struct __jmp_buf_tag jmp_buf[1];
+ typedef struct scm_t_cell {
+ } scm_t_cell;
+ struct scm_prompt_registers {
+ jmp_buf regs;
+ };
+ enum {
+ SCM_VM_APPLY_HOOK,   SCM_VM_PUSH_CONTINUATION_HOOK,   
SCM_VM_POP_CONTINUATION_HOOK,   SCM_VM_NEXT_HOOK,   
SCM_VM_ABORT_CONTINUATION_HOOK,   SCM_VM_RESTORE_CONTINUATION_HOOK,   
SCM_VM_NUM_HOOKS, };
+ typedef SCM (*scm_t_vm_engine) (SCM vm, SCM program, SCM *argv, int nargs);
+ struct scm_vm {
+ scm_t_uint8 *ip;
+ SCM *sp;
+ SCM *fp;
+ int engine;
+ int trace_level;
+ };
+ static SCM vm_regular_engine (SCM vm, SCM program, SCM *argv, int nargs) {
+ }
+ static SCM vm_debug_engine (SCM vm, SCM program, SCM *argv, int nargs) {
+ register scm_t_uint8 *ip ;
+ register SCM *sp ;
+ register SCM *fp ;
+ struct scm_vm *vp = ((struct scm_vm *) scm_t_bits) (0? 
(*(SCM*)0=SCM *)((scm_t_cell *) (((scm_t_bits) (0? (*(SCM*)0=vm): 
(((vm [((1))]))): (((SCM *)((scm_t_cell *) (((scm_t_bits) (0? 
(*(SCM*)0=vm): (((vm [((1))]));
+ static const void **jump_table_pointer = ((void *)0);
+ register const void **jump_table asm ("r12");
+ if (__builtin_expect ((!jump_table_pointer), 0)) {
+   jump_table_pointer[0] = &&l_nop;
+ }
+ l_nop:
+   {
+   SCM *old_sp;
+   scm_t_int32 n;
+   old_sp = sp;
+   sp = (fp - 1) + n;
+   if (old_sp < sp) {
+   while (old_sp < sp) *++old_sp = ((SCM) ((9)) << 8) + 
scm_tc8_flag;
+   }
+ {
+ { if (__builtin_expect ((vp->trace_level > 0), 0)) { { vp->ip = 
ip; vp->sp = sp; vp->fp = fp; }; vm_dispatch_hook (vm, SCM_VM_NEXT_HOOK); } };
+ };
+   }
+   {
+   SCM k, prompt;
+   if ((_setjmp (((struct scm_prompt_registers*)scm_t_bits) (0? 
(*(SCM*)0=SCM *)((scm_t_cell *) (((scm_t_bits) (0? 
(*(SCM*)0=prompt): (((prompt [((2))]))): (((SCM *)((scm_t_cell 
*) (((scm_t_bits) (0? (*(SCM*)0=prompt): (((prompt 
[((2))]))->regs))) {
+ { ip = vp->ip; sp = vp->sp; fp = vp->fp; };
+ { { if (__builtin_expect ((vp->trace_level > 0), 0)) { { vp->ip = 
ip; vp->sp = sp; vp->fp = fp; }; vm_dispatch_hook (vm, SCM_VM_NEXT_HOOK); } }; 
; goto *jump_table[(*ip++) & ((1<<8)-1)]; };
+   }
+ 
+   if (__builtin_expect ((vp->trace_level > 0), 0)) { { vp->ip = ip; 
vp->sp = sp; vp->fp = fp; }; vm_dispatch_hook (vm, SCM_VM_NEXT_HOOK); } ;
+ 
+   }
+ }
+ static const scm_t_vm_engine vm_engines[] =   {
+ vm_regular_engine, vm_debug_engine };
+ SCM scm_c_vm_run (SCM vm, SCM program, SCM *argv, int nargs) {
+ struct scm_vm *vp = ((struct scm_vm *) scm_t_bits) (0? 
(*(SCM*)0=SCM *)((scm_t_cell *) (((scm_t_bits) (0? (*(SCM*)0=vm): 
(((vm [((1))]))): (((SCM *)((scm_t_cell *) (((scm_t_bits) (0?

Re: [patch] libitm: Fix handling of reentrancy in the HTM fastpath

2013-06-20 Thread Torvald Riegel

On Wed, 2013-06-19 at 22:13 -0500, Peter Bergner wrote:
> On Thu, 2013-06-20 at 00:51 +0200, Torvald Riegel wrote:
> > On Wed, 2013-06-19 at 14:43 -0500, Peter Bergner wrote: 
> > >> I'm having trouble seeing why/when _ITM_inTransaction() is
> > >> returning something other than inIrrevocableTransaction.  I'll see if I 
> > >> can
> > >> determine why and will report back.
> > > 
> > > Ok, we return outsideTransaction because the nesting level (tx->nesting)
> > > is zero.
> > 
> > That's a second bug in libitm, sorry.  Can you try with the attached
> > patch additionally to the previous one?  Thanks!
> 
> Ok, with this patch, plus adding a powerpc implementation of
> htm_transaction_active(), reentrant.c now executes correctly
> on both HTM and non-HTM hardware for me.  Thanks for all of
> your help with this!
> 
> I'd still like to hear from Andreas, whether the reentrant.c test case
> with both patches, now works on S390.
> 
> I'll note unlike your x86 htm_transaction_active() implementation,
> my implementation has to check for htm_available() first before
> executing the htm instruction which tells me whether I'm in transaction
> state or not, otherwise I'll get a SIGILL on non-HTM hardware.
> Unfortunately, my htm_available() call uses getauxval() to query
> the AUXV for a hwcap bit.  Is there a place I can stash the result
> of the first call, so that subsequent calls use the cached value?
> Normally, I could use a static var, but I doubt that is what we want
> to do in static inline functions.

You're right, that was missing for x86 as well.  Please see the updated
second patch that is attached.  It additionally checks htm_fastpath to
see whether we are actually using the HTM.  This variable is initialized
to the value that htm_init() returns.  This should do the right thing I
suppose.

Torvald
commit c8352d4d9fa5cfa3453a61c581956835de9753e5
Author: Torvald Riegel 
Date:   Thu Jun 20 00:46:59 2013 +0200

libitm: Handle HTM fastpath in status query functions.

diff --git a/libitm/config/x86/target.h b/libitm/config/x86/target.h
index 77b627f..063c09e 100644
--- a/libitm/config/x86/target.h
+++ b/libitm/config/x86/target.h
@@ -125,6 +125,13 @@ htm_abort_should_retry (uint32_t begin_ret)
 {
   return begin_ret & _XABORT_RETRY;
 }
+
+/* Returns true iff a hardware transaction is currently being executed.  */
+static inline bool
+htm_transaction_active ()
+{
+  return _xtest() != 0;
+}
 #endif
 
 
diff --git a/libitm/query.cc b/libitm/query.cc
index 5707321..39a35b3 100644
--- a/libitm/query.cc
+++ b/libitm/query.cc
@@ -43,6 +43,15 @@ _ITM_libraryVersion (void)
 _ITM_howExecuting ITM_REGPARM
 _ITM_inTransaction (void)
 {
+#if defined(USE_HTM_FASTPATH)
+  // If we use the HTM fastpath, we cannot reliably detect whether we are
+  // in a transaction because this function can be called outside of
+  // a transaction and thus we can't deduce this by looking at just the serial
+  // lock.  This function isn't used in practice currently, so the easiest
+  // way to handle it is to just abort.
+  if (htm_fastpath && htm_transaction_active())
+htm_abort();
+#endif
   struct gtm_thread *tx = gtm_thr();
   if (tx && (tx->nesting > 0))
 {
@@ -58,6 +67,11 @@ _ITM_inTransaction (void)
 _ITM_transactionId_t ITM_REGPARM
 _ITM_getTransactionId (void)
 {
+#if defined(USE_HTM_FASTPATH)
+  // See ITM_inTransaction.
+  if (htm_fastpath && htm_transaction_active())
+htm_abort();
+#endif
   struct gtm_thread *tx = gtm_thr();
   return (tx && (tx->nesting > 0)) ? tx->id : _ITM_noTransactionId;
 }

[Patch, Fortran, committed] Don't set FL_VARIABLE twice

2013-06-20 Thread Tobias Burnus

Follow up to http://gcc.gnu.org/ml/fortran/2013-06/msg00027.html - for 
some reason this part of the patch got lost.


Committed as Rev. 200234.

Tobias
Index: gcc/fortran/ChangeLog
===
--- gcc/fortran/ChangeLog	(Revision 200233)
+++ gcc/fortran/ChangeLog	(Arbeitskopie)
@@ -1,14 +1,18 @@
+2013-06-20  Tobias Burnus  
+
+	* resolve.c (get_temp_from_expr): Don't set FL_VARIABLE twice.
+
 2013-06-17  Tobias Burnus  
 
 	* gfortran.h (gfc_option_t): Add fpe_summary.
 	* gfortran.texi (_gfortran_set_options): Update.
 	* invoke.texi (-ffpe-summary): Add doc.
 	* lang.opt (ffpe-summary): Add flag.
 	* options.c (gfc_init_options, gfc_handle_option): Handle it.
 	(gfc_handle_fpe_option): Renamed from gfc_handle_fpe_trap_option,
 	also handle fpe_summary.
 	* trans-decl.c (create_main_function): Update
 	_gfortran_set_options call.
 
 2013-06-15  Mikael Morin  
 
Index: gcc/fortran/resolve.c
===
--- gcc/fortran/resolve.c	(Revision 200233)
+++ gcc/fortran/resolve.c	(Arbeitskopie)
@@ -9300,29 +9300,28 @@ get_temp_from_expr (gfc_expr *e, gfc_nam
   tmp->n.sym->attr.flavor = FL_VARIABLE;
 
   if (as)
 {
   tmp->n.sym->as = gfc_copy_array_spec (as);
   if (!ref)
 	ref = e->ref;
   if (as->type == AS_DEFERRED)
 	tmp->n.sym->attr.allocatable = 1;
 }
   else
 tmp->n.sym->attr.dimension = 0;
 
   gfc_set_sym_referenced (tmp->n.sym);
-  gfc_add_flavor (&tmp->n.sym->attr, FL_VARIABLE, name, NULL);
   gfc_commit_symbol (tmp->n.sym);
   e = gfc_lval_expr_from_sym (tmp->n.sym);
 
   /* Should the lhs be a section, use its array ref for the
  temporary expression.  */
   if (aref && aref->type != AR_FULL)
 {
   gfc_free_ref_list (e->ref);
   e->ref = gfc_copy_ref (ref);
 }
   return e;
 }

[Patch, Fortran] PR57633 - Fix EOL handling with \r in list-directed I/O

2013-06-20 Thread Tobias Burnus


gfortran failed to correctly read the file
   line1,1,
   line2
with DOS (\r\n) line endings. As the code already set EOL for "\r", 
finish_list_read didn't call eat_line. Result: The attempt to read 
"line2" actually accessed the last byte of line one, namely "\n", which 
it regarded as zero-sized string.


That's fixed by the patch to next_char.

eat_separator is a separate issue and unrelated to the PR. I also failed 
to create a test case for it. In any case, I regard the following as wrong:


 case '\r':
  dtp->u.p.at_eol = 1;
...
   if (n != '\n')
{
  unget_char (dtp, n);
  break;

As the code explicitly does not regard "\r" as EOL in this case, I 
believe EOL shouldn't be set here.


(Recall, Unix (MacOS X, Linux, ...) have '\n' while DOS/Windows has 
"\r\n". While '\r' as line break exists (old Macs, pre MacOS X), 
gfortran does not support formatted I/O with "\r" record markers.)



Build and regtested on x86-64-gnu-linux.
OK for the trunk?

Tobias
2013-06-20  Tobias Burnus  

	PR fortran/57633
	* io/list_read.c (next_char, eat_separator): Don't set EOL for \r.

2013-06-20  Tobias Burnus  

	PR fortran/57633
	* gfortran.dg/list_read_11.f90: New.

diff --git a/libgfortran/io/list_read.c b/libgfortran/io/list_read.c
index c8a1bdfc..82a98a5 100644
--- a/libgfortran/io/list_read.c
+++ b/libgfortran/io/list_read.c
@@ -235,21 +235,21 @@ next_char (st_parameter_dt *dtp)
 	}
 	}
 }
   else
 {
   c = fbuf_getc (dtp->u.p.current_unit);
   if (c != EOF && is_stream_io (dtp))
 	dtp->u.p.current_unit->strm_pos++;
 }
 done:
-  dtp->u.p.at_eol = (c == '\n' || c == '\r' || c == EOF);
+  dtp->u.p.at_eol = (c == '\n' || c == EOF);
   return c;
 }
 
 
 /* Push a character back onto the input.  */
 
 static void
 unget_char (st_parameter_dt *dtp, int c)
 {
   dtp->u.p.last_char = c;
@@ -327,21 +327,20 @@ eat_separator (st_parameter_dt *dtp)
 case ';':
   dtp->u.p.comma_flag = 1;
   eat_spaces (dtp);
   break;
 
 case '/':
   dtp->u.p.input_complete = 1;
   break;
 
 case '\r':
-  dtp->u.p.at_eol = 1;
   if ((n = next_char(dtp)) == EOF)
 	return LIBERROR_END;
   if (n != '\n')
 	{
 	  unget_char (dtp, n);
 	  break;
 	}
 /* Fall through.  */
 case '\n':
   dtp->u.p.at_eol = 1;
--- /dev/null	2013-06-20 10:08:42.876937873 +0200
+++ gcc/gcc/testsuite/gfortran.dg/list_read_11.f90	2013-06-20 10:55:13.329549458 +0200
@@ -0,0 +1,38 @@
+! { dg-do run }
+! { dg-options "-fbackslash" }
+!
+! PR fortran/57633
+!
+program teststuff
+  implicit none
+  integer::a
+  character(len=10)::s1,s2
+
+  open(11,file="testcase.txt",form='unformatted',access='stream',status='new')
+  write(11) 'line1,1,\r\nline2'
+  close(11)
+
+  open(11,file="testcase.txt",form='formatted')
+  s1 = repeat('x', len(s1))
+  a = 99
+  read(11,*)s1,a
+  if (s1 /= "line1" .or. a /= 1) call abort()
+
+  s1 = repeat('x', len(s1))
+  read(11,"(a)")s1
+  close(11,status="delete")
+  if (s1 /= "line2") call abort()
+
+
+  open(11,file="testcase.txt",form='unformatted',access='stream',status='new')
+  write(11) 'word1\rword2,\n'
+  close(11)
+
+  open(11,file="testcase.txt",form='formatted')
+  s1 = repeat('x', len(s1))
+  s2 = repeat('x', len(s1))
+  read(11,*)s1,s2
+  close(11,status="delete")
+  if (s1 /= "word1") call abort()
+  if (s2 /= "word2") call abort()
+end program teststuff

Re: RFA: Fix rtl-optimization/57425

2013-06-20 Thread Richard Biener

On Wed, Jun 19, 2013 at 9:58 PM, Joern Rennecke
 wrote:
> Quoting Michael Matz :
>
>> That's not good.  You now have different order of parameters between
>> anti_dependence and canon_anti_dependence.  That will be mightily
>> confusing, please instead change the caller.  Currently these predicates
>> take their arguments in the order of the corresponding instructions, that
>> should better be retained:
>>
>> true_dependence:  write-then-(depending)read
>> anti_dependence:  read-then-(clobbering)write
>> write_dependence: write-then-(clobbering)write
>
>
> All right, attached is the patch with the arguments in instruction-order.
> Again, bootstrapped/regtested on i686-pc-linux-gnu .

Ok.

Thanks,
Richard.

> 2013-06-19  Joern Rennecke 
>
> PR rtl-optimization/57425
> PR rtl-optimization/57569
> * alias.c (write_dependence_p): Remove parameters mem_mode and
> canon_mem_addr.  Add parameters x_mode, x_addr and x_canonicalized.
> Changed all callers.
> (canon_anti_dependence): Get comments and semantics in sync.
> Add parameter mem_canonicalized.  Changed all callers.
> * rtl.h (canon_anti_dependence): Update prototype.
>
> Index: alias.c
> ===
> --- alias.c (revision 200133)
> +++ alias.c (working copy)
> @@ -156,8 +156,9 @@ static int insert_subset_children (splay
>  static alias_set_entry get_alias_set_entry (alias_set_type);
>  static bool nonoverlapping_component_refs_p (const_rtx, const_rtx);
>  static tree decl_for_component_ref (tree);
> -static int write_dependence_p (const_rtx, enum machine_mode, rtx,
> const_rtx,
> -  bool, bool);
> +static int write_dependence_p (const_rtx,
> +  const_rtx, enum machine_mode, rtx,
> +  bool, bool, bool);
>
>  static void memory_modified_1 (rtx, const_rtx, void *);
>
> @@ -2555,20 +2556,22 @@ canon_true_dependence (const_rtx mem, en
>
>  /* Returns nonzero if a write to X might alias a previous read from
> (or, if WRITEP is true, a write to) MEM.
> -   If MEM_CANONCALIZED is nonzero, CANON_MEM_ADDR is the canonicalized
> -   address of MEM, and MEM_MODE the mode for that access.  */
> +   If X_CANONCALIZED is true, then X_ADDR is the canonicalized address of
> X,
> +   and X_MODE the mode for that access.
> +   If MEM_CANONICALIZED is true, MEM is canonicalized.  */
>
>  static int
> -write_dependence_p (const_rtx mem, enum machine_mode mem_mode,
> -   rtx canon_mem_addr, const_rtx x,
> -   bool mem_canonicalized, bool writep)
> +write_dependence_p (const_rtx mem,
> +   const_rtx x, enum machine_mode x_mode, rtx x_addr,
> +   bool mem_canonicalized, bool x_canonicalized, bool
> writep)
>  {
> -  rtx x_addr, mem_addr;
> +  rtx mem_addr;
>rtx base;
>int ret;
>
> -  gcc_checking_assert (mem_canonicalized ? (canon_mem_addr != NULL_RTX)
> -  : (canon_mem_addr == NULL_RTX && mem_mode ==
> VOIDmode));
> +  gcc_checking_assert (x_canonicalized
> +  ? (x_addr != NULL_RTX && x_mode != VOIDmode)
> +  : (x_addr == NULL_RTX && x_mode == VOIDmode));
>
>if (MEM_VOLATILE_P (x) && MEM_VOLATILE_P (mem))
>  return 1;
> @@ -2593,17 +2596,21 @@ write_dependence_p (const_rtx mem, enum
>if (MEM_ADDR_SPACE (mem) != MEM_ADDR_SPACE (x))
>  return 1;
>
> -  x_addr = XEXP (x, 0);
>mem_addr = XEXP (mem, 0);
> -  if (!((GET_CODE (x_addr) == VALUE
> -&& GET_CODE (mem_addr) != VALUE
> -&& reg_mentioned_p (x_addr, mem_addr))
> -   || (GET_CODE (x_addr) != VALUE
> -   && GET_CODE (mem_addr) == VALUE
> -   && reg_mentioned_p (mem_addr, x_addr
> +  if (!x_addr)
>  {
> -  x_addr = get_addr (x_addr);
> -  mem_addr = get_addr (mem_addr);
> +  x_addr = XEXP (x, 0);
> +  if (!((GET_CODE (x_addr) == VALUE
> +&& GET_CODE (mem_addr) != VALUE
> +&& reg_mentioned_p (x_addr, mem_addr))
> +   || (GET_CODE (x_addr) != VALUE
> +   && GET_CODE (mem_addr) == VALUE
> +   && reg_mentioned_p (mem_addr, x_addr
> +   {
> + x_addr = get_addr (x_addr);
> + if (!mem_canonicalized)
> +   mem_addr = get_addr (mem_addr);
> +   }
>  }
>
>base = find_base_term (mem_addr);
> @@ -2619,17 +2626,16 @@ write_dependence_p (const_rtx mem, enum
>   GET_MODE (mem)))
>  return 0;
>
> -  x_addr = canon_rtx (x_addr);
> -  if (mem_canonicalized)
> -mem_addr = canon_mem_addr;
> -  else
> +  if (!x_canonicalized)
>  {
> -  mem_addr = canon_rtx (mem_addr);
> -  mem_mode = GET_MODE (mem);
> +  x_addr = canon_rtx (x_addr);
> +  x_mode = GET_MODE (x);
>  }
> +  if (!mem_canonicalized)
> +mem_addr = canon_rtx (mem_addr);
>
> -  if ((ret = memrefs_conflict_p (GET_MO

Re: [PATCH] PR32219, weak hidden reference segfault [PING]

2013-06-20 Thread Chung-Lin Tang

Ping again?

On 13/6/11 5:20 PM, Bernhard Reutner-Fischer wrote:
> ping, CCing middle-end maintainers for review.
> 
> On 31 May 2013 10:13, Chung-Lin Tang  wrote:
>> On 13/5/15 8:12 PM, Richard Sandiford wrote:
>>> Chung-Lin Tang  writes:
 On 13/5/10 6:37 PM, Richard Sandiford wrote:
> Chung-Lin Tang  writes:
>> +case UNSPEC:
>> +  /* Reach for a contained symbol.  */
>> +  return nonzero_address_p (XVECEXP (x, 0, 0));
>
> I don't think this is safe.  UNSPECs really are unspecified :-),
> so we can't assume that (unspec X) is nonzero simply because X is.

 Attached is a modified patch (not yet tested but just for demonstration)
 with a more specific test, hopefully regarded as more safe.

 The point is in recognizing (const (unspec [symbol] XYZ)) offsets in PIC
 references, which seems quite idiomatic across all targets by now.
>>>
>>> I agree this is safer.  However, there used to be some ports that
>>> use (plus pic_offset_table_rtx (symbol_ref X)) -- i.e. without an unspec --
>>> to represent a PIC reference to X.  That always seemed semantically wrong,
>>> since you're not actually adding the address of X and the PIC register,
>>> but the patch wouldn't handle that case correctly.
>>
>> Well I can't help those targets then, but at least nothing will be
>> changed for them by this patch. It will just continue to return 'true'.
>>
>>> An alternative might be to remove the pic_offset_table_rtx case altogether
>>> and rely on targetm.delegitimize_address instead.  FWIW, I'd prefer that
>>> if it works, but it's not me you need to convince. :-)
>>
>> Like we discussed below, I think the direction should be towards making
>> things more machine-independent, rather then pushing more into the backend.
>>
 I would suggest that this probably means there should be a new, more
 specific construct in RTL to represent relocation values of this kind,
 instead of (const (unspec)) serving an unofficial role; possibly some
 real support for reasoning about PIC references could also be considered.
>>>
>>> Yeah, maybe we could try to introduce some target-independent knowledge
>>> of certain reloc types, a bit like the generic BFD_RELOC_*s in bfd.
>>
>>
>> FWIW, I've ran tests on the newer patch on i686-linux, with no
>> regressions. Testcase has been moved to gcc.dg/torture by recommendation
>> of Bernhard. If any of the RTL maintainers can give an eye of merciful
>> approval, this old PR could be resolved :)
>>
>> Thanks,
>> Chung-Lin
>>

76 matches

Mail list logo