[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-29 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-116848080
  
Merging this in. Thanks @ilganeli!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-29 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/6686


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread davies
Github user davies commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115029251
  
@ilganeli Once you correct the parameter (make it consistent), this looks 
good to me.

Unfortunately, this PR cannot catch 1.4.1 release, which should only 
include bugfix.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread davies
Github user davies commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115039570
  
LGTM, waiting for tests.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r33203825
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,53 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name, data_type=None, nullable=True, metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema. The method accepts
+either:
+a) A single parameter which is a StructField object.
+b) Between 2 and 4 parameters as (name, data_type, nullable 
(optional),
+ metadata(optional). The data_type parameter may be either a 
String or a DataType object
+
+ struct1 = StructType().add(f1, StringType(), True).add(f2, 
StringType(), True, None)
+ struct2 = StructType([StructField(f1, StringType(), True),\
+ StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+ struct1 = StructType().add(StructField(f1, StringType(), 
True))
+ struct2 = StructType([StructField(f1, StringType(), True)])
+ struct1 == struct2
+True
+ struct1 = StructType().add(f1, string, True)
+ struct2 = StructType([StructField(f1, StringType(), True)])
+ struct1 == struct2
+True
+
+:param nameOrStructField: Either the name of the field or a 
StructField object
--- End diff --

This is different than `name`. I'd like to use `field`, which could be 
FieldType or string of name. How do you think?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115032140
  
  [Test build #35728 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35728/consoleFull)
 for   PR 6686 at commit 
[`27c1de1`](https://github.com/apache/spark/commit/27c1de1a4905b1e4b6bc315f688f900f3b55332a).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115030576
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115030558
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115053843
  
  [Test build #35728 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35728/console)
 for   PR 6686 at commit 
[`27c1de1`](https://github.com/apache/spark/commit/27c1de1a4905b1e4b6bc315f688f900f3b55332a).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-115053967
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-24 Thread ilganeli
Github user ilganeli commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114973836
  
@davies @rxin Are there any further suggestions? Would love to get this 
into 1.4.1, thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114186900
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32959777
  
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala 
---
@@ -43,24 +43,6 @@ private[r] object SQLUtils {
 StructType(fields)
   }
 
-  def getSQLDataType(dataType: String): DataType = {
-dataType match {
-  case byte = org.apache.spark.sql.types.ByteType
-  case integer = org.apache.spark.sql.types.IntegerType
-  case double = org.apache.spark.sql.types.DoubleType
-  case numeric = org.apache.spark.sql.types.DoubleType
-  case character = org.apache.spark.sql.types.StringType
--- End diff --

These are R types, cc @shivaram 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114187161
  
  [Test build #35460 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35460/consoleFull)
 for   PR 6686 at commit 
[`5fef5a4`](https://github.com/apache/spark/commit/5fef5a479ef3638ce481887b24bf43afa61d8f1d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114189600
  
  [Test build #35460 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35460/console)
 for   PR 6686 at commit 
[`5fef5a4`](https://github.com/apache/spark/commit/5fef5a479ef3638ce481887b24bf43afa61d8f1d).
 * This patch **fails to build**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32959687
  
--- Diff: python/pyspark/sql/types.py ---
@@ -56,6 +56,32 @@ def __eq__(self, other):
 def __ne__(self, other):
 return not self.__eq__(other)
 
+def from_string(self, data_type):
--- End diff --

The alias (`integer`, `raw`, `logical`) are only used for R, so we could 
should `_parse_datatype_json_value `


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114189619
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114186871
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread ilganeli
Github user ilganeli commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32958201
  
--- Diff: python/pyspark/sql/types.py ---
@@ -56,6 +56,32 @@ def __eq__(self, other):
 def __ne__(self, other):
 return not self.__eq__(other)
 
+def from_string(self, data_type):
--- End diff --

Is it ok to leave it as is to be specific?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114243273
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114285248
  
  [Test build #35482 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35482/console)
 for   PR 6686 at commit 
[`467d836`](https://github.com/apache/spark/commit/467d8367341f2e53dbc1575c6322b8695801519d).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114285290
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114243212
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-22 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-114243827
  
  [Test build #35482 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35482/consoleFull)
 for   PR 6686 at commit 
[`467d836`](https://github.com/apache/spark/commit/467d8367341f2e53dbc1575c6322b8695801519d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-19 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32806503
  
--- Diff: python/pyspark/sql/types.py ---
@@ -56,6 +56,32 @@ def __eq__(self, other):
 def __ne__(self, other):
 return not self.__eq__(other)
 
+def from_string(self, data_type):
--- End diff --

I think we could use `_parse_datatype_json_value` here, but lose some type 
alias.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-19 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32806501
  
--- Diff: python/pyspark/sql/types.py ---
@@ -56,6 +56,32 @@ def __eq__(self, other):
 def __ne__(self, other):
 return not self.__eq__(other)
 
+def from_string(self, data_type):
+assert(isinstance(data_type, str))
+
+if data_type == byte:
+return ByteType
+elif data_type == long:
+return LongType
+elif data_type == float:
+return FloatType
+elif data_type == int or data_type == integer:
+return IntegerType
+elif data_type == double or data_type == numeric:
+return DoubleType
+elif data_type == string or data_type == character:
--- End diff --

This is Python, we don't need to have R types here (numeric and character)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-19 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32806472
  
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala 
---
@@ -43,26 +43,8 @@ private[r] object SQLUtils {
 StructType(fields)
   }
 
-  def getSQLDataType(dataType: String): DataType = {
-dataType match {
-  case byte = org.apache.spark.sql.types.ByteType
-  case integer = org.apache.spark.sql.types.IntegerType
-  case double = org.apache.spark.sql.types.DoubleType
-  case numeric = org.apache.spark.sql.types.DoubleType
-  case character = org.apache.spark.sql.types.StringType
-  case string = org.apache.spark.sql.types.StringType
-  case binary = org.apache.spark.sql.types.BinaryType
-  case raw = org.apache.spark.sql.types.BinaryType
-  case logical = org.apache.spark.sql.types.BooleanType
-  case boolean = org.apache.spark.sql.types.BooleanType
-  case timestamp = org.apache.spark.sql.types.TimestampType
-  case date = org.apache.spark.sql.types.DateType
-  case _ = throw new IllegalArgumentException(sInvaid type 
$dataType)
-}
-  }
-
   def createStructField(name: String, dataType: String, nullable: 
Boolean): StructField = {
-val dtObj = getSQLDataType(dataType)
+val dtObj = DataTypeParser.parse(dataType)
--- End diff --

These two are different, `getSQLDataType` is used for R.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-19 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32804117
  
--- Diff: python/pyspark/sql/types.py ---
@@ -56,6 +56,32 @@ def __eq__(self, other):
 def __ne__(self, other):
 return not self.__eq__(other)
 
+def from_string(self, data_type):
--- End diff --

@davies why don't we just call the jvm parse type string here?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-19 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32804096
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +393,47 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=None, nullable=True, 
metadata=None):
--- End diff --

can we change name_or_struct_field to just name, and just explain we 
support passing in a struct type in the docstring? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113189491
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113189528
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113190209
  
  [Test build #35137 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35137/consoleFull)
 for   PR 6686 at commit 
[`8109e00`](https://github.com/apache/spark/commit/8109e0067b3abce6f4eec937b39c6d7db2eb6b71).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32751352
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,43 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=None, nullable=True, 
metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema
+
+ struct1 = StructType().add(f1, StringType(), True).add(f2, 
StringType(), True, None)
+ struct2 = StructType([StructField(f1, StringType(), True),\
+ StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+ struct1 = (StructType().add(StructField(f1, StringType(), 
True))
+... .add(StructField(f2, StringType(), True, None)))
+ struct2 = StructType([StructField(f1, StringType(), True),
+... StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+
+:param nameOrStructField: Either the name of the field or a 
StructField object
+:param data_type: If present, the DataType of the StructField to 
create
+:param nullable: Whether the field to add should be nullable 
(default True)
+:param metadata: Any additional metadata (default None)
+:return: a new updated StructType
+
+if isinstance(name_or_struct_field, StructField):
+self.fields.append(name_or_struct_field)
+return self
--- End diff --

Nit: you can save a line by put this out of `if` (at last).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32751787
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,43 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=None, nullable=True, 
metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema
+
+ struct1 = StructType().add(f1, StringType(), True).add(f2, 
StringType(), True, None)
+ struct2 = StructType([StructField(f1, StringType(), True),\
+ StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+ struct1 = (StructType().add(StructField(f1, StringType(), 
True))
+... .add(StructField(f2, StringType(), True, None)))
+ struct2 = StructType([StructField(f1, StringType(), True),
+... StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+
+:param nameOrStructField: Either the name of the field or a 
StructField object
+:param data_type: If present, the DataType of the StructField to 
create
+:param nullable: Whether the field to add should be nullable 
(default True)
+:param metadata: Any additional metadata (default None)
+:return: a new updated StructType
+
+if isinstance(name_or_struct_field, StructField):
+self.fields.append(name_or_struct_field)
+return self
+else:
+if isinstance(name_or_struct_field, str) and data_type is None:
+raise ValueError(Must specify DataType if passing name of 
struct_field to create.)
+self.fields.append(StructField(name_or_struct_field, 
data_type, nullable, metadata))
--- End diff --

Do we need to check the type of `data_type`? In scala, we support that 
`data_type` as a string, so it will be good that we also support it here. At 
least, we should check the type.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113354797
  
  [Test build #35201 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35201/console)
 for   PR 6686 at commit 
[`4085489`](https://github.com/apache/spark/commit/4085489efc22de12e9b695392872ee24ad9cd04c).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113354814
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113331823
  
  [Test build #35201 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35201/consoleFull)
 for   PR 6686 at commit 
[`4085489`](https://github.com/apache/spark/commit/4085489efc22de12e9b695392872ee24ad9cd04c).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113331787
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113331773
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113232083
  
  [Test build #35137 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35137/console)
 for   PR 6686 at commit 
[`8109e00`](https://github.com/apache/spark/commit/8109e0067b3abce6f4eec937b39c6d7db2eb6b71).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113232120
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread ilganeli
Github user ilganeli commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32785670
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,43 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=None, nullable=True, 
metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema
+
+ struct1 = StructType().add(f1, StringType(), True).add(f2, 
StringType(), True, None)
+ struct2 = StructType([StructField(f1, StringType(), True),\
+ StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+ struct1 = (StructType().add(StructField(f1, StringType(), 
True))
+... .add(StructField(f2, StringType(), True, None)))
+ struct2 = StructType([StructField(f1, StringType(), True),
+... StructField(f2, StringType(), True, None)])
+ struct1 == struct2
+True
+
+:param nameOrStructField: Either the name of the field or a 
StructField object
+:param data_type: If present, the DataType of the StructField to 
create
+:param nullable: Whether the field to add should be nullable 
(default True)
+:param metadata: Any additional metadata (default None)
+:return: a new updated StructType
+
+if isinstance(name_or_struct_field, StructField):
+self.fields.append(name_or_struct_field)
+return self
+else:
+if isinstance(name_or_struct_field, str) and data_type is None:
+raise ValueError(Must specify DataType if passing name of 
struct_field to create.)
+self.fields.append(StructField(name_or_struct_field, 
data_type, nullable, metadata))
--- End diff --

@davies Is there an equivalent of the DataTypeParser in Python?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113307230
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113307245
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113307627
  
  [Test build #35184 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35184/consoleFull)
 for   PR 6686 at commit 
[`3670cf5`](https://github.com/apache/spark/commit/3670cf5be7d8430b7d661a1f765bc7103b05d21b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113307870
  
  [Test build #35184 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35184/console)
 for   PR 6686 at commit 
[`3670cf5`](https://github.com/apache/spark/commit/3670cf5be7d8430b7d661a1f765bc7103b05d21b).
 * This patch **fails Python style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113307873
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113063566
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113063564
  
  [Test build #35109 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35109/console)
 for   PR 6686 at commit 
[`e7ba7e0`](https://github.com/apache/spark/commit/e7ba7e03cb11b68ef990945f4c2fa53fbe9ea78d).
 * This patch **fails Python style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113079072
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113079089
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread ilganeli
Github user ilganeli commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113062905
  
@davies Made the changes you suggested. Only thing I didn't know how to do 
is to intercept an exception in Python. Is there a cleaner way of doing it than 
I added? The way I'm doing it right now seems hacky but I didn't see a better 
example in the other cases.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113063328
  
  [Test build #35109 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35109/consoleFull)
 for   PR 6686 at commit 
[`e7ba7e0`](https://github.com/apache/spark/commit/e7ba7e03cb11b68ef990945f4c2fa53fbe9ea78d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113063174
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113063190
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113079303
  
  [Test build #35117 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35117/consoleFull)
 for   PR 6686 at commit 
[`41ab686`](https://github.com/apache/spark/commit/41ab6867faad559902d7c8c92e300959096cd59a).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113102560
  
  [Test build #35117 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/35117/console)
 for   PR 6686 at commit 
[`41ab686`](https://github.com/apache/spark/commit/41ab6867faad559902d7c8c92e300959096cd59a).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-113102620
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-17 Thread ilganeli
Github user ilganeli commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32656127
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,49 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):
--- End diff --

Davies - totally agree. This was changed specifically to consolidate to a 
single method as suggested by Reynold. I initially had separate add methods - 
one which accepted a StructField and one which accepted the 4 parameters, the 
first two of which were defined.

What would you suggest? My preference is to break this out into two methods 
for clarity and to avoid the problem you mention.



Thank you,
Ilya Ganelin



-Original Message-
From: Davies Liu [notificati...@github.commailto:notificati...@github.com]
Sent: Wednesday, June 17, 2015 01:18 PM Eastern Standard Time
To: apache/spark
Cc: Ganelin, Ilya
Subject: Re: [spark] [SPARK-8056][SQL] Design an easier way to construct 
schema for both Scala and Python (#6686)


In 
python/pyspark/sql/types.pyhttps://github.com/apache/spark/pull/6686#discussion_r32650869:


 @@ -368,8 +367,49 @@ def __init__(self, fields):
   struct1 == struct2
  False
  
 -assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
 -self.fields = fields
 +if not fields:
 +self.fields = []
 +else:
 +self.fields = fields
 +assert all(isinstance(f, StructField) for f in fields),\
 +fields should be a list of StructField
 +
 +def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):


What's the use cases that we should have StructType without specifying the 
dataType of each column?

In createDataFrame, if a schema of StructType is provided, it will not try 
to infer the data types, so it does not work with StructType with NoneType in 
it.

—
Reply to this email directly or view it on 
GitHubhttps://github.com/apache/spark/pull/6686/files#r32650869.


The information contained in this e-mail is confidential and/or proprietary 
to Capital One and/or its affiliates and may only be used solely in performance 
of work or services for Capital One. The information transmitted herewith is 
intended only for use by the individual or entity to which it is addressed. If 
the reader of this message is not the intended recipient, you are hereby 
notified that any review, retransmission, dissemination, distribution, copying 
or other use of, or taking of any action in reliance upon this information is 
strictly prohibited. If you have received this communication in error, please 
contact the sender and delete the material from your computer.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-17 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32656949
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,49 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):
--- End diff --

I think you could use `None` as default value of `dataType`, and raise an 
exception if `name_or_struct_field ` is string and `dataType` is None


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-17 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32650599
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,49 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema
+ struct1 = StructType().add(f1, StringType(), True).add(f2, 
StringType(), True, None)
+ struct2 = StructType([StructField(f1, StringType(), True),\
--- End diff --

The doctests will be taken as examples in API docs, should be simple. We 
could put other test cases into sql/tests.py


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-17 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32650869
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,49 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):
--- End diff --

What's the use cases that we should have StructType without specifying the 
dataType of each column?

In `createDataFrame`, if a schema of StructType is provided, it will not 
try to infer the data types, so it does not work with StructType with NoneType 
in it. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-17 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/6686#discussion_r32650363
  
--- Diff: python/pyspark/sql/types.py ---
@@ -368,8 +367,49 @@ def __init__(self, fields):
  struct1 == struct2
 False
 
-assert all(isinstance(f, DataType) for f in fields), fields 
should be a list of DataType
-self.fields = fields
+if not fields:
+self.fields = []
+else:
+self.fields = fields
+assert all(isinstance(f, StructField) for f in fields),\
+fields should be a list of StructField
+
+def add(self, name_or_struct_field, data_type=NullType(), 
nullable=True, metadata=None):
+
+Construct a StructType by adding new elements to it to define the 
schema
--- End diff --

add new line here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-16 Thread ilganeli
Github user ilganeli commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112480475
  
@rxin Is this good to go? Looking forward to getting this merged!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112110192
  
  [Test build #34944 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34944/consoleFull)
 for   PR 6686 at commit 
[`e240040`](https://github.com/apache/spark/commit/e240040cc6d4c206498e8cfc27429f898ba5dab7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112099273
  
  [Test build #34943 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34943/consoleFull)
 for   PR 6686 at commit 
[`bab7823`](https://github.com/apache/spark/commit/bab78239afc4c3148f38e7f79b440ed5235a287e).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112098983
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112098935
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112099705
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112099702
  
  [Test build #34943 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34943/console)
 for   PR 6686 at commit 
[`bab7823`](https://github.com/apache/spark/commit/bab78239afc4c3148f38e7f79b440ed5235a287e).
 * This patch **fails Python style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112108881
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112108953
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112160908
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112160926
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112161079
  
  [Test build #34954 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34954/consoleFull)
 for   PR 6686 at commit 
[`15868fa`](https://github.com/apache/spark/commit/15868fa5d071de4a5c2fd554455e04c4c904b688).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112141366
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112141325
  
  [Test build #34944 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34944/console)
 for   PR 6686 at commit 
[`e240040`](https://github.com/apache/spark/commit/e240040cc6d4c206498e8cfc27429f898ba5dab7).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112151316
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112201653
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112201545
  
  [Test build #34954 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34954/console)
 for   PR 6686 at commit 
[`15868fa`](https://github.com/apache/spark/commit/15868fa5d071de4a5c2fd554455e04c4c904b688).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112182686
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-112182645
  
  [Test build #34952 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34952/console)
 for   PR 6686 at commit 
[`b79b992`](https://github.com/apache/spark/commit/b79b992ecf5e8ffab34eb12090b9e4b0deab36da).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111980790
  
  [Test build #34918 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34918/console)
 for   PR 6686 at commit 
[`73d4677`](https://github.com/apache/spark/commit/73d46774b91175e20194739d61e627c848fec4a3).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111980837
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread ilganeli
Github user ilganeli commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111956199
  
retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111957224
  
  [Test build #34918 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34918/consoleFull)
 for   PR 6686 at commit 
[`73d4677`](https://github.com/apache/spark/commit/73d46774b91175e20194739d61e627c848fec4a3).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111956606
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-15 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111956577
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111805854
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111805844
  
  [Test build #34878 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34878/console)
 for   PR 6686 at commit 
[`0484d7a`](https://github.com/apache/spark/commit/0484d7ac6f479a6da91f8b8b7f8e1a60576e00d0).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111805171
  
  [Test build #34879 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34879/console)
 for   PR 6686 at commit 
[`04cbf0c`](https://github.com/apache/spark/commit/04cbf0c3610c1dd21ac5b27b08b89621e49409e7).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111805174
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111796022
  
  [Test build #34879 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34879/consoleFull)
 for   PR 6686 at commit 
[`04cbf0c`](https://github.com/apache/spark/commit/04cbf0c3610c1dd21ac5b27b08b89621e49409e7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111794364
  
  [Test build #34870 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34870/console)
 for   PR 6686 at commit 
[`689e54d`](https://github.com/apache/spark/commit/689e54dc9992ca9218442729cb8b22e12151cc70).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111794368
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111791362
  
  [Test build #34872 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34872/console)
 for   PR 6686 at commit 
[`6aeb740`](https://github.com/apache/spark/commit/6aeb74018473c5850b7838a0784494ea079c3346).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111791373
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111794023
  
  [Test build #34878 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34878/consoleFull)
 for   PR 6686 at commit 
[`0484d7a`](https://github.com/apache/spark/commit/0484d7ac6f479a6da91f8b8b7f8e1a60576e00d0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111795884
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-8056][SQL] Design an easier way to cons...

2015-06-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/6686#issuecomment-111795912
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   >