[GitHub] spark issue #13758: [SPARK-16043][SQL] Prepare GenericArrayData implementati...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13758
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60768/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13746
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13746
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60760/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13758: [SPARK-16043][SQL] Prepare GenericArrayData implementati...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13758
  
**[Test build #60768 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60768/consoleFull)**
 for PR 13758 at commit 
[`17bdfcf`](https://github.com/apache/spark/commit/17bdfcf7b9c155fe05eb75f73d661c2863cdf2d9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13758: [SPARK-16043][SQL] Prepare GenericArrayData imple...

2016-06-17 Thread kiszk
GitHub user kiszk opened a pull request:

https://github.com/apache/spark/pull/13758

[SPARK-16043][SQL] Prepare GenericArrayData implementation specialized for 
a primitive array

## What changes were proposed in this pull request?

This PR addresses a ToDo of ```GenericArrayData``` class. Current 
implementation of ```GenericArrayData``` leads to boxing/unboxing if type of 
array elements are primitive. It would be good to eliminate boxing/unboxing 
from the view of runtime memory footprint and performance.

This PR eliminattes boxing/unboxing by preparing sub classes of 
```GenericArrayData``` to specialize operations in these classes. This PR 
prepare a new method ```GenericArrayData.allocate(...) ``` that can return 
generic ```GenericArrayData``` instance or a ```GenericArrayData``` 
instance specialized for `` type array.

Here are major improvements:
1. Hold an array in a primitive array (previously ```Object[]``` is used 
and boxing happened in a constructor)
2. a method "get``()" gets a value from an primitive array 
(previously unboxing happened)
3. a method "to``Array" performs data copy using 
```System.arraycopy``` (previously unboxing happened)

## How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)

add unit tests

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/kiszk/spark SPARK-16043

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13758.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13758


commit 17bdfcf7b9c155fe05eb75f73d661c2863cdf2d9
Author: Kazuaki Ishizaki 
Date:   2016-06-18T06:47:11Z

Implementation of GenericArrayData specialized for primitive type array

add unit tests




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13746
  
**[Test build #60760 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60760/consoleFull)**
 for PR 13746 at commit 
[`b1adc7a`](https://github.com/apache/spark/commit/b1adc7aa8a217d18ac0ab6eb89306e23d82c8f86).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra...

2016-06-17 Thread srowen
Github user srowen commented on the issue:

https://github.com/apache/spark/pull/13755
  
OK


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13730: [SPARK-16006][SQL] Attemping to write empty DataFrame wi...

2016-06-17 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue:

https://github.com/apache/spark/pull/13730
  
Hi, @rxin .
Could you review this PR ?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13724: [SPARK-15973] [PYSPARK] Fix GroupedData Documenta...

2016-06-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/13724


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13744: [SPARK-16025] [CORE] Document OFF_HEAP storage level in ...

2016-06-17 Thread srowen
Github user srowen commented on the issue:

https://github.com/apache/spark/pull/13744
  
Looks OK to me


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13724: [SPARK-15973] [PYSPARK] Fix GroupedData Documentation

2016-06-17 Thread rxin
Github user rxin commented on the issue:

https://github.com/apache/spark/pull/13724
  
@josh-howes what's your jira id?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13724: [SPARK-15973] [PYSPARK] Fix GroupedData Documentation

2016-06-17 Thread rxin
Github user rxin commented on the issue:

https://github.com/apache/spark/pull/13724
  
Thanks - merging in master/2.0.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13742: [SPARK-16023][SQL] Move InMemoryRelation to its o...

2016-06-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/13742


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13742: [SPARK-16023][SQL] Move InMemoryRelation to its own file

2016-06-17 Thread rxin
Github user rxin commented on the issue:

https://github.com/apache/spark/pull/13742
  
Merging in master/2.0.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13746: [SPARK-16030] [SQL] Allow specifying static parti...

2016-06-17 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/13746#discussion_r67596845
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ---
@@ -43,11 +44,116 @@ import org.apache.spark.unsafe.types.UTF8String
  * Replaces generic operations with specific variants that are designed to 
work with Spark
  * SQL Data Sources.
  */
-private[sql] object DataSourceAnalysis extends Rule[LogicalPlan] {
+private[sql] case class DataSourceAnalysis(conf: CatalystConf) extends 
Rule[LogicalPlan] {
+
+  def resolver: Resolver = {
+if (conf.caseSensitiveAnalysis) {
+  caseSensitiveResolution
+} else {
+  caseInsensitiveResolution
+}
+  }
+
   override def apply(plan: LogicalPlan): LogicalPlan = plan transform {
+
+// If the InsertIntoTable command is for a partitioned 
HadoopFsRelation and
+// the user has specified static partitions, we add a Project operator 
on top of the query
+// to include those constant column values in the query result.
+//
+// Example:
+// Let's say that we have a table "t", which is created by
+// CREATE TABLE t (a INT, b INT, c INT) USING parquet PARTITIONED BY 
(b, c)
+// The statement of "INSERT INTO TABLE t PARTITION (b=2, c) SELECT 1, 
3"
+// will be converted to "INSERT INTO TABLE t PARTITION (b, c) SELECT 
1, 2, 3".
+//
+// Basically, we will put those partition columns having a assigned 
value back
+// to the SELECT clause. The output of the SELECT clause is organized 
as
+// normal_columns static_partitioning_columns 
dynamic_partitioning_columns.
+// static_partitioning_columns are partitioning columns having assigned
+// values in the PARTITION clause (e.g. b in the above example).
+// dynamic_partitioning_columns are partitioning columns that do not 
assigned
+// values in the PARTITION clause (e.g. c in the above example).
+case i @ logical.InsertIntoTable(
+l @ LogicalRelation(t: HadoopFsRelation, _, _), parts, query, 
overwrite, false)
--- End diff --

and don't use t or i here either ...



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13746: [SPARK-16030] [SQL] Allow specifying static parti...

2016-06-17 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/13746#discussion_r67596839
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ---
@@ -43,11 +44,116 @@ import org.apache.spark.unsafe.types.UTF8String
  * Replaces generic operations with specific variants that are designed to 
work with Spark
  * SQL Data Sources.
  */
-private[sql] object DataSourceAnalysis extends Rule[LogicalPlan] {
+private[sql] case class DataSourceAnalysis(conf: CatalystConf) extends 
Rule[LogicalPlan] {
+
+  def resolver: Resolver = {
+if (conf.caseSensitiveAnalysis) {
+  caseSensitiveResolution
+} else {
+  caseInsensitiveResolution
+}
+  }
+
   override def apply(plan: LogicalPlan): LogicalPlan = plan transform {
+
+// If the InsertIntoTable command is for a partitioned 
HadoopFsRelation and
+// the user has specified static partitions, we add a Project operator 
on top of the query
+// to include those constant column values in the query result.
+//
+// Example:
+// Let's say that we have a table "t", which is created by
+// CREATE TABLE t (a INT, b INT, c INT) USING parquet PARTITIONED BY 
(b, c)
+// The statement of "INSERT INTO TABLE t PARTITION (b=2, c) SELECT 1, 
3"
+// will be converted to "INSERT INTO TABLE t PARTITION (b, c) SELECT 
1, 2, 3".
+//
+// Basically, we will put those partition columns having a assigned 
value back
+// to the SELECT clause. The output of the SELECT clause is organized 
as
+// normal_columns static_partitioning_columns 
dynamic_partitioning_columns.
+// static_partitioning_columns are partitioning columns having assigned
+// values in the PARTITION clause (e.g. b in the above example).
+// dynamic_partitioning_columns are partitioning columns that do not 
assigned
+// values in the PARTITION clause (e.g. c in the above example).
+case i @ logical.InsertIntoTable(
+l @ LogicalRelation(t: HadoopFsRelation, _, _), parts, query, 
overwrite, false)
--- End diff --

l is a bad name to use


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread rxin
Github user rxin commented on the issue:

https://github.com/apache/spark/pull/13746
  
Actually can you try do it in this? I worry similar to past pull requests 
we will never revisit and the code will just look like this 2 years from now.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread rxin
Github user rxin commented on the issue:

https://github.com/apache/spark/pull/13746
  
This is ok for 2.0, but for something like this we should write it in a way 
so the rewriting logic can be unit tested without end-to-end suites.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13749: [SPARK-16034][SQL] Checks the partition columns w...

2016-06-17 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/13749#discussion_r67596783
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala
 ---
@@ -435,26 +435,25 @@ case class DataSource(
 // If we are appending to a table that already exists, make sure 
the partitioning matches
 // up.  If we fail to load the table for whatever reason, ignore 
the check.
 if (mode == SaveMode.Append) {
-  val existingPartitionColumnSet = try {
-Some(
-  resolveRelation()
-.asInstanceOf[HadoopFsRelation]
-.location
-.partitionSpec()
-.partitionColumns
-.fieldNames
-.toSet)
-  } catch {
-case e: Exception =>
-  None
-  }
-
-  existingPartitionColumnSet.foreach { ex =>
-if (ex.map(_.toLowerCase) != 
partitionColumns.map(_.toLowerCase()).toSet) {
-  throw new AnalysisException(
-s"Requested partitioning does not equal existing 
partitioning: " +
-s"$ex != ${partitionColumns.toSet}.")
-}
+  val existingColumns = Try {
+resolveRelation()
+  .asInstanceOf[HadoopFsRelation]
+  .location
+  .partitionSpec()
+  .partitionColumns
+  .fieldNames
+  .toSeq
+  }.getOrElse(Seq.empty[String])
+  val sameColumns =
+existingColumns.map(_.toLowerCase) == 
partitionColumns.map(_.toLowerCase)
+  if (existingColumns.size > 0 && !sameColumns) {
+throw new AnalysisException(
+  s"""Requested partitioning does not match existing 
partitioning.
--- End diff --

can you add "Requested partitioning does not match existing partitioning 
for table $table" ?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13749
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60754/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13749
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13749
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60753/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13749
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13749
  
**[Test build #60754 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60754/consoleFull)**
 for PR 13749 at commit 
[`7a4293b`](https://github.com/apache/spark/commit/7a4293bd68896701e4ca61a9df06133357bc0f8a).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13757: [SPARK-16042][SQL] Eliminate nullcheck code at projectio...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13757
  
**[Test build #60767 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60767/consoleFull)**
 for PR 13757 at commit 
[`3875e1f`](https://github.com/apache/spark/commit/3875e1fe20f7032c3da007bcdccd9f2c627710db).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13749
  
**[Test build #60753 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60753/consoleFull)**
 for PR 13749 at commit 
[`5224802`](https://github.com/apache/spark/commit/5224802418a2917ae09f64d8bffce80a0c4d5201).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13757: [SPARK-16042][SQL] Eliminate nullcheck code at pr...

2016-06-17 Thread kiszk
GitHub user kiszk opened a pull request:

https://github.com/apache/spark/pull/13757

[SPARK-16042][SQL] Eliminate nullcheck code at projection for an array type

## What changes were proposed in this pull request?

This PR eliminates nullcheck code at projection for an array type. The 
nullcheck a call to write each element of an array (lines 076-078 in "Before 
applying this PR") is generated. If we know all of the elements do not have 
``null`` at compilation time, we can eliminate code for nullcheck.

This PR checks whether ```ArrayType.containsNull``` is ```false``` to know 
the all of array elements do not have ```null```.



An example program

```
val df = sparkContext.parallelize(Seq(1.0, 2.0), 1).toDF("v")
df.selectExpr("Array(v + 2.2, v + 3.3)").collect
```

Before applying this PR

```
/* 028 */   protected void processNext() throws java.io.IOException {
/* 029 */ while (inputadapter_input.hasNext()) {
/* 030 */   InternalRow inputadapter_row = (InternalRow) 
inputadapter_input.next();
/* 031 */   double inputadapter_value = inputadapter_row.getDouble(0);
/* 032 */
/* 033 */   final boolean project_isNull = false;
/* 034 */   this.project_values = new Object[2];
/* 035 */   double project_value1 = -1.0;
/* 036 */   project_value1 = inputadapter_value + 2.2D;
/* 037 */   if (false) {
/* 038 */ project_values[0] = null;
/* 039 */   } else {
/* 040 */ project_values[0] = project_value1;
/* 041 */   }
/* 042 */
/* 043 */   double project_value4 = -1.0;
/* 044 */   project_value4 = inputadapter_value + 3.3D;
/* 045 */   if (false) {
/* 046 */ project_values[1] = null;
/* 047 */   } else {
/* 048 */ project_values[1] = project_value4;
/* 049 */   }
/* 050 */
/* 051 */   final ArrayData project_value = new 
org.apache.spark.sql.catalyst.util.GenericArrayData(project_values);
/* 052 */   this.project_values = null;
/* 053 */   project_holder.reset();
/* 054 */
/* 055 */   project_rowWriter.zeroOutNullBytes();
/* 056 */
/* 057 */   if (project_isNull) {
/* 058 */ project_rowWriter.setNullAt(0);
/* 059 */   } else {
/* 060 */ // Remember the current cursor so that we can calculate 
how many bytes are
/* 061 */ // written later.
/* 062 */ final int project_tmpCursor = project_holder.cursor;
/* 063 */
/* 064 */ if (project_value instanceof UnsafeArrayData) {
/* 065 */   final int project_sizeInBytes = ((UnsafeArrayData) 
project_value).getSizeInBytes();
/* 066 */   // grow the global buffer before writing data.
/* 067 */   project_holder.grow(project_sizeInBytes);
/* 068 */   ((UnsafeArrayData) 
project_value).writeToMemory(project_holder.buffer, project_holder.cursor);
/* 069 */   project_holder.cursor += project_sizeInBytes;
/* 070 */
/* 071 */ } else {
/* 072 */   final int project_numElements = 
project_value.numElements();
/* 073 */   project_arrayWriter.initialize(project_holder, 
project_numElements, 8);
/* 074 */
/* 075 */   for (int project_index = 0; project_index < 
project_numElements; project_index++) {
/* 076 */ if (project_value.isNullAt(project_index)) {
/* 077 */   project_arrayWriter.setNullAt(project_index);
/* 078 */ } else {
/* 079 */   final double project_element = 
project_value.getDouble(project_index);
/* 080 */   project_arrayWriter.write(project_index, 
project_element);
/* 081 */ }
/* 082 */
/* 083 */   }
/* 084 */ }
/* 085 */
/* 086 */ project_rowWriter.setOffsetAndSize(0, project_tmpCursor, 
project_holder.cursor - project_tmpCursor);
/* 087 */ project_rowWriter.alignToWords(project_holder.cursor - 
project_tmpCursor);
/* 088 */   }
/* 089 */   project_result.setTotalSize(project_holder.totalSize());
/* 090 */   append(project_result);
/* 091 */   if (shouldStop()) return;
/* 092 */ }
```

After applying this PR

```
/* 028 */   protected void processNext() throws java.io.IOException {
/* 029 */ while (inputadapter_input.hasNext()) {
/* 030 */   InternalRow inputadapter_row = (InternalRow) 
inputadapter_input.next();
/* 031 */   double inputadapter_value = inputadapter_row.getDouble(0);
/* 032 */
/* 033 */   final boolean project_isNull = false;
/* 034 */   this.project_values = new Object[2];
/* 035 */   double project_value1 = -1.0;
/* 036 */   project_value1 = inputadapter_value + 2.2D;
/* 037 */   if (f

[GitHub] spark issue #13734: [SPARK-14995][R] Add `since` tag in Roxygen documentatio...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13734
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60765/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13734: [SPARK-14995][R] Add `since` tag in Roxygen documentatio...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13734
  
**[Test build #60765 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60765/consoleFull)**
 for PR 13734 at commit 
[`0092967`](https://github.com/apache/spark/commit/0092967df878bbaaf5ad2a007d56cc68146daea2).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13734: [SPARK-14995][R] Add `since` tag in Roxygen documentatio...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13734
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13754: [SPARK-16036][SPARK-16037][SQL] fix various table insert...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13754
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13754: [SPARK-16036][SPARK-16037][SQL] fix various table insert...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13754
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60759/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13754: [SPARK-16036][SPARK-16037][SQL] fix various table insert...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13754
  
**[Test build #60759 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60759/consoleFull)**
 for PR 13754 at commit 
[`52e67d4`](https://github.com/apache/spark/commit/52e67d48019c5b27cb5f59584a7de15e0a1869c9).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13706: [SPARK-15988] [SQL] Implement DDL commands: Creat...

2016-06-17 Thread lianhuiwang
Github user lianhuiwang commented on a diff in the pull request:

https://github.com/apache/spark/pull/13706#discussion_r67596616
  
--- Diff: 
sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 ---
@@ -97,6 +97,9 @@ statement
 | CREATE TEMPORARY? FUNCTION qualifiedName AS className=STRING
 (USING resource (',' resource)*)?  
#createFunction
 | DROP TEMPORARY? FUNCTION (IF EXISTS)? qualifiedName  
#dropFunction
+| CREATE TEMPORARY MACRO macroName=identifier
--- End diff --

No, Now Hive only support temporary macro's.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #12268: [SPARK-14480][SQL] Simplify CSV parsing process with a b...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/12268
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #12268: [SPARK-14480][SQL] Simplify CSV parsing process with a b...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/12268
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60751/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #10896: [SPARK-12978][SQL] Skip unnecessary final group-by when ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/10896
  
**[Test build #60766 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60766/consoleFull)**
 for PR 10896 at commit 
[`adad55d`](https://github.com/apache/spark/commit/adad55dbaf4adab313f970c82c2b7a45298c36eb).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #12268: [SPARK-14480][SQL] Simplify CSV parsing process with a b...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/12268
  
**[Test build #60751 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60751/consoleFull)**
 for PR 12268 at commit 
[`7abdfc1`](https://github.com/apache/spark/commit/7abdfc66f2bf275fc4318c0ffe8836dcbb70).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13541: [SPARK-15803][PYSPARK] Support with statement syn...

2016-06-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/13541


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13541: [SPARK-15803][PYSPARK] Support with statement syntax for...

2016-06-17 Thread davies
Github user davies commented on the issue:

https://github.com/apache/spark/pull/13541
  
Merging this into master and 2.0, thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13541: [SPARK-15803][PYSPARK] Support with statement syntax for...

2016-06-17 Thread davies
Github user davies commented on the issue:

https://github.com/apache/spark/pull/13541
  
LGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13734: [SPARK-14995][R] Add `since` tag in Roxygen documentatio...

2016-06-17 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue:

https://github.com/apache/spark/pull/13734
  
Hi, @shivaram and @felixcheung .
Now the document is updated with master and shows merged notes correctly. I 
manually checked all the merged notes and used function signatures for 
overloaded functions.
```
Note
sparkR.stop since 1.4.0
sparkR.session.stop since 2.0.0
```

https://home.apache.org/~dongjoon/spark-2.0.0-docs/api/R/sparkR.session.stop.html


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13592: [SPARK-15863][SQL][DOC] Initial SQL programming guide up...

2016-06-17 Thread maropu
Github user maropu commented on the issue:

https://github.com/apache/spark/pull/13592
  
@liancheng Is it worth adding  two parameters 
`spark.sql.files.maxPartitionBytes` and `spark.sql.files.openCostInBytes` in 
`Other Configuration Options`? They are kinds of internal parameters though, it 
seems they are useful for the users that would like to control #partitions. 
https://issues.apache.org/jira/browse/SPARK-15894


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13752
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60763/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13752
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13752
  
**[Test build #60763 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60763/consoleFull)**
 for PR 13752 at commit 
[`f5f401c`](https://github.com/apache/spark/commit/f5f401c42034396a45015cd558f62cbf83af1719).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13753
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60762/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13753
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13753
  
**[Test build #60762 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60762/consoleFull)**
 for PR 13753 at commit 
[`d70154c`](https://github.com/apache/spark/commit/d70154cee47ca9fe803e79d673396a833b057fa7).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13734: [SPARK-14995][R] Add `since` tag in Roxygen documentatio...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13734
  
**[Test build #60765 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60765/consoleFull)**
 for PR 13734 at commit 
[`0092967`](https://github.com/apache/spark/commit/0092967df878bbaaf5ad2a007d56cc68146daea2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13750: [SPARK-16035][PYSPARK] Fix SparseVector parser as...

2016-06-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/13750


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13750: [SPARK-16035][PYSPARK] Fix SparseVector parser assertion...

2016-06-17 Thread mengxr
Github user mengxr commented on the issue:

https://github.com/apache/spark/pull/13750
  
Merged into master and branch-2.0. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13641: [SPARK-10258][DOC][ML] Add @Since annotations to ...

2016-06-17 Thread mengxr
Github user mengxr commented on a diff in the pull request:

https://github.com/apache/spark/pull/13641#discussion_r67596152
  
--- Diff: 
mllib/src/main/scala/org/apache/spark/ml/feature/ElementwiseProduct.scala ---
@@ -33,21 +33,26 @@ import org.apache.spark.sql.types.DataType
  * multiplier.
  */
 @Experimental
-class ElementwiseProduct(override val uid: String)
+@Since("1.4.0")
+class ElementwiseProduct @Since("1.4.0") (@Since("1.4.0") override val 
uid: String)
   extends UnaryTransformer[Vector, Vector, ElementwiseProduct] with 
DefaultParamsWritable {
 
+  @Since("1.4.0")
   def this() = this(Identifiable.randomUID("elemProd"))
 
   /**
* the vector to multiply with input vectors
* @group param
*/
+  @Since("1.4.0")
   val scalingVec: Param[Vector] = new Param(this, "scalingVec", "vector 
for hadamard product")
 
   /** @group setParam */
+  @Since("1.4.0")
--- End diff --

ditto


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13641: [SPARK-10258][DOC][ML] Add @Since annotations to ...

2016-06-17 Thread mengxr
Github user mengxr commented on a diff in the pull request:

https://github.com/apache/spark/pull/13641#discussion_r67596150
  
--- Diff: 
mllib/src/main/scala/org/apache/spark/ml/feature/ElementwiseProduct.scala ---
@@ -33,21 +33,26 @@ import org.apache.spark.sql.types.DataType
  * multiplier.
  */
 @Experimental
-class ElementwiseProduct(override val uid: String)
+@Since("1.4.0")
+class ElementwiseProduct @Since("1.4.0") (@Since("1.4.0") override val 
uid: String)
   extends UnaryTransformer[Vector, Vector, ElementwiseProduct] with 
DefaultParamsWritable {
 
+  @Since("1.4.0")
   def this() = this(Identifiable.randomUID("elemProd"))
 
   /**
* the vector to multiply with input vectors
* @group param
*/
+  @Since("1.4.0")
--- End diff --

This uses the new `Vector`. So ideally it is since `2.0.0`. I'm not sure 
about whether we should change the class since version to 2.0 as well.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13755
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13755
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60761/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13755
  
**[Test build #60761 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60761/consoleFull)**
 for PR 13755 at commit 
[`4262ad3`](https://github.com/apache/spark/commit/4262ad30ea347f78d42990d0a39bb5db56648821).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13753
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13753
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60757/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13753
  
**[Test build #60757 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60757/consoleFull)**
 for PR 13753 at commit 
[`0b2fe6f`](https://github.com/apache/spark/commit/0b2fe6fbb80599a00ab32b00d0ea256739682519).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13751: [SPARK-15159][SPARKR] SparkSession roxygen2 doc, program...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13751
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60756/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13751: [SPARK-15159][SPARKR] SparkSession roxygen2 doc, program...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13751
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13751: [SPARK-15159][SPARKR] SparkSession roxygen2 doc, program...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13751
  
**[Test build #60756 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60756/consoleFull)**
 for PR 13751 at commit 
[`c434614`](https://github.com/apache/spark/commit/c434614321a3650a18b228f6333ffe245b6ea7ee).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13752
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60755/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13752
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13752
  
**[Test build #60755 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60755/consoleFull)**
 for PR 13752 at commit 
[`179fbdc`](https://github.com/apache/spark/commit/179fbdc7f93a929e00aaeeb575eb404038385252).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13745
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60758/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13745
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13756: [SPARK-16041][SQL] Disallow Duplicate Columns in partiti...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13756
  
**[Test build #60764 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60764/consoleFull)**
 for PR 13756 at commit 
[`83082ff`](https://github.com/apache/spark/commit/83082ffb9dae922374b2d2f6051cce4bf24cb1de).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13745
  
**[Test build #60758 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60758/consoleFull)**
 for PR 13745 at commit 
[`3b01f11`](https://github.com/apache/spark/commit/3b01f111a417b398c659637b7b977df5c387011d).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13756: [SPARK-16041][SQL] Disallow Duplicate Columns in ...

2016-06-17 Thread gatorsmile
GitHub user gatorsmile opened a pull request:

https://github.com/apache/spark/pull/13756

[SPARK-16041][SQL] Disallow Duplicate Columns in partitionBy, blockBy and 
sortBy in DataFrameWriter

 What changes were proposed in this pull request?
Duplicate columns are not allowed in `partitionBy`, `blockBy`, `sortBy` in 
`DataFrameWriter`. The duplicate columns could cause unpredictable results. For 
example, the resolution failure. 

This PR is to detect the duplicates and issue exceptions with appropriate 
messages.

 How was this patch tested?
Added test cases in `DataFrameReaderWriterSuite`


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/gatorsmile/spark dedup

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13756.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13756


commit 83082ffb9dae922374b2d2f6051cce4bf24cb1de
Author: gatorsmile 
Date:   2016-06-18T05:13:36Z

fix




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13745
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13745
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60752/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13745
  
**[Test build #60752 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60752/consoleFull)**
 for PR 13745 at commit 
[`01e4a08`](https://github.com/apache/spark/commit/01e4a08b3a6154a0e04391fa8299821894109bb3).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13752
  
**[Test build #60763 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60763/consoleFull)**
 for PR 13752 at commit 
[`f5f401c`](https://github.com/apache/spark/commit/f5f401c42034396a45015cd558f62cbf83af1719).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13753
  
**[Test build #60762 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60762/consoleFull)**
 for PR 13753 at commit 
[`d70154c`](https://github.com/apache/spark/commit/d70154cee47ca9fe803e79d673396a833b057fa7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13755
  
**[Test build #60761 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60761/consoleFull)**
 for PR 13755 at commit 
[`4262ad3`](https://github.com/apache/spark/commit/4262ad30ea347f78d42990d0a39bb5db56648821).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13755: [SPARK-16040][MLlib][DOC]:spark.mllib PIC documen...

2016-06-17 Thread wangmiao1981
GitHub user wangmiao1981 opened a pull request:

https://github.com/apache/spark/pull/13755

[SPARK-16040][MLlib][DOC]:spark.mllib PIC document extra line of refernece

## What changes were proposed in this pull request?



In the 2.0 document, Line "A full example that produces the experiment 
described in the PIC paper can be found under examples/." is redundant.

There is already "Find full example code at 
"examples/src/main/scala/org/apache/spark/examples/mllib/PowerIterationClusteringExample.scala"
 in the Spark repo.".

We should remove the first line, which is consistent with other documents.



## How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)

Manual test

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/wangmiao1981/spark doc

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13755.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13755


commit 4262ad30ea347f78d42990d0a39bb5db56648821
Author: wm...@hotmail.com 
Date:   2016-06-18T05:04:20Z

remove extra line




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread liancheng
Github user liancheng commented on the issue:

https://github.com/apache/spark/pull/13753
  
LGTM pending Jenkins. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13635: [SPARK-15159][SPARKR] SparkR SparkSession API

2016-06-17 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue:

https://github.com/apache/spark/pull/13635
  
Oh, great!!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13718: [SPARK-16002][SQL]Sleep when no new data arrives to avoi...

2016-06-17 Thread zsxwing
Github user zsxwing commented on the issue:

https://github.com/apache/spark/pull/13718
  
@tdas I fixed the test. Could you take another look?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13746: [SPARK-16030] [SQL] Allow specifying static partitions w...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13746
  
**[Test build #60760 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60760/consoleFull)**
 for PR 13746 at commit 
[`b1adc7a`](https://github.com/apache/spark/commit/b1adc7aa8a217d18ac0ab6eb89306e23d82c8f86).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13754: [SPARK-16036][SPARK-16037][SQL] fix various table insert...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13754
  
**[Test build #60759 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60759/consoleFull)**
 for PR 13754 at commit 
[`52e67d4`](https://github.com/apache/spark/commit/52e67d48019c5b27cb5f59584a7de15e0a1869c9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13740: [SPARK-16020][SQL]Fix complete mode aggregation w...

2016-06-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/13740


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13754: [SPARK-16036][SPARK-16037][SQL] fix various table...

2016-06-17 Thread cloud-fan
GitHub user cloud-fan opened a pull request:

https://github.com/apache/spark/pull/13754

[SPARK-16036][SPARK-16037][SQL] fix various table insertion semantics

## What changes were proposed in this pull request?

The current table insertion has some weird behaviours:

1. inserting into a partitioned table with mismatch columns has confusing 
error message for hive table, and wrong result for datasource table
2. inserting into a partitioned table without partition list has wrong 
result for hive table.

This PR fixes these 2 problems.

## How was this patch tested?

new test in hive `SQLQuerySuite`


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/cloud-fan/spark insert2

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13754.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13754


commit 52e67d48019c5b27cb5f59584a7de15e0a1869c9
Author: Wenchen Fan 
Date:   2016-06-17T21:37:31Z

fix table insertion semantics




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13754: [SPARK-16036][SPARK-16037][SQL] fix various table insert...

2016-06-17 Thread cloud-fan
Github user cloud-fan commented on the issue:

https://github.com/apache/spark/pull/13754
  
cc @yhuai @marmbrus @rxin 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13740: [SPARK-16020][SQL]Fix complete mode aggregation with con...

2016-06-17 Thread zsxwing
Github user zsxwing commented on the issue:

https://github.com/apache/spark/pull/13740
  
The added test passed. I'm going to merge it into master and 2.0 to make 
sure it's in rc1.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and deprec...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13753
  
**[Test build #60757 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60757/consoleFull)**
 for PR 13753 at commit 
[`0b2fe6f`](https://github.com/apache/spark/commit/0b2fe6fbb80599a00ab32b00d0ea256739682519).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13745: [Spark 15997][DOC][ML] Update user guide for HashingTF, ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13745
  
**[Test build #60758 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60758/consoleFull)**
 for PR 13745 at commit 
[`3b01f11`](https://github.com/apache/spark/commit/3b01f111a417b398c659637b7b977df5c387011d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13752
  
**[Test build #60755 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60755/consoleFull)**
 for PR 13752 at commit 
[`179fbdc`](https://github.com/apache/spark/commit/179fbdc7f93a929e00aaeeb575eb404038385252).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13751: [SPARK-15159][SPARKR] SparkSession roxygen2 doc, program...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13751
  
**[Test build #60756 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60756/consoleFull)**
 for PR 13751 at commit 
[`c434614`](https://github.com/apache/spark/commit/c434614321a3650a18b228f6333ffe245b6ea7ee).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13753: [SPARK-16029][SPARKR] SparkR add dropTempView and...

2016-06-17 Thread felixcheung
GitHub user felixcheung opened a pull request:

https://github.com/apache/spark/pull/13753

[SPARK-16029][SPARKR] SparkR add dropTempView and deprecate dropTempTable

## What changes were proposed in this pull request?

Add dropTempView and deprecate dropTempTable

## How was this patch tested?

unit tests

@shivaram @liancheng


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/felixcheung/spark rdroptempview

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13753.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13753


commit 0b2fe6fbb80599a00ab32b00d0ea256739682519
Author: Felix Cheung 
Date:   2016-06-18T04:50:53Z

Add dropTempView and deprecate dropTempTable




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13752: [SPARK-16028][SPARKR] spark.lapply can work with active ...

2016-06-17 Thread felixcheung
Github user felixcheung commented on the issue:

https://github.com/apache/spark/pull/13752
  
example update in the other PR.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13752: [SPARK-16028][SPARKR] spark.lapply can work with ...

2016-06-17 Thread felixcheung
GitHub user felixcheung opened a pull request:

https://github.com/apache/spark/pull/13752

[SPARK-16028][SPARKR] spark.lapply can work with active context

## What changes were proposed in this pull request?

spark.lapply and setLogLevel


## How was this patch tested?

unit test

@shivaram @thunterdb 


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/felixcheung/spark rlapply

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13752.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13752


commit 179fbdc7f93a929e00aaeeb575eb404038385252
Author: Felix Cheung 
Date:   2016-06-18T04:48:05Z

lapply should not need spark context




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13751: [SPARK-15159][SPARKR] SparkSession roxygen2 doc, ...

2016-06-17 Thread felixcheung
GitHub user felixcheung opened a pull request:

https://github.com/apache/spark/pull/13751

[SPARK-15159][SPARKR] SparkSession roxygen2 doc, programming guide, example 
updates

## What changes were proposed in this pull request?

roxygen2 doc, programming guide, example updates

## How was this patch tested?

manual checks
@shivaram 


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/felixcheung/spark rsparksessiondoc

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/13751.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #13751


commit c434614321a3650a18b228f6333ffe245b6ea7ee
Author: Felix Cheung 
Date:   2016-06-18T04:39:44Z

roxygen2 doc, programming guide, example updates




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13725: [SPARK-15892][ML] Backport correctly merging AFTAggregat...

2016-06-17 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue:

https://github.com/apache/spark/pull/13725
  
I am closing this as this is merged. Thank you @mengxr again!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #13725: [SPARK-15892][ML] Backport correctly merging AFTA...

2016-06-17 Thread HyukjinKwon
Github user HyukjinKwon closed the pull request at:

https://github.com/apache/spark/pull/13725


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13750: [SPARK-16035][PYSPARK] Fix SparseVector parser assertion...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13750
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60749/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13750: [SPARK-16035][PYSPARK] Fix SparseVector parser assertion...

2016-06-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/13750
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #13749: [SPARK-16034][SQL] Checks the partition columns when cal...

2016-06-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/13749
  
**[Test build #60754 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60754/consoleFull)**
 for PR 13749 at commit 
[`7a4293b`](https://github.com/apache/spark/commit/7a4293bd68896701e4ca61a9df06133357bc0f8a).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   >