date:20180630

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21684
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21684
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92507/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21684
  
**[Test build #92507 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92507/testReport)**
 for PR 21684 at commit 
[`e2eaa1c`](https://github.com/apache/spark/commit/e2eaa1c47d16549713da9cca7cce06552c384c0c).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread huaxingao

Github user huaxingao commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199338975
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,16 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowedForSqlHint <- function(x) {
+  if (is.character(x) | is.numeric(x)) {
+TRUE
+  } else if (is.list(x)) {
+all (sapply(x, (function (y) is.character(y) | is.numeric(y
+  } else {
+FALSE
+  }
+}
+
 #' hint
 #'
 #' Specifies execution plan hint and return a new SparkDataFrame.
--- End diff --

Scala doesn't have a doc on hint type supported. Actually it has type Any 
in the signature. 
```
def hint(name: String, parameters: Any*): Dataset[T]
```
I was trying to follow the python implementation in #20788, which allows 
```basestring, list, float, int, dict.```Now I am kind of suspect if this is 
the right behavior. Should we allow any types? 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21535: [SPARK-23596][SQL][WIP] Test interpreted path on Dataset...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21535
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21535: [SPARK-23596][SQL][WIP] Test interpreted path on Dataset...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21535
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/607/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21535: [SPARK-23596][SQL][WIP] Test interpreted path on Dataset...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21535
  
**[Test build #92508 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92508/testReport)**
 for PR 21535 at commit 
[`250074b`](https://github.com/apache/spark/commit/250074b0377c3fbcf63ebf355b6d61c4f4f9e446).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21535: [SPARK-23596][SQL][WIP] Test interpreted path on Dataset...

2018-06-30 Thread viirya

Github user viirya commented on the issue:

https://github.com/apache/spark/pull/21535
  
retest this please.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21535: [SPARK-23596][SQL][WIP] Test interpreted path on Dataset...

2018-06-30 Thread viirya

Github user viirya commented on the issue:

https://github.com/apache/spark/pull/21535
  
ping @hvanhovell @cloud-fan 



---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21073
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92505/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21073
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21073
  
**[Test build #92505 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92505/testReport)**
 for PR 21073 at commit 
[`4ee7b46`](https://github.com/apache/spark/commit/4ee7b46ef7d337c37b6198104ea37a93a5584e72).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21667: [SPARK-24691][SQL]Add new API `supportDataType` in FileF...

2018-06-30 Thread gengliangwang

Github user gengliangwang commented on the issue:

https://github.com/apache/spark/pull/21667
  
@hvanhovell 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21685: [SPARK-24707][DSTREAMS] Enable spark-kafka-streaming to ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21685
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21685: [SPARK-24707][DSTREAMS] Enable spark-kafka-streaming to ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21685
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21685: [SPARK-24707][DSTREAMS] Enable spark-kafka-streaming to ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21685
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21685: [SPARK-24707][DSTREAMS] Enable spark-kafka-stream...

2018-06-30 Thread sidhavratha

GitHub user sidhavratha opened a pull request:

https://github.com/apache/spark/pull/21685

[SPARK-24707][DSTREAMS] Enable spark-kafka-streaming to maintain min â¦

â¦buffer using async thread to avoid blocking kafka poll

## What changes were proposed in this pull request?

Currently Spark Kafka RDD will block on kafka consumer poll. Specially in 
Spark-Kafka-streaming job this poll duration adds into batch processing time 
which result in 
* Increased batch processing time (which is apart from time taken to 
process records)
* Results in unpredictable batch processing time based on poll time.

This PR consists of changes to maintain min records in buffer, so that 
streaming batches processing do not have to get blocked on kafka poll.

## How was this patch tested?

Unit test / manual test.

[Before_change.pdf](https://github.com/apache/spark/files/2152353/Before_change.pdf)

[After_change_2000_buffer_per_part.pdf](https://github.com/apache/spark/files/2152354/After_change_2000_buffer_per_part.pdf)


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/sidhavratha/spark master

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/21685.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #21685


commit 35d792a83f13291a99cd1bf3ce89f932614da9c0
Author: s0k00rv 
Date:   2018-07-01T03:00:45Z

[SPARK-24707][DSTREAMS] Enable spark-kafka-streaming to maintain min buffer 
using async thread to avoid blocking kafka poll




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21684
  
**[Test build #92507 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92507/testReport)**
 for PR 21684 at commit 
[`e2eaa1c`](https://github.com/apache/spark/commit/e2eaa1c47d16549713da9cca7cce06552c384c0c).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21684
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/606/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21684: [SPARK-24470][Core] RestSubmissionClient to be robust ag...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21684
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21684: [SPARK-24470][Core] RestSubmissionClient to be ro...

2018-06-30 Thread rekhajoshm

GitHub user rekhajoshm opened a pull request:

https://github.com/apache/spark/pull/21684

[SPARK-24470][Core] RestSubmissionClient to be robust against 404 & non 
json responses

## What changes were proposed in this pull request?
Added check for 404, to avoid json parsing on not found response and to 
avoid returning malformed or bad request when it was a not found http response. 
Not sure if I need to add an additional check on non json response 
[if(connection.getHeaderField("Content-Type").contains("text/html")) then 
exception] as non-json is a subset of malformed json and covered in flow.

## How was this patch tested?
./dev/run-tests

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/rekhajoshm/spark SPARK-24470

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/21684.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #21684


commit e3677c9fa9697e0d34f9df52442085a6a481c9e9
Author: Rekha Joshi 
Date:   2015-05-05T23:10:08Z

Merge pull request #1 from apache/master

Pulling functionality from apache spark

commit 106fd8eee8f6a6f7c67cfc64f57c1161f76d8f75
Author: Rekha Joshi 
Date:   2015-05-08T21:49:09Z

Merge pull request #2 from apache/master

pull latest from apache spark

commit 0be142d6becba7c09c6eba0b8ea1efe83d649e8c
Author: Rekha Joshi 
Date:   2015-06-22T00:08:08Z

Merge pull request #3 from apache/master

Pulling functionality from apache spark

commit 6c6ee12fd733e3f9902e10faf92ccb78211245e3
Author: Rekha Joshi 
Date:   2015-09-17T01:03:09Z

Merge pull request #4 from apache/master

Pulling functionality from apache spark

commit b123c601e459d1ad17511fd91dd304032154882a
Author: Rekha Joshi 
Date:   2015-11-25T18:50:32Z

Merge pull request #5 from apache/master

pull request from apache/master

commit c73c32aadd6066e631956923725a48d98a18777e
Author: Rekha Joshi 
Date:   2016-03-18T19:13:51Z

Merge pull request #6 from apache/master

pull latest from apache spark

commit 7dbf7320057978526635bed09dabc8cf8657a28a
Author: Rekha Joshi 
Date:   2016-04-05T20:26:40Z

Merge pull request #8 from apache/master

pull latest from apache spark

commit 5e9d71827f8e2e4d07027281b80e4e073e7fecd1
Author: Rekha Joshi 
Date:   2017-05-01T23:00:30Z

Merge pull request #9 from apache/master

Pull apache spark

commit 63d99b3ce5f222d7126133170a373591f0ac67dd
Author: Rekha Joshi 
Date:   2017-09-30T22:26:44Z

Merge pull request #10 from apache/master

pull latest apache spark

commit a7fc787466b71784ff86f9694f617db0f1042da8
Author: Rekha Joshi 
Date:   2018-01-21T00:17:58Z

Merge pull request #11 from apache/master

Apache spark pull latest

commit 3a2d45377ed4397de802badd764bc2588cfd275b
Author: Rekha Joshi 
Date:   2018-02-09T04:55:12Z

Merge pull request #12 from apache/master

Apache spark latest pull

commit dca3a9e6f4c5fb6d34aeac7d2579c24102834fc2
Author: Rekha Joshi 
Date:   2018-05-31T22:27:26Z

Merge pull request #13 from apache/master

apache spark latest pull

commit fe58c3770814543dbe923420f0bf77a74696f8d4
Author: Rekha Joshi 
Date:   2018-06-21T05:59:44Z

Merge remote-tracking branch 'upstream/master'

commit ae51f60e9451b4b6be7200f6f142e871c38a1a8a
Author: Rekha Joshi 
Date:   2018-06-21T06:10:45Z

Merge pull request #14 from apache/master

latest apache spark

commit 1cf4ed2df52fbb1404ca8ddf015ac799989b7d62
Author: Rekha Joshi 
Date:   2018-06-30T17:17:49Z

Merge branch 'master' of https://github.com/rekhajoshm/spark

commit 1c48d4f6e49e5c818609ae1e82ed48ac8426a6d0
Author: Rekha Joshi 
Date:   2018-06-30T17:20:10Z

Merge pull request #15 from apache/master

apache spark latest

commit aa735f5e1680b9abf23ce8e40b0b8436fdb61e2b
Author: Rekha Joshi 
Date:   2018-06-30T17:21:09Z

Merge branch 'master' of https://github.com/rekhajoshm/spark

commit e2eaa1c47d16549713da9cca7cce06552c384c0c
Author: Rekha Joshi 
Date:   2018-07-01T01:39:45Z

[SPARK-24470][Core] RestSubmissionClient to be robust against 404 & non 
json responses




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread HyukjinKwon

Github user HyukjinKwon commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199336223
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala
 ---
@@ -317,16 +292,52 @@ class JacksonParser(
 row
   }
 
+  private def parseTimestamp(stringValue: String): Long = {
+// This one will lose microseconds parts.
+// See https://issues.apache.org/jira/browse/SPARK-10681.x
+Try(options.timestampFormat.parse(stringValue).getTime * 1000L)
+  .getOrElse {
+// If it fails to parse, then tries the way used in 2.0 and 1.x 
for backwards
+// compatibility.
+DateTimeUtils.stringToTime(stringValue).getTime * 1000L
+  }
+  }
+
+  private def parseDate(stringValue: String): Int = {
+
Try(DateTimeUtils.millisToDays(options.dateFormat.parse(stringValue).getTime))
+  .orElse {
+// If it fails to parse, then tries the way used in 2.0 and 1.x 
for backwards
+// compatibility.
+
Try(DateTimeUtils.millisToDays(DateTimeUtils.stringToTime(stringValue).getTime))
+  }
+  .getOrElse {
+// In Spark 1.5.0, we store the data as number of days since epoch 
in string.
+// So, we just convert it to Int.
+stringValue.toInt
+  }
+  }
+
   /**
* Parse an object as a Map, preserving all fields.
*/
   private def convertMap(
   parser: JsonParser,
+  keyType: DataType,
   fieldConverter: ValueConverter): MapData = {
-val keys = ArrayBuffer.empty[UTF8String]
+val keys = ArrayBuffer.empty[Any]
 val values = ArrayBuffer.empty[Any]
 while (nextUntil(parser, JsonToken.END_OBJECT)) {
-  keys += UTF8String.fromString(parser.getCurrentName)
+
+  val keyValue = keyType match {
+case DateType =>
+  parseDate(parser.getCurrentName)
+case TimestampType =>
+  parseTimestamp(parser.getCurrentName)
+case _ =>
+  UTF8String.fromString(parser.getCurrentName)
+  }
+
+  keys += keyValue
--- End diff --

Yea, that maybe can be not blocked. However, one thing I am a bit hesitant 
of is if we can really support other types with a min fix, and if it's worth. 
Otherwise I should see if it's worth of a big change. Let me check again and 
left a comment soon again.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTI...

2018-06-30 Thread asfgit

Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/21640


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21681: Pin tag 210

2018-06-30 Thread srowen

Github user srowen commented on the issue:

https://github.com/apache/spark/pull/21681
  
Close this @zhangchj1990 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and...

2018-06-30 Thread srowen

Github user srowen commented on the issue:

https://github.com/apache/spark/pull/21640
  
Merged to master


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21660: [SPARK-24683][K8S] Fix k8s no resource

2018-06-30 Thread rekhajoshm

Github user rekhajoshm commented on a diff in the pull request:

https://github.com/apache/spark/pull/21660#discussion_r199335279
  
--- Diff: 
resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/KubernetesSuite.scala
 ---
@@ -21,17 +21,17 @@ import java.nio.file.{Path, Paths}
 import java.util.UUID
 import java.util.regex.Pattern
 
-import scala.collection.JavaConverters._
-
 import com.google.common.io.PatternFilenameFilter
 import io.fabric8.kubernetes.api.model.{Container, Pod}
 import org.scalatest.{BeforeAndAfter, BeforeAndAfterAll}
 import org.scalatest.concurrent.{Eventually, PatienceConfiguration}
 import org.scalatest.time.{Minutes, Seconds, Span}
+import scala.collection.JavaConverters._
--- End diff --

this order does not seem correct.could be an unintentional move.please 
check.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21683
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21683
  
**[Test build #92506 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92506/testReport)**
 for PR 21683 at commit 
[`61bec7b`](https://github.com/apache/spark/commit/61bec7bd9cd0fc6a87eaadbc3336df7274cd373d).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21683
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92506/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21683
  
**[Test build #92506 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92506/testReport)**
 for PR 21683 at commit 
[`61bec7b`](https://github.com/apache/spark/commit/61bec7bd9cd0fc6a87eaadbc3336df7274cd373d).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21683
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/605/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21683: [SPARK-24507][Documentation] Update streaming guide

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21683
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21683: [SPARK-24507][Documentation] Update streaming gui...

2018-06-30 Thread rekhajoshm

GitHub user rekhajoshm opened a pull request:

https://github.com/apache/spark/pull/21683

[SPARK-24507][Documentation] Update streaming guide

## What changes were proposed in this pull request?
Updated streaming guide for direct stream and link to integration guide.

## How was this patch tested?
jekyll build


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/rekhajoshm/spark SPARK-24507

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/21683.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #21683


commit e3677c9fa9697e0d34f9df52442085a6a481c9e9
Author: Rekha Joshi 
Date:   2015-05-05T23:10:08Z

Merge pull request #1 from apache/master

Pulling functionality from apache spark

commit 106fd8eee8f6a6f7c67cfc64f57c1161f76d8f75
Author: Rekha Joshi 
Date:   2015-05-08T21:49:09Z

Merge pull request #2 from apache/master

pull latest from apache spark

commit 0be142d6becba7c09c6eba0b8ea1efe83d649e8c
Author: Rekha Joshi 
Date:   2015-06-22T00:08:08Z

Merge pull request #3 from apache/master

Pulling functionality from apache spark

commit 6c6ee12fd733e3f9902e10faf92ccb78211245e3
Author: Rekha Joshi 
Date:   2015-09-17T01:03:09Z

Merge pull request #4 from apache/master

Pulling functionality from apache spark

commit b123c601e459d1ad17511fd91dd304032154882a
Author: Rekha Joshi 
Date:   2015-11-25T18:50:32Z

Merge pull request #5 from apache/master

pull request from apache/master

commit c73c32aadd6066e631956923725a48d98a18777e
Author: Rekha Joshi 
Date:   2016-03-18T19:13:51Z

Merge pull request #6 from apache/master

pull latest from apache spark

commit 7dbf7320057978526635bed09dabc8cf8657a28a
Author: Rekha Joshi 
Date:   2016-04-05T20:26:40Z

Merge pull request #8 from apache/master

pull latest from apache spark

commit 5e9d71827f8e2e4d07027281b80e4e073e7fecd1
Author: Rekha Joshi 
Date:   2017-05-01T23:00:30Z

Merge pull request #9 from apache/master

Pull apache spark

commit 63d99b3ce5f222d7126133170a373591f0ac67dd
Author: Rekha Joshi 
Date:   2017-09-30T22:26:44Z

Merge pull request #10 from apache/master

pull latest apache spark

commit a7fc787466b71784ff86f9694f617db0f1042da8
Author: Rekha Joshi 
Date:   2018-01-21T00:17:58Z

Merge pull request #11 from apache/master

Apache spark pull latest

commit 3a2d45377ed4397de802badd764bc2588cfd275b
Author: Rekha Joshi 
Date:   2018-02-09T04:55:12Z

Merge pull request #12 from apache/master

Apache spark latest pull

commit dca3a9e6f4c5fb6d34aeac7d2579c24102834fc2
Author: Rekha Joshi 
Date:   2018-05-31T22:27:26Z

Merge pull request #13 from apache/master

apache spark latest pull

commit fe58c3770814543dbe923420f0bf77a74696f8d4
Author: Rekha Joshi 
Date:   2018-06-21T05:59:44Z

Merge remote-tracking branch 'upstream/master'

commit ae51f60e9451b4b6be7200f6f142e871c38a1a8a
Author: Rekha Joshi 
Date:   2018-06-21T06:10:45Z

Merge pull request #14 from apache/master

latest apache spark

commit 1cf4ed2df52fbb1404ca8ddf015ac799989b7d62
Author: Rekha Joshi 
Date:   2018-06-30T17:17:49Z

Merge branch 'master' of https://github.com/rekhajoshm/spark

commit 1c48d4f6e49e5c818609ae1e82ed48ac8426a6d0
Author: Rekha Joshi 
Date:   2018-06-30T17:20:10Z

Merge pull request #15 from apache/master

apache spark latest

commit aa735f5e1680b9abf23ce8e40b0b8436fdb61e2b
Author: Rekha Joshi 
Date:   2018-06-30T17:21:09Z

Merge branch 'master' of https://github.com/rekhajoshm/spark

commit 61bec7bd9cd0fc6a87eaadbc3336df7274cd373d
Author: Rekha Joshi 
Date:   2018-06-30T23:45:39Z

[SPARK-24507][Docs] Update streaming guide




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21073
  
**[Test build #92505 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92505/testReport)**
 for PR 21073 at commit 
[`4ee7b46`](https://github.com/apache/spark/commit/4ee7b46ef7d337c37b6198104ea37a93a5584e72).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21073
  
Build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21073
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92502/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21073
  
**[Test build #92502 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92502/testReport)**
 for PR 21073 at commit 
[`484ac17`](https://github.com/apache/spark/commit/484ac173d253efcf83a86edc02f61cc3cba57d02).
 * This patch passes all tests.
 * This patch **does not merge cleanly**.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92504/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21677
  
**[Test build #92504 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92504/testReport)**
 for PR 21677 at commit 
[`ec62e13`](https://github.com/apache/spark/commit/ec62e13b9034ca81002dffe07a5ffbd1b425240a).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21680: [SPARK-24704][WebUI] Fix the order of stages in the DAG ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21680
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92501/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21680: [SPARK-24704][WebUI] Fix the order of stages in the DAG ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21680
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21680: [SPARK-24704][WebUI] Fix the order of stages in the DAG ...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21680
  
**[Test build #92501 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92501/testReport)**
 for PR 21680 at commit 
[`b3420d6`](https://github.com/apache/spark/commit/b3420d61025f7bb9e17160dfb586bc54fba1a51d).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21682
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92503/
Test FAILed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21682
  
Merged build finished. Test FAILed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21682
  
**[Test build #92503 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92503/testReport)**
 for PR 21682 at commit 
[`e9d5625`](https://github.com/apache/spark/commit/e9d56252e6c65f5afa207bc98c8c5e008de57e0c).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread MaxGekk

Github user MaxGekk commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199331278
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonParser.scala
 ---
@@ -317,16 +292,52 @@ class JacksonParser(
 row
   }
 
+  private def parseTimestamp(stringValue: String): Long = {
+// This one will lose microseconds parts.
+// See https://issues.apache.org/jira/browse/SPARK-10681.x
+Try(options.timestampFormat.parse(stringValue).getTime * 1000L)
+  .getOrElse {
+// If it fails to parse, then tries the way used in 2.0 and 1.x 
for backwards
+// compatibility.
+DateTimeUtils.stringToTime(stringValue).getTime * 1000L
+  }
+  }
+
+  private def parseDate(stringValue: String): Int = {
+
Try(DateTimeUtils.millisToDays(options.dateFormat.parse(stringValue).getTime))
+  .orElse {
+// If it fails to parse, then tries the way used in 2.0 and 1.x 
for backwards
+// compatibility.
+
Try(DateTimeUtils.millisToDays(DateTimeUtils.stringToTime(stringValue).getTime))
+  }
+  .getOrElse {
+// In Spark 1.5.0, we store the data as number of days since epoch 
in string.
+// So, we just convert it to Int.
+stringValue.toInt
+  }
+  }
+
   /**
* Parse an object as a Map, preserving all fields.
*/
   private def convertMap(
   parser: JsonParser,
+  keyType: DataType,
   fieldConverter: ValueConverter): MapData = {
-val keys = ArrayBuffer.empty[UTF8String]
+val keys = ArrayBuffer.empty[Any]
 val values = ArrayBuffer.empty[Any]
 while (nextUntil(parser, JsonToken.END_OBJECT)) {
-  keys += UTF8String.fromString(parser.getCurrentName)
+
+  val keyValue = keyType match {
+case DateType =>
+  parseDate(parser.getCurrentName)
+case TimestampType =>
+  parseTimestamp(parser.getCurrentName)
+case _ =>
+  UTF8String.fromString(parser.getCurrentName)
+  }
+
+  keys += keyValue
--- End diff --

I don't think that support of all types as key types must block this PR. 
You can support only date and timestamp types in this PR, and postpone generic 
solution for the near future. @HyukjinKwon @viirya What do you think of that? 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21677
  
**[Test build #92504 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92504/testReport)**
 for PR 21677 at commit 
[`ec62e13`](https://github.com/apache/spark/commit/ec62e13b9034ca81002dffe07a5ffbd1b425240a).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/604/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21682
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/603/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21682
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21682: [SPARK-24706][SQL] ByteType and ShortType support pushdo...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21682
  
**[Test build #92503 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92503/testReport)**
 for PR 21682 at commit 
[`e9d5625`](https://github.com/apache/spark/commit/e9d56252e6c65f5afa207bc98c8c5e008de57e0c).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21682: [SPARK-24706][SQL] ByteType and ShortType support...

2018-06-30 Thread wangyum

GitHub user wangyum opened a pull request:

https://github.com/apache/spark/pull/21682

[SPARK-24706][SQL] ByteType and ShortType support pushdown to parquet

## What changes were proposed in this pull request?

`ByteType` and `ShortType` support pushdown to parquet data source.

## How was this patch tested?

unit tests


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/wangyum/spark SPARK-24706

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/21682.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #21682


commit e9d56252e6c65f5afa207bc98c8c5e008de57e0c
Author: Yuming Wang 
Date:   2018-06-30T19:13:13Z

ByteType and ShortType pushdown to parquet




---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread patrickmcgloin

Github user patrickmcgloin commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199330240
  
--- Diff: python/pyspark/sql/functions.py ---
@@ -2163,9 +2163,9 @@ def json_tuple(col, *fields):
 @since(2.1)
 def from_json(col, schema, options={}):
 """
-Parses a column containing a JSON string into a :class:`MapType` with 
:class:`StringType`
-as keys type, :class:`StructType` or :class:`ArrayType` of 
:class:`StructType`\\s with
-the specified schema. Returns `null`, in the case of an unparseable 
string.
+Parses a column containing a JSON string into a :class:`MapType`, 
:class:`StructType`
+or :class:`ArrayType` of :class:`StructType`\\s with the specified 
schema. Returns
+`null`, in the case of an unparseable string.
--- End diff --

I think that other basic types (e.g Int, Long, etc) were already supported 
along with String.  As long as they can be directly converted from String to 
that type specified in the schema then they were ok.  What we have added here 
is parsing on the DateType and TimestampType.  I don't think complex types 
would ever be supported.  And I don't think other basic types need the parsing 
that date/times do.

Do you think we should specify which types are supported as keys?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21495: [SPARK-24418][Build] Upgrade Scala to 2.11.12 and 2.12.6

2018-06-30 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue:

https://github.com/apache/spark/pull/21495
  
In addition to that, it woud be great if we fix `sbt` soon. After this PR, 
`mvn` works correctly, but `sbt` is still hitting `NoSuchMethodError` in 
`master` branch.
```scala
$ ./build/sbt -Pyarn -Phadoop-2.7 -Phadoop-cloud -Phive -Phive-thriftserver 
-Psparkr test:package
$ bin/spark-shell
scala> Spark context Web UI available at http://localhost:4040
Spark context available as 'sc' (master = local[*], app id = 
local-1530385877441).
Spark session available as 'spark'.
Exception in thread "main" java.lang.NoSuchMethodError: 
jline.console.completer.CandidateListCompletionHandler.setPrintSpaceAfterFullCompletion(Z)V
```


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread patrickmcgloin

Github user patrickmcgloin commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199330156
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala
 ---
@@ -43,6 +43,11 @@ private[sql] class JacksonGenerator(
   // we can directly access data in `ArrayData` without the help of 
`SpecificMutableRow`.
   private type ValueWriter = (SpecializedGetters, Int) => Unit
 
+  // A `ValueReader` is responsible for reading a field of an 
`InternalRow` to a String.
+  // The only purpose of this is to read the key values for a map so that 
they can be
+  // written as JSON filed names.
--- End diff --

Fixed


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21672: [SPARK-24694][K8S] Pass all app args to integration test...

2018-06-30 Thread skonto

Github user skonto commented on the issue:

https://github.com/apache/spark/pull/21672
  
@felixcheung we could call a test with more than one parameters like: 
https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/examples/MultiBroadcastTest.scala


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21623: [SPARK-24638][SQL] StringStartsWith support push down

2018-06-30 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue:

https://github.com/apache/spark/pull/21623
  
@cloud-fan . AFAIK, ORC doesn't support custom filter yet. I'll follow up 
that stuff in ORC.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21073: [SPARK-23936][SQL] Implement map_concat

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21073
  
**[Test build #92502 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92502/testReport)**
 for PR 21073 at commit 
[`484ac17`](https://github.com/apache/spark/commit/484ac173d253efcf83a86edc02f61cc3cba57d02).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21649
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21649
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92500/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21649
  
**[Test build #92500 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92500/testReport)**
 for PR 21649 at commit 
[`d323dd0`](https://github.com/apache/spark/commit/d323dd00916957abb48a31440c3274c3877e5401).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199328283
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,16 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowedForSqlHint <- function(x) {
+  if (is.character(x) | is.numeric(x)) {
--- End diff --

nit: very minor, both is.* returns a single element, but with `||` you get 
shortcut eval if is.character(x) is TRUE



---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199328450
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,16 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowedForSqlHint <- function(x) {
+  if (is.character(x) | is.numeric(x)) {
+TRUE
+  } else if (is.list(x)) {
+all (sapply(x, (function (y) is.character(y) | is.numeric(y
--- End diff --

also, no space after `all` - it's really `all()`


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199328269
  
--- Diff: R/pkg/tests/fulltests/test_sparkSQL.R ---
@@ -2370,6 +2370,15 @@ test_that("join(), crossJoin() and merge() on a 
DataFrame", {
   expect_true(any(grepl("BroadcastHashJoin", execution_plan_broadcast)))
 })
 
+test_that("test hint", {
--- End diff --

ok


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199328430
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,16 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowedForSqlHint <- function(x) {
+  if (is.character(x) | is.numeric(x)) {
+TRUE
+  } else if (is.list(x)) {
+all (sapply(x, (function (y) is.character(y) | is.numeric(y
--- End diff --

ditto here


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199328470
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,16 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowedForSqlHint <- function(x) {
+  if (is.character(x) | is.numeric(x)) {
+TRUE
+  } else if (is.list(x)) {
+all (sapply(x, (function (y) is.character(y) | is.numeric(y
+  } else {
+FALSE
+  }
+}
+
 #' hint
 #'
 #' Specifies execution plan hint and return a new SparkDataFrame.
--- End diff --

does scala have any doc on hint type supported?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21640
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21640
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92499/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21640
  
**[Test build #92499 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92499/testReport)**
 for PR 21640 at commit 
[`198b208`](https://github.com/apache/spark/commit/198b2082019d0a17ea327d03816cc7d496df7443).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21631: [SPARK-24645][SQL] Skip parsing when csvColumnPruning en...

2018-06-30 Thread MaxGekk

Github user MaxGekk commented on the issue:

https://github.com/apache/spark/pull/21631
  
The bug has been already fixed in uniVocity `2.6.5-SNAPSHOT`


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21680: [SPARK-24704][WebUI] Fix the order of stages in the DAG ...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21680
  
**[Test build #92501 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92501/testReport)**
 for PR 21680 at commit 
[`b3420d6`](https://github.com/apache/spark/commit/b3420d61025f7bb9e17160dfb586bc54fba1a51d).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21680: [SPARK-24704][WebUI] Fix the order of stages in the DAG ...

2018-06-30 Thread felixcheung

Github user felixcheung commented on the issue:

https://github.com/apache/spark/pull/21680
  
ok to test


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21649
  
**[Test build #92500 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92500/testReport)**
 for PR 21649 at commit 
[`d323dd0`](https://github.com/apache/spark/commit/d323dd00916957abb48a31440c3274c3877e5401).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21649
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21649: [SPARK-23648][R][SQL]Adds more types for hint in SparkR

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21649
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/602/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21649: [SPARK-23648][R][SQL]Adds more types for hint in ...

2018-06-30 Thread huaxingao

Github user huaxingao commented on a diff in the pull request:

https://github.com/apache/spark/pull/21649#discussion_r199327384
  
--- Diff: R/pkg/R/DataFrame.R ---
@@ -3905,6 +3905,18 @@ setMethod("rollup",
 groupedData(sgd)
   })
 
+isTypeAllowed <- function(x) {
+  if (is.character(x)) {
+TRUE
+  } else if (is.list(x)) {
--- End diff --

Thanks. I will make changes. 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21678: [SPARK-23461][R]vignettes should include model predictio...

2018-06-30 Thread huaxingao

Github user huaxingao commented on the issue:

https://github.com/apache/spark/pull/21678
  
Here is the output for Linear SVM Classifier in sparkr-vignettes.html.
```
prediction <- predict(model, training)
head(select(prediction, "Class", "Sex", "Age", "Freq", "Survived", 
"prediction"))
##   ClassSex   Age Freq Survived prediction
## 1   1st   Male Child0   NoYes
## 2   2nd   Male Child0   NoYes
## 3   3rd   Male Child   35   NoYes
## 4  Crew   Male Child0   NoYes
## 5   1st Female Child0   NoYes
## 6   2nd Female Child0   NoYes
```


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21681: Pin tag 210

2018-06-30 Thread wangyum

Github user wangyum commented on the issue:

https://github.com/apache/spark/pull/21681
  
 @zhangchj1990 Looks mistakenly open. Mind closing this please?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21640: [SPARK-24654][BUILD] Update, fix LICENSE and NOTICE, and...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21640
  
**[Test build #92499 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92499/testReport)**
 for PR 21640 at commit 
[`198b208`](https://github.com/apache/spark/commit/198b2082019d0a17ea327d03816cc7d496df7443).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92498/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21648
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21648
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92497/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21648
  
**[Test build #92497 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92497/testReport)**
 for PR 21648 at commit 
[`4fc0ae4`](https://github.com/apache/spark/commit/4fc0ae4410edcf9890a89bb0b47d0d633bfb3dda).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21648
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21648
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92496/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21648: [SPARK-24665][PySpark] Use SQLConf in PySpark to manage ...

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21648
  
**[Test build #92496 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92496/testReport)**
 for PR 21648 at commit 
[`b816549`](https://github.com/apache/spark/commit/b8165495f2ad68d54f3f2ba6b4247d58825bd41c).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread patrickmcgloin

Github user patrickmcgloin commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199319281
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala
 ---
@@ -163,6 +175,22 @@ private[sql] class JacksonGenerator(
   s"with the type of $dataType to JSON.")
   }
 
+  private def makeReader(dataType: DataType): ValueReader = dataType match 
{
+
+case TimestampType =>
+  (row: SpecializedGetters, ordinal: Int) =>
+
options.timestampFormat.format(DateTimeUtils.toJavaTimestamp(row.getLong(ordinal)))
+
+case DateType =>
+  (row: SpecializedGetters, ordinal: Int) =>
+
options.dateFormat.format(DateTimeUtils.toJavaDate(row.getInt(ordinal)))
+
+case _ =>
+  (row: SpecializedGetters, ordinal: Int) =>
+row.get(ordinal, dataType).toString
--- End diff --

Actually, now that I consider it a bit more I think nested types are not 
possible.  The result has to be stored as a String, e.g.

"key": "value"

Putting the nested type inside the name of the JSON wouldn't be possible.  


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread patrickmcgloin

Github user patrickmcgloin commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199319246
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala
 ---
@@ -163,6 +175,22 @@ private[sql] class JacksonGenerator(
   s"with the type of $dataType to JSON.")
   }
 
+  private def makeReader(dataType: DataType): ValueReader = dataType match 
{
+
+case TimestampType =>
+  (row: SpecializedGetters, ordinal: Int) =>
+
options.timestampFormat.format(DateTimeUtils.toJavaTimestamp(row.getLong(ordinal)))
+
+case DateType =>
+  (row: SpecializedGetters, ordinal: Int) =>
+
options.dateFormat.format(DateTimeUtils.toJavaDate(row.getInt(ordinal)))
+
+case _ =>
+  (row: SpecializedGetters, ordinal: Int) =>
+row.get(ordinal, dataType).toString
--- End diff --

I considered this the default case and thought it would be covered by 
existing Unit Tests.  Which types should I add tests for?  Int, String, Long, 
etc.  All of the children of AtomicType?

Nested types is interesting.  I will add some tests for keys with a 
Seq[Int], Map[String, Int] to see that they work.  If there was something else 
you were thinking of please let me know.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread viirya

Github user viirya commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199317878
  
--- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JacksonGenerator.scala
 ---
@@ -43,6 +43,11 @@ private[sql] class JacksonGenerator(
   // we can directly access data in `ArrayData` without the help of 
`SpecificMutableRow`.
   private type ValueWriter = (SpecializedGetters, Int) => Unit
 
+  // A `ValueReader` is responsible for reading a field of an 
`InternalRow` to a String.
+  // The only purpose of this is to read the key values for a map so that 
they can be
+  // written as JSON filed names.
--- End diff --

filed -> field?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21671: [SPARK-24682] [SQL] from_json / to_json now handl...

2018-06-30 Thread viirya

Github user viirya commented on a diff in the pull request:

https://github.com/apache/spark/pull/21671#discussion_r199317586
  
--- Diff: python/pyspark/sql/functions.py ---
@@ -2163,9 +2163,9 @@ def json_tuple(col, *fields):
 @since(2.1)
 def from_json(col, schema, options={}):
 """
-Parses a column containing a JSON string into a :class:`MapType` with 
:class:`StringType`
-as keys type, :class:`StructType` or :class:`ArrayType` of 
:class:`StructType`\\s with
-the specified schema. Returns `null`, in the case of an unparseable 
string.
+Parses a column containing a JSON string into a :class:`MapType`, 
:class:`StructType`
+or :class:`ArrayType` of :class:`StructType`\\s with the specified 
schema. Returns
+`null`, in the case of an unparseable string.
--- End diff --

Do we support all data types as key in `MapType`? With this change, besides 
`StringType`, we only support `DateType` and `TimestampType`, right?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread SparkQA

Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21677
  
**[Test build #92498 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92498/testReport)**
 for PR 21677 at commit 
[`be5d219`](https://github.com/apache/spark/commit/be5d2197d3d3ff06c5f80ed66d5a0bb20924e130).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21677: [SPARK-24692][TESTS] Improvement FilterPushdownBenchmark

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21677
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 

https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/600/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21681: Pin tag 210

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21681
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21681: Pin tag 210

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21681
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #21681: Pin tag 210

2018-06-30 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21681
  
Can one of the admins verify this patch?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21681: Pin tag 210

2018-06-30 Thread zhangchj1990

GitHub user zhangchj1990 opened a pull request:

https://github.com/apache/spark/pull/21681

Pin tag 210

## What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

## How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)
(If this patch involves UI changes, please attach a screenshot; otherwise, 
remove this)

Please review http://spark.apache.org/contributing.html before opening a 
pull request.


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zhangchj1990/spark pin-tag-210

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/21681.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #21681


commit 6b6eb4e520d07a27aa68d3450f3c7613b233d928
Author: Zheng RuiFeng 
Date:   2016-11-16T10:46:27Z

[SPARK-18434][ML] Add missing ParamValidations for ML algos

## What changes were proposed in this pull request?
Add missing ParamValidations for ML algos
## How was this patch tested?
existing tests

Author: Zheng RuiFeng 

Closes #15881 from zhengruifeng/arg_checking.

(cherry picked from commit c68f1a38af67957ee28889667193da8f64bb4342)
Signed-off-by: Yanbo Liang 

commit 416bc3dd3db7f7ae2cc7b3ffe395decd0c5b73f9
Author: Zheng RuiFeng 
Date:   2016-11-16T10:53:23Z

[SPARK-18446][ML][DOCS] Add links to API docs for ML algos

## What changes were proposed in this pull request?
Add links to API docs for ML algos
## How was this patch tested?
Manual checking for the API links

Author: Zheng RuiFeng 

Closes #15890 from zhengruifeng/algo_link.

(cherry picked from commit a75e3fe923372c56bc1b2f4baeaaf5868ad28341)
Signed-off-by: Sean Owen 

commit b0ae8712358fc8c07aa5efe4d0bd337e7e452078
Author: Xianyang Liu 
Date:   2016-11-16T11:59:00Z

[SPARK-18420][BUILD] Fix the errors caused by lint check in Java

Small fix, fix the errors caused by lint check in Java

- Clear unused objects and `UnusedImports`.
- Add comments around the method `finalize` of 
`NioBufferedFileInputStream`to turn off checkstyle.
- Cut the line which is longer than 100 characters into two lines.

Travis CI.
```
$ build/mvn -T 4 -q -DskipTests -Pyarn -Phadoop-2.3 -Pkinesis-asl -Phive 
-Phive-thriftserver install
$ dev/lint-java
```
Before:
```
Checkstyle checks failed at following occurrences:
[ERROR] 
src/main/java/org/apache/spark/network/util/TransportConf.java:[21,8] (imports) 
UnusedImports: Unused import - 
org.apache.commons.crypto.cipher.CryptoCipherFactory.
[ERROR] 
src/test/java/org/apache/spark/network/sasl/SparkSaslSuite.java:[516,5] 
(modifier) RedundantModifier: Redundant 'public' modifier.
[ERROR] 
src/main/java/org/apache/spark/io/NioBufferedFileInputStream.java:[133] 
(coding) NoFinalizer: Avoid using finalizer method.
[ERROR] 
src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeMapData.java:[71] 
(sizes) LineLength: Line is longer than 100 characters (found 113).
[ERROR] 
src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java:[112]
 (sizes) LineLength: Line is longer than 100 characters (found 110).
[ERROR] 
src/test/java/org/apache/spark/sql/catalyst/expressions/HiveHasherSuite.java:[31,17]
 (modifier) ModifierOrder: 'static' modifier out of order with the JLS 
suggestions.

[ERROR]src/main/java/org/apache/spark/examples/ml/JavaLogisticRegressionWithElasticNetExample.java:[64]
 (sizes) LineLength: Line is longer than 100 characters (found 103).
[ERROR] 
src/main/java/org/apache/spark/examples/ml/JavaInteractionExample.java:[22,8] 
(imports) UnusedImports: Unused import - org.apache.spark.ml.linalg.Vectors.
[ERROR] 
src/main/java/org/apache/spark/examples/ml/JavaInteractionExample.java:[51] 
(regexp) RegexpSingleline: No trailing whitespace allowed.
```

After:
```
$ build/mvn -T 4 -q -DskipTests -Pyarn -Phadoop-2.3 -Pkinesis-asl -Phive 
-Phive-thriftserver install
$ dev/lint-java
Using `mvn` from path: 
/home/travis/build/ConeyLiu/spark/build/apache-maven-3.3.9/bin/mvn
Checkstyle checks passed.
```

Author: Xianyang Liu 

Closes #15865 from ConeyLiu/master.

(cherry picked from commit 7569cf6cb85bda7d0e76d3e75e286d4796e77e08)
Signed-off-by: Sean Owen 

commit c0dbe08d604dea543eb17ccb802a8a20d6c21a69
Author: gatorsmile 
Date:   2016-11-16T16:25:15Z

[SPARK-18415][SQL] Weird Plan Output when CTE used in RunnableCommand

### What changes were proposed in this pull request?
Currently, when CTE is used in RunnableCommand, the Analyzer does not 
replace the logical n

1 2 >

1 - 100 of 132 matches

Mail list logo