date:20161203

[GitHub] spark issue #16134: [SPARK-18703] [SQL] Drop Staging Directories and Data Fi...

2016-12-03 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/16134 is this PR related to https://github.com/apache/spark/pull/15819? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #15979: [SPARK-18251][SQL] the type of Dataset can't be Option o...

2016-12-03 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15979 ``` val x: Dataset[String, Option[(String, String)]] = ... x.groupByKey(_._1).mapValues(_._2).agg(someAgg) ``` Does it work before? Please see the discussion in the JIRA:

[GitHub] spark issue #16135: [SPARK-18700][SQL] Add ReadWriteLock for each table's re...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16135 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15979: [SPARK-18251][SQL] the type of Dataset can't be Option o...

2016-12-03 Thread koertkuipers

Github user koertkuipers commented on the issue: https://github.com/apache/spark/pull/15979 this means anything that uses an encoder can no longer use Option[_ <: Product]. encoders are not just used for the top level Dataset creation. Dataset.groupByKey[K] requires an

[GitHub] spark issue #16135: [SPARK-18700][SQL] Add ReadWriteLock for each table's re...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16135 **[Test build #69634 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69634/consoleFull)** for PR 16135 at commit

[GitHub] spark issue #16135: [SPARK-18700][SQL] Add ReadWriteLock for each table's re...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16135 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69634/ Test FAILed. ---

[GitHub] spark pull request #15683: [SPARK-18166][MLlib] Fix Poisson GLM bug due to w...

2016-12-03 Thread actuaryzhang

Github user actuaryzhang commented on a diff in the pull request: https://github.com/apache/spark/pull/15683#discussion_r90771932 --- Diff: mllib/src/test/scala/org/apache/spark/ml/regression/GeneralizedLinearRegressionSuite.scala --- @@ -88,6 +89,12 @@ class

[GitHub] spark pull request #16133: [SPARK-18702][SQL] input_file_block_start and inp...

2016-12-03 Thread dongjoon-hyun

Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/16133#discussion_r90771895 --- Diff: core/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala --- @@ -132,54 +132,57 @@ class NewHadoopRDD[K, V]( override def

[GitHub] spark issue #16135: SPARK-18700: add ReadWriteLock for each table's relation...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16135 **[Test build #69634 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69634/consoleFull)** for PR 16135 at commit

[GitHub] spark pull request #16135: SPARK-18700: add ReadWriteLock for each table's r...

2016-12-03 Thread xuanyuanking

GitHub user xuanyuanking opened a pull request: https://github.com/apache/spark/pull/16135 SPARK-18700: add ReadWriteLock for each table's relation in cache ## What changes were proposed in this pull request? As the scenario describe in

[GitHub] spark issue #16131: [SPARK-18701][ML] Poisson GLM fails due to wrong initial...

2016-12-03 Thread actuaryzhang

Github user actuaryzhang commented on the issue: https://github.com/apache/spark/pull/16131 Jenkins, add to whitelist --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #16134: [SPARK-18703] [SQL] Drop Staging Directories and Data Fi...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16134 **[Test build #69633 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69633/consoleFull)** for PR 16134 at commit

[GitHub] spark pull request #16104: [SPARK-18675][SQL] CTAS for hive serde table shou...

2016-12-03 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16104#discussion_r90771606 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala --- @@ -121,21 +121,61 @@ case class

[GitHub] spark pull request #16134: [SPARK-18703] [SQL] Drop Staging Directories and ...

2016-12-03 Thread gatorsmile

GitHub user gatorsmile opened a pull request: https://github.com/apache/spark/pull/16134 [SPARK-18703] [SQL] Drop Staging Directories and Data Files After each Insertion/CTAS of Hive serde Tables ### What changes were proposed in this pull request? Below are the

[GitHub] spark issue #16133: [SPARK-18702][SQL] input_file_block_start and input_file...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16133 **[Test build #69632 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69632/consoleFull)** for PR 16133 at commit

[GitHub] spark issue #16133: [SPARK-18702][SQL] input_file_block_start and input_file...

2016-12-03 Thread rxin

Github user rxin commented on the issue: https://github.com/apache/spark/pull/16133 cc @ueshin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #16133: [SPARK-18702][SQL] input_file_block_start and inp...

2016-12-03 Thread rxin

GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/16133 [SPARK-18702][SQL] input_file_block_start and input_file_block_length ## What changes were proposed in this pull request? We currently have function input_file_name to get the path of the input

[GitHub] spark pull request #15979: [SPARK-18251][SQL] the type of Dataset can't be O...

2016-12-03 Thread koertkuipers

Github user koertkuipers commented on a diff in the pull request: https://github.com/apache/spark/pull/15979#discussion_r90770855 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala --- @@ -47,16 +47,26 @@ object ExpressionEncoder

[GitHub] spark pull request #15979: [SPARK-18251][SQL] the type of Dataset can't be O...

2016-12-03 Thread koertkuipers

Github user koertkuipers commented on a diff in the pull request: https://github.com/apache/spark/pull/15979#discussion_r90770824 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/encoders/ExpressionEncoder.scala --- @@ -47,16 +47,26 @@ object ExpressionEncoder

[GitHub] spark pull request #16132: [MINOR] [README] Correct Markdown link inside rea...

2016-12-03 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/16132 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #16132: [MINOR] [README] Correct Markdown link inside readme

2016-12-03 Thread rxin

Github user rxin commented on the issue: https://github.com/apache/spark/pull/16132 Merging in master. Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #16132: [MINOR] [README] Correct Markdown link inside readme

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16132 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #16132: [MINOR] [README] Correct Markdown link inside rea...

2016-12-03 Thread linbojin

GitHub user linbojin opened a pull request: https://github.com/apache/spark/pull/16132 [MINOR] [README] Correct Markdown link inside readme ## What changes were proposed in this pull request? "Useful Developer Tools" link inside

[GitHub] spark issue #16131: [SPARK-18701][ML] Poisson GLM fails due to wrong initial...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16131 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #16131: [SPARK-18701][ML] Poisson GLM fails due to wrong ...

2016-12-03 Thread actuaryzhang

GitHub user actuaryzhang opened a pull request: https://github.com/apache/spark/pull/16131 [SPARK-18701][ML] Poisson GLM fails due to wrong initialization Poisson GLM fails for many standard data sets (see example in test or JIRA). The issue is incorrect initialization leading to

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-12-03 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14638 @rxin Could you give me any advice for this issue? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2016-12-03 Thread kiszk

Github user kiszk commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r90769457 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -118,7 +118,42 @@ object

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69631/ Test PASSed. ---

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69631 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69631/consoleFull)** for PR 16122 at commit

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16090 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16090 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69630/ Test PASSed. ---

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16090 **[Test build #69630 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69630/consoleFull)** for PR 16090 at commit

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69631 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69631/consoleFull)** for PR 16122 at commit

[GitHub] spark pull request #15795: [SPARK-18081][ML][DOCS] Add user guide for Locali...

2016-12-03 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15795 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15795: [SPARK-18081][ML][DOCS] Add user guide for Locality Sens...

2016-12-03 Thread jkbradley

Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/15795 LGTM merging with master and branch-2.1 Thanks all! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69629/ Test FAILed. ---

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69629 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69629/consoleFull)** for PR 16122 at commit

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16090 **[Test build #69630 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69630/consoleFull)** for PR 16090 at commit

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread ericl

Github user ericl commented on the issue: https://github.com/apache/spark/pull/16090 Not sure I follow - could you explain more on why that would resolve the issue? Btw, I reverted this pr to b405635, which passes all tests. --- If your project is set up for it, you can

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69629 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69629/consoleFull)** for PR 16122 at commit

[GitHub] spark pull request #16125: [SPARK-18694][SS]Add StreamingQuery.explain and e...

2016-12-03 Thread zsxwing

Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/16125#discussion_r90767030 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/StreamingQueryException.scala --- @@ -24,32 +24,42 @@ import

[GitHub] spark pull request #15813: [SPARK-18362][SQL] Use TextFileFormat in implemen...

2016-12-03 Thread rxin

Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/15813#discussion_r90766854 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVFileFormat.scala --- @@ -173,51 +179,37 @@ class CSVFileFormat extends

[GitHub] spark pull request #15813: [SPARK-18362][SQL] Use TextFileFormat in implemen...

2016-12-03 Thread JoshRosen

Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/15813#discussion_r90766169 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVFileFormat.scala --- @@ -173,51 +179,37 @@ class CSVFileFormat

[GitHub] spark issue #15918: [SPARK-18122][SQL][WIP]Fallback to Kryo for unsupported ...

2016-12-03 Thread koertkuipers

Github user koertkuipers commented on the issue: https://github.com/apache/spark/pull/15918 It can be done with shapeless (which perhaps uses macros under hood, I don't know). On Dec 1, 2016 19:56, "Michael Armbrust" wrote: I don't think

[GitHub] spark pull request #16103: [SPARK-18374][ML]Incorrect words in StopWords/eng...

2016-12-03 Thread hhbyyh

Github user hhbyyh commented on a diff in the pull request: https://github.com/apache/spark/pull/16103#discussion_r90765451 --- Diff: mllib/src/main/resources/org/apache/spark/ml/feature/stopwords/english.txt --- @@ -149,5 +149,58 @@ shan shouldn wasn weren -won

[GitHub] spark issue #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16130 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69628/ Test PASSed. ---

[GitHub] spark issue #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16130 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16130 **[Test build #69628 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69628/consoleFull)** for PR 16130 at commit

[GitHub] spark pull request #16094: [SPARK-18541][Python]Add metadata parameter to py...

2016-12-03 Thread holdenk

Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/16094#discussion_r90764328 --- Diff: python/pyspark/sql/column.py --- @@ -298,19 +299,34 @@ def isin(self, *cols): isNotNull = _unary_op("isNotNull", "True if the current

[GitHub] spark issue #16121: [SPARK-16589][PYTHON] Chained cartesian produces incorre...

2016-12-03 Thread holdenk

Github user holdenk commented on the issue: https://github.com/apache/spark/pull/16121 I was hesistant with the previous PR since it seemed like we didn't fully understand why we were changing what we were at the time, I can try and take a closer look at this over the next few days

[GitHub] spark issue #16119: [SPARK-18687][Pyspark][SQL]Backward compatibility - crea...

2016-12-03 Thread holdenk

Github user holdenk commented on the issue: https://github.com/apache/spark/pull/16119 Since the current tests pass without this change I'd say that we should add a test for the behaviour we are planning to support that isn't currently supported (would also make the purpose of the

[GitHub] spark issue #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16130 **[Test build #69628 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69628/consoleFull)** for PR 16130 at commit

[GitHub] spark issue #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread nchammas

Github user nchammas commented on the issue: https://github.com/apache/spark/pull/16130 cc @vanzin? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark pull request #16130: Update location of Spark YARN shuffle jar

2016-12-03 Thread nchammas

GitHub user nchammas opened a pull request: https://github.com/apache/spark/pull/16130 Update location of Spark YARN shuffle jar Looking at the distributions provided on spark.apache.org, I see that the Spark YARN shuffle jar is under `yarn/` and not `lib/`. You can merge this

[GitHub] spark pull request #16046: [SPARK-18582][SQL] Whitelist LogicalPlan operator...

2016-12-03 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/16046 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #16129: [SPARK-18678][ML] Skewed feature subsampling in Random f...

2016-12-03 Thread srowen

Github user srowen commented on the issue: https://github.com/apache/spark/pull/16129 @felixcheung maybe you can advise me on this. I think this is a correct fix, but ends up changing the results of decision forests a little bit. The SparkR test you wrote fails: ```

[GitHub] spark issue #16046: [SPARK-18582][SQL] Whitelist LogicalPlan operators allow...

2016-12-03 Thread hvanhovell

Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/16046 Merging to master/2.1/2.0. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16129: [SPARK-18678][ML] Skewed feature subsampling in Random f...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16129 **[Test build #3467 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3467/consoleFull)** for PR 16129 at commit

[GitHub] spark pull request #16068: [SPARK-18637][SQL]Stateful UDF should be consider...

2016-12-03 Thread zhzhan

Github user zhzhan commented on a diff in the pull request: https://github.com/apache/spark/pull/16068#discussion_r90763121 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveUDFSuite.scala --- @@ -487,6 +488,29 @@ class HiveUDFSuite extends QueryTest with

[GitHub] spark issue #16068: [SPARK-18637][SQL]Stateful UDF should be considered as n...

2016-12-03 Thread zhzhan

Github user zhzhan commented on the issue: https://github.com/apache/spark/pull/16068 @gatorsmile we cannot use deterministic = true/false, as there are existing udf with deterministic as true, but stateful as true as well. --- If your project is set up for it, you can reply to

[GitHub] spark issue #16068: [SPARK-18637][SQL]Stateful UDF should be considered as n...

2016-12-03 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/16068 https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java#L1373-L1378 Copied the code from Hive `FunctionRegistry.java`: ```JAVA

[GitHub] spark issue #16068: [SPARK-18637][SQL]Stateful UDF should be considered as n...

2016-12-03 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/16068 Found the link: [HIVE-1994: Support new annotation @UDFType(stateful = true)](https://issues.apache.org/jira/browse/HIVE-1994 ) --- If your project is set up for it, you can reply to this email

[GitHub] spark issue #16068: [SPARK-18637][SQL]Stateful UDF should be considered as n...

2016-12-03 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/16068 Could we directly use `@UDFType(deterministic = true/false)`? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #16068: [SPARK-18637][SQL]Stateful UDF should be considered as n...

2016-12-03 Thread zhzhan

Github user zhzhan commented on the issue: https://github.com/apache/spark/pull/16068 My understanding is that the non-deterministic udf does not need to be stageful, but a stateful udf has to be non-deterministic. Here is the comments in hive regarding this property

[GitHub] spark issue #16129: [SPARK-18678][ML] Skewed feature subsampling in Random f...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16129 **[Test build #3467 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3467/consoleFull)** for PR 16129 at commit

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69625/ Test FAILed. ---

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69625 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69625/consoleFull)** for PR 16122 at commit

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69627/ Test PASSed. ---

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69627 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69627/consoleFull)** for PR 16114 at commit

[GitHub] spark issue #16129: [SPARK-18678][ML] Skewed feature subsampling in Random f...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16129 **[Test build #3466 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3466/consoleFull)** for PR 16129 at commit

[GitHub] spark issue #16090: [SPARK-18661] [SQL] Creating a partitioned datasource ta...

2016-12-03 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/16090 If we are going to hack it, how about this? ``` val dataSource = DataSource(...) if (classOf[FileFormat].isAssignableFrom(dataSource.providingClass)) {

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69627 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69627/consoleFull)** for PR 16114 at commit

[GitHub] spark pull request #16114: [SPARK-18620][Streaming][Kinesis] Flatten input r...

2016-12-03 Thread maropu

Github user maropu commented on a diff in the pull request: https://github.com/apache/spark/pull/16114#discussion_r90758322 --- Diff: external/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisRecordProcessor.scala --- @@ -56,6 +56,27 @@ private[kinesis] class

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69626 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69626/consoleFull)** for PR 16114 at commit

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69626/ Test FAILed. ---

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15995: [SPARK-18566][SQL] remove OverwriteOptions

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15995 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69623/ Test PASSed. ---

[GitHub] spark issue #15995: [SPARK-18566][SQL] remove OverwriteOptions

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15995 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69626 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69626/consoleFull)** for PR 16114 at commit

[GitHub] spark issue #15995: [SPARK-18566][SQL] remove OverwriteOptions

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15995 **[Test build #69623 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69623/consoleFull)** for PR 15995 at commit

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread maropu

Github user maropu commented on the issue: https://github.com/apache/spark/pull/16114 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #16114: [SPARK-18620][Streaming][Kinesis] Flatten input r...

2016-12-03 Thread maropu

Github user maropu commented on a diff in the pull request: https://github.com/apache/spark/pull/16114#discussion_r90758182 --- Diff: external/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisRecordProcessor.scala --- @@ -56,6 +56,27 @@ private[kinesis] class

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69625 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69625/consoleFull)** for PR 16122 at commit

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69624 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69624/consoleFull)** for PR 16114 at commit

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69624/ Test FAILed. ---

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16114 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16114: [SPARK-18620][Streaming][Kinesis] Flatten input rates in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16114 **[Test build #69624 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69624/consoleFull)** for PR 16114 at commit

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread wangyum

Github user wangyum commented on the issue: https://github.com/apache/spark/pull/16122 This patch fails because hive-0.12 and hive-0.13 doesn't has `getMetaConf` method. see [HIVE-7532](https://issues.apache.org/jira/browse/HIVE-7532), --- If your project is set up for it, you

[GitHub] spark pull request #16043: [SPARK-18601][SQL] Simplify Create/Get complex ex...

2016-12-03 Thread HyukjinKwon

Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/16043#discussion_r90757729 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/ComplexTypes.scala --- @@ -0,0 +1,78 @@ +/* + * Licensed to the

[GitHub] spark issue #16030: [SPARK-18108][SQL] Fix a bug to fail partition schema in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16030 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69621/ Test PASSed. ---

[GitHub] spark issue #16030: [SPARK-18108][SQL] Fix a bug to fail partition schema in...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16030 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16030: [SPARK-18108][SQL] Fix a bug to fail partition schema in...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16030 **[Test build #69621 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69621/consoleFull)** for PR 16030 at commit

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69622/ Test FAILed. ---

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16122 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16122: [SPARK-18681][SQL] Fix filtering to compatible with part...

2016-12-03 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16122 **[Test build #69622 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69622/consoleFull)** for PR 16122 at commit

[GitHub] spark issue #16098: [SPARK-18672][CORE] Close recordwriter in SparkHadoopMap...

2016-12-03 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16098 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

1 2 >

1 - 100 of 160 matches

Mail list logo