[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428519#comment-15428519 ] Apache Spark commented on SPARK-12868: -- User 'Parth-Brahmbhatt' has created a pull request for this

[jira] [Commented] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428518#comment-15428518 ] Miao Wang commented on SPARK-17157: --- [~felixcheung] Shall we add it to SparkR? I open this JIRA for

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-19 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428557#comment-15428557 ] Xusen Yin commented on SPARK-14381: --- I believe we can resolve this. > Review spark.ml parity for

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Issue Type: Improvement (was: Bug) > PySpark ML Pipeline fails when no stages set >

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Summary: PySpark ML Pipeline raises unclear error when no stages set (was: PySpark ML Pipeline

[jira] [Created] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17156: - Summary: Add multiclass logistic regression Scala Example Key: SPARK-17156 URL: https://issues.apache.org/jira/browse/SPARK-17156 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428509#comment-15428509 ] Miao Wang commented on SPARK-17156: --- I will submit PR soon. > Add multiclass logistic regression Scala

[jira] [Commented] (SPARK-10401) spark-submit --unsupervise

2016-08-19 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428572#comment-15428572 ] Michael Gummelt commented on SPARK-10401: - This should probably be a separate JIRA, but I'm just

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: (was: Apache Spark) > Improve error message for numeric literal parsing >

[jira] [Commented] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428587#comment-15428587 ] Apache Spark commented on SPARK-17158: -- User 'srinathshankar' has created a pull request for this

[jira] [Assigned] (SPARK-13286) JDBC driver doesn't report full exception

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-13286: -- Assignee: Davies Liu > JDBC driver doesn't report full exception >

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: Apache Spark > Improve error message for numeric literal parsing >

[jira] [Commented] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428438#comment-15428438 ] Apache Spark commented on SPARK-17154: -- User 'sarutak' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: (was: Apache Spark) > Wrong result can be returned or AnalysisException can

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: Apache Spark > Wrong result can be returned or AnalysisException can be thrown

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:scala} case class

[jira] [Updated] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16686: Fix Version/s: 2.0.1 > Dataset.sample with seed: result seems to depend on downstream usage >

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Priority: Minor (was: Major) > PySpark ML Pipeline fails when no stages set >

[jira] [Resolved] (SPARK-16197) Cleanup PySpark status api and example

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-16197. -- Resolution: Won't Fix This minor change is would be better addressed during a QA audit >

[jira] [Updated] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miao Wang updated SPARK-17157: -- Component/s: SparkR > Add multiclass logistic regression SparkR Wrapper >

[jira] [Updated] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15382: Fix Version/s: 2.1.0 2.0.1 > monotonicallyIncreasingId doesn't work when data

[jira] [Updated] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-17113: --- Assignee: Sital Kedia > Job failure due to Executor OOM in offheap mode >

[jira] [Resolved] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-17113. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Job failure due to

[jira] [Created] (SPARK-17159) Improve FileInputDStream.findNewFiles list performance

2016-08-19 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-17159: -- Summary: Improve FileInputDStream.findNewFiles list performance Key: SPARK-17159 URL: https://issues.apache.org/jira/browse/SPARK-17159 Project: Spark

[jira] [Closed] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-15382. --- Resolution: Fixed > monotonicallyIncreasingId doesn't work when data is upsampled >

[jira] [Created] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
Mikael Valot created SPARK-17155: Summary: usage of a Dataset inside a Future throws MissingRequirementError Key: SPARK-17155 URL: https://issues.apache.org/jira/browse/SPARK-17155 Project: Spark

[jira] [Closed] (SPARK-16152) `In` predicate does not work with null values

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-16152. - Resolution: Invalid Hi, [~fushar]. This seems to be a SQL question. [~kevinyu98] is right.

[jira] [Created] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17157: - Summary: Add multiclass logistic regression SparkR Wrapper Key: SPARK-17157 URL: https://issues.apache.org/jira/browse/SPARK-17157 Project: Spark Issue Type: New

[jira] [Created] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Srinath (JIRA)
Srinath created SPARK-17158: --- Summary: Improve error message for numeric literal parsing Key: SPARK-17158 URL: https://issues.apache.org/jira/browse/SPARK-17158 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10746) count ( distinct columnref) over () returns wrong result set

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428617#comment-15428617 ] Dongjoon Hyun commented on SPARK-10746: --- Just as an update, Spark 2.0 now raises an exception for

[jira] [Updated] (SPARK-17146) Add RandomizedSearch to the CrossValidator API

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17146: -- Priority: Critical (was: Major) > Add RandomizedSearch to the CrossValidator API >

[jira] [Comment Edited] (SPARK-16785) dapply doesn't return array or raw columns

2016-08-19 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427788#comment-15427788 ] Clark Fitzgerald edited comment on SPARK-16785 at 8/19/16 7:58 AM: ---

[jira] [Commented] (SPARK-17085) Documentation and actual code differs - Unsupported Operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427825#comment-15427825 ] Apache Spark commented on SPARK-17085: -- User 'jagadeesanas2' has created a pull request for this

[jira] [Assigned] (SPARK-17085) Documentation and actual code differs - Unsupported Operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17085: Assignee: (was: Apache Spark) > Documentation and actual code differs - Unsupported

[jira] [Assigned] (SPARK-17085) Documentation and actual code differs - Unsupported Operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17085: Assignee: Apache Spark > Documentation and actual code differs - Unsupported Operations >

[jira] [Comment Edited] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427848#comment-15427848 ] Takeshi Yamamuro edited comment on SPARK-15816 at 8/19/16 8:56 AM: ---

[jira] [Commented] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427848#comment-15427848 ] Takeshi Yamamuro commented on SPARK-15816: -- About 1. Yea, we can support them in there; nested

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15018: Shepherd: Yanbo Liang Assignee: Bryan Cutler > PySpark ML Pipeline fails when no stages set >

[jira] [Commented] (SPARK-16785) dapply doesn't return array or raw columns

2016-08-19 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427788#comment-15427788 ] Clark Fitzgerald commented on SPARK-16785: -- Also my proposal above: bq. to treat the rows as a

[jira] [Commented] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427786#comment-15427786 ] Reynold Xin commented on SPARK-15816: - [~maropu] a few questions: 1. Can this support structs and

[jira] [Assigned] (SPARK-16822) Support latex in scaladoc with MathJax

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16822: Assignee: Apache Spark (was: Shuai Lin) > Support latex in scaladoc with MathJax >

[jira] [Commented] (SPARK-16994) Filter and limit are illegally permuted.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427711#comment-15427711 ] Apache Spark commented on SPARK-16994: -- User 'rxin' has created a pull request for this issue:

[jira] [Updated] (SPARK-17140) Add initial model to MultinomialLogisticRegression

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-17140: --- Assignee: Seth Hendrickson > Add initial model to MultinomialLogisticRegression >

[jira] [Reopened] (SPARK-16822) Support latex in scaladoc with MathJax

2016-08-19 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jagadeesan A S reopened SPARK-16822: Small modification in _LinearRegression.scala_ {code:java} {{{ L = 1/2n||\sum_i w_i(x_i -

[jira] [Issue Comment Deleted] (SPARK-17047) Spark 2 cannot create ORC table when CLUSTERED.

2016-08-19 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jagadeesan A S updated SPARK-17047: --- Comment: was deleted (was: I would like to work on this issue) > Spark 2 cannot create ORC

[jira] [Updated] (SPARK-17146) Add RandomizedSearch to the CrossValidator API

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17146: -- Priority: Minor (was: Critical) > Add RandomizedSearch to the CrossValidator API >

[jira] [Commented] (SPARK-16785) dapply doesn't return array or raw columns

2016-08-19 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427785#comment-15427785 ] Clark Fitzgerald commented on SPARK-16785: -- Making some slow progress digging into this. Here's

[jira] [Commented] (SPARK-17081) Empty strings not preserved which causes SQLException: mismatching column value count

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427815#comment-15427815 ] Takeshi Yamamuro commented on SPARK-17081: -- At least, the current master works well for

[jira] [Updated] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17141: Priority: Minor (was: Trivial) > MinMaxScaler behaves weird when min and max have the same value

[jira] [Commented] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427846#comment-15427846 ] Apache Spark commented on SPARK-17141: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17141: Assignee: (was: Apache Spark) > MinMaxScaler behaves weird when min and max have the

[jira] [Assigned] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17141: Assignee: Apache Spark > MinMaxScaler behaves weird when min and max have the same value

[jira] [Comment Edited] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427860#comment-15427860 ] Yanbo Liang edited comment on SPARK-17141 at 8/19/16 9:01 AM: -- In the

[jira] [Commented] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427860#comment-15427860 ] Yanbo Liang commented on SPARK-17141: - In the existing code, {{MinMaxScaler}} handle NaN value

[jira] [Assigned] (SPARK-16822) Support latex in scaladoc with MathJax

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16822: Assignee: Shuai Lin (was: Apache Spark) > Support latex in scaladoc with MathJax >

[jira] [Commented] (SPARK-17081) Empty strings not preserved which causes SQLException: mismatching column value count

2016-08-19 Thread Ian Hellstrom (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427770#comment-15427770 ] Ian Hellstrom commented on SPARK-17081: --- Unfortunately I cannot because I don't have access to a

[jira] [Resolved] (SPARK-16961) Utils.randomizeInPlace does not shuffle arrays uniformly

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16961. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull

[jira] [Comment Edited] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427671#comment-15427671 ] Takeshi Yamamuro edited comment on SPARK-15816 at 8/19/16 6:07 AM: ---

[jira] [Updated] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-15816: - Attachment: New_SQL_Server_for_Spark.pdf > SQL server based on Postgres protocol >

[jira] [Assigned] (SPARK-17072) generate table level stats:stats generation/storing/loading

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17072: Assignee: (was: Apache Spark) > generate table level stats:stats

[jira] [Commented] (SPARK-15816) SQL server based on Postgres protocol

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427671#comment-15427671 ] Takeshi Yamamuro commented on SPARK-15816: -- [~sarutak] I just posted the design doc. and this is

[jira] [Commented] (SPARK-16581) Making JVM backend calling functions public

2016-08-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427687#comment-15427687 ] Felix Cheung commented on SPARK-16581: -- I think JVM<->R is closely related to RBackend? Because we

[jira] [Commented] (SPARK-17072) generate table level stats:stats generation/storing/loading

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427692#comment-15427692 ] Apache Spark commented on SPARK-17072: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17072) generate table level stats:stats generation/storing/loading

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17072: Assignee: Apache Spark > generate table level stats:stats generation/storing/loading >

[jira] [Created] (SPARK-17152) Spark Flume sink fails with begin() called when transaction is OPEN

2016-08-19 Thread Wojciech Sznapka (JIRA)
Wojciech Sznapka created SPARK-17152: Summary: Spark Flume sink fails with begin() called when transaction is OPEN Key: SPARK-17152 URL: https://issues.apache.org/jira/browse/SPARK-17152 Project:

[jira] [Resolved] (SPARK-16994) Filter and limit are illegally permuted.

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16994. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull

[jira] [Updated] (SPARK-16994) Filter and limit are illegally permuted.

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-16994: Assignee: Reynold Xin > Filter and limit are illegally permuted. >

[jira] [Assigned] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17090: Assignee: (was: Apache Spark) > Make tree aggregation level in linear/logistic

[jira] [Commented] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428186#comment-15428186 ] Apache Spark commented on SPARK-17090: -- User 'hqzizania' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17090: Assignee: Apache Spark > Make tree aggregation level in linear/logistic regression

[jira] [Commented] (SPARK-14501) spark.ml parity for fpm - frequent items

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428231#comment-15428231 ] Nick Pentreath commented on SPARK-14501: Any update? > spark.ml parity for fpm - frequent items

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428228#comment-15428228 ] Qian Huang commented on SPARK-17134: I could be your backup if you are not available. This task is

[jira] [Issue Comment Deleted] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qian Huang updated SPARK-17134: --- Comment: was deleted (was: I could be your backup if you are not available. This task is sort of

[jira] [Commented] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428238#comment-15428238 ] Nick Pentreath commented on SPARK-14378: Any update? > Review spark.ml parity for regression,

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428227#comment-15428227 ] Qian Huang commented on SPARK-17134: I could be your backup if you are not available. This task is

[jira] [Commented] (SPARK-14503) spark.ml API for FPGrowth

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428234#comment-15428234 ] Nick Pentreath commented on SPARK-14503: Shall we focus first on the porting of what is currently

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428235#comment-15428235 ] Nick Pentreath commented on SPARK-14381: [~yinxusen] is there anything outstanding in parity for

[jira] [Resolved] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-7159. --- Resolution: Fixed Fix Version/s: 2.1.0 > Support multiclass logistic regression in

[jira] [Commented] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

2016-08-19 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428213#comment-15428213 ] Qian Huang commented on SPARK-17090: Thanks :) > Make tree aggregation level in linear/logistic

[jira] [Updated] (SPARK-16961) Utils.randomizeInPlace does not shuffle arrays uniformly

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16961: -- Assignee: Nicholas > Utils.randomizeInPlace does not shuffle arrays uniformly >

[jira] [Updated] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Alberto Bonsanto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alberto Bonsanto updated SPARK-17141: - Description: When you have a {{DataFrame}} with a column named {{features}}, which is a

[jira] [Updated] (SPARK-16965) Fix bound checking for SparseVector

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16965: -- Assignee: Jeff Zhang > Fix bound checking for SparseVector > --- > >

[jira] [Resolved] (SPARK-16965) Fix bound checking for SparseVector

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16965. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14555

[jira] [Assigned] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-17141: --- Assignee: Yanbo Liang > MinMaxScaler behaves weird when min and max have the same value and

[jira] [Comment Edited] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427965#comment-15427965 ] Takeshi Yamamuro edited comment on SPARK-15382 at 8/19/16 10:23 AM:

[jira] [Commented] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427965#comment-15427965 ] Takeshi Yamamuro commented on SPARK-15382: -- @rxin [~viirya] Seems this ticket has already been

[jira] [Resolved] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17141. - Resolution: Fixed Fix Version/s: 2.1.0 > MinMaxScaler behaves weird when min and max have

[jira] [Commented] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Alberto Bonsanto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427996#comment-15427996 ] Alberto Bonsanto commented on SPARK-17141: -- Just a question, I had to be the pull requester in

[jira] [Commented] (SPARK-17094) provide simplified API for ML pipeline

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428038#comment-15428038 ] Nick Pentreath commented on SPARK-17094: What about input/output columns? We could set the input

[jira] [Commented] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-08-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428256#comment-15428256 ] Nick Pentreath commented on SPARK-7159: --- SPARK-17133 is a set of follow-ups from this JIRA >

[jira] [Commented] (SPARK-10713) SPARK_DIST_CLASSPATH ignored on Mesos executors

2016-08-19 Thread Wolfgang Buchner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428264#comment-15428264 ] Wolfgang Buchner commented on SPARK-10713: -- i am currently testing spark 2.0 with mesos and it

[jira] [Commented] (SPARK-17148) NodeManager exit because of exception “Executor is not registered”

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428289#comment-15428289 ] Thomas Graves commented on SPARK-17148: --- If this is causing the nodemanager to die this is bad and

[jira] [Created] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-08-19 Thread Dmitri Carpov (JIRA)
Dmitri Carpov created SPARK-17153: - Summary: [Structured streams] readStream ignores partition columns Key: SPARK-17153 URL: https://issues.apache.org/jira/browse/SPARK-17153 Project: Spark

[jira] [Resolved] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-11227. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Spark1.5+ HDFS HA

[jira] [Resolved] (SPARK-16673) New Executor Page displays columns that used to be conditionally hidden

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-16673. --- Resolution: Fixed Fix Version/s: 2.1.0 > New Executor Page displays columns that used

[jira] [Updated] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-11227: -- Assignee: Kousuke Saruta > Spark1.5+ HDFS HA mode throw java.net.UnknownHostException:

  1   2   >