[jira] [Commented] (SPARK-20052) Some InputDStream needs closing processing after processing all batches when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935789#comment-15935789 ] Sasaki Toru commented on SPARK-20052: - My explain is not good, sorry. This ticket is related to

[jira] [Updated] (SPARK-19925) SparkR spark.getSparkFiles fails on executor

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-19925: Fix Version/s: 2.1.1 > SparkR spark.getSparkFiles fails on executor >

[jira] [Resolved] (SPARK-19925) SparkR spark.getSparkFiles fails on executor

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-19925. - Resolution: Fixed > SparkR spark.getSparkFiles fails on executor >

[jira] [Updated] (SPARK-19925) SparkR spark.getSparkFiles fails on executor

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-19925: Target Version/s: 2.2.0 Fix Version/s: 2.2.0 > SparkR spark.getSparkFiles fails on executor

[jira] [Assigned] (SPARK-19925) SparkR spark.getSparkFiles fails on executor

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-19925: --- Assignee: Yanbo Liang > SparkR spark.getSparkFiles fails on executor >

[jira] [Updated] (SPARK-20052) Some InputDStream needs closing processing after processing all batches when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-20052: Summary: Some InputDStream needs closing processing after processing all batches when graceful

[jira] [Commented] (SPARK-20052) Some InputDStream needs closing processing after all batches processed when graceful shutdown

2017-03-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935768#comment-15935768 ] Sean Owen commented on SPARK-20052: --- What do you have in mind? I don't think stopping the stream makes

[jira] [Resolved] (SPARK-20030) Add Event Time based Timeout

2017-03-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-20030. --- Resolution: Fixed Issue resolved by pull request 17361

[jira] [Updated] (SPARK-13947) The error message from using an invalid table reference is not clear

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-13947: Priority: Minor (was: Major) > The error message from using an invalid table reference is not clear >

[jira] [Updated] (SPARK-13947) PySpark DataFrames: The error message from using an invalid table reference is not clear

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-13947: Component/s: (was: PySpark) SQL > PySpark DataFrames: The error message from using an

[jira] [Updated] (SPARK-13947) The error message from using an invalid table reference is not clear

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-13947: Summary: The error message from using an invalid table reference is not clear (was: PySpark DataFrames:

[jira] [Commented] (SPARK-20035) Spark 2.0.2 writes empty file if no record is in the dataset

2017-03-21 Thread Ryan Magnusson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935693#comment-15935693 ] Ryan Magnusson commented on SPARK-20035: I'd like to start looking into this if no one else is

[jira] [Comment Edited] (SPARK-3165) DecisionTree does not use sparsity in data

2017-03-21 Thread Facai Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935646#comment-15935646 ] Facai Yan edited comment on SPARK-3165 at 3/22/17 1:57 AM: --- Do you mean that:

[jira] [Commented] (SPARK-3165) DecisionTree does not use sparsity in data

2017-03-21 Thread Facai Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935646#comment-15935646 ] Facai Yan commented on SPARK-3165: -- Do you mean that: TreePoint.binnedFeatures is Array[int], which

[jira] [Resolved] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-20051. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17382

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935629#comment-15935629 ] Xiao Li commented on SPARK-20009: - [~marmbrus] Does it sound OK to you? > Use user-friendly DDL formats

[jira] [Updated] (SPARK-20052) Some InputDStream needs closing processing after all batches processed when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-20052: Description: Some class extend InputDStream needs closing processing after processing all batches

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935620#comment-15935620 ] Hyukjin Kwon commented on SPARK-20008: -- Thank you for your kind explanation. I think you are more

[jira] [Commented] (SPARK-20054) [Mesos] Detectability for resource starvation

2017-03-21 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935605#comment-15935605 ] Michael Gummelt commented on SPARK-20054: - Sounds like this could be solved just by having some

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935603#comment-15935603 ] Xiao Li commented on SPARK-20008: - In the traditional RDBMS, we do not allow users to create a table with

[jira] [Comment Edited] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935599#comment-15935599 ] Takeshi Yamamuro edited comment on SPARK-20009 at 3/22/17 1:02 AM: --- I

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935599#comment-15935599 ] Takeshi Yamamuro commented on SPARK-20009: -- I meant we support both a json-format and a new DDL

subscribe to spark issues

2017-03-21 Thread Yash Sharma
subscribe to spark issues

[jira] [Resolved] (SPARK-19919) Defer input path validation into DataSource in CSV datasource

2017-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19919. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17256

[jira] [Assigned] (SPARK-19919) Defer input path validation into DataSource in CSV datasource

2017-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19919: --- Assignee: Hyukjin Kwon > Defer input path validation into DataSource in CSV datasource >

[jira] [Updated] (SPARK-19980) Basic Dataset transformation on POJOs does not preserves nulls.

2017-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-19980: Fix Version/s: 2.1.1 > Basic Dataset transformation on POJOs does not preserves nulls. >

[jira] [Created] (SPARK-20054) [Mesos] Detectability for resource starvation

2017-03-21 Thread Kamal Gurala (JIRA)
Kamal Gurala created SPARK-20054: Summary: [Mesos] Detectability for resource starvation Key: SPARK-20054 URL: https://issues.apache.org/jira/browse/SPARK-20054 Project: Spark Issue Type:

[jira] [Commented] (SPARK-20053) Can't select col when the dot (.) in col name

2017-03-21 Thread Xuxiang Mao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935568#comment-15935568 ] Xuxiang Mao commented on SPARK-20053: - This is how my code looks like: String cmdOutputFile

[jira] [Created] (SPARK-20053) Can't select col when the dot (.) in col name

2017-03-21 Thread Xuxiang Mao (JIRA)
Xuxiang Mao created SPARK-20053: --- Summary: Can't select col when the dot (.) in col name Key: SPARK-20053 URL: https://issues.apache.org/jira/browse/SPARK-20053 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Kunal Khamar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kunal Khamar updated SPARK-20051: - Description: There is a race condition between calling stop on a streaming query and deleting

[jira] [Assigned] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20051: Assignee: (was: Apache Spark) > Fix StreamSuite.recover from v2.1 checkpoint failing

[jira] [Assigned] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20051: Assignee: Apache Spark > Fix StreamSuite.recover from v2.1 checkpoint failing with

[jira] [Commented] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935564#comment-15935564 ] Apache Spark commented on SPARK-20051: -- User 'kunalkhamar' has created a pull request for this

[jira] [Created] (SPARK-20052) Some InputDStream needs closing processing after all batches processed when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
Sasaki Toru created SPARK-20052: --- Summary: Some InputDStream needs closing processing after all batches processed when graceful shutdown Key: SPARK-20052 URL: https://issues.apache.org/jira/browse/SPARK-20052

[jira] [Created] (SPARK-20051) Fix StreamSuite.recover from v2.1 checkpoint failing with IOException

2017-03-21 Thread Kunal Khamar (JIRA)
Kunal Khamar created SPARK-20051: Summary: Fix StreamSuite.recover from v2.1 checkpoint failing with IOException Key: SPARK-20051 URL: https://issues.apache.org/jira/browse/SPARK-20051 Project: Spark

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935540#comment-15935540 ] Hyukjin Kwon commented on SPARK-20008: -- [~smilegator], it seems the discussion is about deuplicates

[jira] [Updated] (SPARK-20047) Constrained Logistic Regression

2017-03-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-20047: Description: For certain applications, such as stacked regressions, it is important to put non-negative

[jira] [Updated] (SPARK-20047) Constrained Logistic Regression

2017-03-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-20047: Description: For certain applications, such as stacked regressions, it is important to put non-negative

[jira] [Updated] (SPARK-20047) Constrained Logistic Regression

2017-03-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-20047: Description: For certain applications, such as stacked regressions, it is important to put non-negative

[jira] [Updated] (SPARK-20047) Constrained Logistic Regression

2017-03-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-20047: Description: For certain applications, such as stacked regressions, it is important to put non-negative

[jira] [Updated] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-20050: Description: I use Kafka 0.10 DirectStream with properties 'enable.auto.commit=false' and call

[jira] [Created] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-21 Thread Sasaki Toru (JIRA)
Sasaki Toru created SPARK-20050: --- Summary: Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown Key: SPARK-20050 URL: https://issues.apache.org/jira/browse/SPARK-20050

[jira] [Assigned] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20023: Assignee: Apache Spark (was: Xiao Li) > Can not see table comment when describe

[jira] [Assigned] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20023: Assignee: Xiao Li (was: Apache Spark) > Can not see table comment when describe

[jira] [Commented] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935507#comment-15935507 ] Apache Spark commented on SPARK-20023: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-03-21 Thread Ron Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ron Hu updated SPARK-19408: --- Target Version/s: 2.3.0 (was: 2.2.0) > cardinality estimation involving two columns of the same table >

[jira] [Comment Edited] (SPARK-20004) Spark thrift server ovewrites spark.app.name

2017-03-21 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935472#comment-15935472 ] Bo Meng edited comment on SPARK-20004 at 3/21/17 10:32 PM: --- I think you can

[jira] [Commented] (SPARK-20004) Spark thrift server ovewrites spark.app.name

2017-03-21 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935472#comment-15935472 ] Bo Meng commented on SPARK-20004: - I think you can still use --name for your app name. for example,

[jira] [Created] (SPARK-20049) Writing data to Parquet with partitions takes very long after the job finishes

2017-03-21 Thread Jakub Nowacki (JIRA)
Jakub Nowacki created SPARK-20049: - Summary: Writing data to Parquet with partitions takes very long after the job finishes Key: SPARK-20049 URL: https://issues.apache.org/jira/browse/SPARK-20049

[jira] [Comment Edited] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2017-03-21 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935409#comment-15935409 ] Irina Truong edited comment on SPARK-4296 at 3/21/17 10:01 PM: --- I have the

[jira] [Comment Edited] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2017-03-21 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935409#comment-15935409 ] Irina Truong edited comment on SPARK-4296 at 3/21/17 9:59 PM: -- I have the

[jira] [Commented] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2017-03-21 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935409#comment-15935409 ] Irina Truong commented on SPARK-4296: - I'm have the same exception with pyspark when my expression

[jira] [Assigned] (SPARK-19237) SparkR package on Windows waiting for a long time when no java is found launching spark-submit

2017-03-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman reassigned SPARK-19237: - Assignee: Felix Cheung > SparkR package on Windows waiting for a long

[jira] [Resolved] (SPARK-19237) SparkR package on Windows waiting for a long time when no java is found launching spark-submit

2017-03-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-19237. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue

[jira] [Assigned] (SPARK-20048) Cloning SessionState does not clone query execution listeners

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20048: Assignee: Apache Spark > Cloning SessionState does not clone query execution listeners >

[jira] [Commented] (SPARK-20048) Cloning SessionState does not clone query execution listeners

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935355#comment-15935355 ] Apache Spark commented on SPARK-20048: -- User 'kunalkhamar' has created a pull request for this

[jira] [Assigned] (SPARK-20048) Cloning SessionState does not clone query execution listeners

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20048: Assignee: (was: Apache Spark) > Cloning SessionState does not clone query execution

[jira] [Created] (SPARK-20048) Cloning SessionState does not clone query execution listeners

2017-03-21 Thread Kunal Khamar (JIRA)
Kunal Khamar created SPARK-20048: Summary: Cloning SessionState does not clone query execution listeners Key: SPARK-20048 URL: https://issues.apache.org/jira/browse/SPARK-20048 Project: Spark

[jira] [Assigned] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-20023: --- Assignee: Xiao Li > Can not see table comment when describe formatted table >

[jira] [Commented] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935320#comment-15935320 ] Xiao Li commented on SPARK-20023: - {{DESC EXTENDED}} works. Obviously, {{DESC FORMATTED}} has a bug >

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-03-21 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935304#comment-15935304 ] Miao Wang commented on SPARK-19634: --- Comments never come to email box. [~timhunter] I can continue with

[jira] [Created] (SPARK-20047) Constrained Logistic Regression

2017-03-21 Thread DB Tsai (JIRA)
DB Tsai created SPARK-20047: --- Summary: Constrained Logistic Regression Key: SPARK-20047 URL: https://issues.apache.org/jira/browse/SPARK-20047 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935278#comment-15935278 ] Xiao Li commented on SPARK-20008: - See the discussion

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935264#comment-15935264 ] Xiao Li commented on SPARK-20009: - Are you suggesting to change the semantics of the parameter of the

[jira] [Commented] (SPARK-20044) Support Spark UI behind front-end reverse proxy using a path prefix

2017-03-21 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935223#comment-15935223 ] Alex Bozarth commented on SPARK-20044: -- I like this idea in theory, but I worried it would take a

[jira] [Assigned] (SPARK-20046) Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet()

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20046: Assignee: Apache Spark > Facilitate loop optimizations in a JIT compiler regarding >

[jira] [Assigned] (SPARK-20046) Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet()

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20046: Assignee: (was: Apache Spark) > Facilitate loop optimizations in a JIT compiler

[jira] [Commented] (SPARK-20046) Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet()

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935162#comment-15935162 ] Apache Spark commented on SPARK-20046: -- User 'kiszk' has created a pull request for this issue:

[jira] [Updated] (SPARK-20046) Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet()

2017-03-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-20046: - Issue Type: Improvement (was: Bug) > Facilitate loop optimizations in a JIT compiler

[jira] [Created] (SPARK-20046) Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet()

2017-03-21 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-20046: Summary: Facilitate loop optimizations in a JIT compiler regarding sqlContext.read.parquet() Key: SPARK-20046 URL: https://issues.apache.org/jira/browse/SPARK-20046

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935112#comment-15935112 ] Seth Hendrickson commented on SPARK-17136: -- The reason to support setting them in both places

[jira] [Updated] (SPARK-20017) Functions "str_to_map" and "explode" throws NPE exceptioin

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20017: Labels: (was: correctness) > Functions "str_to_map" and "explode" throws NPE exceptioin >

[jira] [Resolved] (SPARK-20017) Functions "str_to_map" and "explode" throws NPE exceptioin

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20017. - Resolution: Fixed Assignee: roncenzhao Fix Version/s: 2.2.0 2.1.1 >

[jira] [Commented] (SPARK-20016) SparkLauncher submit job failed after setConf with special charaters under windows

2017-03-21 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15935005#comment-15935005 ] Marcelo Vanzin commented on SPARK-20016: This was a long time ago and mostly trial & error, since

[jira] [Updated] (SPARK-20039) Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest

2017-03-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20039: -- Priority: Minor (was: Major) > Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest >

[jira] [Resolved] (SPARK-20039) Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest

2017-03-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-20039. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17368

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2017-03-21 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934960#comment-15934960 ] Seth Hendrickson commented on SPARK-7129: - I don't think anyone is working on it. Though I'm

[jira] [Commented] (SPARK-17121) Support _HOST replacement for principal

2017-03-21 Thread Chris Gianelloni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934908#comment-15934908 ] Chris Gianelloni commented on SPARK-17121: -- I find this useful when configuring Spark

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934905#comment-15934905 ] Nick Pentreath commented on SPARK-20043: I just noticed the error message you put above says

[jira] [Updated] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-20043: --- Docs Text: (was: I saved a CrossValidatorModel with a decision tree and a random forest. I

[jira] [Updated] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-20043: --- Description: I saved a CrossValidatorModel with a decision tree and a random forest. I use

[jira] [Commented] (SPARK-19934) code comments are not very clearly in BlackListTracker.scala

2017-03-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934869#comment-15934869 ] Imran Rashid commented on SPARK-19934: -- technically, you are right, that "another" isn't really

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2017-03-21 Thread Mohamed Baddar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934864#comment-15934864 ] Mohamed Baddar commented on SPARK-7129: --- [~josephkb] [~sethah] [~meihuawu] [~mlnick] If now one is

[jira] [Resolved] (SPARK-19261) Support `ALTER TABLE table_name ADD COLUMNS(..)` statement

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19261. - Resolution: Fixed Assignee: Xin Wu Fix Version/s: 2.2.0 > Support `ALTER TABLE

[jira] [Resolved] (SPARK-20041) Update docs for NaN handling in approxQuantile

2017-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20041. - Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 2.2.0 > Update docs for NaN

[jira] [Comment Edited] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934736#comment-15934736 ] Yanbo Liang edited comment on SPARK-17136 at 3/21/17 3:22 PM: -- [~sethah]

[jira] [Comment Edited] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934736#comment-15934736 ] Yanbo Liang edited comment on SPARK-17136 at 3/21/17 3:18 PM: -- [~sethah]

[jira] [Comment Edited] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934736#comment-15934736 ] Yanbo Liang edited comment on SPARK-17136 at 3/21/17 3:17 PM: -- [~sethah]

[jira] [Comment Edited] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934736#comment-15934736 ] Yanbo Liang edited comment on SPARK-17136 at 3/21/17 3:17 PM: -- [~sethah]

[jira] [Assigned] (SPARK-19998) BlockRDD block not found Exception add RDD id info

2017-03-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-19998: - Assignee: jianran.tfh > BlockRDD block not found Exception add RDD id info >

[jira] [Resolved] (SPARK-19998) BlockRDD block not found Exception add RDD id info

2017-03-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19998. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17334

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934736#comment-15934736 ] Yanbo Liang commented on SPARK-17136: - [~sethah] Thanks for the design doc. One quick question: In

[jira] [Commented] (SPARK-19950) nullable ignored when df.load() is executed for file-based data source

2017-03-21 Thread Jason White (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934719#comment-15934719 ] Jason White commented on SPARK-19950: - Without something that allows us to read using the nullable as

[jira] [Commented] (SPARK-19949) unify bad record handling in CSV and JSON

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15934710#comment-15934710 ] Apache Spark commented on SPARK-19949: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2017-03-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-12664: --- Assignee: Weichen Xu (was: Yanbo Liang) > Expose raw prediction scores in

[jira] [Assigned] (SPARK-20041) Update docs for NaN handling in approxQuantile

2017-03-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20041: Assignee: Apache Spark > Update docs for NaN handling in approxQuantile >

[jira] [Created] (SPARK-20041) Update docs for NaN handling in approxQuantile

2017-03-21 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-20041: Summary: Update docs for NaN handling in approxQuantile Key: SPARK-20041 URL: https://issues.apache.org/jira/browse/SPARK-20041 Project: Spark Issue Type: