[jira] [Updated] (SPARK-15698) Ability to remove old metadata for structure streaming MetadataLog

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15698: Target Version/s: 2.0.1, 2.1.0 > Ability to remove old metadata for structure streaming MetadataLog

[jira] [Created] (SPARK-17167) Issue Exceptions when Analyze Table on In-Memory Cataloged Tables

2016-08-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17167: --- Summary: Issue Exceptions when Analyze Table on In-Memory Cataloged Tables Key: SPARK-17167 URL: https://issues.apache.org/jira/browse/SPARK-17167 Project: Spark Issu

[jira] [Updated] (SPARK-17167) Issue Exceptions when Analyze Table on In-Memory Cataloged Tables

2016-08-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-17167: Description: Currently, `Analyze Table` is only for Hive-serde tables. We should issue exceptions in all th

[jira] [Resolved] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15018. - Resolution: Fixed Fix Version/s: 2.1.0 > PySpark ML Pipeline raises unclear error when no

[jira] [Commented] (SPARK-17165) FileStreamSource should not track the list of seen files indefinitely

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429261#comment-15429261 ] Apache Spark commented on SPARK-17165: -- User 'petermaxlee' has created a pull reques

[jira] [Assigned] (SPARK-17165) FileStreamSource should not track the list of seen files indefinitely

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17165: Assignee: (was: Apache Spark) > FileStreamSource should not track the list of seen fil

[jira] [Assigned] (SPARK-17165) FileStreamSource should not track the list of seen files indefinitely

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17165: Assignee: Apache Spark > FileStreamSource should not track the list of seen files indefini

[jira] [Commented] (SPARK-17138) Python API for multinomial logistic regression

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429258#comment-15429258 ] Yanbo Liang commented on SPARK-17138: - [~WeichenXu123] Please hold on this task, sinc

[jira] [Commented] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429255#comment-15429255 ] Yanbo Liang commented on SPARK-17137: - Yes, I will do some performance test to weigh

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429253#comment-15429253 ] Yanbo Liang commented on SPARK-17136: - Yes, only first order optimizer can scale well

[jira] [Assigned] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17166: Assignee: (was: Apache Spark) > CTAS lost table properties after conversion to data so

[jira] [Commented] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429244#comment-15429244 ] Apache Spark commented on SPARK-17166: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17166: Assignee: Apache Spark > CTAS lost table properties after conversion to data source tables

[jira] [Created] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17166: --- Summary: CTAS lost table properties after conversion to data source tables. Key: SPARK-17166 URL: https://issues.apache.org/jira/browse/SPARK-17166 Project: Spark Iss

[jira] [Commented] (SPARK-16757) Set up caller context to HDFS

2016-08-19 Thread Weiqing Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429237#comment-15429237 ] Weiqing Yang commented on SPARK-16757: -- Thanks, [~srowen]. When Spark applications r

[jira] [Created] (SPARK-17165) FileStreamSource should not track the list of seen files indefinitely

2016-08-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-17165: --- Summary: FileStreamSource should not track the list of seen files indefinitely Key: SPARK-17165 URL: https://issues.apache.org/jira/browse/SPARK-17165 Project: Spark

[jira] [Resolved] (SPARK-17150) Support SQL generation for inline tables

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17150. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull req

[jira] [Updated] (SPARK-17150) Support SQL generation for inline tables

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17150: Assignee: Peter Lee > Support SQL generation for inline tables > --

[jira] [Commented] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429226#comment-15429226 ] Apache Spark commented on SPARK-16862: -- User 'tejasapatil' has created a pull reques

[jira] [Closed] (SPARK-16264) Allow the user to use operators on the received DataFrame

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-16264. --- Resolution: Won't Fix > Allow the user to use operators on the received DataFrame > -

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429196#comment-15429196 ] Yanbo Liang commented on SPARK-17134: - [~qhuang] Please feel free to take this task a

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429195#comment-15429195 ] Reynold Xin commented on SPARK-17164: - I tried in Postgres: {code} rxin=# create tab

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429194#comment-15429194 ] Reynold Xin commented on SPARK-17164: - This is actually valid? > Query with colon in

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429193#comment-15429193 ] Sital Kedia commented on SPARK-17164: - cc - [~hvanhovell], [~rxin] > Query with colo

[jira] [Created] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-17164: --- Summary: Query with colon in the table name fails to parse in 2.0 Key: SPARK-17164 URL: https://issues.apache.org/jira/browse/SPARK-17164 Project: Spark Issue

[jira] [Resolved] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17158. - Resolution: Fixed Assignee: Srinath Fix Version/s: 2.1.0 2.0.1

[jira] [Resolved] (SPARK-17149) array.sql for testing array related functions

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17149. - Resolution: Fixed Assignee: Peter Lee Fix Version/s: 2.1.0 2.0.

[jira] [Created] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-19 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17163: Summary: Decide on unified multinomial and binary logistic regression interfaces Key: SPARK-17163 URL: https://issues.apache.org/jira/browse/SPARK-17163 Proje

[jira] [Commented] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429069#comment-15429069 ] DB Tsai commented on SPARK-17151: - [~sethah] I think it sort of makes sense that we allow

[jira] [Comment Edited] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429066#comment-15429066 ] DB Tsai edited comment on SPARK-17151 at 8/19/16 11:49 PM: --- Not

[jira] [Commented] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429066#comment-15429066 ] DB Tsai commented on SPARK-17151: - BTW, not only the zero coefficients issues but also th

[jira] [Assigned] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17161: Assignee: (was: Apache Spark) > Add PySpark-ML JavaWrapper convenience function to cre

[jira] [Commented] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429043#comment-15429043 ] Apache Spark commented on SPARK-17161: -- User 'BryanCutler' has created a pull reques

[jira] [Assigned] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17161: Assignee: Apache Spark > Add PySpark-ML JavaWrapper convenience function to create py4j Ja

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429039#comment-15429039 ] DB Tsai commented on SPARK-17136: - Typically, the first order optimizer will take a funct

[jira] [Updated] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-17161: - Summary: Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays (was: Add PyS

[jira] [Comment Edited] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429025#comment-15429025 ] DB Tsai edited comment on SPARK-17137 at 8/19/16 11:16 PM: --- Cur

[jira] [Commented] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429025#comment-15429025 ] DB Tsai commented on SPARK-17137: - Currently, for LiR or BLOR, we always do `Vector.compr

[jira] [Assigned] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17162: Assignee: (was: Apache Spark) > Range does not support SQL generation > --

[jira] [Commented] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429018#comment-15429018 ] Apache Spark commented on SPARK-17162: -- User 'ericl' has created a pull request for

[jira] [Assigned] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17162: Assignee: Apache Spark > Range does not support SQL generation > -

[jira] [Created] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Eric Liang (JIRA)
Eric Liang created SPARK-17162: -- Summary: Range does not support SQL generation Key: SPARK-17162 URL: https://issues.apache.org/jira/browse/SPARK-17162 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-17161) Add PySpark-ML JavaWrapper convienience function to create py4j JavaArrays

2016-08-19 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-17161: Summary: Add PySpark-ML JavaWrapper convienience function to create py4j JavaArrays Key: SPARK-17161 URL: https://issues.apache.org/jira/browse/SPARK-17161 Project: S

[jira] [Commented] (SPARK-17140) Add initial model to MultinomialLogisticRegression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429003#comment-15429003 ] DB Tsai commented on SPARK-17140: - Since we're doing smoothing, the intercepts computed f

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428988#comment-15428988 ] Nicholas Chammas commented on SPARK-17025: -- {quote} We'd need to figure out a go

[jira] [Resolved] (SPARK-17128) Schema is not Created for nested Json Array objects

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17128. --- Resolution: Invalid Target Version/s: (was: 2.0.0) This is not a reasonable description o

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428906#comment-15428906 ] Joseph K. Bradley commented on SPARK-17025: --- I'd call this a new API, not a bug

[jira] [Updated] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17025: -- Issue Type: New Feature (was: Bug) > Cannot persist PySpark ML Pipeline model that inc

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark shell

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark shell

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark shell

[jira] [Resolved] (SPARK-16443) ALS wrapper in SparkR

2016-08-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-16443. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14384 [https://g

[jira] [Comment Edited] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428848#comment-15428848 ] DB Tsai edited comment on SPARK-17134 at 8/19/16 9:21 PM: -- It ma

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428848#comment-15428848 ] DB Tsai commented on SPARK-17134: - {code:borderStyle=solid} val margins = Array.ofDim[Dou

[jira] [Closed] (SPARK-16569) Use Cython to speed up Pyspark internals

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-16569. -- Resolution: Won't Fix > Use Cython to speed up Pyspark internals >

[jira] [Commented] (SPARK-16569) Use Cython to speed up Pyspark internals

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428843#comment-15428843 ] Davies Liu commented on SPARK-16569: Agreed to [~robert3005]. Another options could b

[jira] [Commented] (SPARK-13286) JDBC driver doesn't report full exception

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428762#comment-15428762 ] Apache Spark commented on SPARK-13286: -- User 'davies' has created a pull request for

[jira] [Commented] (SPARK-13342) Cannot run INSERT statements in Spark

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428753#comment-15428753 ] Dongjoon Hyun commented on SPARK-13342: --- Hi, All. Just to make this issue up-to-dat

[jira] [Commented] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428751#comment-15428751 ] Xin Ren commented on SPARK-17157: - I guess a lot more ml algorithms are still missing R w

[jira] [Commented] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428745#comment-15428745 ] Iaroslav Zeigerman commented on SPARK-17024: Issue occurs only when reading t

[jira] [Updated] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iaroslav Zeigerman updated SPARK-17024: --- Affects Version/s: (was: 1.6.0) 2.0.0 > Weird behaviour of

[jira] [Reopened] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iaroslav Zeigerman reopened SPARK-17024: The issue occurs in Spark 2.0.0. Now it's even worse. I can't even get an rdd from a D

[jira] [Created] (SPARK-17160) GetExternalRowField does not properly escape field names, causing generated code not to compile

2016-08-19 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-17160: -- Summary: GetExternalRowField does not properly escape field names, causing generated code not to compile Key: SPARK-17160 URL: https://issues.apache.org/jira/browse/SPARK-17160

[jira] [Commented] (SPARK-17159) Improve FileInputDStream.findNewFiles list performance

2016-08-19 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428677#comment-15428677 ] Steve Loughran commented on SPARK-17159: # the most minimal change is to get rid

[jira] [Created] (SPARK-17159) Improve FileInputDStream.findNewFiles list performance

2016-08-19 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-17159: -- Summary: Improve FileInputDStream.findNewFiles list performance Key: SPARK-17159 URL: https://issues.apache.org/jira/browse/SPARK-17159 Project: Spark Is

[jira] [Commented] (SPARK-10746) count ( distinct columnref) over () returns wrong result set

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428617#comment-15428617 ] Dongjoon Hyun commented on SPARK-10746: --- Just as an update, Spark 2.0 now raises an

[jira] [Updated] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-17113: --- Assignee: Sital Kedia > Job failure due to Executor OOM in offheap mode > ---

[jira] [Resolved] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-17113. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Job failure due to Executo

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: Apache Spark > Improve error message for numeric literal parsing > -

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: (was: Apache Spark) > Improve error message for numeric literal parsing > --

[jira] [Commented] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428587#comment-15428587 ] Apache Spark commented on SPARK-17158: -- User 'srinathshankar' has created a pull req

[jira] [Assigned] (SPARK-13286) JDBC driver doesn't report full exception

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-13286: -- Assignee: Davies Liu > JDBC driver doesn't report full exception > ---

[jira] [Created] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Srinath (JIRA)
Srinath created SPARK-17158: --- Summary: Improve error message for numeric literal parsing Key: SPARK-17158 URL: https://issues.apache.org/jira/browse/SPARK-17158 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15382: Fix Version/s: 2.1.0 2.0.1 > monotonicallyIncreasingId doesn't work when data is

[jira] [Updated] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16686: Fix Version/s: 2.0.1 > Dataset.sample with seed: result seems to depend on downstream usage > -

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-19 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428557#comment-15428557 ] Xusen Yin commented on SPARK-14381: --- I believe we can resolve this. > Review spark.ml

[jira] [Commented] (SPARK-10401) spark-submit --unsupervise

2016-08-19 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428572#comment-15428572 ] Michael Gummelt commented on SPARK-10401: - This should probably be a separate JIR

[jira] [Updated] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miao Wang updated SPARK-17157: -- Component/s: SparkR > Add multiclass logistic regression SparkR Wrapper > -

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428519#comment-15428519 ] Apache Spark commented on SPARK-12868: -- User 'Parth-Brahmbhatt' has created a pull r

[jira] [Commented] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428518#comment-15428518 ] Miao Wang commented on SPARK-17157: --- [~felixcheung] Shall we add it to SparkR? I open t

[jira] [Created] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17157: - Summary: Add multiclass logistic regression SparkR Wrapper Key: SPARK-17157 URL: https://issues.apache.org/jira/browse/SPARK-17157 Project: Spark Issue Type: New F

[jira] [Commented] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428509#comment-15428509 ] Miao Wang commented on SPARK-17156: --- I will submit PR soon. > Add multiclass logistic

[jira] [Created] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17156: - Summary: Add multiclass logistic regression Scala Example Key: SPARK-17156 URL: https://issues.apache.org/jira/browse/SPARK-17156 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class A

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:scala} case class

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class A

[jira] [Created] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
Mikael Valot created SPARK-17155: Summary: usage of a Dataset inside a Future throws MissingRequirementError Key: SPARK-17155 URL: https://issues.apache.org/jira/browse/SPARK-17155 Project: Spark

[jira] [Closed] (SPARK-16152) `In` predicate does not work with null values

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-16152. - Resolution: Invalid Hi, [~fushar]. This seems to be a SQL question. [~kevinyu98] is right. Spark

[jira] [Closed] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-15382. --- Resolution: Fixed > monotonicallyIncreasingId doesn't work when data is upsampled > -

[jira] [Resolved] (SPARK-16197) Cleanup PySpark status api and example

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-16197. -- Resolution: Won't Fix This minor change is would be better addressed during a QA audit > Clean

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity trans

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity trans

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Summary: PySpark ML Pipeline raises unclear error when no stages set (was: PySpark ML Pipeline f

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Issue Type: Improvement (was: Bug) > PySpark ML Pipeline fails when no stages set >

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Priority: Minor (was: Major) > PySpark ML Pipeline fails when no stages set > --

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: Apache Spark > Wrong result can be returned or AnalysisException can be thrown a

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: (was: Apache Spark) > Wrong result can be returned or AnalysisException can

[jira] [Commented] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428438#comment-15428438 ] Apache Spark commented on SPARK-17154: -- User 'sarutak' has created a pull request fo

[jira] [Commented] (SPARK-17135) Consolidate code in linear/logistic regression where possible

2016-08-19 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428401#comment-15428401 ] Gayathri Murali commented on SPARK-17135: - I can work on this > Consolidate code

[jira] [Reopened] (SPARK-13331) Spark network encryption optimization

2016-08-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-13331: > Spark network encryption optimization > - > >

  1   2   >