[jira] [Commented] (SPARK-20193) Selecting empty struct causes ExpressionEncoder error.

2017-04-03 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954584#comment-15954584 ] Liang-Chi Hsieh commented on SPARK-20193: - Actually I am not sure what {{struct()}} represents.

[jira] [Created] (SPARK-20207) Add ablity to exclude current row in WindowSpec

2017-04-03 Thread Mathew Wicks (JIRA)
Mathew Wicks created SPARK-20207: Summary: Add ablity to exclude current row in WindowSpec Key: SPARK-20207 URL: https://issues.apache.org/jira/browse/SPARK-20207 Project: Spark Issue Type:

[jira] [Commented] (SPARK-20144) spark.read.parquet no long maintains ordering of the data

2017-04-03 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954559#comment-15954559 ] Liang-Chi Hsieh commented on SPARK-20144: - I don't think the API has the guarantee about the data

[jira] [Updated] (SPARK-20079) Re registration of AM hangs spark cluster in yarn-client mode

2017-04-03 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-20079: Description: The ExecutorAllocationManager.reset method is called when re-registering AM, which

[jira] [Commented] (SPARK-11421) Add the ability to add a jar to the current class loader

2017-04-03 Thread Daniel Erenrich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954533#comment-15954533 ] Daniel Erenrich commented on SPARK-11421: - Is this not basically a duplicate of the much older

[jira] [Commented] (SPARK-20176) Spark Dataframe UDAF issue

2017-04-03 Thread Dinesh Man Amatya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954527#comment-15954527 ] Dinesh Man Amatya commented on SPARK-20176: --- Thanks Kazuaki for the effort. I was able to

[jira] [Updated] (SPARK-20206) spark.ui.killEnabled=false property doesn't reflect on task/stages

2017-04-03 Thread srinivasan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] srinivasan updated SPARK-20206: --- Priority: Minor (was: Major) > spark.ui.killEnabled=false property doesn't reflect on task/stages >

[jira] [Created] (SPARK-20206) spark.ui.killEnabled=false property doesn't reflect on task/stages

2017-04-03 Thread srinivasan (JIRA)
srinivasan created SPARK-20206: -- Summary: spark.ui.killEnabled=false property doesn't reflect on task/stages Key: SPARK-20206 URL: https://issues.apache.org/jira/browse/SPARK-20206 Project: Spark

[jira] [Comment Edited] (SPARK-14726) Support for sampling when inferring schema in CSV data source

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954447#comment-15954447 ] Hyukjin Kwon edited comment on SPARK-14726 at 4/4/17 1:47 AM: -- Actually,

[jira] [Comment Edited] (SPARK-14726) Support for sampling when inferring schema in CSV data source

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954447#comment-15954447 ] Hyukjin Kwon edited comment on SPARK-14726 at 4/4/17 1:40 AM: -- Actually,

[jira] [Resolved] (SPARK-14726) Support for sampling when inferring schema in CSV data source

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14726. -- Resolution: Won't Fix Actually, after re-thinking, it seems we would not need this for now if

[jira] [Resolved] (SPARK-19186) Hash symbol in middle of Sybase database table name causes Spark Exception

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19186. -- Resolution: Not A Problem ^ I agree with this. Also, up to my knowledge, we can deal with the

[jira] [Resolved] (SPARK-10364) Support Parquet logical type TIMESTAMP_MILLIS

2017-04-03 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-10364. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15332

[jira] [Updated] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-04-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-19408: Description: In SPARK-17075, we estimate cardinality of predicate expression "column (op) literal", where

[jira] [Resolved] (SPARK-19408) cardinality estimation involving two columns of the same table

2017-04-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19408. - Resolution: Fixed Assignee: Ron Hu Fix Version/s: 2.2.0 > cardinality estimation

[jira] [Resolved] (SPARK-20145) "SELECT * FROM range(1)" works, but "SELECT * FROM RANGE(1)" doesn't

2017-04-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-20145. - Resolution: Fixed Assignee: sam elamin Fix Version/s: 2.2.0 > "SELECT * FROM

[jira] [Comment Edited] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954387#comment-15954387 ] Mridul Muralidharan edited comment on SPARK-20205 at 4/4/17 12:15 AM:

[jira] [Commented] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954387#comment-15954387 ] Mridul Muralidharan commented on SPARK-20205: - For history server that will fail - good

[jira] [Resolved] (SPARK-18893) Not support "alter table .. add columns .."

2017-04-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18893. - Resolution: Fixed Fix Version/s: 2.2.0 > Not support "alter table .. add columns .." >

[jira] [Commented] (SPARK-18893) Not support "alter table .. add columns .."

2017-04-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954375#comment-15954375 ] Wenchen Fan commented on SPARK-18893: - https://issues.apache.org/jira/browse/SPARK-19261 > Not

[jira] [Commented] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954357#comment-15954357 ] Marcelo Vanzin commented on SPARK-20205: bq. I was referring to the case where we are persisting

[jira] [Commented] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954348#comment-15954348 ] Mridul Muralidharan commented on SPARK-20205: - bq. I wouldn't say incorrect; at worst it's

[jira] [Commented] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954340#comment-15954340 ] Marcelo Vanzin commented on SPARK-20205: bq. This is nasty ! This means submissionTime will

[jira] [Commented] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954333#comment-15954333 ] Mridul Muralidharan commented on SPARK-20205: - This is nasty ! This means submissionTime will

[jira] [Commented] (SPARK-4899) Support Mesos features: roles and checkpoints

2017-04-03 Thread Kamal Gurala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954312#comment-15954312 ] Kamal Gurala commented on SPARK-4899: - Some performance related concerns

[jira] [Comment Edited] (SPARK-20153) Support Multiple aws credentials in order to access multiple Hive on S3 table in spark application

2017-04-03 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954233#comment-15954233 ] Steve Loughran edited comment on SPARK-20153 at 4/3/17 10:13 PM: - This is

[jira] [Commented] (SPARK-20153) Support Multiple aws credentials in order to access multiple Hive on S3 table in spark application

2017-04-03 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954233#comment-15954233 ] Steve Loughran commented on SPARK-20153: This is fixed in Hadoop 2.8 with [per-bucket

[jira] [Commented] (SPARK-4899) Support Mesos features: roles and checkpoints

2017-04-03 Thread Charles Allen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954212#comment-15954212 ] Charles Allen commented on SPARK-4899: -- It was discussed on the mailing list with [~timchen] that

[jira] [Assigned] (SPARK-20064) Bump the PySpark verison number to 2.2

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20064: Assignee: (was: Apache Spark) > Bump the PySpark verison number to 2.2 >

[jira] [Assigned] (SPARK-20064) Bump the PySpark verison number to 2.2

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20064: Assignee: Apache Spark > Bump the PySpark verison number to 2.2 >

[jira] [Commented] (SPARK-20064) Bump the PySpark verison number to 2.2

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954186#comment-15954186 ] Apache Spark commented on SPARK-20064: -- User 'setjet' has created a pull request for this issue:

[jira] [Commented] (SPARK-4899) Support Mesos features: roles and checkpoints

2017-04-03 Thread Charles Allen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954170#comment-15954170 ] Charles Allen commented on SPARK-4899: --

[jira] [Created] (SPARK-20205) DAGScheduler posts SparkListenerStageSubmitted before updating stage

2017-04-03 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-20205: -- Summary: DAGScheduler posts SparkListenerStageSubmitted before updating stage Key: SPARK-20205 URL: https://issues.apache.org/jira/browse/SPARK-20205 Project:

[jira] [Commented] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954132#comment-15954132 ] Apache Spark commented on SPARK-18278: -- User 'foxish' has created a pull request for this issue:

[jira] [Commented] (SPARK-20176) Spark Dataframe UDAF issue

2017-04-03 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954093#comment-15954093 ] Kazuaki Ishizaki commented on SPARK-20176: -- Thanks. The code seem to work for the master. I am

[jira] [Comment Edited] (SPARK-20176) Spark Dataframe UDAF issue

2017-04-03 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15954093#comment-15954093 ] Kazuaki Ishizaki edited comment on SPARK-20176 at 4/3/17 8:13 PM: --

[jira] [Commented] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-04-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953968#comment-15953968 ] Wenchen Fan commented on SPARK-19659: - What's the smallest unit of fetching remote shuffle blocks?

[jira] [Commented] (SPARK-20204) separate SQLConf into catalyst confs and sql confs

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953930#comment-15953930 ] Apache Spark commented on SPARK-20204: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20204) separate SQLConf into catalyst confs and sql confs

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20204: Assignee: Apache Spark (was: Wenchen Fan) > separate SQLConf into catalyst confs and sql

[jira] [Assigned] (SPARK-20204) separate SQLConf into catalyst confs and sql confs

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20204: Assignee: Wenchen Fan (was: Apache Spark) > separate SQLConf into catalyst confs and sql

[jira] [Created] (SPARK-20204) separate SQLConf into catalyst confs and sql confs

2017-04-03 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-20204: --- Summary: separate SQLConf into catalyst confs and sql confs Key: SPARK-20204 URL: https://issues.apache.org/jira/browse/SPARK-20204 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator

2017-04-03 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953820#comment-15953820 ] Bryan Cutler commented on SPARK-19979: -- >From the discussion in the PR {noformat} val tokenizer =

[jira] [Commented] (SPARK-19712) EXISTS and Left Semi join do not produce the same plan

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953793#comment-15953793 ] Apache Spark commented on SPARK-19712: -- User 'nsyca' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19712) EXISTS and Left Semi join do not produce the same plan

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19712: Assignee: Apache Spark > EXISTS and Left Semi join do not produce the same plan >

[jira] [Assigned] (SPARK-19712) EXISTS and Left Semi join do not produce the same plan

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19712: Assignee: (was: Apache Spark) > EXISTS and Left Semi join do not produce the same

[jira] [Commented] (SPARK-20047) Constrained Logistic Regression

2017-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953728#comment-15953728 ] DB Tsai commented on SPARK-20047: - I changed the target to 2.3.0 Thanks. > Constrained Logistic

[jira] [Updated] (SPARK-20047) Constrained Logistic Regression

2017-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-20047: Affects Version/s: (was: 2.1.0) 2.2.0 Target Version/s: 2.3.0 (was: 2.2.0)

[jira] [Commented] (SPARK-20193) Selecting empty struct causes ExpressionEncoder error.

2017-04-03 Thread Adrian Ionescu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953704#comment-15953704 ] Adrian Ionescu commented on SPARK-20193: cc [~hvanhovell] > Selecting empty struct causes

[jira] [Resolved] (SPARK-20194) Support partition pruning for InMemoryCatalog

2017-04-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20194. - Resolution: Fixed Assignee: Adrian Ionescu Fix Version/s: 2.2.0 > Support partition

[jira] [Issue Comment Deleted] (SPARK-20199) GradientBoostedTreesModel doesn't have Column Sampling Rate Paramenter

2017-04-03 Thread Arush Kharbanda (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arush Kharbanda updated SPARK-20199: Comment: was deleted (was: I will work on this issue.) > GradientBoostedTreesModel

[jira] [Commented] (SPARK-11783) When deployed against remote Hive metastore, HiveContext.executionHive points to wrong metastore

2017-04-03 Thread Jonathan Maron (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953647#comment-15953647 ] Jonathan Maron commented on SPARK-11783: I am running a spark job and, when instantiating a

[jira] [Commented] (SPARK-9272) Persist information of individual partitions when persisting partitioned data source tables to metastore

2017-04-03 Thread Daniel Tomes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953582#comment-15953582 ] Daniel Tomes commented on SPARK-9272: - BUMP This is an important issue. Let's get this resolved. >

[jira] [Commented] (SPARK-20047) Constrained Logistic Regression

2017-04-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953551#comment-15953551 ] Nick Pentreath commented on SPARK-20047: Is this really targeted for 2.2.0? > Constrained

[jira] [Commented] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953489#comment-15953489 ] Sean Owen commented on SPARK-20202: --- Alrighty, you can leave the status for now, but generally

[jira] [Updated] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated SPARK-20202: -- Priority: Blocker (was: Critical) It is against Apache policy to release binaries that aren't

[jira] [Commented] (SPARK-19809) NullPointerException on empty ORC file

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953386#comment-15953386 ] Hyukjin Kwon commented on SPARK-19809: -- Shoudn't it contain footer and schema information or a magic

[jira] [Commented] (SPARK-19809) NullPointerException on empty ORC file

2017-04-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953341#comment-15953341 ] MichaƂ Dawid commented on SPARK-19809: -- Those empty files have been created while processing with

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953328#comment-15953328 ] Cyril de Vogelaere commented on SPARK-20203: Oh, I thought we were talking about the

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953319#comment-15953319 ] Sean Owen commented on SPARK-20203: --- How can this not have performance implications? you generate more

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953318#comment-15953318 ] Cyril de Vogelaere commented on SPARK-20203: I'm not splitting it, I deleted the other

[jira] [Commented] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953315#comment-15953315 ] Owen O'Malley commented on SPARK-20202: --- I should also say here that the Hive community is willing

[jira] [Comment Edited] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953299#comment-15953299 ] Cyril de Vogelaere edited comment on SPARK-20203 at 4/3/17 11:18 AM: -

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953299#comment-15953299 ] Cyril de Vogelaere commented on SPARK-20203: This cannot have performance implication, we are

[jira] [Commented] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953298#comment-15953298 ] Owen O'Malley commented on SPARK-20202: --- As an Apache member, the Spark project can't release

[jira] [Comment Edited] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953298#comment-15953298 ] Owen O'Malley edited comment on SPARK-20202 at 4/3/17 11:16 AM: As an

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953297#comment-15953297 ] Cyril de Vogelaere commented on SPARK-20203: SPARK-20180 was about adding a special value (0)

[jira] [Closed] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere closed SPARK-20180. -- Resolution: Won't Fix > Add a special value for unlimited max pattern length in Prefix

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953289#comment-15953289 ] Sean Owen commented on SPARK-20203: --- This is again not addressing the point, that doing so has

[jira] [Updated] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20203: --- Description: I think changing the default value to Int.MaxValue would be more user

[jira] [Commented] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953282#comment-15953282 ] Cyril de Vogelaere commented on SPARK-20180: Fine, I thought a TODO left in the code would

[jira] [Commented] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953280#comment-15953280 ] Sean Owen commented on SPARK-20203: --- I don't understand, this is the same as SPARK-20180? > Change

[jira] [Created] (SPARK-20203) Change default maxPatternLength value to Int.MaxValue in PrefixSpan

2017-04-03 Thread Cyril de Vogelaere (JIRA)
Cyril de Vogelaere created SPARK-20203: -- Summary: Change default maxPatternLength value to Int.MaxValue in PrefixSpan Key: SPARK-20203 URL: https://issues.apache.org/jira/browse/SPARK-20203

[jira] [Updated] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20202: -- Priority: Critical (was: Blocker) Fix Version/s: (was: 2.1.1)

[jira] [Commented] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953240#comment-15953240 ] Sean Owen commented on SPARK-20180: --- Surely, the impact is more than an 'if' statement. If you

[jira] [Created] (SPARK-20202) Remove references to org.spark-project.hive

2017-04-03 Thread Owen O'Malley (JIRA)
Owen O'Malley created SPARK-20202: - Summary: Remove references to org.spark-project.hive Key: SPARK-20202 URL: https://issues.apache.org/jira/browse/SPARK-20202 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-19752) OrcGetSplits fails with 0 size files

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19752. -- Resolution: Duplicate It sounds a duplicate of SPARK-19809. Please reopen that if I

[jira] [Resolved] (SPARK-19809) NullPointerException on empty ORC file

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19809. -- Resolution: Invalid I don't think there is 0 byte ORC file. It should have the footer.

[jira] [Comment Edited] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953201#comment-15953201 ] Cyril de Vogelaere edited comment on SPARK-20180 at 4/3/17 9:57 AM:

[jira] [Assigned] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-04-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19641: --- Assignee: Hyukjin Kwon > JSON schema inference in DROPMALFORMED mode produces incorrect

[jira] [Comment Edited] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953201#comment-15953201 ] Cyril de Vogelaere edited comment on SPARK-20180 at 4/3/17 9:45 AM:

[jira] [Resolved] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-04-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19641. - Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17492

[jira] [Commented] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953201#comment-15953201 ] Cyril de Vogelaere commented on SPARK-20180: => Why not let the default be Int.MaxValue? I'm

[jira] [Assigned] (SPARK-19969) Doc and examples for Imputer

2017-04-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-19969: -- Assignee: yuhao yang > Doc and examples for Imputer > >

[jira] [Resolved] (SPARK-19969) Doc and examples for Imputer

2017-04-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-19969. Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17324

[jira] [Commented] (SPARK-20090) Add StructType.fieldNames to Python API

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953199#comment-15953199 ] Hyukjin Kwon commented on SPARK-20090: -- [~josephkb], gentle ping. > Add StructType.fieldNames to

[jira] [Commented] (SPARK-20108) Spark query is getting failed with exception

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953196#comment-15953196 ] Hyukjin Kwon commented on SPARK-20108: -- It will help other guys like me to track down the problem

[jira] [Commented] (SPARK-20108) Spark query is getting failed with exception

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953195#comment-15953195 ] Hyukjin Kwon commented on SPARK-20108: -- It seems almost impossible to reproduce to me. Do you mind

[jira] [Updated] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cyril de Vogelaere updated SPARK-20180: --- Summary: Add a special value for unlimited max pattern length in Prefix span, and

[jira] [Comment Edited] (SPARK-20185) csv decompressed incorrectly with extention other than 'gz'

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953183#comment-15953183 ] Hyukjin Kwon edited comment on SPARK-20185 at 4/3/17 9:28 AM: -- {{codec}} or

[jira] [Commented] (SPARK-20185) csv decompressed incorrectly with extention other than 'gz'

2017-04-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953183#comment-15953183 ] Hyukjin Kwon commented on SPARK-20185: -- {{codec}} or {{compression}} is an option for writing out as

[jira] [Resolved] (SPARK-9002) KryoSerializer initialization does not include 'Array[Int]'

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9002. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17482

[jira] [Commented] (SPARK-20180) Unlimited max pattern length in Prefix span

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953166#comment-15953166 ] Sean Owen commented on SPARK-20180: --- Why not let the default be Int.MaxValue? if that's what this is

[jira] [Assigned] (SPARK-20166) Use XXX for ISO timezone instead of ZZ which is FastDateFormat specific in CSV/JSON time related options

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-20166: - Assignee: Hyukjin Kwon Priority: Minor (was: Trivial) > Use XXX for ISO timezone instead

[jira] [Resolved] (SPARK-20166) Use XXX for ISO timezone instead of ZZ which is FastDateFormat specific in CSV/JSON time related options

2017-04-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20166. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17489

[jira] [Commented] (SPARK-20180) Unlimited max pattern length in Prefix span

2017-04-03 Thread Cyril de Vogelaere (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953160#comment-15953160 ] Cyril de Vogelaere commented on SPARK-20180: Can you not just set a very large max, like

[jira] [Commented] (SPARK-15352) Topology aware block replication

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15953153#comment-15953153 ] Apache Spark commented on SPARK-15352: -- User 'lins05' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15352) Topology aware block replication

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15352: Assignee: Shubham Chopra (was: Apache Spark) > Topology aware block replication >

[jira] [Assigned] (SPARK-15352) Topology aware block replication

2017-04-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15352: Assignee: Apache Spark (was: Shubham Chopra) > Topology aware block replication >

[jira] [Assigned] (SPARK-19985) Some ML Models error when copy or do not set parent

2017-04-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-19985: -- Assignee: Bryan Cutler > Some ML Models error when copy or do not set parent >

[jira] [Resolved] (SPARK-19985) Some ML Models error when copy or do not set parent

2017-04-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-19985. Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17326

  1   2   >