[jira] [Updated] (SPARK-14425) SQL/dataframe join error: mixes up the columns

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Commented] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build as documented

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227773#comment-15227773 ] Apache Spark commented on SPARK-14424: -- User 'holdenk' has created a pull request for this issue:

[jira] [Updated] (SPARK-14425) SQL/dataframe join error: mixes up the columns

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14425) SQL/dataframe join error: mixes up the columns

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Summary: SQL/dataframe join error: mixes up the columns (was: spark SQL/dataframe join

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes up the columns

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Summary: spark SQL/dataframe join error: mixes up the columns (was: spark

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14410) SessionCatalog needs to check function existence

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14410: -- Summary: SessionCatalog needs to check function existence (was: SessionCatalog needs to check

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14410) SessionCatalog needs to check function existence

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14410: -- Description: Right now, operations for an existing functions in SessionCatalog do not really check if

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Description: I am running this on databricks cloud. I am running a join operation and

[jira] [Assigned] (SPARK-14132) [Table related commands] Alter partition

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or reassigned SPARK-14132: - Assignee: Andrew Or > [Table related commands] Alter partition >

[jira] [Updated] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build as documented

2016-04-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-14424: Description: SPARK-13579 disabled building the assembly artifacts. Our shell scripts (specifically

[jira] [Updated] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] venu k tangirala updated SPARK-14425: - Priority: Blocker (was: Major) > spark SQL/dataframe join error: mixes the columns up >

[jira] [Updated] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build as documented

2016-04-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-14424: Summary: spark-class and related (spark-shell, etc.) no longer work with sbt build as documented (was:

[jira] [Created] (SPARK-14425) spark SQL/dataframe join error: mixes the columns up

2016-04-05 Thread venu k tangirala (JIRA)
venu k tangirala created SPARK-14425: Summary: spark SQL/dataframe join error: mixes the columns up Key: SPARK-14425 URL: https://issues.apache.org/jira/browse/SPARK-14425 Project: Spark

[jira] [Resolved] (SPARK-14252) Executors do not try to download remote cached blocks

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14252. --- Resolution: Fixed Assignee: Eric Liang Fix Version/s: 2.0.0 > Executors do not try

[jira] [Commented] (SPARK-8338) Ganglia fails to start

2016-04-05 Thread Leonardo Apolonio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227757#comment-15227757 ] Leonardo Apolonio commented on SPARK-8338: -- I am also seeing the error that [~mikeyreilly]

[jira] [Resolved] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-14416. --- Resolution: Fixed Assignee: Shixiong Zhu Fix Version/s: 2.0.0 > Add thread-safe

[jira] [Updated] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-14416: -- Target Version/s: 2.0.0 > Add thread-safe comments for CoarseGrainedSchedulerBackend's fields >

[jira] [Commented] (SPARK-14409) Investigate adding a RankingEvaluator to ML

2016-04-05 Thread Yong Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227743#comment-15227743 ] Yong Tang commented on SPARK-14409: --- [~mlnick] I can work on this issue if no one has started yet.

[jira] [Commented] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-04-05 Thread Jason Piper (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227722#comment-15227722 ] Jason Piper commented on SPARK-14393: - If this isn't a bug, I guess it needs to be clear in the

[jira] [Assigned] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14424: Assignee: (was: Apache Spark) > spark-class and related (spark-shell, etc.) no longer

[jira] [Assigned] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14424: Assignee: Apache Spark > spark-class and related (spark-shell, etc.) no longer work with

[jira] [Commented] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227707#comment-15227707 ] Apache Spark commented on SPARK-14424: -- User 'holdenk' has created a pull request for this issue:

[jira] [Created] (SPARK-14424) spark-class and related (spark-shell, etc.) no longer work with sbt build

2016-04-05 Thread holdenk (JIRA)
holdenk created SPARK-14424: --- Summary: spark-class and related (spark-shell, etc.) no longer work with sbt build Key: SPARK-14424 URL: https://issues.apache.org/jira/browse/SPARK-14424 Project: Spark

[jira] [Commented] (SPARK-12741) DataFrame count method return wrong size.

2016-04-05 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227688#comment-15227688 ] Stephane Maarek commented on SPARK-12741: - Hi, May be related to:

[jira] [Commented] (SPARK-14249) Change MLReader.read to be a property for PySpark

2016-04-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227686#comment-15227686 ] Miao Wang commented on SPARK-14249: --- [~josephkb]I will take a look. Now, I am working on a the

[jira] [Resolved] (SPARK-14413) For data source tables, we should not allow users to set/change partition locations

2016-04-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14413. -- Resolution: Fixed Fix Version/s: 2.0.0 > For data source tables, we should not allow users to

[jira] [Commented] (SPARK-14413) For data source tables, we should not allow users to set/change partition locations

2016-04-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227683#comment-15227683 ] Yin Huai commented on SPARK-14413: -- It has been resolved by https://github.com/apache/spark/pull/12186.

[jira] [Updated] (SPARK-14413) For data source tables, we should not allow users to set/change partition locations

2016-04-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14413: - Summary: For data source tables, we should not allow users to set/change partition locations (was: For

[jira] [Resolved] (SPARK-14296) whole stage codegen support for Dataset.map

2016-04-05 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-14296. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12087

[jira] [Commented] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-04-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227671#comment-15227671 ] Takeshi Yamamuro commented on SPARK-13184: -- Yeah, lowering `maxPartitionBytes` increases

[jira] [Commented] (SPARK-14423) Handle jar conflict issue when uploading to distributed cache

2016-04-05 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227664#comment-15227664 ] Saisai Shao commented on SPARK-14423: - I will fix it soon. > Handle jar conflict issue when

[jira] [Created] (SPARK-14423) Handle jar conflict issue when uploading to distributed cache

2016-04-05 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-14423: --- Summary: Handle jar conflict issue when uploading to distributed cache Key: SPARK-14423 URL: https://issues.apache.org/jira/browse/SPARK-14423 Project: Spark

[jira] [Updated] (SPARK-14344) saveAsParquetFile creates _metadata file even when disabled

2016-04-05 Thread Kashish Jain (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kashish Jain updated SPARK-14344: - Summary: saveAsParquetFile creates _metadata file even when disabled (was: saveAsParquetFile

[jira] [Commented] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-04-05 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227657#comment-15227657 ] koert kuipers commented on SPARK-13184: --- i am not familiar with those settings. are they respected

[jira] [Assigned] (SPARK-14300) Scala MLlib examples code merge and clean up

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14300: Assignee: (was: Apache Spark) > Scala MLlib examples code merge and clean up >

[jira] [Commented] (SPARK-14300) Scala MLlib examples code merge and clean up

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227655#comment-15227655 ] Apache Spark commented on SPARK-14300: -- User 'keypointt' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14300) Scala MLlib examples code merge and clean up

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14300: Assignee: Apache Spark > Scala MLlib examples code merge and clean up >

[jira] [Commented] (SPARK-14373) PySpark ml RandomForestClassifier, Regressor support export/import

2016-04-05 Thread Kai Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227646#comment-15227646 ] Kai Jiang commented on SPARK-14373: --- I would work on this one. > PySpark ml RandomForestClassifier,

[jira] [Commented] (SPARK-14412) spark.ml ALS prefered storage level Params

2016-04-05 Thread Rishabh Bhardwaj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227639#comment-15227639 ] Rishabh Bhardwaj commented on SPARK-14412: -- I can take this up if no one has started on it. >

[jira] [Resolved] (SPARK-13211) StreamingContext throws NoSuchElementException when created from non-existent checkpoint directory

2016-04-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-13211. -- Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.0.0 > StreamingContext

[jira] [Updated] (SPARK-12469) Consistent Accumulators for Spark

2016-04-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12469: Target Version/s: 2.0.0 > Consistent Accumulators for Spark > - >

[jira] [Created] (SPARK-14422) Improve handling of optional configs in SQLConf

2016-04-05 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-14422: -- Summary: Improve handling of optional configs in SQLConf Key: SPARK-14422 URL: https://issues.apache.org/jira/browse/SPARK-14422 Project: Spark Issue

[jira] [Resolved] (SPARK-14359) Improve user experience for typed aggregate functions in Java

2016-04-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14359. - Resolution: Fixed Assignee: Eric Liang Fix Version/s: 2.0.0 > Improve user

[jira] [Comment Edited] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-04-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227551#comment-15227551 ] Takeshi Yamamuro edited comment on SPARK-14393 at 4/6/16 2:08 AM: -- Seems

[jira] [Assigned] (SPARK-14400) ScriptTransformation does not fail the job for bad user command

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14400: Assignee: Apache Spark > ScriptTransformation does not fail the job for bad user command

[jira] [Assigned] (SPARK-14400) ScriptTransformation does not fail the job for bad user command

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14400: Assignee: (was: Apache Spark) > ScriptTransformation does not fail the job for bad

[jira] [Commented] (SPARK-14400) ScriptTransformation does not fail the job for bad user command

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227566#comment-15227566 ] Apache Spark commented on SPARK-14400: -- User 'tejasapatil' has created a pull request for this

[jira] [Commented] (SPARK-14252) Executors do not try to download remote cached blocks

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227562#comment-15227562 ] Apache Spark commented on SPARK-14252: -- User 'ericl' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14252) Executors do not try to download remote cached blocks

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14252: Assignee: Apache Spark > Executors do not try to download remote cached blocks >

[jira] [Assigned] (SPARK-14252) Executors do not try to download remote cached blocks

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14252: Assignee: (was: Apache Spark) > Executors do not try to download remote cached blocks

[jira] [Commented] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-04-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227551#comment-15227551 ] Takeshi Yamamuro commented on SPARK-14393: -- Seems different `MonotonicallyIncreasingID`

[jira] [Assigned] (SPARK-14402) initcap UDF doesn't match Hive/Oracle behavior in lowercasing rest of string

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14402: Assignee: Apache Spark (was: Dongjoon Hyun) > initcap UDF doesn't match Hive/Oracle

[jira] [Assigned] (SPARK-14402) initcap UDF doesn't match Hive/Oracle behavior in lowercasing rest of string

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14402: Assignee: Dongjoon Hyun (was: Apache Spark) > initcap UDF doesn't match Hive/Oracle

[jira] [Reopened] (SPARK-14402) initcap UDF doesn't match Hive/Oracle behavior in lowercasing rest of string

2016-04-05 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski reopened SPARK-14402: - There's a compilation error after the change. > initcap UDF doesn't match Hive/Oracle

[jira] [Commented] (SPARK-14402) initcap UDF doesn't match Hive/Oracle behavior in lowercasing rest of string

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227536#comment-15227536 ] Apache Spark commented on SPARK-14402: -- User 'jaceklaskowski' has created a pull request for this

[jira] [Created] (SPARK-14421) Kinesis deaggregation with PySpark

2016-04-05 Thread Brian ONeill (JIRA)
Brian ONeill created SPARK-14421: Summary: Kinesis deaggregation with PySpark Key: SPARK-14421 URL: https://issues.apache.org/jira/browse/SPARK-14421 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-04-05 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227498#comment-15227498 ] Sun Rui commented on SPARK-12922: - cool:) > Implement gapply() on DataFrame in SparkR >

[jira] [Commented] (SPARK-14252) Executors do not try to download remote cached blocks

2016-04-05 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227493#comment-15227493 ] Eric Liang commented on SPARK-14252: I'm going to take a look at fixing this > Executors do not try

[jira] [Commented] (SPARK-13687) Cleanup pyspark temporary files

2016-04-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227482#comment-15227482 ] holdenk commented on SPARK-13687: - I'll take this one :) > Cleanup pyspark temporary files >

[jira] [Created] (SPARK-14420) keepLastCheckpoint Param for Python LDA with EM

2016-04-05 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14420: - Summary: keepLastCheckpoint Param for Python LDA with EM Key: SPARK-14420 URL: https://issues.apache.org/jira/browse/SPARK-14420 Project: Spark

[jira] [Assigned] (SPARK-14398) Audit non-reserved keyword list in ANTLR4 parser.

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14398: Assignee: Apache Spark > Audit non-reserved keyword list in ANTLR4 parser. >

[jira] [Commented] (SPARK-14398) Audit non-reserved keyword list in ANTLR4 parser.

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227451#comment-15227451 ] Apache Spark commented on SPARK-14398: -- User 'bomeng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14398) Audit non-reserved keyword list in ANTLR4 parser.

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14398: Assignee: (was: Apache Spark) > Audit non-reserved keyword list in ANTLR4 parser. >

[jira] [Updated] (SPARK-4591) Algorithm/model parity in spark.ml (Scala)

2016-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Updated] (SPARK-4591) Algorithm/model parity in spark.ml (Scala)

2016-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Commented] (SPARK-4591) Algorithm/model parity in spark.ml (Scala)

2016-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227445#comment-15227445 ] Joseph K. Bradley commented on SPARK-4591: -- We will; eventually, we should support everything. I

[jira] [Commented] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-04-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227435#comment-15227435 ] Takeshi Yamamuro commented on SPARK-13184: -- Seems we can handle this by

[jira] [Created] (SPARK-14419) Improve the HashedRelation for key fit within Long

2016-04-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14419: -- Summary: Improve the HashedRelation for key fit within Long Key: SPARK-14419 URL: https://issues.apache.org/jira/browse/SPARK-14419 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13842) Consider __iter__ and __getitem__ methods for pyspark.sql.types.StructType

2016-04-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227392#comment-15227392 ] holdenk commented on SPARK-13842: - cc [~davies] any thoughts? > Consider __iter__ and __getitem__

[jira] [Closed] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2016-04-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas closed SPARK-3821. --- Resolution: Won't Fix I'm resolving this as "Won't Fix" due to lack of interest, both on my

[jira] [Commented] (SPARK-14418) Broadcast.unpersist() in PySpark is not consistent with that in Scala

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227382#comment-15227382 ] Apache Spark commented on SPARK-14418: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14418) Broadcast.unpersist() in PySpark is not consistent with that in Scala

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14418: Assignee: Apache Spark (was: Davies Liu) > Broadcast.unpersist() in PySpark is not

[jira] [Assigned] (SPARK-14418) Broadcast.unpersist() in PySpark is not consistent with that in Scala

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14418: Assignee: Davies Liu (was: Apache Spark) > Broadcast.unpersist() in PySpark is not

[jira] [Created] (SPARK-14418) Broadcast.unpersist() in PySpark is not consistent with that in Scala

2016-04-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14418: -- Summary: Broadcast.unpersist() in PySpark is not consistent with that in Scala Key: SPARK-14418 URL: https://issues.apache.org/jira/browse/SPARK-14418 Project: Spark

[jira] [Commented] (SPARK-14392) CountVectorizer Estimator should include binary toggle Param

2016-04-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227371#comment-15227371 ] Miao Wang commented on SPARK-14392: --- I moved the code from CountVectorizerModel to

[jira] [Created] (SPARK-14417) Cleanup Scala deprecation warnings once we drop 2.10.X

2016-04-05 Thread holdenk (JIRA)
holdenk created SPARK-14417: --- Summary: Cleanup Scala deprecation warnings once we drop 2.10.X Key: SPARK-14417 URL: https://issues.apache.org/jira/browse/SPARK-14417 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14416: Assignee: Apache Spark > Add thread-safe comments for CoarseGrainedSchedulerBackend's

[jira] [Commented] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227354#comment-15227354 ] Apache Spark commented on SPARK-14416: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14416: Assignee: (was: Apache Spark) > Add thread-safe comments for

[jira] [Created] (SPARK-14416) Add thread-safe comments for CoarseGrainedSchedulerBackend's fields

2016-04-05 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-14416: Summary: Add thread-safe comments for CoarseGrainedSchedulerBackend's fields Key: SPARK-14416 URL: https://issues.apache.org/jira/browse/SPARK-14416 Project: Spark

[jira] [Commented] (SPARK-14128) [Table related commands] For a table related commands, it should be able to distinguish data source tables and hive tables

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227284#comment-15227284 ] Apache Spark commented on SPARK-14128: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14415) Add ExpressionDescription annotation for SQL expressions

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14415: Assignee: (was: Apache Spark) > Add ExpressionDescription annotation for SQL

[jira] [Commented] (SPARK-14415) Add ExpressionDescription annotation for SQL expressions

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227269#comment-15227269 ] Apache Spark commented on SPARK-14415: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-14415) Add ExpressionDescription annotation for SQL expressions

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14415: Assignee: Apache Spark > Add ExpressionDescription annotation for SQL expressions >

[jira] [Created] (SPARK-14415) Add ExpressionDescription annotation for SQL expressions

2016-04-05 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14415: - Summary: Add ExpressionDescription annotation for SQL expressions Key: SPARK-14415 URL: https://issues.apache.org/jira/browse/SPARK-14415 Project: Spark

[jira] [Commented] (SPARK-14057) sql time stamps do not respect time zones

2016-04-05 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227251#comment-15227251 ] Vijay Parmar commented on SPARK-14057: -- Hi Andrew, Any updates? Sorry the github link is working.

[jira] [Resolved] (SPARK-529) Have a single file that controls the environmental variables and spark config options

2016-04-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-529. -- Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.0.0 The new

[jira] [Resolved] (SPARK-14411) Add a note to warn that onQueryProgress is asynchronous

2016-04-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-14411. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12180

[jira] [Created] (SPARK-14414) Make error messages consistent across DDLs

2016-04-05 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14414: - Summary: Make error messages consistent across DDLs Key: SPARK-14414 URL: https://issues.apache.org/jira/browse/SPARK-14414 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-14372) Dataset.randomSplit() needs a Java version

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14372: Assignee: (was: Apache Spark) > Dataset.randomSplit() needs a Java version >

[jira] [Commented] (SPARK-14372) Dataset.randomSplit() needs a Java version

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227232#comment-15227232 ] Apache Spark commented on SPARK-14372: -- User 'rekhajoshm' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14372) Dataset.randomSplit() needs a Java version

2016-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14372: Assignee: Apache Spark > Dataset.randomSplit() needs a Java version >

[jira] [Comment Edited] (SPARK-14392) CountVectorizer Estimator should include binary toggle Param

2016-04-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227209#comment-15227209 ] Miao Wang edited comment on SPARK-14392 at 4/5/16 10:05 PM: [~mlnick]Do you

[jira] [Commented] (SPARK-14392) CountVectorizer Estimator should include binary toggle Param

2016-04-05 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15227209#comment-15227209 ] Miao Wang commented on SPARK-14392: --- [~mlnick]Do you mean moving the val binary from class

  1   2   3   >