[jira] [Assigned] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15616: Assignee: Apache Spark > Metastore relation should fallback to HDFS size of partitions

[jira] [Commented] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305191#comment-15305191 ] Apache Spark commented on SPARK-15616: -- User 'lianhuiwang' has created a pull request for this

[jira] [Assigned] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15616: Assignee: (was: Apache Spark) > Metastore relation should fallback to HDFS size of

[jira] [Assigned] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15585: Assignee: Apache Spark > Don't use null in data source options to indicate default value

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305185#comment-15305185 ] Apache Spark commented on SPARK-15585: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15585: Assignee: (was: Apache Spark) > Don't use null in data source options to indicate

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: (was: Apache Spark) > Try to push down filter at RowGroups level for

[jira] [Commented] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305180#comment-15305180 ] Apache Spark commented on SPARK-15639: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: Apache Spark > Try to push down filter at RowGroups level for parquet reader >

[jira] [Created] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-15639: --- Summary: Try to push down filter at RowGroups level for parquet reader Key: SPARK-15639 URL: https://issues.apache.org/jira/browse/SPARK-15639 Project: Spark

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305178#comment-15305178 ] Takeshi Yamamuro commented on SPARK-15585: -- okay, I'll push soon. > Don't use null in data

[jira] [Updated] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15638: Description: See the attached pull request for details. > Audit Dataset, SparkSession, and

[jira] [Assigned] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15638: Assignee: Reynold Xin (was: Apache Spark) > Audit Dataset, SparkSession, and SQLContext

[jira] [Commented] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305170#comment-15305170 ] Apache Spark commented on SPARK-15638: -- User 'rxin' has created a pull request for this issue:

[jira] [Created] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15638: --- Summary: Audit Dataset, SparkSession, and SQLContext functions and documentations Key: SPARK-15638 URL: https://issues.apache.org/jira/browse/SPARK-15638 Project:

[jira] [Assigned] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15611: Assignee: Apache Spark > Got the same sequence random number in every forked worker. >

[jira] [Commented] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305169#comment-15305169 ] Apache Spark commented on SPARK-15611: -- User 'ThomasLau' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15611: Assignee: (was: Apache Spark) > Got the same sequence random number in every forked

[jira] [Updated] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Summary: Got the same sequence random number in every forked worker. (was: Each forked worker in

[jira] [Resolved] (SPARK-15553) Dataset.createTempView should use CreateViewCommand

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15553. - Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.0.0 >

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below:

[jira] [Resolved] (SPARK-15597) Add SparkSession.emptyDataset

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15597. - Resolution: Fixed Fix Version/s: 2.0.0 > Add SparkSession.emptyDataset >

[jira] [Updated] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13184: Target Version/s: 2.1.0 > Support minPartitions parameter for JSON and CSV datasources as options

[jira] [Resolved] (SPARK-15633) Make package name for Java tests consistent

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15633. - Resolution: Fixed Fix Version/s: 2.0.0 > Make package name for Java tests consistent >

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below:

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305166#comment-15305166 ] Reynold Xin commented on SPARK-15585: - Feel free to create a pr with python changes and then we can

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: {code:python} from random import random from

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: {quote} from random import random from operator

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Summary: Each forked worker in daemon.py keep the parent's random state (was: each forked worker

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305152#comment-15305152 ] Takeshi Yamamuro commented on SPARK-15585: -- okay > Don't use null in data source options to

[jira] [Commented] (SPARK-15528) conv function returns inconsistent result for the same data

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305150#comment-15305150 ] Takeshi Yamamuro commented on SPARK-15528: -- I tried this in master and I could reproduce;

[jira] [Commented] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305143#comment-15305143 ] Dilip Biswal commented on SPARK-15634: -- I would like to work on this issue. > SQL repl is bricked

[jira] [Resolved] (SPARK-15610) update error message for k in pca

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15610. --- Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 2.0.0 Resolved by

[jira] [Commented] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305119#comment-15305119 ] Sean Owen commented on SPARK-12550: --- This is not from the Spark project. I mean, what docs _from the

[jira] [Commented] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305117#comment-15305117 ] Sean Owen commented on SPARK-15619: --- Interesting, looks like it's related to the lz4 library, and I see

[jira] [Updated] (SPARK-15562) Temp directory is not deleted after program exit in DataFrameExample

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15562: -- Assignee: ding > Temp directory is not deleted after program exit in DataFrameExample >

[jira] [Resolved] (SPARK-15562) Temp directory is not deleted after program exit in DataFrameExample

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15562. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13328

[jira] [Updated] (SPARK-15449) MLlib NaiveBayes example in Java uses wrong data format

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15449: -- Assignee: Miao Wang > MLlib NaiveBayes example in Java uses wrong data format >

[jira] [Resolved] (SPARK-15449) MLlib NaiveBayes example in Java uses wrong data format

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15449. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13301

[jira] [Reopened] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-15607: --- > Remove redundant toArray in ml.linalg > - > > Key:

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15610: -- Priority: Trivial (was: Minor) Component/s: Documentation > update error message for k in pca

[jira] [Resolved] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15607. --- Resolution: Not A Problem > Remove redundant toArray in ml.linalg >

[jira] [Updated] (SPARK-15549) Disable bucketing when the output doesn't contain all bucketing columns

2016-05-27 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-15549: -- Summary: Disable bucketing when the output doesn't contain all bucketing columns (was: Bucket column

[jira] [Comment Edited] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Greg Silverman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305095#comment-15305095 ] Greg Silverman edited comment on SPARK-12550 at 5/28/16 1:45 AM: - I am

[jira] [Commented] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Greg Silverman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305095#comment-15305095 ] Greg Silverman commented on SPARK-12550: I am having the same exact issue on Debian 7.10 wheezy.

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305089#comment-15305089 ] zhengruifeng commented on SPARK-15617: -- I can work on this > Clarify that fMeasure in

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305086#comment-15305086 ] zhengruifeng commented on SPARK-15617: --

[jira] [Commented] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305077#comment-15305077 ] Apache Spark commented on SPARK-15637: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15637: Assignee: (was: Apache Spark) > SparkR tests failing on R 3.2.2 >

[jira] [Assigned] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15637: Assignee: Apache Spark > SparkR tests failing on R 3.2.2 >

[jira] [Created] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-15637: Summary: SparkR tests failing on R 3.2.2 Key: SPARK-15637 URL: https://issues.apache.org/jira/browse/SPARK-15637 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15557: - Target Version/s: 2.0.0 Description: expression "select (cast(99 as decimal(19,6))+ '3')*'2.3'

[jira] [Resolved] (SPARK-15594) ALTER TABLE ... SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15594. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13343

[jira] [Updated] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-14343: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > Dataframe operations on a partitioned

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Description: error message for {{k}} should match the bound (was: Vector size must be greater

[jira] [Updated] (SPARK-15610) PCA should not support k == numFeatures

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Priority: Minor (was: Major) > PCA should not support k == numFeatures >

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Summary: update error message for k in pca (was: PCA should not support k == numFeatures) >

[jira] [Updated] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15632: --- Description: Filter operations should never change query plan schema. However, Dataset typed filter

[jira] [Updated] (SPARK-9876) Upgrade parquet-mr to 1.8.1

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9876: -- Assignee: Ryan Blue > Upgrade parquet-mr to 1.8.1 > --- > >

[jira] [Resolved] (SPARK-9876) Upgrade parquet-mr to 1.8.1

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9876. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13280

[jira] [Closed] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15607. Resolution: Won't Fix > Remove redundant toArray in ml.linalg >

[jira] [Closed] (SPARK-15291) Remove redundant codes in SVD++

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15291. Resolution: Won't Fix > Remove redundant codes in SVD++ > --- > >

[jira] [Assigned] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15557: Assignee: (was: Apache Spark) > expression ((cast(99 as decimal) + '3') * '2.3' )

[jira] [Commented] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304994#comment-15304994 ] Apache Spark commented on SPARK-15557: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15557: Assignee: Apache Spark > expression ((cast(99 as decimal) + '3') * '2.3' ) return null >

[jira] [Updated] (SPARK-9576) DataFrame API improvement umbrella ticket (Spark 2.0 and 2.1)

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9576: --- Target Version/s: 2.1.0 (was: 2.0.0) > DataFrame API improvement umbrella ticket (Spark 2.0 and 2.1)

[jira] [Updated] (SPARK-9576) DataFrame API improvement umbrella ticket (Spark 2.0 and 2.1)

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9576: --- Summary: DataFrame API improvement umbrella ticket (Spark 2.0 and 2.1) (was: DataFrame API

[jira] [Updated] (SPARK-15636) Make aggregate expressions more concise in explain

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15636: Description: Aggregate expressions have very long string representations in explain outputs. For

[jira] [Commented] (SPARK-15636) Make aggregate expressions more concise in explain

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304934#comment-15304934 ] Apache Spark commented on SPARK-15636: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15636) Make aggregate expressions more concise in explain

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15636: Assignee: Reynold Xin (was: Apache Spark) > Make aggregate expressions more concise in

[jira] [Assigned] (SPARK-15636) Make aggregate expressions more concise in explain

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15636: Assignee: Apache Spark (was: Reynold Xin) > Make aggregate expressions more concise in

[jira] [Created] (SPARK-15636) Make aggregate expressions more concise in explain

2016-05-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15636: --- Summary: Make aggregate expressions more concise in explain Key: SPARK-15636 URL: https://issues.apache.org/jira/browse/SPARK-15636 Project: Spark Issue Type:

[jira] [Created] (SPARK-15635) ALTER TABLE RENAME doesn't work for datasource tables

2016-05-27 Thread Andrew Or (JIRA)
Andrew Or created SPARK-15635: - Summary: ALTER TABLE RENAME doesn't work for datasource tables Key: SPARK-15635 URL: https://issues.apache.org/jira/browse/SPARK-15635 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304911#comment-15304911 ] shane knapp edited comment on SPARK-15619 at 5/27/16 10:33 PM: --- next time

[jira] [Resolved] (SPARK-15450) Clean up SparkSession builder for python

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15450. --- Resolution: Fixed Fix Version/s: 2.0.0 > Clean up SparkSession builder for python >

[jira] [Resolved] (SPARK-15534) TRUNCATE TABLE should throw exceptions, not logError

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15534. --- Resolution: Fixed Fix Version/s: 2.0.0 > TRUNCATE TABLE should throw exceptions, not logError

[jira] [Resolved] (SPARK-15535) Remove code for TRUNCATE TABLE ... COLUMN

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15535. --- Resolution: Fixed Fix Version/s: 2.0.0 > Remove code for TRUNCATE TABLE ... COLUMN >

[jira] [Updated] (SPARK-15450) Clean up SparkSession builder for python

2016-05-27 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15450: -- Assignee: Eric Liang (was: Andrew Or) > Clean up SparkSession builder for python >

[jira] [Commented] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304911#comment-15304911 ] shane knapp commented on SPARK-15619: - next time we have a maintenance, i will wipe /tmp completely

[jira] [Commented] (SPARK-15622) Janino's classloader has an unexpected behavior when its parent classloader throws an ClassNotFoundException with a cause set

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304905#comment-15304905 ] Apache Spark commented on SPARK-15622: -- User 'yhuai' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15622) Janino's classloader has an unexpected behavior when its parent classloader throws an ClassNotFoundException with a cause set

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15622: Assignee: Apache Spark (was: Yin Huai) > Janino's classloader has an unexpected behavior

[jira] [Assigned] (SPARK-15622) Janino's classloader has an unexpected behavior when its parent classloader throws an ClassNotFoundException with a cause set

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15622: Assignee: Yin Huai (was: Apache Spark) > Janino's classloader has an unexpected behavior

[jira] [Updated] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Sela updated SPARK-15489: -- Description: When setting a custom "spark.kryo.registrator" (or any other configuration for that

[jira] [Updated] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Sela updated SPARK-15489: -- Description: When setting a custom "spark.kryo.registrator" (or any other configuration for that

[jira] [Comment Edited] (SPARK-15623) 2.0 python coverage ml.feature

2016-05-27 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304898#comment-15304898 ] Bryan Cutler edited comment on SPARK-15623 at 5/27/16 10:11 PM: I was

[jira] [Commented] (SPARK-15623) 2.0 python coverage ml.feature

2016-05-27 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304898#comment-15304898 ] Bryan Cutler commented on SPARK-15623: -- I was only able to quickly go though the user guide and api

[jira] [Commented] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304897#comment-15304897 ] Wenchen Fan commented on SPARK-15632: - good catch! we should not implement typed filter in this way,

[jira] [Updated] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amit Sela updated SPARK-15489: -- Summary: Dataset kryo encoder won't load custom user settings (was: Dataset kryo encoder fails on

[jira] [Commented] (SPARK-15489) Dataset kryo encoder fails on Collections$UnmodifiableCollection

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304893#comment-15304893 ] Amit Sela commented on SPARK-15489: --- The issue here is the fact that setting the SparkConf does not

[jira] [Assigned] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15618: Assignee: Apache Spark (was: Dongjoon Hyun) > Use SparkSession.builder.sparkContext(...)

[jira] [Commented] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304891#comment-15304891 ] Apache Spark commented on SPARK-15618: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15618: Assignee: Dongjoon Hyun (was: Apache Spark) > Use SparkSession.builder.sparkContext(...)

[jira] [Commented] (SPARK-15489) Dataset kryo encoder fails on Collections$UnmodifiableCollection

2016-05-27 Thread Amit Sela (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304886#comment-15304886 ] Amit Sela commented on SPARK-15489: --- Got it! So I wasn't using the custom registrator correctly, it

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-27 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304884#comment-15304884 ] Seth Hendrickson commented on SPARK-15581: -- [~BenFradet] See

[jira] [Commented] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304874#comment-15304874 ] Sean Owen commented on SPARK-15619: --- Although I think we've cleaned up this over time, and even seen a

[jira] [Comment Edited] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304872#comment-15304872 ] Eric Liang edited comment on SPARK-15634 at 5/27/16 9:57 PM: - Note that

[jira] [Commented] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304872#comment-15304872 ] Eric Liang commented on SPARK-15634: Note that adding jars in the repl also doesn't work currently,

[jira] [Commented] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304873#comment-15304873 ] Cheng Lian commented on SPARK-15632: cc [~cloud_fan] [~marmbrus] > Dataset typed filter operation

[jira] [Updated] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15632: --- Description: Filter operations should never changes query plan schema. However, Dataset typed

[jira] [Created] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Eric Liang (JIRA)
Eric Liang created SPARK-15634: -- Summary: SQL repl is bricked if a function is registered with a non-existent jar Key: SPARK-15634 URL: https://issues.apache.org/jira/browse/SPARK-15634 Project: Spark

  1   2   3   >