[jira] [Commented] (SPARK-13382) Update PySpark testing notes

2016-02-19 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154375#comment-15154375 ] holdenk commented on SPARK-13382: - I'll make a PR for the in repo fix but I don't have wiki write access

[jira] [Commented] (SPARK-12583) spark shuffle fails with mesos after 2mins

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154262#comment-15154262 ] Apache Spark commented on SPARK-12583: -- User 'bbossy' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-13394) What is "java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.security.SimpleUserGroupsMapping not found"

2016-02-19 Thread Minwoo Kang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154251#comment-15154251 ] Minwoo Kang edited comment on SPARK-13394 at 2/19/16 2:09 PM: -- I am so sorry

[jira] [Commented] (SPARK-13394) What is "java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.security.SimpleUserGroupsMapping not found"

2016-02-19 Thread Minwoo Kang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154251#comment-15154251 ] Minwoo Kang commented on SPARK-13394: - I am so sorry for doesn't read the url before post this issue.

[jira] [Resolved] (SPARK-13394) What is "java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.security.SimpleUserGroupsMapping not found"

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13394. --- Resolution: Invalid Target Version/s: (was: 1.6.0) Read

[jira] [Created] (SPARK-13394) What is "java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.security.SimpleUserGroupsMapping not found"

2016-02-19 Thread Minwoo Kang (JIRA)
Minwoo Kang created SPARK-13394: --- Summary: What is "java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.security.SimpleUserGroupsMapping not found" Key: SPARK-13394 URL:

[jira] [Created] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-02-19 Thread Varadharajan (JIRA)
Varadharajan created SPARK-13393: Summary: Column mismatch issue in left_outer join using Spark DataFrame Key: SPARK-13393 URL: https://issues.apache.org/jira/browse/SPARK-13393 Project: Spark

[jira] [Commented] (SPARK-13391) Use Apache Arrow as In-memory columnar store implementation

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154145#comment-15154145 ] Sean Owen commented on SPARK-13391: --- Cc [~nongli] [~wesmckinn] yes I'm pretty sure others have this in

[jira] [Commented] (SPARK-13392) KafkaSink for Metrics

2016-02-19 Thread UTKARSH BHATNAGAR (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154085#comment-15154085 ] UTKARSH BHATNAGAR commented on SPARK-13392: --- Please assign me this issue. I have already

[jira] [Created] (SPARK-13392) KafkaSink for Metrics

2016-02-19 Thread UTKARSH BHATNAGAR (JIRA)
UTKARSH BHATNAGAR created SPARK-13392: - Summary: KafkaSink for Metrics Key: SPARK-13392 URL: https://issues.apache.org/jira/browse/SPARK-13392 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-13391) Use Apache Arrow as In-memory columnar store implementation

2016-02-19 Thread JIRA
Maciej BryƄski created SPARK-13391: -- Summary: Use Apache Arrow as In-memory columnar store implementation Key: SPARK-13391 URL: https://issues.apache.org/jira/browse/SPARK-13391 Project: Spark

[jira] [Resolved] (SPARK-13339) Clarify commutative / associative operator requirements for reduce, fold

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13339. --- Resolution: Fixed Fix Version/s: 2.0.0 Target Version/s: 2.0.0 (was: 1.6.1, 2.0.0)

[jira] [Commented] (SPARK-13241) add long--formatted timestamps to org.apache.spark.status.api.v1.ApplicationAttemptInfo

2016-02-19 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154052#comment-15154052 ] Steve Loughran commented on SPARK-13241: The REST API compatibility is described in

[jira] [Commented] (SPARK-13379) MLlib LogisticRegressionWithLBFGS set regularization incorrectly

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154047#comment-15154047 ] Sean Owen commented on SPARK-13379: --- (You can make these more descriptive -- it swaps L1 and L2

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154044#comment-15154044 ] Takeshi Yamamuro commented on SPARK-8000: - Oh, it is a simple and good approach to detect formats

[jira] [Assigned] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8000: --- Assignee: Apache Spark > SQLContext.read.load() should be able to auto-detect input data >

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154043#comment-15154043 ] Apache Spark commented on SPARK-8000: - User 'HyukjinKwon' has created a pull request for this issue:

[jira] [Assigned] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8000: --- Assignee: (was: Apache Spark) > SQLContext.read.load() should be able to auto-detect

[jira] [Assigned] (SPARK-13363) Aggregator not working with DataFrame

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13363: Assignee: (was: Apache Spark) > Aggregator not working with DataFrame >

[jira] [Commented] (SPARK-13363) Aggregator not working with DataFrame

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154034#comment-15154034 ] Apache Spark commented on SPARK-13363: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13363) Aggregator not working with DataFrame

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13363: Assignee: Apache Spark > Aggregator not working with DataFrame >

[jira] [Updated] (SPARK-13377) binaryFileRDD preferredLocations issue

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13377: -- Priority: Minor (was: Blocker) Issue Type: Improvement (was: Bug) [~zhaozhang] never set

[jira] [Resolved] (SPARK-13378) Add tee method to RDD

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13378. --- Resolution: Not A Problem No, we don't want to do this. You'd end up making hundreds of methods to

[jira] [Commented] (SPARK-13375) PySpark API Utils missing item: kFold

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15154008#comment-15154008 ] Apache Spark commented on SPARK-13375: -- User 'davies' has created a pull request for this issue:

[jira] [Updated] (SPARK-13387) Add support for SPARK_DAEMON_JAVA_OPTS with MesosClusterDispatcher.

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13387: -- Priority: Minor (was: Major) Component/s: Mesos [~tnachen] please set component/priority. >

[jira] [Updated] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13386: -- Priority: Minor (was: Major) > ConnectedComponents should support maxIteration option >

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2016-02-19 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153988#comment-15153988 ] Emlyn Corrin commented on SPARK-8480: - This would be really useful. We have a fairly large Spark

[jira] [Created] (SPARK-13390) Java Spark createDataFrame with List parameter bug

2016-02-19 Thread mike niemaz (JIRA)
mike niemaz created SPARK-13390: --- Summary: Java Spark createDataFrame with List parameter bug Key: SPARK-13390 URL: https://issues.apache.org/jira/browse/SPARK-13390 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153974#comment-15153974 ] Apache Spark commented on SPARK-13386: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13386: Assignee: (was: Apache Spark) > ConnectedComponents should support maxIteration

[jira] [Assigned] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13386: Assignee: Apache Spark > ConnectedComponents should support maxIteration option >

[jira] [Assigned] (SPARK-13389) SparkR support first/last with ignore NAs

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13389: Assignee: (was: Apache Spark) > SparkR support first/last with ignore NAs >

[jira] [Commented] (SPARK-13389) SparkR support first/last with ignore NAs

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153955#comment-15153955 ] Apache Spark commented on SPARK-13389: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13389) SparkR support first/last with ignore NAs

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13389: Assignee: Apache Spark > SparkR support first/last with ignore NAs >

[jira] [Commented] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153950#comment-15153950 ] Apache Spark commented on SPARK-13388: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13388: Assignee: Apache Spark > PySpark Pipeline and PipelineModel should take advantages of its

[jira] [Assigned] (SPARK-13038) PySpark ml.pipeline support export/import

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13038: Assignee: (was: Apache Spark) > PySpark ml.pipeline support export/import >

[jira] [Assigned] (SPARK-13038) PySpark ml.pipeline support export/import

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13038: Assignee: Apache Spark > PySpark ml.pipeline support export/import >

[jira] [Commented] (SPARK-13038) PySpark ml.pipeline support export/import

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153949#comment-15153949 ] Apache Spark commented on SPARK-13038: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13388: Assignee: (was: Apache Spark) > PySpark Pipeline and PipelineModel should take

[jira] [Created] (SPARK-13389) SparkR support first/last with ignore NAs

2016-02-19 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13389: --- Summary: SparkR support first/last with ignore NAs Key: SPARK-13389 URL: https://issues.apache.org/jira/browse/SPARK-13389 Project: Spark Issue Type:

[jira] [Created] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion

2016-02-19 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-13388: - Summary: PySpark Pipeline and PipelineModel should take advantages of its Scala companion Key: SPARK-13388 URL: https://issues.apache.org/jira/browse/SPARK-13388 Project:

[jira] [Commented] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153933#comment-15153933 ] Apache Spark commented on SPARK-12363: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-12449) Pushing down arbitrary logical plans to data sources

2016-02-19 Thread Stephan Kessler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153931#comment-15153931 ] Stephan Kessler commented on SPARK-12449: - To keep the interfaces stable, the possibility of

[jira] [Commented] (SPARK-12449) Pushing down arbitrary logical plans to data sources

2016-02-19 Thread Stephan Kessler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153920#comment-15153920 ] Stephan Kessler commented on SPARK-12449: - In our github repo

[jira] [Commented] (SPARK-12343) Remove YARN Client / ClientArguments

2016-02-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153910#comment-15153910 ] Saisai Shao commented on SPARK-12343: - Hi guys, Do we still want to support users who directly

[jira] [Commented] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2016-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153903#comment-15153903 ] Apache Spark commented on SPARK-12363: -- User 'viirya' has created a pull request for this issue:

[jira] [Created] (SPARK-13387) Add support for SPARK_DAEMON_JAVA_OPTS with MesosClusterDispatcher.

2016-02-19 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-13387: Summary: Add support for SPARK_DAEMON_JAVA_OPTS with MesosClusterDispatcher. Key: SPARK-13387 URL: https://issues.apache.org/jira/browse/SPARK-13387 Project: Spark

[jira] [Created] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13386: Summary: ConnectedComponents should support maxIteration option Key: SPARK-13386 URL: https://issues.apache.org/jira/browse/SPARK-13386 Project: Spark Issue

<    1   2