[jira] [Commented] (SPARK-4430) Apache RAT Checks fail spuriously on test files

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290750#comment-14290750 ] Apache Spark commented on SPARK-4430: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-5393) Flood of util.RackResolver log messages after SPARK-1714

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290851#comment-14290851 ] Apache Spark commented on SPARK-5393: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-1714) Take advantage of AMRMClient APIs to simplify logic in YarnAllocationHandler

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290852#comment-14290852 ] Apache Spark commented on SPARK-1714: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-4964) Exactly-once semantics for Kafka

2015-01-24 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290747#comment-14290747 ] Cody Koeninger commented on SPARK-4964: --- Design doc at

[jira] [Commented] (SPARK-2285) Give various TaskEndReason subclass more descriptive names

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290785#comment-14290785 ] Apache Spark commented on SPARK-2285: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-4831) Current directory always on classpath with spark-submit

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4831. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Daniel Darabos Looks like this was

[jira] [Commented] (SPARK-4147) Reduce log4j dependency

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290752#comment-14290752 ] Apache Spark commented on SPARK-4147: - User 'srowen' has created a pull request for

[jira] [Updated] (SPARK-2280) Java Scala reference docs should describe function reference behavior.

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-2280: - Priority: Minor (was: Major) Assignee: Sean Owen I'd like to work on this, but, would a change to

[jira] [Resolved] (SPARK-1960) EOFException when file size 0 exists when use sc.sequenceFile[K,V](path)

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1960. -- Resolution: Not a Problem An empty {{SequenceFile}} will still contain some header info. For example

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway

2015-01-24 Thread Dale Richardson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290847#comment-14290847 ] Dale Richardson commented on SPARK-5388: Hi Andrew, I think the idea is well worth

[jira] [Resolved] (SPARK-4697) System properties should override environment variables

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4697. -- Resolution: Fixed Fix Version/s: 1.3.0 This looks like it was fixed in

[jira] [Resolved] (SPARK-1029) spark Window shell script errors regarding shell script location reference

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1029. -- Resolution: Fixed Fix Version/s: 1.0.0 Looks like this was fixed in

[jira] [Updated] (SPARK-4147) Reduce log4j dependency

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4147: - Summary: Reduce log4j dependency (was: Remove log4j dependency) Reduce log4j dependency

[jira] [Commented] (SPARK-4147) Remove log4j dependency

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290751#comment-14290751 ] Sean Owen commented on SPARK-4147: -- [~tgpfeiffer] Yeah that's a good change, since all

[jira] [Resolved] (SPARK-4491) Using sbt assembly with spark as dep requires Phd in sbt

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4491. -- Resolution: Won't Fix I don't see a reason here that indicates the Spark build should change its

[jira] [Created] (SPARK-5398) Support the eu-central-1 region for spark-ec2

2015-01-24 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5398: --- Summary: Support the eu-central-1 region for spark-ec2 Key: SPARK-5398 URL: https://issues.apache.org/jira/browse/SPARK-5398 Project: Spark Issue

[jira] [Created] (SPARK-5399) tree Losses strings should match loss names

2015-01-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5399: Summary: tree Losses strings should match loss names Key: SPARK-5399 URL: https://issues.apache.org/jira/browse/SPARK-5399 Project: Spark Issue

[jira] [Updated] (SPARK-3359) `sbt/sbt unidoc` doesn't work with Java 8

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3359: - Assignee: (was: Sean Owen) I spent more time on this tonight, mostly looking at the {{genjavadoc}}

[jira] [Commented] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-24 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290971#comment-14290971 ] Travis Galoppo commented on SPARK-5400: --- Hmm. This has me thinking in a different

[jira] [Commented] (SPARK-5401) Executor ID should be set before MetricsSystem is created

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291007#comment-14291007 ] Apache Spark commented on SPARK-5401: - User 'ryan-williams' has created a pull request

[jira] [Commented] (SPARK-5402) Log executor ID at executor-construction time

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291010#comment-14291010 ] Apache Spark commented on SPARK-5402: - User 'ryan-williams' has created a pull request

[jira] [Updated] (SPARK-5401) Executor ID should be set before MetricsSystem is created

2015-01-24 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-5401: - Description: MetricsSystem construction [attempts to namespace metrics from each executor using

[jira] [Created] (SPARK-5402) Log executor ID at executor-construction time

2015-01-24 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5402: Summary: Log executor ID at executor-construction time Key: SPARK-5402 URL: https://issues.apache.org/jira/browse/SPARK-5402 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-5235) Determine serializability of SQLContext

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5235. -- Resolution: Fixed Fix Version/s: 1.3.0 This was merged in

[jira] [Commented] (SPARK-4452) Shuffle data structures can starve others on the same thread for memory

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290874#comment-14290874 ] Sean Owen commented on SPARK-4452: -- Can this JIRA be resolved now that its children are

[jira] [Commented] (SPARK-4452) Shuffle data structures can starve others on the same thread for memory

2015-01-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290885#comment-14290885 ] Sandy Ryza commented on SPARK-4452: --- I think there's more to this one, the subtasks

[jira] [Commented] (SPARK-3359) `sbt/sbt unidoc` doesn't work with Java 8

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290934#comment-14290934 ] Apache Spark commented on SPARK-3359: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-01-24 Thread ding (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290961#comment-14290961 ] ding commented on SPARK-4105: - I hit this error when using pagerank(It cannot be consistent

[jira] [Commented] (SPARK-3489) support rdd.zip(rdd1, rdd2,...) with variable number of rdds as params

2015-01-24 Thread Mohit Jaggi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290967#comment-14290967 ] Mohit Jaggi commented on SPARK-3489: pull request does exist here:

[jira] [Resolved] (SPARK-4642) Documents about running-on-YARN needs update

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4642. -- Resolution: Fixed Fix Version/s: 1.2.1 1.1.2 1.3.0

[jira] [Resolved] (SPARK-5028) Add total received and processed records metrics to Streaming UI

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5028. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Saisai Shao Also one that was merged

[jira] [Comment Edited] (SPARK-5235) Determine serializability of SQLContext

2015-01-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290905#comment-14290905 ] Reynold Xin edited comment on SPARK-5235 at 1/25/15 1:00 AM: -

[jira] [Commented] (SPARK-3298) [SQL] registerAsTable / registerTempTable overwrites old tables

2015-01-24 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290962#comment-14290962 ] Imran Rashid commented on SPARK-3298: - If {{allowOverwrite}} defaulted to {{true}},

[jira] [Resolved] (SPARK-4934) Connection key is hard to read

2015-01-24 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen resolved SPARK-4934. -- Resolution: Not a Problem Connection key is hard to read --

[jira] [Resolved] (SPARK-5038) Add explicit return type for all implicit functions

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5038. -- Resolution: Fixed Looks like this was merged in

[jira] [Resolved] (SPARK-5074) Fix a non-deterministic test in org.apache.spark.scheduler.DAGSchedulerSuite

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5074. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Shixiong Zhu This was merged in

[jira] [Resolved] (SPARK-5131) A typo in configuration doc

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5131. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: uncleGen The PR was resolved for

[jira] [Reopened] (SPARK-5235) Determine serializability of SQLContext

2015-01-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-5235: Sean - this was not done. We merged a patch to make it serializable again, but for 1.3 we should

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290921#comment-14290921 ] Sean Owen commented on SPARK-5235: -- Sounds good, keep it open. This particular change was

[jira] [Created] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5400: Summary: Rename GaussianMixtureEM to GaussianMixture Key: SPARK-5400 URL: https://issues.apache.org/jira/browse/SPARK-5400 Project: Spark Issue

[jira] [Commented] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-01-24 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290923#comment-14290923 ] Florian Verhein commented on SPARK-3185: Sure [~grzegorz-dubicki]. You need to

[jira] [Commented] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290924#comment-14290924 ] Joseph K. Bradley commented on SPARK-5400: -- [~mengxr] [~tgaloppo] What do you

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290744#comment-14290744 ] Apache Spark commented on SPARK-4267: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290742#comment-14290742 ] Sean Owen commented on SPARK-4267: -- The warning is from YARN, I believe, rather than

[jira] [Resolved] (SPARK-2105) SparkUI doesn't remove active stages that failed

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2105. -- Resolution: Fixed Fix Version/s: 1.1.0 It appears this is considered fixed by that commit, for

[jira] [Updated] (SPARK-5395) Large number of Python workers causing resource depletion

2015-01-24 Thread Sven Krasser (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sven Krasser updated SPARK-5395: Description: During job execution a large number of Python worker accumulates eventually causing

[jira] [Created] (SPARK-5395) Large number of Python workers causing resource depletion

2015-01-24 Thread Sven Krasser (JIRA)
Sven Krasser created SPARK-5395: --- Summary: Large number of Python workers causing resource depletion Key: SPARK-5395 URL: https://issues.apache.org/jira/browse/SPARK-5395 Project: Spark Issue

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-24 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290523#comment-14290523 ] Muhammad-Ali A'rabi commented on SPARK-5226: That's right. For very huge data,

[jira] [Updated] (SPARK-2285) Give various TaskEndReason subclass more descriptive names

2015-01-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2285: --- Assignee: (was: Reynold Xin) Give various TaskEndReason subclass more descriptive names

[jira] [Updated] (SPARK-2285) Give various TaskEndReason subclass more descriptive names

2015-01-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2285: --- Component/s: Spark Core Give various TaskEndReason subclass more descriptive names

[jira] [Commented] (SPARK-2285) Give various TaskEndReason subclass more descriptive names

2015-01-24 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290502#comment-14290502 ] Reynold Xin commented on SPARK-2285: Hey Sean - I was thinking TaskSuccess,

[jira] [Created] (SPARK-5396) Syntax error in spark scripts on windows.

2015-01-24 Thread Vladimir Protsenko (JIRA)
Vladimir Protsenko created SPARK-5396: - Summary: Syntax error in spark scripts on windows. Key: SPARK-5396 URL: https://issues.apache.org/jira/browse/SPARK-5396 Project: Spark Issue

[jira] [Resolved] (SPARK-3471) Automatic resource manager for SparkContext in Scala?

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3471. -- Resolution: Not a Problem This is about adding some kind of try-with-resources equivalent for Scala?

[jira] [Updated] (SPARK-5383) support alias for udfs with multi output columns

2015-01-24 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-5383: --- Summary: support alias for udfs with multi output columns (was: Multi alias names support) support alias

[jira] [Updated] (SPARK-5383) support alias for udfs with multi output columns

2015-01-24 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-5383: --- Description: when a udf output multi columns, now we can not use alias for them in spark-sql, see this

[jira] [Resolved] (SPARK-3430) Introduce ValueIncrementableHashMapAccumulator to compute Histogram and other statistical metrics

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3430. -- Resolution: Won't Fix PR says this is WontFix Introduce ValueIncrementableHashMapAccumulator to

[jira] [Updated] (SPARK-5396) Syntax error in spark scripts on windows.

2015-01-24 Thread Vladimir Protsenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Protsenko updated SPARK-5396: -- Attachment: windows8.1.png windows7.png Syntax error in spark scripts

[jira] [Updated] (SPARK-5396) Syntax error in spark scripts on windows.

2015-01-24 Thread Vladimir Protsenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Protsenko updated SPARK-5396: -- Description: I made the following steps: 1. downloaded and installed Scala 2.11.5 2.

[jira] [Updated] (SPARK-3489) support rdd.zip(rdd1, rdd2,...) with variable number of rdds as params

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3489: - Priority: Minor (was: Major) Target Version/s: (was: 1.2.0) This should be a pull request

[jira] [Resolved] (SPARK-3621) Provide a way to broadcast an RDD (instead of just a variable made of the RDD) so that a job can access

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3621. -- Resolution: Not a Problem Given the discussion, this is best solved by reading data directly at the

[jira] [Resolved] (SPARK-3195) Can you add some statistics to do logistic regression better in mllib?

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3195. -- Resolution: Invalid Can you add some statistics to do logistic regression better in mllib?

[jira] [Resolved] (SPARK-2442) Add a Hadoop Writable serializer

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2442. -- Resolution: Duplicate Add a Hadoop Writable serializer

[jira] [Created] (SPARK-5397) Assigning aliases to several return values of an UDF

2015-01-24 Thread Max (JIRA)
Max created SPARK-5397: -- Summary: Assigning aliases to several return values of an UDF Key: SPARK-5397 URL: https://issues.apache.org/jira/browse/SPARK-5397 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3852) Document spark.driver.extra* configs

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290597#comment-14290597 ] Apache Spark commented on SPARK-3852: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-3859) Use consistent config names for duration (with units!)

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290600#comment-14290600 ] Sean Owen commented on SPARK-3859: -- I double-checked that all of the config properties

[jira] [Commented] (SPARK-3875) Add TEMP DIRECTORY configuration

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290602#comment-14290602 ] Sean Owen commented on SPARK-3875: -- You can already set {{java.io.tmpdir}}, without

[jira] [Commented] (SPARK-5383) support alias for udfs with multi output columns

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290630#comment-14290630 ] Apache Spark commented on SPARK-5383: - User 'scwf' has created a pull request for this

[jira] [Resolved] (SPARK-4283) Spark source code does not correctly import into eclipse

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4283. -- Resolution: Won't Fix I suggest resolving this as WontFix since the Maven build is correct and

[jira] [Commented] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-24 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290634#comment-14290634 ] Muhammad-Ali A'rabi commented on SPARK-3439: Possible implementation:

[jira] [Commented] (SPARK-3782) Direct use of log4j in AkkaUtils interferes with certain logging configurations

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290594#comment-14290594 ] Sean Owen commented on SPARK-3782: -- Aha, I think there's a good point here. Looks like

[jira] [Commented] (SPARK-3782) Direct use of log4j in AkkaUtils interferes with certain logging configurations

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290595#comment-14290595 ] Apache Spark commented on SPARK-3782: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-3148) Update global variables of HttpBroadcast so that multiple SparkContexts can coexist

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3148. -- Resolution: Won't Fix PR says this is WontFix Update global variables of HttpBroadcast so that

[jira] [Commented] (SPARK-2348) In Windows having a enviorinment variable named 'classpath' gives error

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290632#comment-14290632 ] Sean Owen commented on SPARK-2348: -- [~chiragtodarka] [~Xierqi] The resolution proposed

[jira] [Updated] (SPARK-5297) JavaStreamingContext.fileStream won't work because type info isn't propagated

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5297: - Summary: JavaStreamingContext.fileStream won't work because type info isn't propagated (was: File

[jira] [Comment Edited] (SPARK-3439) Add Canopy Clustering Algorithm

2015-01-24 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290634#comment-14290634 ] Muhammad-Ali A'rabi edited comment on SPARK-3439 at 1/24/15 2:41 PM:

[jira] [Commented] (SPARK-3754) Spark Streaming fileSystem API is not callable from Java

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290637#comment-14290637 ] Sean Owen commented on SPARK-3754: -- Is this the same as the issue reported in

[jira] [Resolved] (SPARK-4289) Creating an instance of Hadoop Job fails in the Spark shell when toString() is called on the instance.

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4289. -- Resolution: Not a Problem I suggest this is NotAProblem, at least not something I can see Spark can

[jira] [Commented] (SPARK-4368) Ceph integration?

2015-01-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290644#comment-14290644 ] Sean Owen commented on SPARK-4368: -- I don't think Spark does anything in particular to

[jira] [Commented] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290655#comment-14290655 ] Apache Spark commented on SPARK-5309: - User 'MickDavies' has created a pull request