[jira] [Commented] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-10-02 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157679#comment-14157679 ] Masayoshi TSUZUKI commented on SPARK-2630: -- The issue I reported here looks to be

[jira] [Resolved] (SPARK-3654) Implement all extended HiveQL statements/commands with a separate parser combinator

2014-10-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3654. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2590 [https:/

[jira] [Commented] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157617#comment-14157617 ] Josh Rosen commented on SPARK-3772: --- Actually, I'm surprised that nobody reported this i

[jira] [Commented] (SPARK-3755) Do not bind port 1 - 1024 to server in spark

2014-10-02 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157607#comment-14157607 ] wangfei commented on SPARK-3755: Hi, Andrew Or, i can not change the title now > Do not b

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157605#comment-14157605 ] Guoqiang Li commented on SPARK-1405: This should be the checkpoint without work. You

[jira] [Commented] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157604#comment-14157604 ] Josh Rosen commented on SPARK-3772: --- The reason that we never hit this before is that se

[jira] [Commented] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157602#comment-14157602 ] Josh Rosen commented on SPARK-3772: --- Ah, I see the problem: PythonWorkerFactory also pa

[jira] [Commented] (SPARK-3774) typo comment in bin/utils.sh

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157593#comment-14157593 ] Apache Spark commented on SPARK-3774: - User 'tsudukim' has created a pull request for

[jira] [Created] (SPARK-3774) typo comment in bin/utils.sh

2014-10-02 Thread Masayoshi TSUZUKI (JIRA)
Masayoshi TSUZUKI created SPARK-3774: Summary: typo comment in bin/utils.sh Key: SPARK-3774 URL: https://issues.apache.org/jira/browse/SPARK-3774 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-3771) AppendingParquetOutputFormat should use reflection to prevent from breaking binary-compatibility.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin updated SPARK-3771: - Summary: AppendingParquetOutputFormat should use reflection to prevent from breaking binary-compat

[jira] [Commented] (SPARK-3773) Sphinx build warnings

2014-10-02 Thread cocoatomo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157590#comment-14157590 ] cocoatomo commented on SPARK-3773: -- Using Sphinx to generate API docs for PySpark > Sphi

[jira] [Updated] (SPARK-3773) Sphinx build warnings

2014-10-02 Thread cocoatomo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cocoatomo updated SPARK-3773: - Description: When building Sphinx documents for PySpark, we have 12 warnings. Their causes are almost docs

[jira] [Commented] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread cocoatomo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157586#comment-14157586 ] cocoatomo commented on SPARK-3772: -- Thank you for the advice. I added the commit hash on

[jira] [Updated] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread cocoatomo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] cocoatomo updated SPARK-3772: - Description: To reproduce this issue, we should execute following commands on the commit: 6e27cb630de69fa

[jira] [Commented] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157548#comment-14157548 ] Josh Rosen commented on SPARK-3772: --- Can you post the SHA of the commit that you were us

[jira] [Closed] (SPARK-3759) SparkSubmitDriverBootstrapper should return exit code of driver process

2014-10-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3759. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Assignee: Eric Ei

[jira] [Created] (SPARK-3773) Sphinx build warnings

2014-10-02 Thread cocoatomo (JIRA)
cocoatomo created SPARK-3773: Summary: Sphinx build warnings Key: SPARK-3773 URL: https://issues.apache.org/jira/browse/SPARK-3773 Project: Spark Issue Type: Bug Components: PySpark

[jira] [Closed] (SPARK-3755) Do not bind port 1 - 1024 to server in spark

2014-10-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3755. Resolution: Fixed Fix Version/s: 1.1.1 > Do not bind port 1 - 1024 to server in spark > -

[jira] [Updated] (SPARK-2066) Better error message for non-aggregated attributes with aggregates

2014-10-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2066: Priority: Critical (was: Major) > Better error message for non-aggregated attributes with a

[jira] [Updated] (SPARK-3572) Support register UserType in SQL

2014-10-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3572: - Assignee: Joseph K. Bradley > Support register UserType in SQL >

[jira] [Created] (SPARK-3772) RDD operation on IPython REPL failed with an illegal port number

2014-10-02 Thread cocoatomo (JIRA)
cocoatomo created SPARK-3772: Summary: RDD operation on IPython REPL failed with an illegal port number Key: SPARK-3772 URL: https://issues.apache.org/jira/browse/SPARK-3772 Project: Spark Issue

[jira] [Closed] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin closed SPARK-3764. Resolution: Fixed > Invalid dependencies of artifacts in Maven Central Repository. > ---

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157447#comment-14157447 ] Takuya Ueshin commented on SPARK-3764: -- I filed a new issue SPARK-3771 and close this

[jira] [Commented] (SPARK-3771) AppendingParquetOutputFormat should use reflection to prevent breaking binary-compatibility.

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157441#comment-14157441 ] Apache Spark commented on SPARK-3771: - User 'ueshin' has created a pull request for th

[jira] [Created] (SPARK-3771) AppendingParquetOutputFormat should use reflection to prevent breaking binary-compatibility.

2014-10-02 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-3771: Summary: AppendingParquetOutputFormat should use reflection to prevent breaking binary-compatibility. Key: SPARK-3771 URL: https://issues.apache.org/jira/browse/SPARK-3771

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157418#comment-14157418 ] Takuya Ueshin commented on SPARK-3764: -- {{AppendingParquetOutputFormat}} is using {{T

[jira] [Assigned] (SPARK-1671) Cached tables should follow write-through policy

2014-10-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-1671: --- Assignee: Michael Armbrust > Cached tables should follow write-through policy > -

[jira] [Resolved] (SPARK-3769) SparkFiles.get gives me the wrong fully qualified path

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3769. --- Resolution: Not a Problem > SparkFiles.get gives me the wrong fully qualified path > -

[jira] [Commented] (SPARK-3769) SparkFiles.get gives me the wrong fully qualified path

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157389#comment-14157389 ] Josh Rosen commented on SPARK-3769: --- I think that {{SparkFiles.get()}} can be called fro

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-10-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157351#comment-14157351 ] Marcelo Vanzin commented on SPARK-3633: --- Hey [~pwendell] [~matei], is anyone activel

[jira] [Commented] (SPARK-3769) SparkFiles.get gives me the wrong fully qualified path

2014-10-02 Thread Tom Weber (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157263#comment-14157263 ] Tom Weber commented on SPARK-3769: -- Thanks for the quick turnaround! I can see that it w

[jira] [Commented] (SPARK-3770) The userFeatures RDD from MatrixFactorizationModel isn't accessible from the python bindings

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157242#comment-14157242 ] Apache Spark commented on SPARK-3770: - User 'mdagost' has created a pull request for t

[jira] [Resolved] (SPARK-1284) pyspark hangs after IOError on Executor

2014-10-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-1284. --- Resolution: Fixed Fix Version/s: 1.1.0 I think this is an logging issue ,should be fixed by ht

[jira] [Created] (SPARK-3770) The userFeatures RDD from MatrixFactorizationModel isn't accessible from the python bindings

2014-10-02 Thread Michelangelo D'Agostino (JIRA)
Michelangelo D'Agostino created SPARK-3770: -- Summary: The userFeatures RDD from MatrixFactorizationModel isn't accessible from the python bindings Key: SPARK-3770 URL: https://issues.apache.org/jira/brows

[jira] [Resolved] (SPARK-3632) ConnectionManager can run out of receive threads with authentication on

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3632. Resolution: Fixed Fix Version/s: 1.2.0 > ConnectionManager can run out of receive threads wit

[jira] [Updated] (SPARK-3496) Block replication can by mistake choose driver BlockManager as a peer for replication

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-3496: --- Target Version/s: 1.2.0 (was: 1.1.1, 1.2.0) > Block replication can by mistake choose driver BlockMan

[jira] [Resolved] (SPARK-3496) Block replication can by mistake choose driver BlockManager as a peer for replication

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3496. Resolution: Fixed Fix Version/s: 1.2.0 > Block replication can by mistake choose driver Block

[jira] [Resolved] (SPARK-3495) Block replication fails continuously when the replication target node is dead

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3495. Resolution: Fixed Fix Version/s: 1.2.0 > Block replication fails continuously when the replic

[jira] [Updated] (SPARK-3495) Block replication fails continuously when the replication target node is dead

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-3495: --- Target Version/s: 1.2.0 (was: 1.1.1, 1.2.0) > Block replication fails continuously when the replicati

[jira] [Commented] (SPARK-3769) SparkFiles.get gives me the wrong fully qualified path

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157156#comment-14157156 ] Sean Owen commented on SPARK-3769: -- My understanding is that you execute: {code} sc.addF

[jira] [Resolved] (SPARK-3766) Snappy is also the default compression codec for broadcast variables

2014-10-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3766. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: wangfei > Snappy is also the defaul

[jira] [Created] (SPARK-3769) SparkFiles.get gives me the wrong fully qualified path

2014-10-02 Thread Tom Weber (JIRA)
Tom Weber created SPARK-3769: Summary: SparkFiles.get gives me the wrong fully qualified path Key: SPARK-3769 URL: https://issues.apache.org/jira/browse/SPARK-3769 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157114#comment-14157114 ] Apache Spark commented on SPARK-3219: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3424) KMeans Plus Plus is too slow

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157116#comment-14157116 ] Apache Spark commented on SPARK-3424: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3261) KMeans clusterer can return duplicate cluster centers

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157115#comment-14157115 ] Apache Spark commented on SPARK-3261: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3218) K-Means clusterer can fail on degenerate data

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157113#comment-14157113 ] Apache Spark commented on SPARK-3218: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-1473) Feature selection for high dimensional datasets

2014-10-02 Thread David Martinez Rego (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157047#comment-14157047 ] David Martinez Rego commented on SPARK-1473: Sorry for having my name incomple

[jira] [Resolved] (SPARK-3768) Modify default YARN memory_overhead-- from an additive constant to a multiplier

2014-10-02 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3768. -- Resolution: Fixed > Modify default YARN memory_overhead-- from an additive constant to a > mult

[jira] [Created] (SPARK-3768) Modify default YARN memory_overhead-- from an additive constant to a multiplier

2014-10-02 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-3768: Summary: Modify default YARN memory_overhead-- from an additive constant to a multiplier Key: SPARK-3768 URL: https://issues.apache.org/jira/browse/SPARK-3768 Project

[jira] [Commented] (SPARK-1270) An optimized gradient descent implementation

2014-10-02 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156964#comment-14156964 ] Peng Cheng commented on SPARK-1270: --- Yo, any follow up story on this one? I'm curious to

[jira] [Commented] (SPARK-3105) Calling cache() after RDDs are pipelined has no effect in PySpark

2014-10-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156936#comment-14156936 ] Nicholas Chammas commented on SPARK-3105: - I think it's definitely important for t

[jira] [Resolved] (SPARK-3706) Cannot run IPython REPL with IPYTHON set to "1" and PYSPARK_PYTHON unset

2014-10-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3706. --- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2554 [https://github.com/

[jira] [Commented] (SPARK-2870) Thorough schema inference directly on RDDs of Python dictionaries

2014-10-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156904#comment-14156904 ] Nicholas Chammas commented on SPARK-2870: - [~marmbrus] - A related feature that I

[jira] [Created] (SPARK-3767) Support wildcard in Spark properties

2014-10-02 Thread Andrew Or (JIRA)
Andrew Or created SPARK-3767: Summary: Support wildcard in Spark properties Key: SPARK-3767 URL: https://issues.apache.org/jira/browse/SPARK-3767 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-2447) Add common solution for sending upsert actions to HBase (put, deletes, and increment)

2014-10-02 Thread Ted Malaska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156865#comment-14156865 ] Ted Malaska commented on SPARK-2447: Hey Norman, Yes the github project has been used

[jira] [Commented] (SPARK-2447) Add common solution for sending upsert actions to HBase (put, deletes, and increment)

2014-10-02 Thread Norman He (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156840#comment-14156840 ] Norman He commented on SPARK-2447: -- HI Ted, I am very glad to see the hbase RDD work. I

[jira] [Commented] (SPARK-3706) Cannot run IPython REPL with IPYTHON set to "1" and PYSPARK_PYTHON unset

2014-10-02 Thread cocoatomo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156776#comment-14156776 ] cocoatomo commented on SPARK-3706: -- Thank you for the comment and modification, [~joshros

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156723#comment-14156723 ] Sean Owen commented on SPARK-3764: -- I'm not sure what you mean. Spark compiles versus mos

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-02 Thread Evan Sparks (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156699#comment-14156699 ] Evan Sparks commented on SPARK-1405: Hi Guoqiang - is it correct that your runtimes ar

[jira] [Commented] (SPARK-3766) Snappy is also the default compression codec for broadcast variables

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156680#comment-14156680 ] Apache Spark commented on SPARK-3766: - User 'scwf' has created a pull request for this

[jira] [Updated] (SPARK-3765) add testing with sbt to doc

2014-10-02 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-3765: --- Component/s: Documentation > add testing with sbt to doc > --- > > Key

[jira] [Updated] (SPARK-3766) Snappy is also the default compression codec for broadcast variables

2014-10-02 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei updated SPARK-3766: --- Component/s: Documentation > Snappy is also the default compression codec for broadcast variables > --

[jira] [Created] (SPARK-3766) Snappy is also the default compression codec for broadcast variables

2014-10-02 Thread wangfei (JIRA)
wangfei created SPARK-3766: -- Summary: Snappy is also the default compression codec for broadcast variables Key: SPARK-3766 URL: https://issues.apache.org/jira/browse/SPARK-3766 Project: Spark Issue

[jira] [Commented] (SPARK-3625) In some cases, the RDD.checkpoint does not work

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156663#comment-14156663 ] Apache Spark commented on SPARK-3625: - User 'witgo' has created a pull request for thi

[jira] [Commented] (SPARK-3623) Graph should support the checkpoint operation

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156664#comment-14156664 ] Apache Spark commented on SPARK-3623: - User 'witgo' has created a pull request for thi

[jira] [Updated] (SPARK-2811) update algebird to 0.8.1

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2811: --- Summary: update algebird to 0.8.1 (was: update algebird to 0.8) > update algebird to 0.8.1 >

[jira] [Updated] (SPARK-2811) update algebird to 0.8

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2811: --- Description: First algebird_2.11 0.8.1 has to be released (was: First algebird_2.11 0.7.0 has to be r

[jira] [Updated] (SPARK-2811) update algebird to 0.8

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2811: --- Summary: update algebird to 0.8 (was: update algebird to 0.7) > update algebird to 0.8 >

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156623#comment-14156623 ] Guoqiang Li commented on SPARK-1405: Hi everyone [The PR 2388|https://github.com/apach

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156599#comment-14156599 ] Takuya Ueshin commented on SPARK-3764: -- Ah, I see that {{context.getTaskAttemptID}} a

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156588#comment-14156588 ] Takuya Ueshin commented on SPARK-3764: -- But there are some codes using binary-incompa

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156581#comment-14156581 ] Sean Owen commented on SPARK-3764: -- The artifacts themselves don't contain any Hadoop cod

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156578#comment-14156578 ] Takuya Ueshin commented on SPARK-3764: -- Now I found the instruction [here|http://spa

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156576#comment-14156576 ] Takuya Ueshin commented on SPARK-3764: -- Ah, so, these artifacts are only for hadoop-2

[jira] [Commented] (SPARK-3765) add testing with sbt to doc

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156569#comment-14156569 ] Apache Spark commented on SPARK-3765: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-3765) add testing with sbt to doc

2014-10-02 Thread wangfei (JIRA)
wangfei created SPARK-3765: -- Summary: add testing with sbt to doc Key: SPARK-3765 URL: https://issues.apache.org/jira/browse/SPARK-3765 Project: Spark Issue Type: Improvement Affects Versions: 1

[jira] [Comment Edited] (SPARK-1834) NoSuchMethodError when invoking JavaPairRDD.reduce() in Java

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156477#comment-14156477 ] Sean Owen edited comment on SPARK-1834 at 10/2/14 12:46 PM: We

[jira] [Commented] (SPARK-1834) NoSuchMethodError when invoking JavaPairRDD.reduce() in Java

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156477#comment-14156477 ] Sean Owen commented on SPARK-1834: -- Weird, I can reproduce this. I have a new test case f

[jira] [Commented] (SPARK-1834) NoSuchMethodError when invoking JavaPairRDD.reduce() in Java

2014-10-02 Thread Alexis Seigneurin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156464#comment-14156464 ] Alexis Seigneurin commented on SPARK-1834: -- Same issue here with Spark 1.1.0: red

[jira] [Commented] (SPARK-2809) update chill to version 0.5.0

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156440#comment-14156440 ] Guoqiang Li commented on SPARK-2809: The related work. https://github.com/apache/spark

[jira] [Commented] (SPARK-3761) Class anonfun$1 not found exception / sbt 13.x / Scala 2.10.4

2014-10-02 Thread Igor Tkachenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156373#comment-14156373 ] Igor Tkachenko commented on SPARK-3761: --- Created the same bug in Cloudera Jira: htt

[jira] [Commented] (SPARK-3761) Class anonfun$1 not found exception / sbt 13.x / Scala 2.10.4

2014-10-02 Thread Igor Tkachenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156365#comment-14156365 ] Igor Tkachenko commented on SPARK-3761: --- I've tried sbt 12.4, but unfortunately with

[jira] [Comment Edited] (SPARK-3761) Class anonfun$1 not found exception / sbt 13.x / Scala 2.10.4

2014-10-02 Thread Igor Tkachenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156365#comment-14156365 ] Igor Tkachenko edited comment on SPARK-3761 at 10/2/14 10:55 AM: ---

[jira] [Commented] (SPARK-2809) update chill to version 0.5.0

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156362#comment-14156362 ] Sean Owen commented on SPARK-2809: -- PS chill 0.5.0 is the first to support Scala 2.11, so

[jira] [Commented] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156311#comment-14156311 ] Sean Owen commented on SPARK-3764: -- This is correct and as intended. Without any addition

[jira] [Updated] (SPARK-2809) update chill to version 0.5.0

2014-10-02 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-2809: --- Summary: update chill to version 0.5.0 (was: update chill to version 0.4) > update chill to version 0

[jira] [Created] (SPARK-3764) Invalid dependencies of artifacts in Maven Central Repository.

2014-10-02 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-3764: Summary: Invalid dependencies of artifacts in Maven Central Repository. Key: SPARK-3764 URL: https://issues.apache.org/jira/browse/SPARK-3764 Project: Spark

[jira] [Commented] (SPARK-3763) The example of building with sbt should be "sbt assembly" instead of "sbt compile"

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156238#comment-14156238 ] Apache Spark commented on SPARK-3763: - User 'sarutak' has created a pull request for t

[jira] [Commented] (SPARK-3759) SparkSubmitDriverBootstrapper should return exit code of driver process

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156239#comment-14156239 ] Apache Spark commented on SPARK-3759: - User 'ericeijkelenboom' has created a pull requ

[jira] [Created] (SPARK-3763) The example of building with sbt should be "sbt assembly" instead of "sbt compile"

2014-10-02 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-3763: - Summary: The example of building with sbt should be "sbt assembly" instead of "sbt compile" Key: SPARK-3763 URL: https://issues.apache.org/jira/browse/SPARK-3763 Pr

[jira] [Commented] (SPARK-3687) Spark hang while processing more than 100 sequence files

2014-10-02 Thread Ziv Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156236#comment-14156236 ] Ziv Huang commented on SPARK-3687: -- The following is the jstack dump of one CoarseGrained

[jira] [Commented] (SPARK-3007) Add "Dynamic Partition" support to Spark Sql hive

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156225#comment-14156225 ] Apache Spark commented on SPARK-3007: - User 'liancheng' has created a pull request for

[jira] [Resolved] (SPARK-1767) Prefer HDFS-cached replicas when scheduling data-local tasks

2014-10-02 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1767. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Colin Patrick McCabe Fixed

[jira] [Commented] (SPARK-3731) RDD caching stops working in pyspark after some time

2014-10-02 Thread Milan Straka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156174#comment-14156174 ] Milan Straka commented on SPARK-3731: - I have attached reproducible program, input fil

[jira] [Commented] (SPARK-2461) Add a toString method to GeneralizedLinearModel

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156175#comment-14156175 ] Apache Spark commented on SPARK-2461: - User 'davies' has created a pull request for th

[jira] [Updated] (SPARK-3731) RDD caching stops working in pyspark after some time

2014-10-02 Thread Milan Straka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Milan Straka updated SPARK-3731: Attachment: spark-3731.log > RDD caching stops working in pyspark after some time >

[jira] [Updated] (SPARK-3731) RDD caching stops working in pyspark after some time

2014-10-02 Thread Milan Straka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Milan Straka updated SPARK-3731: Attachment: spark-3731.txt.bz2 > RDD caching stops working in pyspark after some time >

[jira] [Updated] (SPARK-3731) RDD caching stops working in pyspark after some time

2014-10-02 Thread Milan Straka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Milan Straka updated SPARK-3731: Attachment: spark-3731.py > RDD caching stops working in pyspark after some time > -

[jira] [Commented] (SPARK-3762) clear all SparkEnv references after stop

2014-10-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156159#comment-14156159 ] Apache Spark commented on SPARK-3762: - User 'davies' has created a pull request for th

[jira] [Commented] (SPARK-3759) SparkSubmitDriverBootstrapper should return exit code of driver process

2014-10-02 Thread Eric Eijkelenboom (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14156157#comment-14156157 ] Eric Eijkelenboom commented on SPARK-3759: -- Yes, no problem! > SparkSubmitDriver

  1   2   >