[GitHub] spark pull request: [WIP][SPARK-1875]:NoClassDefFoundError: String...

2014-05-19 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/820 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-19 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43581291 @tdas CheckpointRDD is not properly cleaned. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-19 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43581755 @mateiz Why the checkpoint data must be written to the file system?. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-19 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43583620 @mateiz It is not necessary to write it in the file system.After all, there is no other RDD in reading it.I think it should be put checkpoint data into blockManager, so

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-19 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43589674 [The code](https://github.com/witgo/spark/commit/6d7f2408a40bf4bb2889bf66fa61bced782cdefc#diff-2b593e0b4bd6eddab37f04968baa826c) will make the checkpoint directory larger

[GitHub] spark pull request: Convert spark.cleaner.ttl.* to lowercase

2014-05-20 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/811 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-20 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43608181 @mateiz @mengxr I added a new operation `cachePoint` of RDD --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-20 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43656940 Another [solution](https://github.com/witgo/spark/compare/cachePoint). --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-21 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43790944 @mateiz, @mengxr I am using [the code](https://github.com/witgo/spark/compare/cachePoint) to test ALS. A brief description of the test: | Item

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-21 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-43840745 @tdas You're right. the code breaks the fault-tolerance properties of RDDs. The perfect solution is the automatic cleanup and rebuilding shuffle data. --- If

[GitHub] spark pull request: Automatically cleanup checkpoint date

2014-05-22 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/855 Automatically cleanup checkpoint date You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark cleanup_checkpoint_date Alternatively you can

[GitHub] spark pull request: Automatically cleanup checkpoint date

2014-05-22 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/855#issuecomment-43969051 @tdas Optional? Default is off? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: Automatically cleanup checkpoint date

2014-05-22 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/855#issuecomment-43971489 @mridulm @tdas The code has been updated. Now, automatically clean up checkpoint data is optional --- If your project is set up for it, you can reply to this

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-25 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-44122991 I am using [the code](https://github.com/witgo/spark/compare/cleanup_checkpoint_date_als) to test ALS. A brief description of the test: | Item | Description

[GitHub] spark pull request: Fix scalastyle warnings in yarn alpha

2014-05-26 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/884 Fix scalastyle warnings in yarn alpha You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark scalastyle Alternatively you can review and

[GitHub] spark pull request: SPARK-1935: Explicitly add commons-codec 1.4 a...

2014-05-26 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/889#issuecomment-44235434 spark-hive => commons-codec 1.4 spark-sql => commons-codec 1.5 ``` [INFO]

[GitHub] spark pull request: [SPARK-1930] Container memory beyond limit, we...

2014-05-27 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/894 [SPARK-1930] Container memory beyond limit, were killed You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK-1930 Alternatively you

[GitHub] spark pull request: [WIP][SPARK-1930] The Container is running bey...

2014-05-28 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/894#discussion_r13131123 --- Diff: yarn/stable/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocationHandler.scala --- @@ -90,6 +90,12 @@ private[yarn] class YarnAllocationHandler

[GitHub] spark pull request: [WIP][SPARK-1930] The Container is running bey...

2014-05-28 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/894#issuecomment-44424182 I agree with @sryza .Spark automatically handle these better. Of course, we can allow users to manually specify the special value. --- If your project is set up for it

[GitHub] spark pull request: Pluggable Diskstore for BlockManager

2014-05-28 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/907#issuecomment-44487759 @colorant This is a big changes. Can you explain this change reason? --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: Improve maven plugin configuration

2014-05-28 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/786#issuecomment-44499013 @pwendell Do you have time to review this PR ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: SPARK-1714. Take advantage of AMRMClient APIs ...

2014-05-30 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/655#discussion_r13227671 --- Diff: yarn/stable/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocationHandler.scala --- @@ -105,278 +96,222 @@ private[yarn] class

[GitHub] spark pull request: In some cases, yarn does not automatically res...

2014-05-30 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/921 In some cases, yarn does not automatically restart the container You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark allocateExecutors

[GitHub] spark pull request: In some cases, yarn does not automatically res...

2014-05-30 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/921#issuecomment-44719589 @sryza When `yarnAllocator.getNumExecutorsFailed` return value is greater than zero . `yarnAllocator.getNumExecutorsRunning < args.numExecutors` is true fore

[GitHub] spark pull request: [WIP][SPARK-1930] The Container is running bey...

2014-05-31 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/894#discussion_r13259895 --- Diff: yarn/alpha/src/main/scala/org/apache/spark/deploy/yarn/ExecutorLauncher.scala --- @@ -92,21 +92,22 @@ class ExecutorLauncher(args

[GitHub] spark pull request: Improve ALS algorithm resource usage

2014-05-31 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/929 Improve ALS algorithm resource usage You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark improve_als Alternatively you can review and

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-31 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/828 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [WIP]Improve ALS algorithm resource usage

2014-05-31 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/828#issuecomment-44742037 This solution is not perfect. temporarily close this. The new #929 . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [WIP][SPARK-1930] The Container is running bey...

2014-06-01 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/894#issuecomment-4410 @mridulm The following code in line with your thoughts? https://github.com/witgo/spark/compare/SPARK-1930_different --- If your project is set up for it, you can

[GitHub] spark pull request: update breeze to version 0.8.1

2014-06-02 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/940 update breeze to version 0.8.1 `breeze 0.8.1` dependent on `scala-logging-slf4j 2.1.1` The relevant code on #332 You can merge this pull request into a Git repository by running: $ git pull

[GitHub] spark pull request: [SPARK-1997] update breeze to version 0.8.1

2014-06-02 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/940#issuecomment-44911857 @markhamstra , `breeze 0.7 ` does not support `scala 2.11` . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark pull request: SPARK-6846 [WEBUI] Stage kill URL easy to acci...

2015-04-27 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/5528#issuecomment-96947031 @srowen This PR seems to have a bug in yarn-client: ``` HTTP ERROR 405 Problem accessing /proxy/application_1429108701044_0316/stages/stage/kill

[GitHub] spark pull request: SPARK-6846 [WEBUI] Stage kill URL easy to acci...

2015-04-28 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/5528#issuecomment-97126406 The hadoop version of my test cluster is `2.3.0-cdh5.0.1`. I'm not sure, tomorrow I'll test what you said. --- If your project is set up for it, you can rep

[GitHub] spark pull request: SPARK-6846 [WEBUI] Stage kill URL easy to acci...

2015-04-28 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/5528#issuecomment-97298740 `curl -d "id=3&terminate=true" "http://host:4040/stages/stage/kill/"` does not work. There are other better way? --- If your project is set up

[GitHub] spark pull request: SPARK-6846 [WEBUI] Stage kill URL easy to acci...

2015-04-29 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/5528#issuecomment-97619733 No, get blanks. but `curl -d "id=3&terminate=true" "http://host:9082/proxy/application_1429108701044_0377/stages/stage/kill/"` get a 405

[GitHub] spark pull request: [SPARK-2491] Don't handle uncaught exceptions ...

2015-04-30 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/1482#discussion_r29440041 --- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala --- @@ -280,13 +280,18 @@ private[spark] class Executor( m

[GitHub] spark pull request: [SPARK-6846] [WEBUI] [HOTFIX] return to GET fo...

2015-05-01 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/5837#issuecomment-98146620 The kill link work in yarn-client, LGTM. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request #14662: [WIP][SPARK-17082][CORE]Replace ByteBuffer with C...

2016-08-23 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14662 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14751: [WIP][SPARK-17184][[CORE]]Replace ByteBuf with In...

2016-08-23 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14751 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14664: [WIP][SPARK-6236][SPARK-6237][CORE]Support cachin...

2016-08-23 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14664 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14647: [WIP][Test only][DEMO][SPARK-6235]Address various...

2016-08-23 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14647 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14658: [WIP][SPARK-5928][SPARK-6238] Remote Shuffle Bloc...

2016-08-23 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14658 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14311: [SPARK-16550] [SPARK-17042] [core] Certain classes fail ...

2016-08-31 Thread witgo
Github user witgo commented on the issue: https://github.com/apache/spark/pull/14311 @rxin @ericl This PR may cause the following code to throw an exception ```scala private def getRemoteValues(blockId: BlockId): Option[BlockResult] = { getRemoteBytes

[GitHub] spark pull request #14977: [Test Only][not ready for review][SPARK-6235][COR...

2016-09-06 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/14977 [Test Only][not ready for review][SPARK-6235][CORE]Address various 2G limits ## What changes were proposed in this pull request? ### Design Setup for eliminating the various 2G

[GitHub] spark pull request #14977: [Test Only][not ready for review][SPARK-6235][COR...

2016-09-07 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/14977 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14995: [Test Only][not ready for review][SPARK-6235][COR...

2016-09-07 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/14995 [Test Only][not ready for review][SPARK-6235][CORE]Address various 2G limits ## What changes were proposed in this pull request? ### motivation The various 2G limit in Spark

[GitHub] spark issue #14995: [Test Only][not ready for review][SPARK-6235][CORE]Addre...

2016-09-08 Thread witgo
Github user witgo commented on the issue: https://github.com/apache/spark/pull/14995 retest please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #14995: [Test Only][SPARK-6235][CORE]Address various 2G limits

2016-09-08 Thread witgo
Github user witgo commented on the issue: https://github.com/apache/spark/pull/14995 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14995: [Test Only][SPARK-6235][CORE]Address various 2G limits

2016-09-27 Thread witgo
Github user witgo commented on the issue: https://github.com/apache/spark/pull/14995 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request: [SPARK-2491] Don't handle uncaught exceptions ...

2015-11-03 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/1482 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-7483] [MLlib] Upgrade Chill to 0.7.2 to...

2016-02-14 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/11041#issuecomment-184034549 @srowen 0.8.0 is the latest. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-7483] [MLlib] Upgrade Chill to 0.7.2 to...

2016-02-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/11041#issuecomment-184175637 @srowen I've run some simple spark SQL cases, and it doesn't seem to have any issues. --- If your project is set up for it, you can reply to this emai

[GitHub] spark pull request: [SPARK-10350] [Minor] [Doc] Fix SQL Programmin...

2015-08-29 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/8520 [SPARK-10350] [Minor] [Doc] Fix SQL Programming Guide You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK-10350 Alternatively you

[GitHub] spark pull request: [SPARK-9424] [SQL] Parquet programming guide u...

2015-08-30 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/8467#discussion_r38283165 --- Diff: docs/sql-programming-guide.md --- @@ -1371,6 +1380,26 @@ Configuration of Parquet can be done using the `setConf` method on `SQLContext

[GitHub] spark pull request: [SPARK-2491] Don't handle uncaught exceptions ...

2015-09-02 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1482#issuecomment-136971893 I think it is necessary to merge the PR into master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: SPARK-1470: Use the scala-logging wrapper inst...

2014-07-10 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/1208 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: SPARK-1470: Use the scala-logging wrapper inst...

2014-07-10 Thread witgo
GitHub user witgo reopened a pull request: https://github.com/apache/spark/pull/1208 SPARK-1470: Use the scala-logging wrapper instead of the directly sfl4j api You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK

[GitHub] spark pull request: SPARK-1470: Use the scala-logging wrapper inst...

2014-07-10 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/1208 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: Use the scala-logging wrapper instead of the d...

2014-07-10 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1369 Use the scala-logging wrapper instead of the directly sfl4j api You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK-1470_new

[GitHub] spark pull request: Use the scala-logging wrapper instead of the d...

2014-07-10 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1369#issuecomment-48696680 Jenkins, test this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: Use the scala-logging wrapper instead of the d...

2014-07-11 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1369#issuecomment-48708268 #332 can't automatic test . #1208 was messing up and I do not know how to solve . :sweat: --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-1470] Use the scala-logging wrapper ins...

2014-07-11 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/332#issuecomment-48708675 It can't automatic test. I submit a new PR #1369. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [WIP]SPARK-2098: All Spark processes should su...

2014-07-11 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/1256#discussion_r14850885 --- Diff: core/src/main/scala/org/apache/spark/deploy/master/MasterArguments.scala --- @@ -38,19 +39,24 @@ private[spark] class MasterArguments(args: Array

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-12 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1387 [WIP]When the executor is thrown OutOfMemoryError exception driver run garbage collection You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-13 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-48841019 Now, `SparkContext.cleaner` without considering the executor memory usage. This will cause the spark to fail in the shortage of memory. --- If your project is set up for

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-13 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-48841151 @srowen [Executor.scala#L253](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/executor/Executor.scala#L253) handle exceptions. But the

[GitHub] spark pull request: SPARK-2465. Use long as user / item ID for ALS

2014-07-13 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1393#issuecomment-48843123 The overall increase how much memory? Have a detailed contrast? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [WIP][SQL] By default does not run hive compat...

2014-07-14 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1403 [WIP][SQL] By default does not run hive compatibility tests You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark hive_compatibility

[GitHub] spark pull request: Remove "NOTE: SPARK_YARN is deprecated, please...

2014-07-14 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1404 Remove "NOTE: SPARK_YARN is deprecated, please use -Pyarn flag" You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark

[GitHub] spark pull request: SPARK-2480: Remove "NOTE: SPARK_YARN is deprec...

2014-07-14 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1404#issuecomment-48981721 Jenkins, test this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-14 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-48985219 I agree with your point. But when a memory overflow exception is thrown .Error is the Spark given: ``` org.apache.spark.SparkException: Job aborted due to stage

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-14 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-48985713 ``` # # java.lang.OutOfMemoryError: Java heap space # -XX:OnOutOfMemoryError="kill %p" # Executing /bin/sh -c "kill 44942"... 14/

[GitHub] spark pull request: SPARK-1291: Link the spark UI to RM ui in yarn...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1112#issuecomment-49045150 @tgravescs The code has been submitted. Because I don't have the hadoop 0.23.x cluster, the code no strict test. --- If your project is set up for it, you can rep

[GitHub] spark pull request: SPARK-2480: Resolve sbt warnings "NOTE: SPARK_...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1404#issuecomment-49048633 Jenkins, test this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49061792 `SparkContext.cleaner` will clean up no reference RDD, shuffle and broadcast. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49064468 Explicitly clear the means to keep all the reference object, for Java programmers ,it is very unfriendly. --- If your project is set up for it, you can reply to this

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49067472 Yes , `System.gc()` is just advice, may not really free resources. But RDD no close method,can only be cleared by `ContextCleaner` --- If your project is set up for it

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49069644 This involves a bug https://issues.apache.org/jira/browse/SPARK-2491 . --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49072476 Yes this solution is not perfect. I have been thinking about this problem. BTW the `runGC ` method run GC and make sure it actually has run. reference https

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/1387#discussion_r14953652 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskEventListener.scala --- @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49074742 I'm sorry, my English is poor. The problem now is we do not have a reliable solution to the RDD is cleared. Close this first? --- If your project is set up for it

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49075444 `runGC` method's main problem is likely to run for a long time and still didn't work. --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49076040 In my tests, `runGC` method is normally working in jdk7_45. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark pull request: [WIP]When the executor is thrown OutOfMemoryEr...

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1387#issuecomment-49077845 Ok, tomorrow or the day after tomorrow I try it on the way you said. I only tested the default gc configuration and I will test the other. --- If your project is set up

[GitHub] spark pull request: SPARK-2482: Resolve sbt warnings during build

2014-07-15 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1330#issuecomment-49117596 As a result of #772. The master has fixed this problem. But we should remove this line `-language:postfixOps` in [pom.xml#L807](https://github.com/apache/spark/blob

[GitHub] spark pull request: SPARK-1719: spark.*.extraLibraryPath isn't app...

2014-07-16 Thread witgo
Github user witgo closed the pull request at: https://github.com/apache/spark/pull/1022 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: SPARK-1719: spark.*.extraLibraryPath isn't app...

2014-07-16 Thread witgo
GitHub user witgo reopened a pull request: https://github.com/apache/spark/pull/1022 SPARK-1719: spark.*.extraLibraryPath isn't applied on yarn Fix: spark.executor.extraLibraryPath isn't applied on yarn You can merge this pull request into a Git repository by running:

[GitHub] spark pull request: SPARK-2491: Fix When an OOM is thrown,the exec...

2014-07-18 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1482 SPARK-2491: Fix When an OOM is thrown,the executor does not stop properly. You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK-2491

[GitHub] spark pull request: SPARK-1097: Do not introduce deadlock while fi...

2014-07-19 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1409#issuecomment-49513420 @aarondav @pwendell In my tests, it seems that there are still a deadlock. To find a possible reason this here [Executor.scala#L189] (https://github.com

[GitHub] spark pull request: [YARN]In some cases, pages display incorrect i...

2014-07-20 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1501#issuecomment-49546564 cc @tgravescs --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: [YARN]In some cases, pages display incorrect i...

2014-07-20 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1501 [YARN]In some cases, pages display incorrect in WebUI The issue is caused by #1112 . You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark

[GitHub] spark pull request: [YARN]In some cases, pages display incorrect i...

2014-07-20 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1501#issuecomment-49549682 Jenkins, test this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: Fix NPE for JsonProtocol

2014-07-21 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1511 Fix NPE for JsonProtocol You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark JsonProtocol Alternatively you can review and apply these

[GitHub] spark pull request: [WIP]Spark 2037: yarn client mode doesn't supp...

2014-07-22 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1180#issuecomment-49828369 @tgravescs Done. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-2644] Hive should not be enabled by def...

2014-07-23 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1546#issuecomment-49852813 [HiveFromSpark](https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/examples/sql/hive/HiveFromSpark.scala#L22) class is dependent on the

[GitHub] spark pull request: [WIP]Spark 2037: yarn client mode doesn't supp...

2014-07-23 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1180#issuecomment-49874821 If we do not add this, When spark has failed in the yarn.SparkContext's progress will be hang. --- If your project is set up for it, you can reply to this emai

[GitHub] spark pull request: Spark 2037: yarn client mode doesn't support s...

2014-07-23 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1180#issuecomment-49881861 A little error repair at once. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: Spark 2037: yarn client mode doesn't support s...

2014-07-23 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1180#issuecomment-49886617 Done --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request: SPARK-2662: Fix NPE for JsonProtocol

2014-07-23 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/1511#issuecomment-49959000 @mateiz Done. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: Build should not run hive tests by default.

2014-07-23 Thread witgo
GitHub user witgo opened a pull request: https://github.com/apache/spark/pull/1565 Build should not run hive tests by default. cc @pwendell @ScrapCodes You can merge this pull request into a Git repository by running: $ git pull https://github.com/witgo/spark SPARK-2484

<    3   4   5   6   7   8   9   10   >