subject:"\[jira\] \[Commented\] \(SPARK\-2468\) Netty\-based block server \/ client module"

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201765#comment-14201765
]

zzc commented on SPARK-2468:

@Lianhui Wang, How to view the associated logs with yarn still kill
executor's container because it's physical memory beyond allocated memory. I
can't find it.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201766#comment-14201766
]

Aaron Davidson commented on SPARK-2468:
---

[~zzcclp] Yes, please do. What's the memory of your YARN executors/containers?
With preferDirectBufs off, we should allocate little to no off-heap memory, so
these results are surprising.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201772#comment-14201772
 ] 

zzc commented on SPARK-2468:


aa...@databricks.com?

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201774#comment-14201774
 ] 

Aaron Davidson commented on SPARK-2468:
---

Yup, that would work.

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-07 Thread Lianhui Wang (JIRA)

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201778#comment-14201778
]

Lianhui Wang commented on SPARK-2468:
-

[~zzcclp] in am's log, you can find this log:
Exit status: 143. Diagnostics: Container[container-id]is running beyond
physical memory limits. Current usage: 8.3 GB of 8 GB physical memory used;
11.0 GB of 16.8 GB virtual memory used. Killing container.
and i already set spark.yarn.executor.memoryOverhead=1024 and executor's memory
is 7G.
so through above log, i can confirm that executor use big no-heap jvm memory.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201783#comment-14201783
]

Aaron Davidson commented on SPARK-2468:
---

Thanks a lot for those diagnostics. Can you confirm that
spark.shuffle.io.preferDirectBufs does show up in the UI as being set
properly? Does your workload mainly involve a large shuffle? How big is each
partition/how many are there? In addition to the netty buffers (which _should_
be disabled by the config), we also memory map shuffle blocks larger than 2MB.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201812#comment-14201812
]

Aaron Davidson commented on SPARK-2468:
---

Looking at the netty code a bit more, it seems that they might unconditionally
allocate direct buffers for IO, whether or not direct is preferred.
Additionally, they allocate more memory based on the number of cores in your
system. The default settings would be roughly 16MB per core, and this might be
multiplied by 2 in our current setup since we have independent client and
server pools in the same JVM. I'm not certain how executors running in YARN
report availableProcessors, but is it possible your machines have 32 or
greater cores? This could cause an extra allocation of around 1GB direct heap
buffers.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201832#comment-14201832
 ] 

Aaron Davidson commented on SPARK-2468:
---

[~lianhuiwang] I have created 
[#3155|https://github.com/apache/spark/pull/3155/files], which I will clean up 
and try to get in tomorrow, which makes the preferDirectBufs config forcefully 
disable direct byte buffers from both the server and client pools. 
Additionally, I have added the conf spark.shuffle.io.maxUsableCores which 
should allow you to inform the executor how many cores you're actually using, 
so it will avoid allocating enough memory for all the machine's cores. 

I hope that simply specifying the maxUsableCores is sufficient to actually fix 
this issue for you, but the combination should give a higher chance of success.

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201844#comment-14201844
 ] 

zzc commented on SPARK-2468:


By the way, My test code: 
val mapR = textFile.map(line = {
..
((value(1) + _ + date.toString(), url), (flow, 1))
}).reduceByKey((pair1, pair2) = {
(pair1._1 + pair2._1, pair1._2 + pair2._2)
}, 100)

mapR.persist(StorageLevel.MEMORY_AND_DISK_SER)

val mapR1 = mapR.groupBy(_._1._1)
.mapValues(pairs = {
pairs.toList.sortBy(_._2._1).reverse
})
.flatMap(values = {
values._2
})
.map(values = {
values._1._1 + \t + values._1._2 + \t + 
values._2._1.toString() + \t + values._2._2.toString()
})
.saveAsTextFile(outputPath + _1/)

val mapR2 = mapR.groupBy(_._1._1)
.mapValues(pairs = {
pairs.toList.sortBy(_._2._2).reverse
})
.flatMap(values = {
values._2
})
.map(values = {
values._1._1 + \t + values._1._2 + \t + 
values._2._1.toString() + \t + values._2._2.toString()
})
.saveAsTextFile(outputPath + _2/)


 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201849#comment-14201849
]

Aaron Davidson commented on SPARK-2468:
---

[~zzcclp] Thank you for the writeup. Is it really the case that each of your
executors is only using 1 core for its 20GB of RAM? It seems like 5 would be in
line with the portion of memory you're using. Also, the sum of your storage and
memory fractions exceed 1, so if you're caching any data and then performing a
reduction/groupBy, you could actually see an OOM even without this other issue.
I would recommend keeping shuffle fraction relatively low unless you have a
good reason not to, as it can lead to increased instability.

The numbers are relatively close to my expectations, which would estimate netty
allocating around 750MB of direct buffer space, thinking that it has 24 cores.
With #3155 and maxUsableCores set to 1 (or 5), I hope this issue may be
resolved.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201865#comment-14201865
]

zzc commented on SPARK-2468:

Hi, Aaron Davidson, what do you mean that Is it really the case that each of
your executors is only using 1 core for its 20GB of RAM? It seems like 5 would
be in line with the portion of memory you're using?

I try to set spark.storage.memoryFraction and spark.shuffle.memoryFraction from
0.2 to 0.5 before, OOM still occur.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-07 Thread Lianhui Wang (JIRA)

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14202056#comment-14202056
]

Lianhui Wang commented on SPARK-2468:
-

[~adav] yes,with https://github.com/apache/spark/pull/3155/ in my test it
does not happened.but i discover that Netty's performance is not good than
NioBlockTransferService. so I need to find why Netty's performance is bad than
NioBlockTransferService in my test.Can you give me some suggestions? thanks.and
how about your test? [~zzcclp]

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14202136#comment-14202136
]

zzc commented on SPARK-2468:

The performance of Netty is worse than NIO in my test. Why?@Aaron Davidson.

I want to improve the performance of shuffle, with 500G of shuffle data, the
performance is more worse than hadoop.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14202490#comment-14202490
]

Aaron Davidson commented on SPARK-2468:
---

[~lianhuiwang] Can you try again with preferDirectBufs set to true, and just
setting maxUsableCores down to the number of cores each container actually has?
It's possible the performance discrepancy you're seeing is simply due to heap
byte buffers not being as fast as direct ones. You might also decrease the Java
heap size a bit while keeping the container size the same, if _any_ direct
memory allocation is causing the container to be killed.

[~zzcclp] Same suggestion for you about setting preferDirectBufs to true and
setting maxUsableCores down, but I will also perform another round of
benchmarking -- it's possible we accidentally introduced a performance
regression in the last few patches.

Comparing Hadoop vs Spark performance is a different matter. A few suggestions
on your setup: You should set executor-cores to 5, so that each executor is
actually using 5 cores instead of just 1. You're losing significant parallelism
because of this setting, as Spark will only launch 1 task per core on an
executor at any given time. Second, groupBy() is inefficient (it's doc was
changed recently to reflect this), and should be avoided. I would recommend
changing your job to sort the whole RDD using something similar to
{code}mapR.map { x = ((x._1._1, x._2._1), x) }.sortByKey(){code}, which would
not require that all values for a single group fit in memory. This would still
effectively group by x._1._1, but would sort within each group by x._2._1, and
would utilize Spark's efficient sorting machinery.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14200037#comment-14200037
 ] 

zzc commented on SPARK-2468:


Hi, Aaron Davidson, I set spark.shuffle.blockTransferService=netty and 
spark.shuffle.io.mode=nio, run on CentOS 5.8 with 12G files successfully , but 
when I set spark.shuffle.blockTransferService=netty and 
spark.shuffle.io.mode=epoll, there is error:
Exception in thread main java.lang.UnsatisfiedLinkError: 
/tmp/libnetty-transport-native-epoll7072694982027222413.so: /lib64/libc.so.6: 
version `GLIBC_2.10' not found

I find GLIBC_2.5 on CentOS 5.8 and can not upgrade, how to resolve it. 

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-06 Thread Lianhui Wang (JIRA)

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14200096#comment-14200096
]

Lianhui Wang commented on SPARK-2468:
-

[~adav] i use your branch and memory overhead on yarn is exist. [~zzcclp] how
about your test.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-06 Thread Aaron Davidson (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14200519#comment-14200519
 ] 

Aaron Davidson commented on SPARK-2468:
---

[~zzcclp] Use of epoll mode is highly dependent on your environment, and I 
personally would not recommend it due to known netty bugs which may cause it to 
be less stable. We have found nio mode to be sufficiently performant in our 
testing (and netty actually still tries to use epoll if it's available as its 
selector).

[~lianhuiwang] Could you please elaborate on what you mean?

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201378#comment-14201378
]

zzc commented on SPARK-2468:

@Aaron Davidson, Thank you for your recommendation. By the way, PR #3101 can
be merged into master today?

@Lianhui Wang, I haven't tested it.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-06 Thread Aaron Davidson (JIRA)

[
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201432#comment-14201432
]

Aaron Davidson commented on SPARK-2468:
---

[~zzcclp] I believe it is close to merging. [~rxin] is finishing up his review.

Netty-based block server / client module

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201482#comment-14201482
 ] 

zzc commented on SPARK-2468:


Thank you.

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module

2014-11-06 Thread Reynold Xin (JIRA)


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201483#comment-14201483
 ] 

Reynold Xin commented on SPARK-2468:


It's been merged.

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module


[ 
https://issues.apache.org/jira/browse/SPARK-2468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14201487#comment-14201487
 ] 

zzc commented on SPARK-2468:


YES, I see it, and start to compile and test.

 Netty-based block server / client module
 

 Key: SPARK-2468
 URL: https://issues.apache.org/jira/browse/SPARK-2468
 Project: Spark
  Issue Type: Improvement
  Components: Shuffle, Spark Core
Reporter: Reynold Xin
Assignee: Reynold Xin
Priority: Critical
 Fix For: 1.2.0


 Right now shuffle send goes through the block manager. This is inefficient 
 because it requires loading a block from disk into a kernel buffer, then into 
 a user space buffer, and then back to a kernel send buffer before it reaches 
 the NIC. It does multiple copies of the data and context switching between 
 kernel/user. It also creates unnecessary buffer in the JVM that increases GC
 Instead, we should use FileChannel.transferTo, which handles this in the 
 kernel space with zero-copy. See 
 http://www.ibm.com/developerworks/library/j-zerocopy/
 One potential solution is to use Netty.  Spark already has a Netty based 
 network module implemented (org.apache.spark.network.netty). However, it 
 lacks some functionality and is turned off by default. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-2468) Netty-based block server / client module