Re: GSoC 2019: Off-heap memory management JIRA issue available

John Yang Thu, 23 May 2019 21:04:50 -0700

Awesome. That makes sense, thanks!

John


On Fri, May 24, 2019 at 11:11 AM Haeyoon Cho <chohy0...@gmail.com> wrote:

> Hi John,
>
> Garbage-collecting the MemoryChunks in (5. a. iv.) means that the objects
> that refer to the off-heap buffers to be garbage-collected, as the
> MemoryChunks itself is a JVM object that points to the native memory. Now
> that I think of it, it wouldn't be necessary to explicitly do this since
> shutting down the executor means the java process to terminate, which
> implies that garbage collections are not needed in the first place. I guess
> it is better to delete (5. a. iv) since it is too obvious. Thanks :)
>
> For the off-heap sorting, it is expected to be feasible with the current
> design plan just like you mentioned. The MemoryChunk is intended to be a
> representation of off-heap buffers (i.e. java.nio.ByteBuffer) and also a
> specialized class for operations on off-heap data at the same time (swap,
> compare, etc.). I will propose another design plan on this issue later for
> further discussion.
>
> Thank you for the review! It was a really big help.
>
> Best,
> Haeyoon
>
>
>
> 2019년 5월 24일 (금) 오전 10:03, John Yang <johnya...@gmail.com>님이 작성:
>
> > Hi Haeyoon,
> >
> > I like the overall design. I have minor questions about some details.
> >
> > In (5. a. iv.), what do you mean by freeing the MemoryChunks to be
> > garbage-collected? My understanding was that a MemoryChunk is outside of
> > the JVM heap, and thus is never accessed by the JVM garbage collector.
> > Also, why does the garbage collection happen "when the executor shuts
> > down"?
> >
> > I like the interfaces in (6. b.), and am curious to know whether they can
> > be also used in performance-critical places other than
> > DirectByteBufferOutputstream/SerializedMemoryStore. For example, can we
> use
> > the interfaces to sort data off-heap as in Apache Crail and Apache Spark
> > [1.2]?
> >
> > [1] Sorting on a 100Gbit/s Cluster using Spark/Crail
> > https://crail.incubator.apache.org/blog/2017/01/sorting.html
> > [2] Project Tungsten: Bringing Apache Spark Closer to Bare Metal
> >
> >
> https://databricks.com/blog/2015/04/28/project-tungsten-bringing-spark-closer-to-bare-metal.html
> >
> > Thanks,
> > John
> >
> >
> > On Thu, May 23, 2019 at 11:33 AM Gyewon Lee <strayyy...@gmail.com>
> wrote:
> >
> > > Hi, Haeyoon.
> > >
> > > Thanks a lot for your working. The initial design seems reasonable.
> > >
> > > Folks, could you give some feedbacks on the design document, so that
> > > Haeyoon can quickly move onto implementation? She will welcome any
> > > feedbacks, as she told us :)
> > >
> > > Best,
> > > Gyewon
> > >
> > > 2019년 5월 22일 (수) 오후 7:45, Haeyoon Cho <chohy0...@gmail.com>님이 작성:
> > >
> > > > Hi all,
> > > >
> > > > Just as I mentioned in the previous email, we are about to implement
> > > > components for off-heap memory management. Before that, we wrote an
> > > initial
> > > > design document and reported a JIRA issue for this(NEMO-388
> > > > <https://issues.apache.org/jira/browse/NEMO-388>). We welcome any
> > > > feedbacks
> > > > regarding the design.
> > > >
> > > >
> > > > Best,
> > > > Haeyoon
> > > >
> > >
> >
>

Re: GSoC 2019: Off-heap memory management JIRA issue available

Reply via email to