[ https://issues.apache.org/jira/browse/TEZ-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rajesh Balamohan updated TEZ-1593: ---------------------------------- Attachment: TEZ-1593.2.patch > Refactor PipelinedSorter to remove all MMAP based ByteBuffer references > ----------------------------------------------------------------------- > > Key: TEZ-1593 > URL: https://issues.apache.org/jira/browse/TEZ-1593 > Project: Apache Tez > Issue Type: Bug > Affects Versions: 0.6.0 > Reporter: Gopal V > Assignee: Gopal V > Labels: Performance > Attachments: TEZ-1593.1.patch, TEZ-1593.2-WIP.patch, TEZ-1593.2.patch > > > The current implementation of PipelinedSorter has a slow section which > revolves around key comparisons - this was relevant when the implementation > used direct byte buffers to back the kvbuffer. > {code} > kvbuffer.position(istart); > kvbuffer.get(ki, 0, ilen); > kvbuffer.position(jstart); > kvbuffer.get(kj, 0, jlen); > // sort by key > final int cmp = comparator.compare(ki, 0, ilen, kj, 0, jlen); > {code} > The kvbuffer.get into the arrays ki and kj are the slowest part of the > comparator operation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)