[jira] [Commented] (TEZ-1733) TezMerger should sort FileChunks on decompressed size

2014-11-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14195330#comment-14195330 ] Gopal V commented on TEZ-1733: -- [~rajesh.balamohan]: Can you take a look at this change-set?

[jira] [Assigned] (TEZ-1733) TezMerger should sort FileChunks on decompressed size

2014-11-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V reassigned TEZ-1733: Assignee: Gopal V TezMerger should sort FileChunks on decompressed size

[jira] [Updated] (TEZ-1733) TezMerger should sort FileChunks on decompressed size

2014-11-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1733: - Attachment: TEZ-1733.2.patch TezMerger should sort FileChunks on decompressed size

[jira] [Commented] (TEZ-1733) TezMerger should sort FileChunks on decompressed size

2014-11-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14195734#comment-14195734 ] Gopal V commented on TEZ-1733: -- Thought I'd hit long/long overflow issues there - some of those

[jira] [Commented] (TEZ-1733) TezMerger should sort FileChunks on decompressed size

2014-11-04 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14196366#comment-14196366 ] Gopal V commented on TEZ-1733: -- [~pramachandran]: that would be a problem. Can you take a

[jira] [Commented] (TEZ-1738) tez tfile parser for log parsing

2014-11-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14199724#comment-14199724 ] Gopal V commented on TEZ-1738: -- As an added note, if this proves to be widely used, we need to

[jira] [Updated] (TEZ-1738) tez tfile parser for log parsing

2014-11-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1738: - Issue Type: Improvement (was: Bug) tez tfile parser for log parsing

[jira] [Updated] (TEZ-1738) tez tfile parser for log parsing

2014-11-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1738: - Affects Version/s: 0.6.0 tez tfile parser for log parsing Key:

[jira] [Commented] (TEZ-1738) tez tfile parser for log parsing

2014-11-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14199884#comment-14199884 ] Gopal V commented on TEZ-1738: -- Content pending from me on

[jira] [Created] (TEZ-1777) Explore distributed tracing of Tez tasks

2014-11-13 Thread Gopal V (JIRA)
Gopal V created TEZ-1777: Summary: Explore distributed tracing of Tez tasks Key: TEZ-1777 URL: https://issues.apache.org/jira/browse/TEZ-1777 Project: Apache Tez Issue Type: Bug

[jira] [Updated] (TEZ-1777) Explore distributed tracing of Tez tasks

2014-11-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1777: - Issue Type: Wish (was: Bug) Explore distributed tracing of Tez tasks -

[jira] [Commented] (TEZ-1781) Configurations view ~ New design

2014-11-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217651#comment-14217651 ] Gopal V commented on TEZ-1781: -- [~rajesh.balamohan]: has this been pushed to the git repos?

[jira] [Commented] (TEZ-1780) tez-api is missing jersey dependencies

2014-11-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221752#comment-14221752 ] Gopal V commented on TEZ-1780: -- Would it be better to remove the excludes from the main pom.xml

[jira] [Resolved] (TEZ-1250) Prevent timeouts for pre-warmed containers

2014-11-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V resolved TEZ-1250. -- Resolution: Not a Problem min-held containers fix is more recent development, which offers a better alternative

[jira] [Resolved] (TEZ-1086) Allow Tez sessions to decay smoothly instead of timeouts

2014-11-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V resolved TEZ-1086. -- Resolution: Not a Problem min-held containers fix provides a cleaner fix to the this problem. Allow Tez

[jira] [Commented] (TEZ-1248) Reduce slow-start should special case 1 reducer runs

2014-12-08 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14238790#comment-14238790 ] Gopal V commented on TEZ-1248: -- bq. More details would help. For a case with 100 mappers and 1

[jira] [Comment Edited] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-01-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14292192#comment-14292192 ] Gopal V edited comment on TEZ-1993 at 1/26/15 6:32 PM: --- No, because

[jira] [Commented] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-01-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14292192#comment-14292192 ] Gopal V commented on TEZ-1993: -- No, because inheritance is not shimmable. And because of that

[jira] [Commented] (TEZ-2021) Tez tool to analyze shuffle performance in large clusters by mining task logs

2015-02-03 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304683#comment-14304683 ] Gopal V commented on TEZ-2021: -- The axis names need to be swapped around - the src-machine is

[jira] [Updated] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-02-08 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1993: - Attachment: TEZ-1993.2.patch Rebasing to trunk. Without native MixIn support in the language, there is no better

[jira] [Created] (TEZ-1949) Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

2015-01-14 Thread Gopal V (JIRA)
Gopal V created TEZ-1949: Summary: Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges Key: TEZ-1949 URL: https://issues.apache.org/jira/browse/TEZ-1949 Project: Apache Tez Issue Type:

[jira] [Updated] (TEZ-1949) Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

2015-01-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1949: - Affects Version/s: 0.7.0 Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

[jira] [Updated] (TEZ-1949) Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

2015-01-14 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1949: - Attachment: TEZ-1949.1.patch Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

[jira] [Updated] (TEZ-1949) Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

2015-01-15 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1949: - Attachment: TEZ-1949.2.patch Test-case to confirm that we can set these parameters successfully. Whitelist

[jira] [Updated] (TEZ-1949) Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast edges

2015-01-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1949: - Attachment: TEZ-1949.3.patch Added the extra test. Whitelist TEZ_RUNTIME_OPTIMIZE_SHARED_FETCH for broadcast

[jira] [Commented] (TEZ-1593) Refactor PipelinedSorter to remove all MMAP based ByteBuffer references

2015-01-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14281032#comment-14281032 ] Gopal V commented on TEZ-1593: -- +1. This made the sorted data movement scale tests go from

[jira] [Updated] (TEZ-1593) Refactor PipelinedSorter to remove all MMAP based ByteBuffer references

2015-01-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1593: - Assignee: Rajesh Balamohan (was: Gopal V) Refactor PipelinedSorter to remove all MMAP based ByteBuffer references

[jira] [Commented] (TEZ-1803) Support 2gb sort buffer in pipelinedsorter

2015-01-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284691#comment-14284691 ] Gopal V commented on TEZ-1803: -- +1 - looks good. Only one detail, there's a perItem size reset

[jira] [Updated] (TEZ-1987) Tez UI non-standalone mode uses invalid protocol

2015-01-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1987: - Assignee: Sreenath Somarajapuram Tez UI non-standalone mode uses invalid protocol

[jira] [Commented] (TEZ-1987) Tez UI non-standalone mode uses invalid protocol

2015-01-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287021#comment-14287021 ] Gopal V commented on TEZ-1987: -- No, [~hitesh]. This is just a missing : in the URL generation

[jira] [Commented] (TEZ-1803) Support 2gb sort buffer in pipelinedsorter

2015-01-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287043#comment-14287043 ] Gopal V commented on TEZ-1803: -- Looks good - +1 Support 2gb sort buffer in pipelinedsorter

[jira] [Created] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-01-22 Thread Gopal V (JIRA)
Gopal V created TEZ-1993: Summary: Implement a pluggable InputSizeEstimator for grouping fairly Key: TEZ-1993 URL: https://issues.apache.org/jira/browse/TEZ-1993 Project: Apache Tez Issue Type: Bug

[jira] [Comment Edited] (TEZ-2001) Support pipelined data transfer for ordered output

2015-02-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14326419#comment-14326419 ] Gopal V edited comment on TEZ-2001 at 2/18/15 7:24 PM: --- bq. Are we

[jira] [Commented] (TEZ-2001) Support pipelined data transfer for ordered output

2015-02-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14326419#comment-14326419 ] Gopal V commented on TEZ-2001: -- bq. Are we looking to address the memory overhead in this jira,

[jira] [Commented] (TEZ-2085) PipelinedSorter should bail out (on BufferOverflowException) instead of retrying continuously

2015-02-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14326450#comment-14326450 ] Gopal V commented on TEZ-2085: -- +1. For reference, to trigger this needs a single key bigger

[jira] [Commented] (TEZ-2085) PipelinedSorter should bail out (on BufferOverflowException) instead of retrying continuously

2015-02-12 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318983#comment-14318983 ] Gopal V commented on TEZ-2085: -- Looks good, I suggest making all the PipelinedSorter parameters

[jira] [Commented] (TEZ-2076) Tez tool to analyze data stored in ATS for specific dag

2015-02-12 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319564#comment-14319564 ] Gopal V commented on TEZ-2076: -- The DAG structure in-memory looks very useful even outside of

[jira] [Commented] (TEZ-2001) Support pipelined data transfer for ordered output

2015-02-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328628#comment-14328628 ] Gopal V commented on TEZ-2001: -- bq. Isnt straggler mitigation one of the primary motivations

[jira] [Comment Edited] (TEZ-2001) Support pipelined data transfer for ordered output

2015-02-19 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328628#comment-14328628 ] Gopal V edited comment on TEZ-2001 at 2/20/15 6:56 AM: --- bq. Isnt

[jira] [Commented] (TEZ-1997) Remove synchronization DefaultSorter::isRLENeeded() (Causes sorter to hang indefinitely in large jobs)

2015-01-26 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14293135#comment-14293135 ] Gopal V commented on TEZ-1997: -- Clear deadlock between object lock spill lock in

[jira] [Updated] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-01-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1993: - Attachment: TEZ-1993.1.patch Implement a pluggable InputSizeEstimator for grouping fairly

[jira] [Updated] (TEZ-1913) Reduce deserialize cost in ValuesIterator

2015-01-05 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1913: - Description: When TezRawKeyValueIterator-isSameKey() is added, it should be possible to reduce the number of

[jira] [Updated] (TEZ-1593) Refactor PipelinedSorter to remove all MMAP based ByteBuffer references

2015-01-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1593: - Attachment: TEZ-1593.2-WIP.patch Refactor PipelinedSorter to remove all MMAP based ByteBuffer references

[jira] [Commented] (TEZ-1993) Implement a pluggable InputSizeEstimator for grouping fairly

2015-02-09 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14312960#comment-14312960 ] Gopal V commented on TEZ-1993: -- The FIleSplit::getLen() along with offset is used to

[jira] [Updated] (TEZ-2103) Implement a Partial completion VertexManagerPlugin

2015-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2103: - Issue Type: New Feature (was: Improvement) Implement a Partial completion VertexManagerPlugin

[jira] [Commented] (TEZ-2103) Implement a Partial completion VertexManagerPlugin

2015-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321162#comment-14321162 ] Gopal V commented on TEZ-2103: -- The handling of short-circuit success + exit in an out-of-order

[jira] [Commented] (TEZ-2104) A CrossProductEdge which produces synthetic cross-product parallelism

2015-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321182#comment-14321182 ] Gopal V commented on TEZ-2104: -- The cross-product edge has special affinity scheduling

[jira] [Comment Edited] (TEZ-2104) A CrossProductEdge which produces synthetic cross-product parallelism

2015-02-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321182#comment-14321182 ] Gopal V edited comment on TEZ-2104 at 2/14/15 2:47 AM: --- The

[jira] [Moved] (TEZ-2202) Fix LocalTaskExecutionThread ID to the standard thread numbering

2015-03-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V moved HIVE-9973 to TEZ-2202: Target Version/s: 0.7.0 Key: TEZ-2202 (was: HIVE-9973)

[jira] [Updated] (TEZ-2198) Fix sorter spill counts

2015-03-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2198: - Description: Prior to pipelined shuffle, tez merged all spilled data into a single file. This ended up creating

[jira] [Updated] (TEZ-2202) Fix LocalTaskExecutionThread ID to the standard thread numbering

2015-03-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2202: - Attachment: TEZ-2202.1.patch Fix LocalTaskExecutionThread ID to the standard thread numbering

[jira] [Commented] (TEZ-2198) Fix sorter spill counts

2015-03-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363219#comment-14363219 ] Gopal V commented on TEZ-2198: -- [~rajesh.balamohan]: patch looks good - will test it today

[jira] [Commented] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-20 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14372361#comment-14372361 ] Gopal V commented on TEZ-2217: -- The performance bug occurs within the queue, that is a

[jira] [Created] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-20 Thread Gopal V (JIRA)
Gopal V created TEZ-2217: Summary: The min-held-containers constraint is not enforced during query runtime Key: TEZ-2217 URL: https://issues.apache.org/jira/browse/TEZ-2217 Project: Apache Tez

[jira] [Updated] (TEZ-1167) Statistics infrastructure and API for Tez

2015-03-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1167: - Attachment: tez-reducer-skew.png We need leaky bucket stats collection for some of the more expensive metrics. Let

[jira] [Updated] (TEZ-1167) Statistics infrastructure and API for Tez

2015-03-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1167: - Attachment: tez-reducer-skew.png Statistics infrastructure and API for Tez

[jira] [Commented] (TEZ-2198) Fix sorter spill counts

2015-03-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14360173#comment-14360173 ] Gopal V commented on TEZ-2198: -- That exact update means that a clear recommendation can be made

[jira] [Commented] (TEZ-2198) Fix sorter spill counts

2015-03-13 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14360170#comment-14360170 ] Gopal V commented on TEZ-2198: -- [~rajesh.balamohan]: The example seems to not match the

[jira] [Updated] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-02-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2076: - Summary: Tez framework to extract/analyze data stored in ATS for specific dag (was: Tez tool to analyze data

[jira] [Commented] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-02-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14335484#comment-14335484 ] Gopal V commented on TEZ-2076: -- {code} scala import java.io._; import java.io._ scala import

[jira] [Commented] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-02-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14335481#comment-14335481 ] Gopal V commented on TEZ-2076: -- [~rajesh.balamohan]: The approach looks good - +1. The exposed

[jira] [Commented] (TEZ-2076) Tez framework to extract/analyze data stored in ATS for specific dag

2015-02-25 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14336999#comment-14336999 ] Gopal V commented on TEZ-2076: -- bq. Essentially, the patch is enabling a library to download

[jira] [Commented] (TEZ-2001) Support pipelined data transfer for ordered output

2015-02-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14335762#comment-14335762 ] Gopal V commented on TEZ-2001: -- The Q17 plan is not IO bound today on the network pipe (i.e all

[jira] [Created] (TEZ-2244) PipelinedSorter: Progressive allocation for sort-buffers

2015-03-27 Thread Gopal V (JIRA)
Gopal V created TEZ-2244: Summary: PipelinedSorter: Progressive allocation for sort-buffers Key: TEZ-2244 URL: https://issues.apache.org/jira/browse/TEZ-2244 Project: Apache Tez Issue Type:

[jira] [Commented] (TEZ-145) Support a combiner processor that can run non-local to map/reduce nodes

2015-03-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367813#comment-14367813 ] Gopal V commented on TEZ-145: - This is a question for [~bikassaha]. There is an combiner edge

[jira] [Commented] (TEZ-145) Support a combiner processor that can run non-local to map/reduce nodes

2015-03-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14367762#comment-14367762 ] Gopal V commented on TEZ-145: - [~ozawa]: the CombineProcessor patch looks good. This will help

[jira] [Commented] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14377072#comment-14377072 ] Gopal V commented on TEZ-2217: -- [~bikassaha]: any suggestions on more logging in the code to

[jira] [Commented] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14377068#comment-14377068 ] Gopal V commented on TEZ-2217: -- I quickly cross-checked, this - it seems to be still letting go

[jira] [Commented] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14377086#comment-14377086 ] Gopal V commented on TEZ-2217: -- Yes, the LOG does not say delay expired or is new. - which

[jira] [Updated] (TEZ-2217) The min-held-containers constraint is not enforced during query runtime

2015-03-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2217: - Attachment: TEZ-2217-debug.txt.bz2 Debug logs attached. {code} $ grep Releasing unused app-log.txt | wc -l 111

[jira] [Commented] (TEZ-2251) Enabling auto reduce parallelism in certain jobs causes DAG to hang

2015-04-01 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14391832#comment-14391832 ] Gopal V commented on TEZ-2251: -- [~rajesh.balamohan]: I think the ideal interpretation of what

[jira] [Updated] (TEZ-2270) TezClient.create() with AM localresources CLASSPATH order (Rolling Upgrade)

2015-04-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2270: - Labels: RollingUpgrade (was: ) TezClient.create() with AM localresources CLASSPATH order (Rolling Upgrade)

[jira] [Created] (TEZ-2270) TezClient.create() with AM localresources CLASSPATH order

2015-04-02 Thread Gopal V (JIRA)
Gopal V created TEZ-2270: Summary: TezClient.create() with AM localresources CLASSPATH order Key: TEZ-2270 URL: https://issues.apache.org/jira/browse/TEZ-2270 Project: Apache Tez Issue Type: Bug

[jira] [Updated] (TEZ-2270) TezClient.create() with AM localresources CLASSPATH order (Rolling Upgrade)

2015-04-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2270: - Summary: TezClient.create() with AM localresources CLASSPATH order (Rolling Upgrade) (was: TezClient.create() with

[jira] [Commented] (TEZ-2270) TezClient.create() with AM localresources CLASSPATH order (Rolling Upgrade)

2015-04-02 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394029#comment-14394029 ] Gopal V commented on TEZ-2270: -- Filed as a place-holder until more analysis next week.

[jira] [Commented] (TEZ-145) Support a combiner processor that can run non-local to map/reduce nodes

2015-04-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490860#comment-14490860 ] Gopal V commented on TEZ-145: - [~bikassaha]: The figure 7 is identical to the runtime expansion I

[jira] [Comment Edited] (TEZ-145) Support a combiner processor that can run non-local to map/reduce nodes

2015-04-11 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14490860#comment-14490860 ] Gopal V edited comment on TEZ-145 at 4/11/15 8:13 AM: -- [~bikassaha]: The

[jira] [Commented] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510894#comment-14510894 ] Gopal V commented on TEZ-2358: -- [~rajesh.balamohan]: The patch is looking good in tests. The

[jira] [Updated] (TEZ-2363) Counters: off by 1 error for REDUCE_INPUT_GROUPS counter

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2363: - Attachment: TEZ-2363.1.patch Counters: off by 1 error for REDUCE_INPUT_GROUPS counter

[jira] [Commented] (TEZ-2363) Counters: off by 1 error for REDUCE_INPUT_GROUPS counter

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511137#comment-14511137 ] Gopal V commented on TEZ-2363: -- [~zjffdu]/[~rajesh.balamohan]: was a missing TODO: item - can

[jira] [Commented] (TEZ-2363) Counters: off by 1 error for REDUCE_INPUT_GROUPS counter

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511056#comment-14511056 ] Gopal V commented on TEZ-2363: -- Yes, I have a fix - resolving that as duplicate. Counters:

[jira] [Resolved] (TEZ-2208) Counter of REDUCE_INPUT_GROUPS is incorrect

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V resolved TEZ-2208. -- Resolution: Duplicate Counter of REDUCE_INPUT_GROUPS is incorrect ---

[jira] [Commented] (TEZ-2348) EOF exception during UnorderedKVReader.next()

2015-04-21 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14506418#comment-14506418 ] Gopal V commented on TEZ-2348: -- [~rajesh.balamohan]: this needs a different exception with

[jira] [Commented] (TEZ-2363) Counters: off by 1 error for REDUCE_INPUT_GROUPS counter

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14511173#comment-14511173 ] Gopal V commented on TEZ-2363: -- The warning diffs are due to the code shifting lines -

[jira] [Updated] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2358: - Summary: Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task (was: Fix MergeManager

[jira] [Commented] (TEZ-2358) Fix MergeManager assumptions about 1 merge per source-task

2015-04-23 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14508976#comment-14508976 ] Gopal V commented on TEZ-2358: -- [~rajesh.balamohan]: I have more logs for this issue, but they

[jira] [Updated] (TEZ-2161) Support CRDT aggregation models for counters

2015-04-22 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2161: - Summary: Support CRDT aggregation models for counters (was: Support different aggregation models for counters )

[jira] [Commented] (TEZ-2161) Support CRDT aggregation models for counters

2015-04-22 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14506741#comment-14506741 ] Gopal V commented on TEZ-2161: -- The aggregations can be done consistently if and only if any

[jira] [Commented] (TEZ-2348) EOF exception during UnorderedKVReader.next()

2015-04-24 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14510615#comment-14510615 ] Gopal V commented on TEZ-2348: -- [~sseth]: Admittedly, I'd like it to throw exceptions for all

[jira] [Commented] (TEZ-2383) release sort buffers on close

2015-04-28 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518728#comment-14518728 ] Gopal V commented on TEZ-2383: -- [~rajesh.balamohan]: This is likely to be a fix limited to the

[jira] [Commented] (TEZ-2390) tez-tools swimlane tool fails to parse large jobs 8K containers

2015-04-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14521136#comment-14521136 ] Gopal V commented on TEZ-2390: -- [~jeagles]: the patch looks good, except for the debug print

[jira] [Commented] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14514797#comment-14514797 ] Gopal V commented on TEZ-2358: -- [~hitesh]: marking this as blocker for 0.7.x, because it causes

[jira] [Updated] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-2358: - Priority: Blocker (was: Major) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

[jira] [Commented] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14513650#comment-14513650 ] Gopal V commented on TEZ-2358: -- The patch was again tested at 10Tb scale over the weekend -

[jira] [Commented] (TEZ-2358) Pipelined Shuffle: MergeManager assumptions about 1 merge per source-task

2015-04-27 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14513742#comment-14513742 ] Gopal V commented on TEZ-2358: -- [~rajesh.balamohan]: minor nit on the checkargument (not or

[jira] [Created] (TEZ-2371) Upgrade hive branch to latest Tez

2015-04-27 Thread Gopal V (JIRA)
Gopal V created TEZ-2371: Summary: Upgrade hive branch to latest Tez Key: TEZ-2371 URL: https://issues.apache.org/jira/browse/TEZ-2371 Project: Apache Tez Issue Type: Bug Reporter: Gopal

[jira] [Commented] (TEZ-2390) tez-tools swimlane tool fails to parse large jobs 8K containers

2015-04-30 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14522123#comment-14522123 ] Gopal V commented on TEZ-2390: -- LGTM - +1. tez-tools swimlane tool fails to parse large jobs

[jira] [Commented] (TEZ-2405) PipelinedSorter can throw NPE with custom compartor

2015-05-04 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14526314#comment-14526314 ] Gopal V commented on TEZ-2405: -- [~rajesh.balamohan]: the patch looks good - +1 But the code

[jira] [Commented] (TEZ-2407) Drop references to the old DataInputBuffer impl in PipelinedSorter

2015-05-04 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14527112#comment-14527112 ] Gopal V commented on TEZ-2407: -- This is code-cleanliness refactoring - this does not add

[jira] [Commented] (TEZ-2405) PipelinedSorter can throw NPE with custom compartor

2015-05-04 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14527124#comment-14527124 ] Gopal V commented on TEZ-2405: -- [~hitesh]: nope, this was introduced during 0.7 release cycle

<    1   2   3   4   5   6   >