[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allen Wittenauer updated MAPREDUCE-4087: Fix Version/s: (was: 2.1.0-beta) (was: 3.0.0) > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Fix For: 1.1.0 > > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated MAPREDUCE-4087: - Fix Version/s: 2.0.5-beta I merged this to branch-2 > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Fix For: 1.1.0, 3.0.0, 2.0.5-beta > > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Foley updated MAPREDUCE-4087: -- Fix Version/s: 3.0.0 1.1.0 Based on @Ravi: 31/Mar/12 09:29 "I just committed this to trunk and branch-1." Marking this fixed in 3.0.0 and 1.1.0. > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Fix For: 1.1.0, 3.0.0 > > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Resolution: Fixed Release Note: Fixes the issue of GenerateDistCacheData job slowness. (was: Fixes the issue of Generate Dist Cache Data generation job slowness.) Status: Resolved (was: Patch Available) As this fix improves runtime of GenerateDistCacheData job, adding unit test for this seems to be not simple. So not adding unit test. Gridmix unit tests passed on my local machine. I just committed this to trunk and branch-1. > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Hadoop Flags: Reviewed Status: Patch Available (was: Open) > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Attachment: 4087.trunk.patch Attaching patch for trunk. > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Status: Open (was: Patch Available) > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch, 4087.trunk.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Release Note: Fixes the issue of Generate Dist Cache Data generation job slowness. Status: Patch Available (was: Open) > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
[ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated MAPREDUCE-4087: Attachment: 4087.patch Attaching patch with the fix. > [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases > > > Key: MAPREDUCE-4087 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087 > Project: Hadoop Map/Reduce > Issue Type: Bug >Reporter: Ravi Gummadi >Assignee: Ravi Gummadi > Attachments: 4087.patch > > > In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is > done every time based on the bytes to be written to a distributed cache file. > When we try to write data to next distributed cache file in the same map > task, the size of random data generated in each iteration can become small > based on the particular case. This can make this dist cache data generation > slow. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira