[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2014-09-03 Thread Allen Wittenauer (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-4087:

Fix Version/s: (was: 2.1.0-beta)
   (was: 3.0.0)

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 1.1.0
>
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2013-03-26 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated MAPREDUCE-4087:
-

Fix Version/s: 2.0.5-beta

I merged this to branch-2

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 1.1.0, 3.0.0, 2.0.5-beta
>
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-08-23 Thread Matt Foley (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Foley updated MAPREDUCE-4087:
--

Fix Version/s: 3.0.0
   1.1.0

Based on @Ravi: 31/Mar/12 09:29 "I just committed this to trunk and branch-1."
Marking this fixed in 3.0.0 and 1.1.0.

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Fix For: 1.1.0, 3.0.0
>
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-31 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


  Resolution: Fixed
Release Note: Fixes the issue of GenerateDistCacheData  job slowness.  
(was: Fixes the issue of Generate Dist Cache Data generation job slowness.)
  Status: Resolved  (was: Patch Available)

As this fix improves runtime of GenerateDistCacheData job, adding unit test for 
this seems to be not simple. So not adding unit test.

Gridmix unit tests passed on my local machine.

I just committed this to trunk and branch-1.

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-30 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


Hadoop Flags: Reviewed
  Status: Patch Available  (was: Open)

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-30 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


Attachment: 4087.trunk.patch

Attaching patch for trunk.

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-30 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


Status: Open  (was: Patch Available)

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-30 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


Release Note: Fixes the issue of Generate Dist Cache Data generation job 
slowness.
  Status: Patch Available  (was: Open)

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases

2012-03-30 Thread Ravi Gummadi (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-4087:


Attachment: 4087.patch

Attaching patch with the fix.

> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> 
>
> Key: MAPREDUCE-4087
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Reporter: Ravi Gummadi
>Assignee: Ravi Gummadi
> Attachments: 4087.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is 
> done every time based on the bytes to be written to a distributed cache file. 
> When we try to write data to next distributed cache file in the same map 
> task, the size of random data generated in each iteration can become small 
> based on the particular case. This can make this dist cache data generation 
> slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira