[ 
https://issues.apache.org/jira/browse/HADOOP-3770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Douglas updated HADOOP-3770:
----------------------------------

    Status: Open  (was: Patch Available)

I haven't been over the details, but had a few general suggestions after a 
first pass:
* Most of this doesn't conform to the coding guidelines. Converting tabs to 
spaces, removing commented-out code, putting constants and defaults in a 
reasonable place, etc. should be done before this can be committed.
* Temporary directories should be configurable and default off of a single, 
configurable temp dir rather than being hard-coded off /tmp
* If independent from the original, the configuration and drivers of gridmix2 
should not be in the same place. If this is intended as a replacement for 
gridmix, it should modify the existing benchmark rather than creating _file_, 
_file2_ pairs. If it's a new benchmark, it should be in src/benchmarks/gridmix2.
* Would it be possible to split the pig benchmarks into a separate JIRA? This 
is simply too large to review well.
* GridMixRunner is unnecessarily enormous. Most of the methods are setting 
defaults and performing work best encapsulated in the \*Creator classes that 
currently do trivial work. This class would also benefit from utility methods 
converting results from Configuration::getStrings to int[] (instead of 
subclassing Configuration), abstracting out the creation of unique Strings for 
runs (the use of Calendar/Date may not be the correct choice), javadoc, and 
general cleanup
* Exceptions are almost always ignored; most probably should not be.

> improve composition, submission and result collection of gridmix
> ----------------------------------------------------------------
>
>                 Key: HADOOP-3770
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3770
>             Project: Hadoop Core
>          Issue Type: Improvement
>            Reporter: Lingyun Yang
>            Assignee: Runping Qi
>         Attachments: patch-3770.txt, patch-3770.txt, patch-3770.v2.txt
>
>
> Current gridmix submits jobs using a set of scripts, which is inconvenient 
> and the results are difficult to collect.  To improve the gridmix submission 
> and results collection, we implemented a new program  using JobControl to 
> submit and collect the results of jobs 
> Also the new gridmix allows to have more different types of jobs such as, pig 
> jobs, jobs with combiners etc. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to