longterm plan for MRUNIT development?
-------------------------------------

                 Key: MRUNIT-69
                 URL: https://issues.apache.org/jira/browse/MRUNIT-69
             Project: MRUnit
          Issue Type: Brainstorming
            Reporter: Jim Donofrio
            Priority: Minor


So I am curious what the plan is for the longterm future of MRUNIT?

I think currently MRUNIT is useful for just unit testing a single mapper or 
reducer but currently there is a void for testing more complicated features 
such as MultipleInputs, MultipleOutputs, a driver class, counters, among other 
things. I wonder if instead of adding support to the current MRUNIT framework 
for these extra features it would more useful to add in hooks to the existing 
LocalJobRunner and MiniMRCluster classes to provide methods to more easily 
verify file output from text files, sequence files, etc. This would allow 
MRUNIT to test driver classes, MultipleInputs, MultipleOutputs, etc. MRUNIT 
would also then test against the real hadoop code instead of an implementation 
that mimics hadoop which can miss some bugs such as the ReduceDriver that did 
not reuse the same object until 0.8.0. MRUNIT would also keep up with new map 
reduce features instead of us having to implement fake versions of them

I understand that performance would be an issue due to the file I/O but I 
wonder how fast the LocalJobRunner would be if we wrote a new class that 
extending FileSystem to allow users to write out fake files to memory and make 
the LocalJobRunner read from them

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to