[
https://issues.apache.org/jira/browse/HIVE-4531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13768648#comment-13768648
]
Eugene Koifman commented on HIVE-4531:
--------------------------------------
Review Comments:
Should this include e2e tests in addition (or instead of unit tests). If (when
:) Hadoop changes the log file format this will break, but Unit tests won't
catch this since the data that the tests parse is "static".
Here is a bunch of little things/nits:
o Server.java has “ if (enablelog == true &&
!TempletonUtils.isset(statusdir)) throw new BadParam("enablelog is only
applicable when statusdir is set");” in 4 different places. Can this be a
method?
o What is the purpose of Server#misc()?
o TempletonControllerJob: import org.apache.hive.hcatalog.templeton.Main; -
unused import
oo Line 173 - indentation is off
oo Line 295 - writer.close() - This writer is connected to System.err. What
are the implications of closing this? What if something tries to write to it
later?
o TempletonUtils has unused imports - checkstyle needs to be run on the whole
patch.
o TestJobIDParser mixes JUnit3 and JUnit4. It should either not extend
TestCase (I vote for this) or not use @Test annotations
o Can JobIDParser (and all subclasses) be made package scoped since they are
not used outside templeton pacakge? Similarly, can methods be made as private
as possible?
o JobIDParser#parseJobID() has “fname” param which is not used. What is the
intent? Should it be used in openStatusFile() call? If not, better to remove
it.
o JobIDParser#openStatusFile() creas a Reader. Where/when is it being closed?
o Could the 2 member variables in JobIDParser be made private (even final)?
o Why is TestJobIDParser using findJobID() directly? Could it not use
parseJobID()?
o Can JobIDParser have 1 line of class level javadoc about the purpose of this
class?
> [WebHCat] Collecting task logs to hdfs
> --------------------------------------
>
> Key: HIVE-4531
> URL: https://issues.apache.org/jira/browse/HIVE-4531
> Project: Hive
> Issue Type: New Feature
> Components: HCatalog
> Reporter: Daniel Dai
> Assignee: Daniel Dai
> Fix For: 0.12.0
>
> Attachments: HIVE-4531-1.patch, HIVE-4531-2.patch, HIVE-4531-3.patch,
> HIVE-4531-4.patch, HIVE-4531-5.patch, HIVE-4531-6.patch, HIVE-4531-7.patch,
> HIVE-4531-8.patch, samplestatusdirwithlist.tar.gz
>
>
> It would be nice we collect task logs after job finish. This is similar to
> what Amazon EMR does.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira