[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14199724#comment-14199724 ]
Gopal V commented on TEZ-1738: ------------------------------ As an added note, if this proves to be widely used, we need to move this to either YARN or PIG, but for now this is a lodged here as the Tez log analytics scripts depend on this. > tez tfile parser for log parsing > -------------------------------- > > Key: TEZ-1738 > URL: https://issues.apache.org/jira/browse/TEZ-1738 > Project: Apache Tez > Issue Type: Bug > Reporter: Rajesh Balamohan > Assignee: Rajesh Balamohan > Attachments: TEZ-1738.1.patch, TEZ-1738.2.patch > > > It can be time consuming to download logs via "yarn logs -applicationId > <appId> | grep something". Also mining large volumes of logs can be time > consuming on single node. > A simple pigloader would be useful to have in tez-tools which can parse > TFiles and provide line by line format (tuple of (machine, key, line)) for > distributed processing of logs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)