[ https://issues.apache.org/jira/browse/TEZ-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14196439#comment-14196439 ]
Rajesh Balamohan edited comment on TEZ-1738 at 11/4/14 5:46 PM: ---------------------------------------------------------------- [~gopalv], [~sseth] - Can you please review? was (Author: rajesh.balamohan): [~gopalv], [~sseth] - Please reivew. > tez tfile parser for log parsing > -------------------------------- > > Key: TEZ-1738 > URL: https://issues.apache.org/jira/browse/TEZ-1738 > Project: Apache Tez > Issue Type: Bug > Reporter: Rajesh Balamohan > Assignee: Rajesh Balamohan > Attachments: TEZ-1738.1.patch > > > It can be time consuming to download logs via "yarn logs -applicationId > <appId> | grep something". Also mining large volumes of logs can be time > consuming on single node. > A simple pigloader would be useful to have in tez-tools which can parse > TFiles and provide line by line format (tuple of (machine, key, line)) for > distributed processing of logs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)