[ https://issues.apache.org/jira/browse/PIG-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13459167#comment-13459167 ]
Bill Graham commented on PIG-2924: ---------------------------------- We ran into similar issues with HCatalog and reducer estimation (PIG-2573, PIG-2574), since an HDFS path was assumed. For this issue we could register different classes that know how to look up (or not support) stats based on the URI prefix of the data location (hdfs, hbase, s3, etc). > PigStats should not be assuming all Storage classes to be file-based storage > ---------------------------------------------------------------------------- > > Key: PIG-2924 > URL: https://issues.apache.org/jira/browse/PIG-2924 > Project: Pig > Issue Type: Bug > Components: tools > Affects Versions: 0.9.2 > Reporter: Harsh J > > Using PigStatsUtil (like Oozie does) to collect JobStats for jobs that use a > HBaseStorage blows up when the stats are asked to be accumulated. > This is because JobStats (which adds stuff up) is assuming all storages are > file based and that it can do listStatus/etc. operations on their > filespec-provided filename. For HBaseStorage, this is set to the tablename > and there's no such file, leading to an exception (FileNotFound or Invalid > URI - depending on using 'tablename' or 'hbase://tablename'). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira