[
https://issues.apache.org/jira/browse/CRUNCH-408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Wills updated CRUNCH-408:
------------------------------
Attachment: CRUNCH-408b.patch
[~stepinto] had to tweak this to make it work on hadoop2.
> HFileSource does not estimate the size of input correctly when there is a
> wildcard in path
> ------------------------------------------------------------------------------------------
>
> Key: CRUNCH-408
> URL: https://issues.apache.org/jira/browse/CRUNCH-408
> Project: Crunch
> Issue Type: Bug
> Affects Versions: 0.8.2, 0.10.0
> Reporter: Chao Shi
> Fix For: 0.10.0, 0.8.3
>
> Attachments: CRUNCH-408b.patch, crunch-408.patch
>
>
> The cause is that it calls FileSystem#listStatus rather than
> FileSystem#globStatus to retrieve the list of files under the given path. So
> the fix is straight forward.
--
This message was sent by Atlassian JIRA
(v6.2#6252)