[
https://issues.apache.org/jira/browse/HADOOP-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12532547
]
Tsz Wo (Nicholas), SZE commented on HADOOP-1968:
------------------------------------------------
+1
Codes looks good. Below are some thoughts.
- Since this is a single thread situation, StringBuilder is more efficient than
StringBuffer.
- For this problem, using some parser generators (e.g. yacc) might be better
than Java Regex.
> Wildcard input syntax (glob) should support {}
> ----------------------------------------------
>
> Key: HADOOP-1968
> URL: https://issues.apache.org/jira/browse/HADOOP-1968
> Project: Hadoop
> Issue Type: Improvement
> Affects Versions: 0.14.1
> Reporter: eric baldeschwieler
> Assignee: Hairong Kuang
> Fix For: 0.15.0
>
> Attachments: curlyGlob.patch
>
>
> We have users who have organized data by day and would like to select several
> days in a single input specification. For example they would like to be able
> to say:
> '/data/2007{0830,0831,0901}/typeX/'
> To input 3 days data into map-reduce (or Pig in this case).
> (Also the use of regexp to resolve glob paterns looks like it might introduce
> some other bugs. I'd appreciate it if someone took another look at the code
> to see if there are any file characters that could
> be interpreted as regexp "instructions").
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.