[ 
https://issues.apache.org/jira/browse/CRUNCH-97?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13530269#comment-13530269
 ] 

Gabriel Reid commented on CRUNCH-97:
------------------------------------

I don't have any direct use or need for it right now, but I do have the feeling 
that something like this is a really useful addition to Crunch, so I think it 
would be a shame to close this out now. I'm pretty indifferent about the 
Scanner vs Tokenizer discussion, but I don't have much context to base an 
opinion on for now.

In any case, even if this would be primarily a help for prototyping, I think 
that that is more than enough reason to include it. It's a similar situation to 
reflection-based Avro -- it might not be what you want to use in production, 
but it's incredibly useful for quick iterations in development. In any case, 
I'm very much in favour of adding this (in one of its incarnations) to Crunch.

                
> Add helpers for parsing PCollection<String> instances
> -----------------------------------------------------
>
>                 Key: CRUNCH-97
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-97
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Josh Wills
>            Assignee: Josh Wills
>             Fix For: 0.5.0
>
>         Attachments: CRUNCH-97.patch, CRUNCH-97-take2.patch, 
> CRUNCH-97-Tokenizer-v1.patch, CRUNCH-97v3.patch, CRUNCH-97v4.patch
>
>
> We should make it a bit easier to parse delimited text files into specific 
> data types (e.g., ints, floats, etc.) or combinations of types-- e.g., pairs 
> of strings and ints, a Tuple3 of booleans, etc.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to