[ 
https://issues.apache.org/jira/browse/PIG-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720259#action_12720259
 ] 

Alan Gates commented on PIG-842:
--------------------------------

I'm concerned about the performance hit of supporting multi-byte comparators.  
Before we commit to doing this in PigStorage, we should test how much it slows 
down reading data.  If it is significant, we should consider having a 
PigMultiByteStorage or something that handles multi-byte delimiter characters.  
It could extend PigStorage and only differ in how it parses the records.

> PigStorage should support multi-byte delimiters
> -----------------------------------------------
>
>                 Key: PIG-842
>                 URL: https://issues.apache.org/jira/browse/PIG-842
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>    Affects Versions: 0.3.0
>            Reporter: Santhosh Srinivasan
>             Fix For: 0.3.0
>
>
> Currently, PigStorage supports single byte delimiters. Users have requested 
> mult-byte delimiters. There are performance implications with multi-byte 
> delimiters. i.e., instead of looking for a single byte, PigStorage should 
> look for a pattern ala BinStorage.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to