[ 
https://issues.apache.org/jira/browse/PIG-1271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1271:
----------------------------

    Description: 
With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to 
load txt files containing complex data type (map/bag/tuple) according to 
schema. However, the format of complex data field is very strict. User have to 
use pre-determined special characters to mark the beginning and end of each 
field, and those special characters can not be used in the content. The goals 
of this issue are:

1. Provide a way for user to escape special characters
2. Make it easy for users to customize Utf8StorageConverter when they have 
their own data format

This is a candidate project for Google summer of code 2012. More information 
about the program can be found at 
https://cwiki.apache.org/confluence/display/PIG/GSoc2012

  was:
With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to 
load txt files containing complex data type (map/bag/tuple) according to 
schema. However, the format of complex data field is very strict. User have to 
use pre-determined special characters to mark the beginning and end of each 
field, and those special characters can not be used in the content. The goals 
of this issue are:

1. Provide a way for user to escape special characters
2. Make it easy for users to customize Utf8StorageConverter when they have 
their own data format



    
> Provide a more flexible data format to load complex field (bag/tuple/map) in 
> PigStorage
> ---------------------------------------------------------------------------------------
>
>                 Key: PIG-1271
>                 URL: https://issues.apache.org/jira/browse/PIG-1271
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.7.0
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>              Labels: gsoc2012
>
> With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to 
> load txt files containing complex data type (map/bag/tuple) according to 
> schema. However, the format of complex data field is very strict. User have 
> to use pre-determined special characters to mark the beginning and end of 
> each field, and those special characters can not be used in the content. The 
> goals of this issue are:
> 1. Provide a way for user to escape special characters
> 2. Make it easy for users to customize Utf8StorageConverter when they have 
> their own data format
> This is a candidate project for Google summer of code 2012. More information 
> about the program can be found at 
> https://cwiki.apache.org/confluence/display/PIG/GSoc2012

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to