Update to my previous request:

I could handle that using '\u00FE' .

BUT I have another issue. When I try to parse my time as datetime type, I get 
empty fields. My time string is like this:
05-01-2014-00:43:52


data = LOAD '/home/hadoop/data/test.log' USING 
org.apache.pig.piggybank.storage.CSVExcelStorage('\u00FE', 'YES_MULTILINE', 
'NOCHANGE', 'SKIP_INPUT_HEADER') AS (time:datetime, userid: chararray, visit 
int);


Thanks in advance,
Hanif
On Monday, August 18, 2014 2:34 PM, hanif mahboobi 
<[email protected]> wrote:
 


Hi All,

I want to load data from a file which uses Latin small letter thorn 'þ' as 
delimiter. However pig cannot detect this separator. In Hive I could solve this 
issue
 using '\-2' in but did not work for pig. 

data = LOAD '/home/hadoop/data/test.log' USING 
org.apache.pig.piggybank.storage.CSVExcelStorage('þ', 'YES_MULTILINE', 
'NOCHANGE', 'SKIP_INPUT_HEADER') AS (time:chararray,userid: chararray, visit 
int);

Thanks for your help,
Hanif

Reply via email to