Update to my previous request:
I could handle that using '\u00FE' .
BUT I have another issue. When I try to parse my time as datetime type, I get
empty fields. My time string is like this:
05-01-2014-00:43:52
data = LOAD '/home/hadoop/data/test.log' USING
org.apache.pig.piggybank.storage.CSVExcelStorage('\u00FE', 'YES_MULTILINE',
'NOCHANGE', 'SKIP_INPUT_HEADER') AS (time:datetime, userid: chararray, visit
int);
Thanks in advance,
Hanif
On Monday, August 18, 2014 2:34 PM, hanif mahboobi
<[email protected]> wrote:
Hi All,
I want to load data from a file which uses Latin small letter thorn 'þ' as
delimiter. However pig cannot detect this separator. In Hive I could solve this
issue
using '\-2' in but did not work for pig.
data = LOAD '/home/hadoop/data/test.log' USING
org.apache.pig.piggybank.storage.CSVExcelStorage('þ', 'YES_MULTILINE',
'NOCHANGE', 'SKIP_INPUT_HEADER') AS (time:chararray,userid: chararray, visit
int);
Thanks for your help,
Hanif