Ramon,

If all the data is in one line, then you would need to preprocess the data,
but from your explanation below it sounds like the lines terminated by a
newline character after the | ?

Thanks,Gireesh
vivarasystems.com

On Wed, May 30, 2012 at 9:27 AM, <[email protected]> wrote:

>  Hi,
>
>
>
>    I’m trying to define a table over an external file. My file has 12
> fixed columns followed by a varying amount of columns that depends on some
> of the fixed ones. I tried to define the table as:
>
>
>
> CREATE EXTERNAL TABLE IF NOT EXISTS log_array (
>
> dt              string,
>
> txOperOpciResto string,
>
> idRegPerf       string,
>
> oper            string,
>
> opcion          string,
>
> accion          string,
>
> servc           string,
>
> canal           string,
>
> platf           string,
>
> codIdioma       string,
>
> pais            string,
>
> lacre           string,
>
> dirIP           string,
>
> restoMsg        array<string>
>
> )
>
> ROW FORMAT DELIMITED
>
>   FIELDS TERMINATED BY '|'
>
>   COLLECTION ITEMS TERMINATED BY '|'
>
> STORED AS SEQUENCEFILE
>
> LOCATION '/user/hadoop-user/uc3/seq/';
>
>
>
> So what I tried was to get all varing part on an array field (restoMsg).
> The trick is not working because both delimiters, fields and collections,
> are the same. My restoMsg field only gets one column and the rest are
> omitted.
>
>
>
> Is there any way to get that last part without custom code? If not, what
> classes should I create to this and how can I define the table then?
>
>
>
> Thx,
>
>    Ramón Pin
>
>
>
> ------------------------------
> Subject to local law, communications with Accenture and its affiliates
> including telephone calls and emails (including content), may be monitored
> by our systems for the purposes of security and the assessment of internal
> compliance with Accenture policy.
>
> ______________________________________________________________________________________
>
> www.accenture.com
>

Reply via email to