Re: how to load data

Kumar Jayapal Fri, 01 May 2015 06:06:03 -0700

106,"2003-02-03",20,2,"A","2","2","037"
106,"2003-02-03",20,3,"A","2","2","037"
106,"2003-02-03",8,2,"A","2","2","037"











Thanks
Jay

On Fri, May 1, 2015 at 12:10 AM, Nitin Pawar <nitinpawar...@gmail.com>
wrote:

> Jay can you give first 3 lines of your gz file
>
> On Fri, May 1, 2015 at 10:53 AM, Kumar Jayapal <kjayapa...@gmail.com>
> wrote:
>
>> Alex,
>>
>>
>> I followed the same steps as mentioned in the site. Once I load data into
>> table which is create below
>>
>>
>>
>> Created table  CREATE TABLE raw (line STRING) PARTITIONED BY
>> (FISCAL_YEAR  smallint, FISCAL_PERIOD smallint)
>> STORED AS TEXTFILE;
>>
>> and loaded it with data.
>>
>> LOAD DATA LOCAL INPATH '/tmp/weblogs/20090603-access.log.gz' INTO TABLE
>> raw;
>>
>>
>>
>> when I say select * from raw it shows all null values.
>>
>>
>> NULLNULLNULLNULLNULLNULLNULLNULL
>> NULLNULLNULLNULLNULLNULLNULLNULL
>> NULLNULLNULLNULLNULLNULLNULLNULL
>> NULLNULLNULLNULLNULLNULLNULLNULL
>> Why is not show showing the actual data in file. will it show once I load
>> it to parque table?
>>
>> Please let me know if I am doing anything wrong.
>>
>> I appreciate your help.
>>
>>
>> Thanks
>> jay
>>
>>
>>
>> Thank you very much for you help Alex,
>>
>>
>> On Wed, Apr 29, 2015 at 3:43 PM, Alexander Pivovarov <
>> apivova...@gmail.com> wrote:
>>
>>> 1. Create external textfile hive table pointing to /extract/DBCLOC and
>>> specify CSVSerde
>>>
>>> if using hive-0.14 and newer use this
>>> https://cwiki.apache.org/confluence/display/Hive/CSV+Serde
>>> if hive-0.13 and older use https://github.com/ogrodnek/csv-serde
>>>
>>> You do not even need to unzgip the file. hive automatically unzgip data
>>> on select.
>>>
>>> 2. run simple query to load data
>>> insert overwrite table <orc_table>
>>> select * from <csv_table>
>>>
>>> On Wed, Apr 29, 2015 at 3:26 PM, Kumar Jayapal <kjayapa...@gmail.com>
>>> wrote:
>>>
>>>> Hello All,
>>>>
>>>>
>>>> I have this table
>>>>
>>>>
>>>> CREATE  TABLE DBCLOC(
>>>>    BLwhse int COMMENT 'DECIMAL(5,0) Whse',
>>>>    BLsdat string COMMENT 'DATE Sales Date',
>>>>    BLreg_num smallint COMMENT 'DECIMAL(3,0) Reg#',
>>>>    BLtrn_num int COMMENT 'DECIMAL(5,0) Trn#',
>>>>    BLscnr string COMMENT 'CHAR(1) Scenario',
>>>>    BLareq string COMMENT 'CHAR(1) Act Requested',
>>>>    BLatak string COMMENT 'CHAR(1) Act Taken',
>>>>    BLmsgc string COMMENT 'CHAR(3) Msg Code')
>>>> PARTITIONED BY (FSCAL_YEAR  smallint, FSCAL_PERIOD smallint)
>>>> STORED AS PARQUET;
>>>>
>>>> have to load from hdfs location  /extract/DBCLOC/DBCL0301P.csv.gz to
>>>> the table above
>>>>
>>>>
>>>> Can any one tell me what is the most efficient way of doing it.
>>>>
>>>>
>>>> Thanks
>>>> Jay
>>>>
>>>
>>>
>>
>
>
> --
> Nitin Pawar
>

Re: how to load data

Reply via email to