First we need to create a table 1. Use XMLInputFormat as InputFormat, Use TextOutputFormat as OutputFormat. 2. Use RegexSerDe: http://wiki.apache.org/hadoop/Hive/GettingStarted#Apache_Weblog_Data 3. See http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL for create table syntax.
Then we need to load the data: 4. See http://wiki.apache.org/hadoop/Hive/LanguageManual/DML for load data command syntax. Zheng On Wed, Jan 20, 2010 at 2:00 PM, ankit bhatnagar <abhatna...@gmail.com> wrote: > Mahout project has the XMLInputFormat. > > Could you please brief me how would I load data into hive > > > > Ankit > -- Yours, Zheng