Help with implementing a Storm topology to stream tweets

2015-05-09 Thread mani kandan
Hi I'm new to Storm, and I would like to create a Storm topology to stream tweets, do analysis and store on hdfs. Is there a tutorial to do the exact same thing? How do I start? Thanks for the help. PS: I have already installed Storm in HDP using Ambari. -- Thanks, *Manikandan Ramakrishnan*

Hive LOAD DATA INPATH imports all records twice

2015-05-08 Thread mani kandan
Hi, I'm trying to load JSON data into Hive using hcatalog JsonSerDe. I have created the table, but when I use LOAD DATA INPATH command to load 8 records into the table. However, SELECT * shows 16 records in the table, each record duplicated. Why is this happening? -- Thanks, *Manikandan

Re: Hive JSON Create Table query error: ParseException line 3:14 cannot recognize input near ':' 'string' ',' in column type

2015-05-08 Thread mani kandan
the issue. This should work: CREATE EXTERNAL TABLE twitter_data( userdata struct userlocation:string,id:bigint,name:string,screenname:string, tweetmessage string, createddate string) ROW FORMAT SERDE 'org.apache.hcatalog.JsonSerDe'; Best, Udit On Thu, May 7, 2015 at 6:01 PM, mani

Hive JSON Create Table query error: ParseException line 3:14 cannot recognize input near ':' 'string' ',' in column type

2015-05-07 Thread mani kandan
I'm trying to import a JSON file into a hive table, and trying to execute the below query: CREATE EXTERNAL TABLE twitter_data( userdata struct userlocation:string,id:bigint,name:string,screenname:string, tweetmessage:string, createddate:string) ROW FORMAT SERDE

What is the most interesting problem you came across in your hadoop projects? How did you resolve it?

2015-03-31 Thread mani kandan
-- Thanks, *Manikandan Ramakrishnan*

Planning to propose Hadoop initiative to company. Need some inputs please.

2014-10-01 Thread mani kandan
First off, I'm a mainframe developer, so I don't know much about Java or Web technology. I work at an insurance company as a software developer. We have a website to get quotes, raise and view claims and such. I came to know about Hadoop and that I might be able to ​ leverage it to the advantage

New to hadoop/java. How do I write map reduce programs in Cloudera VM? I can't import org.Apache.* like in tutorials.

2014-08-27 Thread mani kandan
Thanks, Mani

Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode?

2014-08-12 Thread mani kandan
Which distribution are you people using? Cloudera vs Hortonworks vs Biginsights?