Thanks Sean, that is exactly what I want.
On Mon, Sep 30, 2013 at 3:09 PM, Sean Busbey wrote:
> S,
>
> Check out these presentations from Data Science Maryland back in May[1].
>
> 1. working with Tweets in Hive:
>
>
> http://www.slideshare.net/JoeyEcheverria/analyzing-twitter-data-with-hadoop-2
S,
Check out these presentations from Data Science Maryland back in May[1].
1. working with Tweets in Hive:
http://www.slideshare.net/JoeyEcheverria/analyzing-twitter-data-with-hadoop-20929978
2. then pulling stuff out of Hive to use with Mahout:
http://files.meetup.com/6195792/Working%20With%
Hi Nitin,
No offense taken. Thank you for your response. Part of this is also trying
to find the right tool for the job.
I am doing queries to determine the cuts of tweets that I want, then doing
some modest normalization (through a python script) and then I want to
create sequenceFiles from that
are you using hive to just convert your text files to sequence files?
If thats the case then you may want to look at the purpose why hive was
developed.
If you want to modify data or process data which does not involve any kind
of analytics functions on a routine basis.
If you want to do a data m
Hi,
I have a lot of tweets saved as text. I created an external table on top of
it to access it as textfile. I need to convert these to sequencefiles with
each tweet as its own record. To do this, I created another table as a
sequencefile table like so -
CREATE EXTERNAL TABLE tweetseq(
tweet ST
Hi,
I have a lot of tweets saved as text. I created an external table on top of
it to access it as textfile. I need to convert these to sequencefiles with
each tweet as its own record. To do this, I created another table as a
sequencefile table like so -
CREATE EXTERNAL TABLE tweetseq(
tweet ST
Hi,
I have a lot of tweets saved as text. I created an external table on top of
it to access it as textfile. I need to convert these to sequencefiles with
each tweet as its own record. To do this, I created another table as a
sequencefile table like so -
CREATE EXTERNAL TABLE tweetseq(
tweet ST