Hi
I want to save reducers outputs like other files in Hadoop. Does NameNode keep
any information about them? How can I do this?
Or can I add a new component to Hadoop like NameNode and make JobTracker to
consult with it too (I mean I want to make JobTracker to consult with NameNode
AND myNewCo
neo21 zerro,
*
*
Hadoop may or may not be able to help you based on your problem
specification. Based on your statement "processes big data files" it
should be able to help. Relating to that, I am unclear why HDFS is an
issue? As for queues and databases, can you describe what you have in mind?
Luiz,
Does your code work locally? I need a couple of more details to help.
Ron
On Sat, Jan 28, 2012 at 4:51 PM, Luiz wrote:
> Hi people,
>
> I wrote this code to implemment per-term indexing (Ivory), like figure 4
> of paper http://www.dcs.gla.ac.uk/~richardm/papers/IPM_MapReduce.pdf but
> i
Hi people,
I wrote this code to implemment per-term indexing (Ivory), like figure 4 of
paper http://www.dcs.gla.ac.uk/~richardm/papers/IPM_MapReduce.pdf but it don't
print anything into part-0 file.
Does somebody know why it don't print anything?
WordCount.java
Description: Binary data
So basically if I don't have the data into the HDFS sistem, the map reduce
from HADOOP will not help me ?
Because I need to build a tool that processes big data files, a lot of
messages from queues or databases,
and I thought that by using the map reduce from HADOOP my life would be easier
:
Crux reporting for hbase can also be included.
Sonal
Sent from my iPad
On 28-Jan-2012, at 11:40 PM, Chris K Wensel wrote:
> PyCascading
> Scalding
> Cascading.JRuby
> Bixo
>
> Strictly speaking, those plus Cascalog (below) are on top of Cascading, which
> is of course on top of Hadoop, but
PyCascading
Scalding
Cascading.JRuby
Bixo
Strictly speaking, those plus Cascalog (below) are on top of Cascading, which
is of course on top of Hadoop, but all of which have independent developer
teams (@ twitter, Scale Unlimited, Etsy, etc).
On Jan 28, 2012, at 7:59 AM, Ayad Al-Qershi wrote:
>
I'd add crunch (https://github.com/cloudera/crunch) and remove Hoop as
it's integrated with Hadoop in 0.23.1+.
-Joey
On Sat, Jan 28, 2012 at 10:59 AM, Ayad Al-Qershi wrote:
> I'm compiling a list of all Hadoop ecosystem/sub projects ordered
> alphabetically and I need your help if I missed somet
Same with Solr and Lily.
On Sat, Jan 28, 2012 at 8:09 AM, Ted Yu wrote:
> I think Bookkeeper should be included as well.
>
>
> On Sat, Jan 28, 2012 at 7:59 AM, Ayad Al-Qershi wrote:
>
>> I'm compiling a list of all Hadoop ecosystem/sub projects ordered
>> alphabetically and I need your help if I
I think Bookkeeper should be included as well.
On Sat, Jan 28, 2012 at 7:59 AM, Ayad Al-Qershi wrote:
> I'm compiling a list of all Hadoop ecosystem/sub projects ordered
> alphabetically and I need your help if I missed something.
>
>1. Ambari
>2. Avro
>3. Cascading
>4. Cascalog
I'm compiling a list of all Hadoop ecosystem/sub projects ordered
alphabetically and I need your help if I missed something.
1. Ambari
2. Avro
3. Cascading
4. Cascalog
5. Cassandra
6. Chukwa
7. Elastic Map Reduce
8. Flume
9. Hadoop common
10. Hama
11. Hbase
12.
11 matches
Mail list logo