reducers outputs

2012-01-28 Thread aliyeh saeedi
Hi I want to save reducers outputs like other files in Hadoop. Does NameNode keep any information about them? How can I do this? Or can I add a new component to Hadoop like NameNode and make JobTracker to consult with it too (I mean I want to make JobTracker to consult with NameNode AND myNewCo

Re: Question about mapReduce.

2012-01-28 Thread Ronald Petty
neo21 zerro, * * Hadoop may or may not be able to help you based on your problem specification. Based on your statement "processes big data files" it should be able to help. Relating to that, I am unclear why HDFS is an issue? As for queues and databases, can you describe what you have in mind?

Re: Why it don't print anything into part-00000 file

2012-01-28 Thread Ronald Petty
Luiz, Does your code work locally? I need a couple of more details to help. Ron On Sat, Jan 28, 2012 at 4:51 PM, Luiz wrote: > Hi people, > > I wrote this code to implemment per-term indexing (Ivory), like figure 4 > of paper http://www.dcs.gla.ac.uk/~richardm/papers/IPM_MapReduce.pdf but > i

Why it don't print anything into part-00000 file

2012-01-28 Thread Luiz
Hi people, I wrote this code to implemment per-term indexing (Ivory), like figure 4 of paper http://www.dcs.gla.ac.uk/~richardm/papers/IPM_MapReduce.pdf but it don't print anything into part-0 file. Does somebody know why it don't print anything? WordCount.java Description: Binary data

Re: Question about mapReduce.

2012-01-28 Thread neo21 zerro
So basically if I don't have the data into the  HDFS sistem, the map reduce from HADOOP will not help me ?  Because I need  to build a tool that processes big data files, a lot of messages from queues or databases,  and I thought that by using the map reduce from HADOOP my life would be easier :

Re: hadoop ecosystem

2012-01-28 Thread Sonal Goyal
Crux reporting for hbase can also be included. Sonal Sent from my iPad On 28-Jan-2012, at 11:40 PM, Chris K Wensel wrote: > PyCascading > Scalding > Cascading.JRuby > Bixo > > Strictly speaking, those plus Cascalog (below) are on top of Cascading, which > is of course on top of Hadoop, but

Re: hadoop ecosystem

2012-01-28 Thread Chris K Wensel
PyCascading Scalding Cascading.JRuby Bixo Strictly speaking, those plus Cascalog (below) are on top of Cascading, which is of course on top of Hadoop, but all of which have independent developer teams (@ twitter, Scale Unlimited, Etsy, etc). On Jan 28, 2012, at 7:59 AM, Ayad Al-Qershi wrote: >

Re: hadoop ecosystem

2012-01-28 Thread Joey Echeverria
I'd add crunch (https://github.com/cloudera/crunch) and remove Hoop as it's integrated with Hadoop in 0.23.1+. -Joey On Sat, Jan 28, 2012 at 10:59 AM, Ayad Al-Qershi wrote: > I'm compiling a list of all Hadoop ecosystem/sub projects ordered > alphabetically and I need your help if I missed somet

Re: hadoop ecosystem

2012-01-28 Thread Ted Yu
Same with Solr and Lily. On Sat, Jan 28, 2012 at 8:09 AM, Ted Yu wrote: > I think Bookkeeper should be included as well. > > > On Sat, Jan 28, 2012 at 7:59 AM, Ayad Al-Qershi wrote: > >> I'm compiling a list of all Hadoop ecosystem/sub projects ordered >> alphabetically and I need your help if I

Re: hadoop ecosystem

2012-01-28 Thread Ted Yu
I think Bookkeeper should be included as well. On Sat, Jan 28, 2012 at 7:59 AM, Ayad Al-Qershi wrote: > I'm compiling a list of all Hadoop ecosystem/sub projects ordered > alphabetically and I need your help if I missed something. > >1. Ambari >2. Avro >3. Cascading >4. Cascalog

hadoop ecosystem

2012-01-28 Thread Ayad Al-Qershi
I'm compiling a list of all Hadoop ecosystem/sub projects ordered alphabetically and I need your help if I missed something. 1. Ambari 2. Avro 3. Cascading 4. Cascalog 5. Cassandra 6. Chukwa 7. Elastic Map Reduce 8. Flume 9. Hadoop common 10. Hama 11. Hbase 12.