From a newbie: Questions and will MapReduce fit our needs

2011-08-26 Thread Per Steffensen
he essence of my questions. I hope you will try to answer all the questions, even though some of them might seem stupid to you. Remember that I am a newbie :-) I have been running thourgh the FAQ, but didnt find any answers to my questions (maybe because they are stupid :-) ). I wasnt able to search the archives of the mailing-list, so I quickly gave up finding my answers in "old threads". Can someone point me to a way of searching in the archives? Regards, Per Steffensen

Re: From a newbie: Questions and will MapReduce fit our needs

2011-08-29 Thread Per Steffensen
version numbers (0.20 and 0.21), that doesnt make it sound like at production-ready tool. Regards, Per Steffensen MONTMORY Alain skrev: Hi, I am going to try to response to your response in the text. I am not an hadoop expert but we are facing the same kind of problem (dealing with file which

Re: From a newbie: Questions and will MapReduce fit our needs

2011-08-29 Thread Per Steffensen
which are external to HDFS) in our project and we use hadoop. [@@THALES GROUP RESTRICTED@@] -Message d'origine- De : Per Steffensen [mailto:st...@designware.dk] Envoyé : vendredi 26 août 2011 13:13 À : mapreduce-user@hadoop.apache.org

Re: From a newbie: Questions and will MapReduce fit our needs

2011-08-29 Thread Per Steffensen
Can you point me to at good place to read about Sqoop. I only find http://incubator.apache.org/projects/sqoop.html and https://cwiki.apache.org/confluence/display/SQOOP. There is really not much to find, about what Sqoop can do, how to use it etc. Regards, Per Steffensen Peyman Mohajerian

How does FileInputFormat sub-classes handle many small files

2011-09-01 Thread Per Steffensen
gards, Per Steffensen

Re: How does FileInputFormat sub-classes handle many small files

2011-09-01 Thread Per Steffensen
Harsh J skrev: Hello Per, On Thu, Sep 1, 2011 at 2:27 PM, Per Steffensen wrote: Hi FileInputFormat sub-classes (TextInputFormat and SequenceFileInputFormat) are able to take all files in a folder and split the work of handling them into several sub-jobs (map-jobs). I know it can split a