Re: Submit Hadoop Job Remotely (without creating a jar)

2014-06-27 Thread Tsuyoshi OZAWA
Some tools provide us CLI tools which don't require creating jar. For example, you can use Pig interactive mode if you'd like to use Pig. http://pig.apache.org/docs/r0.12.1/start.html#interactive-mode Hive CLI is one of them: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Cli

Re: Partitioning and setup errors

2014-06-27 Thread Chris Mawata
The new Configuration() is suspicious. Are you setting configuration information manually? Chris On Jun 27, 2014 5:16 AM, Chris MacKenzie stu...@chrismackenziephotography.co.uk wrote: Hi, I realise my previous question may have been a bit naïve and I also realise I am asking an awful lot

Re: group similar items using pairwise similar items

2014-06-27 Thread Chris Mawata
Since you say mutually similar are you really not looking for maximal cliques rather than connected components. Hi, I have a set of items and a pairwise similar items. I want to group together items that are mutually similar. For ex : if *A B C D E F G* are the items I have the following

RE: persisent services in Hadoop

2014-06-27 Thread John Lilley
Thanks Arun! I do think we are on the bleeding edge of YARN, because everyone else in our application space generates MapReduce (Pig, Hive), or they have overlaid their legacy server-grid on Hadoop. I will explore both resources you mentioned to see where the development community is headed.

Re: Partitioning and setup errors

2014-06-27 Thread Chris MacKenzie
HI Chris, Thanks for your response. I deeply appreciate it. I don¹t know what you mean by that question. I use configuration: * In the driver Job job = Job.getInstance(new Configuration()); * In the CustomLineRecordReader Configuration job = context.getConfiguration(); One of the biggest

Re: Partitioning and setup errors

2014-06-27 Thread Chris Mawata
Probably my fault. I was looking for the extends Configurable implements Tool part. I will double check when I get home rather than send you on a wild goose chase. Cheers Chris On Jun 27, 2014 8:16 AM, Chris MacKenzie stu...@chrismackenziephotography.co.uk wrote: Hi, I realise my previous

How to see total pending containers ?

2014-06-27 Thread Ashwin Shankar
Hi, Is there a way to see total pending containers in a cluster,so that we know how far behind we are with etl ? There is a pending containers field on the scheduler page under dr. who table,but that is always zero. -- Thanks, Ashwin

Re: Configuring Hadoop Client: Where is fail-over configured.

2014-06-27 Thread Juan Carlos
Hi Charley, in hdfs-site.xml you can find the property dfs.ha.namenodes, setting this property every client will know which NN are elegibles to be active, nothing else are required in client. Regards. 2014-06-26 21:30 GMT+02:00 Charley Newtonne cnewto...@gmail.com: I have hadoop 2.4

how to replication speed up

2014-06-27 Thread 조주일
hi. my cluster is . 2 ha namenode 8 datanodes Occurred under the block of 506,803. 1000 block of 10 minutes will be replicated. 600 megabytes of traffic per server occurs. Will take much longer until complete. How can I increase the replication rate.