Help me with architecture of a somewhat non-trivial mapreduce implementation

2012-04-18 Thread Sky USC
Please help me architect the design of my first significant MR task beyond "word count". My program works well. but I am trying to optimize performance to maximize use of available computing resources. I have 3 questions at the bottom. Project description in an abstract sense (written in jav

RE: How do I include the newer version of Commons-lang in my jar?

2012-04-09 Thread Sky USC
Thanks for the reply. I appreciate your helpfulness. I created Jars by following instructions at "http://blog.mafr.de/2010/07/24/maven-hadoop-job/";. So external Jars are stored in lib/ folder within a jar. Am I summarizing this correctly: 1. If hadoop version = 0.20.203 or lower - then, the

Reading properties file from command line + passing parameters from main to mapper/reducer?

2012-04-06 Thread Sky USC
Dear friends I am new to hadoop on aws. I am using AWS Elastic mapreduce. I am trying to convert an old java program to elastic mapreduce. I would appreciate your help. My question is about: 1. How do I pass a "Properties" file which i used to pass via command line paramters - to elastic map

Reading properties file from command line + passing parameters from main to mapper/reducer?

2012-04-06 Thread Sky USC
Dear friends I am new to hadoop on aws. I am using AWS Elastic mapreduce. I am trying to convert an old java program to elastic mapreduce. I would appreciate your help. My question is about: 1. How do I pass a "Properties" file which i used to pass via command line paramters - to elastic

Reading properties file from command line + passing parameters from main to mapper/reducer?

2012-04-06 Thread Sky USC
Dear friends I am new to hadoop on aws. I am using AWS Elastic mapreduce. I am trying to convert an old java program to elastic mapreduce. I would appreciate your help. My question is about: 1. How do I pass a "Properties" file which i used to pass via command line paramters - to elastic map