Re: muti-thread mapreduce

2012-12-13 Thread Yu Yang
each thread to create maps to handle different logs. 2012/12/13 Yang > but I do have run across some situations where I could benefit from > multi-threading: if your hadoop mapper is prone to random access IO (such > as looking up a TFile, or HBase, which ultimately makes a network call an

Re: muti-thread mapreduce

2012-12-12 Thread Yang
I think it won't help much, since in a hadoop cluster, people already allocate "SLOTS" to be the number of cores, supposedly the inherent parallelism can be already exploited, since different mappers/reducers are completely independent. On Wed, Dec 12, 2012 at 2:09 AM, Yu Yang

Re: How to connect to a cluster by using eclipse

2012-07-04 Thread Yang
mapred.java.child.opts) then use eclipse to connect to the tasktracker , note you need to choose between "Listen/attach" in the "Run remote application" window in eclipse On Wed, Jul 4, 2012 at 7:00 PM, Jason Yang wrote: > ramon, > > Thank for your reply very much. &g

Re: How to connect to a cluster by using eclipse

2012-07-04 Thread Jason Yang
all right, thanks~ 在 2012年7月5日星期四,Marcos Ortiz 写道: > Jason, > Ramon is right. > The best way to debug a MapReduce job is mounting a local cluster, and > then, when you have tested enough your code, then, you can > deploy it in a real distributed cluster. > On 07/04/2012 10

Re: How to connect to a cluster by using eclipse

2012-07-04 Thread Jason Yang
VM and can be easily > debugged using Eclipse. Hope this will be useful. > > > > *From:* Jason Yang [mailto:lin.yang.ja...@gmail.com 'cvml', 'lin.yang.ja...@gmail.com');>] > *Sent:* miércoles, 04 de julio de 2012 11:25 > *To:* mapreduce-user > *Subject:* Ho

How to connect to a cluster by using eclipse

2012-07-04 Thread Jason Yang
lipse? -- YANG, Lin

Re: Would I call a DLL in MR by using JNI

2012-05-23 Thread jason Yang
.dll into the PATH. > And everything works. > > Zhu, Guojun > Modeling Sr Graduate > 571-3824370 > guojun_...@freddiemac.com > Financial Engineering > Freddie Mac > > > *jason Yang * > >05/23/2012 05:37 AM > Please respond to > mapreduce-user@had

Would I call a DLL in MR by using JNI

2012-05-23 Thread jason Yang
Hi, All~ Currently, I'm trying to rewrite an algorithm to MapReduce form. Since the algorithm depends on some third-party DLLs which are written in C++, I was wondering would I call a DLL in the Map() / Reduce() by using JNI? Thanks. -- YANG, Lin

Re: correct package of FileInputFormat

2012-01-30 Thread Yang
lib.input is the new architecture, .mapred is for legacy backward compatibility if u use .mapreduce , you should use mapreduces.lib.. On Mon, Jan 30, 2012 at 10:21 AM, GUOJUN Zhu wrote: > > Hi, > > I am learning Hadoop now. I am trying to write a customized inputformat. > I found out that

controlling where a map task runs?

2012-01-17 Thread Yang
possible to spread out my mappers? Thanks Yang

hadoop streaming job IOException

2011-12-18 Thread Yu Yang
Hi, all I got the following exception when I submit a hadoop streaming job to my hadoop cluster. I wrote the mapper in perl langguage, and there is no reducer. the mapper script runs well on local server. Hadoop version: Hadoop 0.20.2-CDH3B4 Can anyone give me some help? What is the problem exact

Re: easiest way to install hadoop

2011-02-23 Thread Eric Yang
Hi, For anyone who is interested in installing Yahoo patched hadoop 0.20.2+ ( branch-0.20-security) in rpm or debian form. There is a new option. There are proposals from Owen to standardize Hadoop deployment in https://issues.apache.org/jira/browse/HADOOP-6255. I created rpm/debian package

Re: Add user jars to mapreduce

2010-01-20 Thread Eric Yang
Hi Victor, Thanks for the detailed examination. I will make sure to remove the URI prefix in my code for now. Regards, Eric On 1/20/10 5:36 AM, "Victor Hsieh" wrote: > BTW, this issue has been reported: > http://issues.apache.org/jira/browse/MAPREDUCE-752 > > On Wed, Jan 20, 2010 at 7:59 PM,

Re: Add user jars to mapreduce

2010-01-04 Thread Eric Yang
>> Your above code should work properly. Added jars will not be part of job.jar. >> The addon.jar will be uploaded by the client, the added libjars will be added >> to classpath using DistributedCache. >> So, above code should work unless your path to jar or the jar itself was >> wrong. When the

Re: Add user jars to mapreduce

2009-12-31 Thread Eric Yang
wrote: > You should use the DistributedCache. >  http://hadoop.apache.org/common/docs/current/mapred_tutorial.html#Distributed > Cache > > > > On Thu, Dec 31, 2009 at 3:21 PM, Eric Yang wrote: >> Hi, >> >> I have a mapreduce program embeded in a java appli

Add user jars to mapreduce

2009-12-31 Thread Eric Yang
Hi, I have a mapreduce program embeded in a java application, and I am trying to load additional jar files as add-on for the execution of my map reduce job. My program works like this: JobConf conf = new JobConf(new Configuration(), Demux.class); conf.setBoolean("mapred.used.genericoptionsparser"