Re: How to modify task assignment algorithm?

2010-10-07 Thread Jeff Zhang
Currently, TaskTracker send heatbeat to JobTracker, and JobTracker will send actions to TaskTracker according TaskTracker's status. I think you should do some hacking about the heatbeat part of JobTracker public synchronized HeartbeatResponse heartbeat(TaskTrackerStatus status,

Re: How to modify task assignment algorithm?

2010-10-07 Thread Shen LI
Hi, Thanks you very much for your reply. I want to run my own algorithm for this part to see if we can achieve better outcome in specific scenario. So how can I modify it? Thanks a lot! Shen On Thu, Oct 7, 2010 at 6:33 PM, Jeff Zhang wrote: > I believe it is possible. But what is your purpose

Re: How to modify task assignment algorithm?

2010-10-07 Thread Jeff Zhang
I believe it is possible. But what is your purpose? I believe current solution is good enough. On Fri, Oct 8, 2010 at 2:57 AM, Shen LI wrote: > Hi, > How can I modify the task assignment strategy in hadoop which is used to > assign tasks to different worker nodes? (Not the job scheduler) > Bi

Re: Hdfs Block Size

2010-10-07 Thread Jeff Zhang
Yes, this relates with the native file system block size and disk block size, and can reduce the disk fragmentation. On Fri, Oct 8, 2010 at 2:03 AM, rakesh kothari wrote: > Is there a reason why block size should be set to some 2^N, for some integer > N ? Does it help with block defragmentation

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/08/2010 02:52 AM, Ted Yu wrote: > Have you checked > http://download.oracle.com/javase/6/docs/api/javax/xml/stream/XMLEventReader.html? Hm, I'm working really often with StAX and I'm using the event reader in my custom input format to create the records and produce a List of XMLEvents or mor

Re: ClassCastException

2010-10-07 Thread Ted Yu
Have you checked http://download.oracle.com/javase/6/docs/api/javax/xml/stream/XMLEventReader.html? On Thu, Oct 7, 2010 at 5:49 PM, Johannes.Lichtenberger < johannes.lichtenber...@uni-konstanz.de> wrote: > On 10/08/2010 02:38 AM, Johannes.Lichtenberger wrote: > > On 10/08/2010 01:29 AM, Ted Yu wr

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/08/2010 02:38 AM, Johannes.Lichtenberger wrote: > On 10/08/2010 01:29 AM, Ted Yu wrote: >> http://download.oracle.com/javase/6/docs/api/javax/xml/stream/XMLEventWriter.html >> >> You can use an XMLOutputFactory to create an XMLEventWriter, and then use an >> XMLEventFactory to create events t

Re: how to set system properties for mapper/reducer?

2010-10-07 Thread Ted Yu
If you look at the parameters passed to your mapper/reducer, you should see something like this: -Djava.library.path=/opt//hadoop/bin/../lib/native/Linux-amd64-64:/tmp/hadoop-hadoop/mapred/local/taskTracker/jobcache/job_201010062322_0002/attempt_201010062322_0002_m_00_0/work On Thu, Oct 7, 20

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/08/2010 01:29 AM, Ted Yu wrote: > http://download.oracle.com/javase/6/docs/api/javax/xml/stream/XMLEventWriter.html > > You can use an XMLOutputFactory to create an XMLEventWriter, and then use an > XMLEventFactory to create events that can then be written to the > XMLEventWriter. I just re

custom Input fornat

2010-10-07 Thread hmchiud
Hi there, My files, each about 40G, content format like: datadatadatadatadata datadatadatadatadata datadatadatadatadata datadatadatadatadata datadatadatadatadata datadatadatadatadata

Re: how to set system properties for mapper/reducer?

2010-10-07 Thread Yin Lou
I don't understand. I want to pass something like java.library.path="/home/.../libXXX.so" so that every mapper can load that lib and use the native code. Could you give me an example? Thanks! Yin On Thu, Oct 7, 2010 at 6:05 PM, Ted Yu wrote: > Take a look at bin/hadoop: > hadoop: HADOOP_OPTS=

Re: ClassCastException

2010-10-07 Thread Ted Yu
http://download.oracle.com/javase/6/docs/api/javax/xml/stream/XMLEventWriter.html You can use an XMLOutputFactory to create an XMLEventWriter, and then use an XMLEventFactory to create events that can then be written to the XMLEventWriter. On Thu, Oct 7, 2010 at 4:05 PM, Johannes.Lichtenberger <

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/08/2010 12:01 AM, Ted Yu wrote: > http://www.ibm.com/developerworks/xml/library/x-stax2.html I think the problem would be how to serialize XMLEvents or more precisely I don't know if there's an existing StAX XMLEvent-to-String class/method. regards, Johannes

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/08/2010 12:01 AM, Ted Yu wrote: > http://www.ibm.com/developerworks/xml/library/x-stax2.html Yes, my approach is to parse a very big XML file (wikipedia revisions) with StAX in my RecordReader implementation. The key is a timestamp and the values are Lists, because I don't want to have to se

Re: how to set system properties for mapper/reducer?

2010-10-07 Thread Ted Yu
Take a look at bin/hadoop: hadoop: HADOOP_OPTS="$HADOOP_OPTS -Djava.library.path=$JAVA_LIBRARY_PATH" On Thu, Oct 7, 2010 at 12:04 PM, Yin Lou wrote: > Hi, > > Is there any way to pass system properties, like java.library.path to each > mapper/reducer? > > Thanks, > Yin >

Re: ClassCastException

2010-10-07 Thread Ted Yu
http://www.ibm.com/developerworks/xml/library/x-stax2.html On Thu, Oct 7, 2010 at 2:54 PM, Johannes.Lichtenberger < johannes.lichtenber...@uni-konstanz.de> wrote: > On 10/07/2010 05:41 PM, Ted Yu wrote: > > Since mFormatter.format() returns a String, you don't need to introduce > > newline. > > Y

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/07/2010 05:41 PM, Ted Yu wrote: > Since mFormatter.format() returns a String, you don't need to introduce > newline. > You can call paramOut.writeUTF() to save the String and call > paramIn.readUTF() to read it back. Ok, that's what I did right after replying. My value is a List or more prec

Re: Too large class path for map reduce jobs

2010-10-07 Thread Tom White
I wonder if there is a misunderstanding here - the problem is that the classpath has too many classes on it (and clashes with user classes), rather than it being a text string which is too long. I would suggest that the technical discussion of how to fix this goes onto the JIRA. Cheers, Tom On T

how to set system properties for mapper/reducer?

2010-10-07 Thread Yin Lou
Hi, Is there any way to pass system properties, like java.library.path to each mapper/reducer? Thanks, Yin

Re: ClassCastException

2010-10-07 Thread Anthony Urso
Oops, I meant Writable. On Thu, Oct 7, 2010 at 6:58 AM, Johannes.Lichtenberger wrote: > On 10/07/2010 06:46 AM, Anthony Urso wrote: >>  Hadoop is attempting to cast a Date object to WritableComparable, which >> Date does not implement, and is causing that exception. >> >> Your keys must implement

How to modify task assignment algorithm?

2010-10-07 Thread Shen LI
Hi, How can I modify the task assignment strategy in hadoop which is used to assign tasks to different worker nodes? (Not the job scheduler) Big thanks, Shen

Hdfs Block Size

2010-10-07 Thread rakesh kothari
Is there a reason why block size should be set to some 2^N, for some integer N ? Does it help with block defragmentation etc. ? Thanks, -Rakesh

Re: ClassCastException

2010-10-07 Thread Ted Yu
Since mFormatter.format() returns a String, you don't need to introduce newline. You can call paramOut.writeUTF() to save the String and call paramIn.readUTF() to read it back. The value class doesn't need to implement Comparable. On Thu, Oct 7, 2010 at 6:58 AM, Johannes.Lichtenberger < johannes.

Re: ClassCastException

2010-10-07 Thread Johannes.Lichtenberger
On 10/07/2010 06:46 AM, Anthony Urso wrote: > Hadoop is attempting to cast a Date object to WritableComparable, which > Date does not implement, and is causing that exception. > > Your keys must implement WritableComparable and your values must > implement Comparable. The values have to implemen

Re: Too large class path for map reduce jobs

2010-10-07 Thread Alejandro Abdelnur
well, if the issue is a too long classpath, the softlink thingy will give some room to breath as the total CP length will be much smaller. A On Thu, Oct 7, 2010 at 3:43 PM, Henning Blohm wrote: > So that's actually another issue, right? Besides splitting the classpath > into those three groups,

Re: Too large class path for map reduce jobs

2010-10-07 Thread Henning Blohm
So that's actually another issue, right? Besides splitting the classpath into those three groups, you want the TT to create soft-links on demand to simplify the computation of classpath string. Is that right? But it's the TT that actually starts the job VM. Why does it matter what the string actua