Re: running hadoop with gij

2008-07-17 Thread Matt Kent
There be dragons. Use the Sun JVM. On Thu, 2008-07-17 at 13:45 +0200, Gert Pfeifer wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Did anyone try to get hadoop running on the Gnu java environment? Does that work? Cheers, Gert -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6

Re: JobClient question

2008-07-11 Thread Matt Kent
The best way to configure all that stuff is in hadoop-site.xml, which lives in the hadoop conf directory. Make sure that directory is on the classpath of your application. On Fri, 2008-07-11 at 11:55 -0400, Larry Compton wrote: Thanks. Is this the correct syntax? conf.set(mapred.job.tracker,

Re: Google Protocol Buffers - structured binary data

2008-07-09 Thread Matt Kent
I have extensive experience with Thrift, and have been playing with protocol buffers for a couple days. Thrift is a more complete RPC solution, including client and server implementations, whereas PB is just a data exchange format. If you want a ready-to-go RPC server, use Thrift. If you want

Re: realtime hadoop

2008-06-24 Thread Matt Kent
at 21:54, Matt Kent [EMAIL PROTECTED] wrote: We use Hadoop in a similar manner, to process batches of data in real-time every few minutes. However, we do substantial amounts of processing on that data, so we use Hadoop to distribute our computation. Unless you have a significant amount

Re: realtime hadoop

2008-06-23 Thread Matt Kent
We use Hadoop in a similar manner, to process batches of data in real-time every few minutes. However, we do substantial amounts of processing on that data, so we use Hadoop to distribute our computation. Unless you have a significant amount of work to be done, I wouldn't recommend using Hadoop

Re: is there a way to to debug hadoop from Eclipse

2008-06-17 Thread Matt Kent
Eclipse? Could we set up breakpoints, trace the running steps of the map reduce program? Richard On Mon, Jun 16, 2008 at 6:54 PM, Matt Kent [EMAIL PROTECTED] wrote: The approach I've taken is to use JMock and create a unit test for the mapreduce, then debug that within Eclipse on my

Re: is there a way to to debug hadoop from Eclipse

2008-06-16 Thread Matt Kent
The approach I've taken is to use JMock and create a unit test for the mapreduce, then debug that within Eclipse on my workstation. For performance debugging, I use YourKit on the cluster. Matt On Mon, 2008-06-16 at 16:58 -0700, Mori Bellamy wrote: Hey Richard, I'm interested in the same

runtime exceptions not killing job

2008-03-17 Thread Matt Kent
throwing NPEs, the tasks just hung. Eventually they timed out and were killed, but is this expected behavior in 0.16.1? I'd prefer the job to fail quickly if NPEs are being thrown. Matt -- Matt Kent Co-Founder Persai 1221 40th St #113 Emeryville, CA 94608 [EMAIL PROTECTED]