Re: using git for SCM?

2007-07-31 Thread Jim White
Eric Baldeschwieler wrote: Interesting topic. I'd be interested in learning what would be made infinitely better. Distributed revision control is definitely superior to the conventional centralized revision control plus patching rigmarole. I don't think there is anything offering infinite i

Re: [jira] Created: (HADOOP-1384) HBase code update 0.0.0-4

2007-05-17 Thread Jim White
Doug Cutting wrote: > Jim, > Rather than making big, omnibus patches every few weeks, you should contribute a more continuous stream of narrowly targeted changes. That would encourage collaboration with others, as their patches could be interspersed with yours and have less chance of conflict

Re: JavaSpaces (Blitz?) and hadoop - comparison?

2007-03-02 Thread Jim White
Doğacan Güney wrote: On 3/2/07, Dan Creswell <[EMAIL PROTECTED]> wrote: Nigel Daley wrote: > One more difference... > ,,, > OTOH, Hadoop servers and clients are currently pre-configured with > necessary host/ports. > Indeed - I was toying with doing something about removing this pre-configurat

Re: [Lucene-hadoop Wiki] Trivial Update of "Bigtable&Sawzall" by udanax

2007-02-15 Thread Jim White
Seems to me that wiki pages are absurdly cheap and we're better off encouraging participation than discouraging it. As for noise from wiki update notifications, that must be a subscriber choice because I don't get them. For folks who don't like them I suggest turning that option off. Also wi

Re: inline checksums

2007-01-24 Thread Jim White
Doug Cutting wrote: Hairong Kuang wrote: Another option is to create a checksum file per block at the data node where the block is placed. Yes, but then we'd need a separate checksum implementation for intermediate data, and for other distributed filesystems that don't already guarantee e

Re: Administrative Request

2006-11-16 Thread Jim White
David Pollak wrote: > Would it be possible to add [Hadoop-Dev] to the subject of each message > in this mailing list? I receive hundreds of pieces of spam a day (this > is after filtering... with filter off, it's > 5,000 pieces.) One of my > mental filters is looking for people I know or sub

Now *that* is a programming contest!

2006-10-03 Thread Jim White
Netflix is offering $1 million for a 10% improvement in the quality of their suggestion software: http://blog.eweek.com/blogs/bizbytes/archive/2006/10/02/13548.aspx Jim White

TextInputFormat bug - lines which are *not* split

2006-08-21 Thread Jim White
As I don't have Hadoop installed (yet), I'm not able to offer a test case, but I'm fairly confident of a bug line TextInputFormat. The current implementation will ignore the first line of a file split when the previous split ended with a newline. There are two ways to fix this, the easiest (and m