Eric Baldeschwieler wrote:
Interesting topic.
I'd be interested in learning what would be made infinitely better.
Distributed revision control is definitely superior to the conventional
centralized revision control plus patching rigmarole.
I don't think there is anything offering infinite i
Doug Cutting wrote:
> Jim,
>
Rather than making big, omnibus patches every few weeks, you should
contribute a more continuous stream of narrowly targeted changes. That
would encourage collaboration with others, as their patches could be
interspersed with yours and have less chance of conflict
Doğacan Güney wrote:
On 3/2/07, Dan Creswell <[EMAIL PROTECTED]> wrote:
Nigel Daley wrote:
> One more difference...
> ,,,
> OTOH, Hadoop servers and clients are currently pre-configured with
> necessary host/ports.
>
Indeed - I was toying with doing something about removing this
pre-configurat
Seems to me that wiki pages are absurdly cheap and we're better off
encouraging participation than discouraging it.
As for noise from wiki update notifications, that must be a subscriber
choice because I don't get them. For folks who don't like them I
suggest turning that option off. Also wi
Doug Cutting wrote:
Hairong Kuang wrote:
Another option is to create a checksum file per block at the data node
where
the block is placed.
Yes, but then we'd need a separate checksum implementation for
intermediate data, and for other distributed filesystems that don't
already guarantee e
David Pollak wrote:
> Would it be possible to add [Hadoop-Dev] to the subject of each message
> in this mailing list? I receive hundreds of pieces of spam a day (this
> is after filtering... with filter off, it's > 5,000 pieces.) One of my
> mental filters is looking for people I know or sub
Netflix is offering $1 million for a 10% improvement in the quality of
their suggestion software:
http://blog.eweek.com/blogs/bizbytes/archive/2006/10/02/13548.aspx
Jim White
As I don't have Hadoop installed (yet), I'm not able to offer a test
case, but I'm fairly confident of a bug line TextInputFormat.
The current implementation will ignore the first line of a file split
when the previous split ended with a newline.
There are two ways to fix this, the easiest (and m