RE: Understanding file splits

2008-10-28 Thread Malcolm Matalka
h. I am using hadoop 0.18.1 by the way. -Original Message- From: Malcolm Matalka [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 28, 2008 11:41 To: core-user@hadoop.apache.org Subject: RE: Understanding file splits Thanks for the response Owen. As for the 'isSplittable' thing.

RE: Understanding file splits

2008-10-28 Thread Malcolm Matalka
). Is there any other information I can give? -Original Message- From: Doug Cutting [mailto:[EMAIL PROTECTED] On Behalf Of Doug Cutting Sent: Tuesday, October 28, 2008 11:41 To: core-user@hadoop.apache.org Subject: Re: Understanding file splits This is hard to diagnose without knowing your

RE: Understanding file splits

2008-10-28 Thread Malcolm Matalka
FSDataInputStream fileIn = fs.open(split.getPath()); Which I took from LineRecordReader. Any thoughts? Thanks -Original Message- From: Owen O'Malley [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 28, 2008 11:30 To: core-user@hadoop.apache.org Subject: Re: Understanding file

Re: Understanding file splits

2008-10-28 Thread Doug Cutting
This is hard to diagnose without knowing your InputFormat. Each split returned by your #getSplits() implementation is passed to your #getRecordReader() implementation. If your RecordReader is not stopping when you expect it to, then that's a problem in your RecordReader, no? Have you written

Re: Understanding file splits

2008-10-28 Thread Owen O'Malley
On Oct 28, 2008, at 6:29 AM, Malcolm Matalka wrote: I am trying to write an InputFormat and I am having some trouble understanding how my data is being broken up. My input is a previous hadoop job and I have added code to my record reader to print out the FileSplit's start and end position, as