RE: Custom Slicer

2011-03-23 Thread Lai Will
ache.org Cc: Lai Will Subject: Re: Custom Slicer Slicers are deprecated -- Pig now uses Hadoop InputFormats directly; you can read up what those entail in Hadoop documentation and books. As far as dealing with partial records at the beginning and end of the slice, the normal pattern is to al

Re: Custom Slicer

2011-03-01 Thread Lai Will
mobile phone. I apologize for any typos and abbreviations. - Reply message - From: "Dmitriy Ryaboy" Date: Tue, Mar 1, 2011 22:05 Subject: Custom Slicer To: "user@pig.apache.org" Cc: "Lai Will" Slicers are deprecated -- Pig now uses Hadoop InputFormats directly;

Re: Custom Slicer

2011-03-01 Thread Dmitriy Ryaboy
Slicers are deprecated -- Pig now uses Hadoop InputFormats directly; you can read up what those entail in Hadoop documentation and books. As far as dealing with partial records at the beginning and end of the slice, the normal pattern is to always read a full record even if it takes you past the c

Custom Slicer

2011-03-01 Thread Lai Will
Hello, The data I want to process is XML. It boils down to ... ... According to what I read in the documentation. When loading the file using the default Slicer, I end up in block sized chunks, that will very likely contain partial s at the beginning and at