RE: Pig indexing

2010-10-05 Thread Yan Zhou
Zebra, a contrib project under pig, is such a loader that builds indexes by itself. Yan -Original Message- From: Renato Marroquín Mogrovejo [mailto:renatoj.marroq...@gmail.com] Sent: Tuesday, October 05, 2010 7:52 AM To: pig-u...@hadoop.apache.org Subject: Re: Pig indexing Hey Dmitriy

Re: Pig indexing

2010-10-05 Thread Renato Marroquín Mogrovejo
Hey Dmitriy! I've been trying to get to this email for the last couple of weeks, and finally I am here. You were talking about Pig's merge, but there is one thing I didn't quite understand from the wiki (http://wiki.apache.org/pig/PigMergeJoin) if it uses sampling records to create indexes because