Hello everyone, I am currently a master student at EURECOM (www.eurecom.fr). I am working on a project related to Apache Pig in the context of a EU-funded project Bigfoot (www.bigfootproject.eu).
Based on our previous work: “Duy-Hung Phan, Matteo Dell’Amico, Pietro Michiardi: On the design space of MapReduce ROLLUP aggregates” ( http://www.eurecom.fr/en/publication/4212/download/rs-publi-4212_2.pdf), I am working on a new family of algorithms to address some limitations of the current ROLLUP operator in Apache Pig: the IRG (in-reducer grouping), the hybrid IRG, and chained-IRG. I have an implementation that indicates superior performance to the existing ROLLUP implementation. You can find out more information on this work here: https://issues.apache.org/jira/browse/PIG-4066. I've also created a review request on the review board: https://reviews.apache.org/r/23804/ It would be very helpful for me if someone can review and have some feedback on this patch. Looking forward for the feedback. Regards, Quang-Nhat HOANG-XUAN
