crunch-contrib would be the most standard nomenclature, yes? Even
though it's a little boring. ;-)

On Thu, Sep 27, 2012 at 9:09 AM, Matthias Friedrich <[email protected]> wrote:
> Hi,
>
> I'm fine with any that makes remotely sense to a non-native speaker :)
>
> Regards,
>   Matthias
>
> On Thursday, 2012-09-27, Rahul wrote:
>> I have named it crunch-bytes, but I like crunch-bars as well. J
>> Pool in your suggestions.
>>
>> regards
>> Rahul
>>
>> On 26-09-2012 21:36, Matthias Friedrich wrote:
>> >OK, then let's do it! As soon as we've agreed on a name, of course :)
>> >
>> >Regards,
>> >   Matthias
>> >
>> >On Wednesday, 2012-09-26, Rahul wrote:
>> >>Hi,
>> >>
>> >>I believe every project has a bunch of interesting users which can
>> >>provide additional food for thought to others. Hadoop provides lots
>> >>of random opportunities to people and the same should be possible
>> >>with crunch. I would be delighted to see what people are able to
>> >>pull off using the existing things. These contributions should be
>> >>kept in crunch as we are pretty young and at times we will go under
>> >>various refactorings, keeping them in crunch will keep them up-to
>> >>date.
>> >>
>> >>And yes, +1 to the idea of keeping dependencies to crunch-core only.
>> >>
>> >>regards,
>> >>rahul
>> >>On 26-09-2012 04:32, Josh Wills wrote:
>> >>>I like the idea of having a place in the project that showcases the
>> >>>cool things that you can do with it-- something more advanced and
>> >>>broadly applicable than the starter pipelines we have in
>> >>>crunch-examples, the kind of stuff that you can't easy do using tools
>> >>>like Hive and Pig.
>> >>>
>> >>>I also agree that we don't want to get into dependency creep, so I'd
>> >>>be inclined to limit crunch-bytes (crunch-berries? crunch-bars?
>> >>>crunch-abs?) to just those dependencies that are also in crunch-core.
>> >>>I think the Bloom Filter stuff meets this criteria.
>> >>>
>> >>>The project is still young enough that our problem is much more likely
>> >>>to be attracting new folks than it is to be getting overwhelmed with
>> >>>random contributions, so my inclination is to be welcoming.
>> >>>
>> >>>On Tue, Sep 25, 2012 at 11:29 AM, Matthias Friedrich <[email protected]> wrote:
>> >>>>Hi Rahul,
>> >>>>
>> >>>>I think it would be really great to have an ecosystem of
>> >>>>micro-libraries around Crunch for all kinds of cool stuff that is
>> >>>>relevant for smaller audiences, just like your Bloom filters.
>> >>>>
>> >>>>But since I expect most of this stuff to be so extremely special, it
>> >>>>would in my opinion make more sense to put this into small, focused
>> >>>>and independent projects that can be released separately from each
>> >>>>other and don't need to go through Crunch's review process. It would
>> >>>>make dependency management easier for users, too, in case a library
>> >>>>needs additional dependencies.
>> >>>>
>> >>>>We could maintain a registry of these projects on Crunch's homepage
>> >>>>so people can find them easily (I expect most of them would end up
>> >>>>at GitHub because it's perfect for this kind of thing). If a project
>> >>>>turns out to be interesting for a larger audience, we can still add it
>> >>>>to Crunch core.
>> >>>>
>> >>>>Regards,
>> >>>>   Matthias
>> >>>>
>> >>>>On Tuesday, 2012-09-25, Rahul wrote:
>> >>>>>There can be interesting use-cases like BloomFilters which do not
>> >>>>>have a place in the current set of Crunch modules. These functions
>> >>>>>are kind of utility functions that can be used in Crunch. We need to
>> >>>>>create a place where users can share such functions. In the earlier
>> >>>>>discussion for BloomFilters we thought of some thing that is well
>> >>>>>along the lines of PiggyBank. I had a look at the module but in
>> >>>>>Pig's structure the module is branched under contrib module as there
>> >>>>>are other modules like peeny for monitering and zebra for storage.
>> >>>>>
>> >>>>>I have created a module name *crunch-bytes* , for issue
>> >>>>>https://issues.apache.org/jira/browse/CRUNCH-75, which is direct
>> >>>>>sub-module in crunch-parent. I named it so because I felt it will
>> >>>>>providing a space to have all those interesting data computations
>> >>>>>that we can not have in core.
>> >>>>>
>> >>>>>Please share your thoughts for the same.
>> >>>>>
>> >>>>>regards,
>> >>>>>rahul
>> >>>>>
>> >>>
>>



-- 
Director of Data Science
Cloudera
Twitter: @josh_wills

Reply via email to