I think these would be an excellent addition, given the improvements David suggested. Since they span a range of Accumulo uses, they are probably best served as a contrib repository[1] rather than an Accumulo version-specific example.
In any case this will involve adopting a non-trivial code base developed outside of the project; we'll need to go through the ASF Incubator[2]. Before that can happen we'll need to call a vote[3]. [1]: http://accumulo.apache.org/contrib.html [2]: http://incubator.apache.org/faq.html#proposed_new_codebase http://incubator.apache.org/ip-clearance/index.html [3]: http://accumulo.apache.org/bylaws.html#actions On Tue, Nov 11, 2014 at 10:38 AM, Josh Elser <josh.el...@gmail.com> wrote: > I've given a quick glance over them -- they look like they'd be a great > addition! > > We'd have to figure out some mechanism to distribute the exercises (as we > can't compile them), but that's a manageable problem. > > If you want to open an issue on JIRA, that'd be the first step to get > these into the codebase. Some things to think about meanwhile: > > * Check out the coding practices and code formatting guidelines - > http://accumulo.apache.org/source.html#coding-practices > * Add ASL headers to the files > * Figure out where might be a good place to include these in the Accumulo > tree - maybe examples/training? > * Consider what documentation would be needed for someone to self-guide > themselves through these examples > * Look into redistribution rights on the included twitter.json file. I'm > not sure what Twitter's terms of service are. It may be easier to write a > script that will generate some example tweets. It keeps us from being > liable for what those tweets contain and also prevents us from having to > distribute a big blob. > > Thanks again! > > > David Medinets wrote: > >> Can you add descriptions of the exercises to the README file? Many >> people finding that page would move to the next one with details to >> catch their interest. >> >> On Tue, Nov 11, 2014 at 9:49 AM, Chris Rigano<chris.p.rig...@gmail.com> >> wrote: >> >>> I believe they would be of benefit. >>> >>> On Mon, Nov 10, 2014 at 12:43 PM, Adam J. Shook<adamjsh...@gmail.com> >>> wrote: >>> >>> I had just finished finalizing the training materials for a basic and >>>> advanced Accumulo class my company ClearEdge IT offers. I think some of >>>> the more advanced code tutorials would be valuable to contribute to the >>>> Accumulo examples library. >>>> >>>> The examples all work with status updates from Twitter and include >>>> topics >>>> such as: >>>> >>>> - Basic Reading/Writing >>>> - Indexing tweets and creating a program to retrieve tweets based on >>>> given search terms >>>> - Bulk ingestion of the tweets >>>> - Using MapReduce to building a geo-index table for the tweets with >>>> latitude/longitude information via z-points >>>> - Levering the geo-index to retrieve tweets from a given lat/long >>>> bounding box >>>> - Custom iterators such as filters and combiners >>>> >>>> If this is something the community would be interested in, please take >>>> the >>>> time to review them at https://github.com/adamjshook/accumulo-training >>>> and >>>> let me know if there are any you think would be worth contributing. >>>> I'd be >>>> happy to take the time to massage them to meet the standards. >>>> >>>> Cheers, >>>> --Adam >>>> >>>> -- Sean