Publishing the jar to bigtops maven is probably a good first step ,Then apps can just include it as needed...?.
I'm not against packaging if someone wants packages for this. Maybe even include it in bigtop util ? Let's move to jira, > On Aug 25, 2015, at 9:41 PM, Konstantin Boudnik <[email protected]> wrote: > > It is pretty cool indeed! > > I wonder how it needs to be structured to be: > - easy to access/use from other components wherever it is needed > - doesn't interfere with the rest of the stack > > I guess one possible way would be to implement the generator as a set of maven > artifacts, that could be installed/consumed transparently by just declaring a > dependency e.g as proposed via top-level component. > > Another way is to have a new package like we do for bigtop-utils and such. > > Perhaps this discussion should be moved to JIRA or shall we continue on the > dev@ ?? > > Cos > >> On Sun, Aug 23, 2015 at 11:53AM, RJ Nowling wrote: >> Hi BigTop, >> >> I had a discussion with Jay yesterday, we'd like to propose a new component >> for BigTop: BigTop Data Generators. >> >> BigTop Data Generators would consist of a common set of libraries for >> building data generators and three example data generators: >> >> * BigPetStore transaction generator (moved from BigPetStore) >> * BigTop Bazaar -- attendee movement and interactions with booths on a >> showroom floor, at a conference, or at a mall >> * BigTop Weatherman -- stochastic weather simulation (temperature, wind >> speed, wind chill, rainfall, etc.) per zip code. (From a model trained on >> NOAA historical weather data) >> >> We believe that creating a common set of libraries will have several >> benefits including: >> >> * Easier for others to build their own data generators >> * Make data generators smaller and easier to maintain >> * Share improvements across the data generators >> >> More details on the libraries are below. >> >> BigPetStore will be continue to focus on building and maintaining >> blueprints, powered by the BigTop Data Generators. >> >> Our vision is that we get all of Apache coming to BigTop for tools for >> building better, more comprehensive blueprints. We want to support these >> efforts through data generators and the initial set of blueprint we've been >> building. >> >> If the community is generally in support of this, I can create a top-level >> "bigtop-data-generators" directory and put the data generators and >> libraries in there. >> >> Thanks! >> >> RJ >> >> >> ------- >> Library details: >> >> So far, I've extracted the following common libraries: >> >> * Samplers -- provides classes for PDFs and various samplers >> * Name generator -- data set and samplers for generating names >> * Location data set -- data set and classes for US zip codes, their >> GPS coordinates, median house hold incomes, and population sizes >> * Product generator -- library for enumerating products from a >> specification file. Comes with default specifications for BigPetStore >> >> I also expect that I'll add libraries for: >> >> * Particle simulation -- customer movement in a room >> * Latent factor model generation -- generate latent factors and >> customer weights to create something like MovieLens data. Used in Bazaar >> for booth preferences and potentially in BigPetStore for customer item >> preferences >> >> Most of these libraries came out of the BigPetStore data generator but the >> other generators have been refactored to be based off the standard set of >> libraries.
