Re: Best Practices for Plugin Dev and Deployment

2016-04-08 Thread Mattmann, Chris A (3980)
Hi Thiago, Sorry for the top post: 1. Yes you could do conf/models, and/or an HDFS url, either one. The conf directory is packaged up when you create a *.job file for Hadoop by running ant job. That said, if your job jar includes 100-1GB model files that’s how big your *.job will be. A better

Re: Best Practices for Plugin Dev and Deployment

2016-04-08 Thread Thiago Galery
Hi Chris, thanks for the response, here are some elaborations of my initial questions on the basis of your reply. On Wed, Apr 6, 2016 at 2:12 PM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Hi Thiago, > > Welcome! > > First thing to check out: > >

Re: Best Practices for Plugin Dev and Deployment

2016-04-06 Thread Thiago Galery
Thanks for the pointers Chris On Wed, Apr 6, 2016 at 2:12 PM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Hi Thiago, > > Welcome! > > First thing to check out: > > http://wiki.apache.org/nutch/Becoming_A_Nutch_Developer > > > I would follow that by checking out info on how

Re: Best Practices for Plugin Dev and Deployment

2016-04-06 Thread Mattmann, Chris A (3980)
Hi Thiago, Welcome! First thing to check out: http://wiki.apache.org/nutch/Becoming_A_Nutch_Developer I would follow that by checking out info on how to use our Source Code repo: http://wiki.apache.org/nutch/UsingGit OK now on to your specific questions: On 4/6/16, 8:48 AM, "Thiago

Best Practices for Plugin Dev and Deployment

2016-04-06 Thread Thiago Galery
Dear list, I'm a new Nutch Developer and I have a few questions to ask you. 1 - Are there any general guidelines for plugin development (in addition to the ones specified in the wiki guide). I looked around github and it seems that many plugins are developed as a monolithic piece of code that is