Re: Nutch 2.0

2010-06-28 Thread Andrzej Bialecki
On 2010-06-28 07:49, Sami Siren wrote: One aspect that has not been discussed yet is the legal aspect. According to http://incubator.apache.org/ip-clearance/index.html there is a formal process for integrating externally development efforts that have happened outside of Apache. Should we be

Re: Nutch 2.0

2010-06-28 Thread Sami Siren
On 06/28/2010 10:10 AM, Andrzej Bialecki wrote: On 2010-06-28 07:49, Sami Siren wrote: One aspect that has not been discussed yet is the legal aspect. According to http://incubator.apache.org/ip-clearance/index.html there is a formal process for integrating externally development efforts that

[jira] Commented: (NUTCH-834) Separate the Nutch web site from trunk

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12883141#action_12883141 ] Chris A. Mattmann commented on NUTCH-834: - Hey Julien: My recommendation would be

Re: Nutch 2.0

2010-06-28 Thread Doğacan Güney
Hey all, I will double check to make sure, but IIRC, there is no need to delete svn:nutchbase since current code on github simply builds on top of that. So why not simply merge github branch into svn? It will be a clear merge... The only problem is contributor info is messed up in github but I

Re: Nutch 2.0

2010-06-28 Thread Mattmann, Chris A (388J)
Hi Doğacan, So your proposition is to combine (a) and (b) then? That’s fine by me, so long as there are no objections from others. I can still move forward with , (e) and (g) then... Cheers, Chris On 6/28/10 8:39 AM, Doğacan Güney doga...@gmail.com wrote: Hey all, I will double check to

Re: Nutch 2.0

2010-06-28 Thread Mattmann, Chris A (388J)
Hi Guys, And, let me clarify my OK’ness with this. My assumption is that regardless of whether we physically svn:delete nutchbase in Apache SVN (the choice I went to after hearing there were *significant* changes in the Git version from that of the Apache one), and then import a fresh copy

[jira] Updated: (NUTCH-363) Fetcher normalizes everything at least twice

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-363: Fix Version/s: 2.0 (was: 1.2) Fetcher normalizes everything at

[jira] Updated: (NUTCH-833) Website is still Lucene branded

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-833: Fix Version/s: 2.0 (was: 1.2) Website is still Lucene branded

[jira] Updated: (NUTCH-50) Benchmarks Performance goals

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-50?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-50: --- Fix Version/s: 2.0 (was: 1.2) Benchmarks Performance goals

[jira] Updated: (NUTCH-832) Website menu has lots of broken links - in particular the API docs

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-832: Fix Version/s: 2.0 (was: 1.2) Website menu has lots of broken links

[jira] Updated: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-831: Fix Version/s: 2.0 (was: 1.2) Allow configuration of how fields

[jira] Commented: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12883415#action_12883415 ] Chris A. Mattmann commented on NUTCH-831: - I applied this patch to the Nutch 1.2

Re: Nutch 2.0

2010-06-28 Thread Mattmann, Chris A (388J)
Okey dokey guys, (c), (e) and (g) are done. Julien, Doğacan, your turn on (a) and (d) and then we can all work on (e) and (f)... Cheers, Chris On 6/28/10 12:55 PM, Doğacan Güney doga...@gmail.com wrote: On Mon, Jun 28, 2010 at 20:23, Andrzej Bialecki a...@getopt.org wrote: On 2010-06-28

[jira] Resolved: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-831. - Assignee: Chris A. Mattmann Fix Version/s: (was: 2.0) Resolution: