[jira] [Resolved] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2023-01-08 Thread Sebastian Nagel (Jira)
] ! > different versions of the same library in nutch-2.0-dev.job and local\lib > directory > > > Key: NUTCH-849 > URL: https://issues.apache.org/jir

[jira] [Closed] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2023-01-08 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel closed NUTCH-849. - > different versions of the same library in nutch-2.0-dev.job and local\lib > dir

[jira] [Updated] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2014-07-07 Thread Julien Nioche (JIRA)
in nutch-2.0-dev.job and local\lib directory Key: NUTCH-849 URL: https://issues.apache.org/jira/browse/NUTCH-849 Project: Nutch Issue Type: Task Affects

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2013-02-28 Thread kiran (JIRA)
i) Feed ii) parse-swf iii) parse-ext iv) parse-zip v) parse-metatags ( I wrote patch for this earlier, NUTCH-1478) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

[jira] [Updated] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Lewis John McGibbney (JIRA)
attention by Kiran Chitturi. Thanks for this Kiran, your contributions are greatly appreciated. Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Lewis John McGibbney (JIRA)
imports committed @revision 1396850 in 2.x head Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874 URL: https://issues.apache.org/jira

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Hudson (JIRA)
://builds.apache.org/job/Nutch-nutchgora/375/]) NUTCH-874 Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora (part 1) (Revision 1396850) Result = SUCCESS lewismc : Files : * /nutch/branches/2.x/CHANGES.txt * /nutch/branches/2.x/src/plugin/feed/src/java/org/apache/nutch/indexer

[jira] [Updated] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-09-18 Thread Lewis John McGibbney (JIRA)
in src/plugin are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874 URL: https://issues.apache.org/jira/browse/NUTCH-874 Project: Nutch Issue Type: Bug

[jira] [Updated] (NUTCH-875) Port Webgraph to Nutch 2.0

2012-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-875: --- Fix Version/s: (was: 2.1) 2.2 Port Webgraph to Nutch 2.0

[jira] [Updated] (NUTCH-841) Nutch 2.0 webapp

2012-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-841: --- Fix Version/s: (was: 2.1) 2.2 Nutch 2.0 webapp

[jira] [Updated] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2012-09-18 Thread Lewis John McGibbney (JIRA)
of the same library in nutch-2.0-dev.job and local\lib directory Key: NUTCH-849 URL: https://issues.apache.org/jira/browse/NUTCH-849 Project: Nutch Issue Type

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-07 Thread Julien Nioche
Lewis, Looks like you've released 2.0. If so can you make an announcement to the mailing list + update the website. It's not really something that should go unnoticed. I know about the press release but surely it does not mean that NOTHING should be said about the release then. I see a 1.5 on a

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-07 Thread Mattmann, Chris A (388J)
Thanks for your hard work here, Lewis! Cheers, Chris On Jul 7, 2012, at 3:44 PM, Lewis John Mcgibbney wrote: Hi Julien, Believe it or not I've just spent around 45 mins waiting on committing the site... broadband in Paris is nothing short of utterly abysmal to say the very best. Please

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-07 Thread Julien Nioche
Hi Lewis, Thanks for your hard work indeed and your explanations below. Well done everybody past and present for 2.0 Julien On 7 July 2012 23:44, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: Hi Julien, Believe it or not I've just spent around 45 mins waiting on committing the

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-06 Thread Lewis John Mcgibbney
OK Guys I think I'll push this tonight when I finish working. I hope you got things working Chris it would be a nice way to resolve the VOTE this thread. I'll push to servers later tonight. I've been away this week so haven't quite been on the ball with this. Thanks Lewis On Wed, Jul 4, 2012

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-06 Thread Mattmann, Chris A (388J)
OK, +1 from me :) ant runtime works: job: [jar] Building jar: /Users/mattmann/tmp/nutch2/build/apache-nutch-2.0.job runtime: [mkdir] Created dir: /Users/mattmann/tmp/nutch2/runtime [mkdir] Created dir: /Users/mattmann/tmp/nutch2/runtime/local [mkdir] Created dir: /Users

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-04 Thread Lewis John Mcgibbney
Hi Chris, lewismc@lewismc-HP-Mini-110-3100:~$ java -showversion java version 1.6.0_25 Java(TM) SE Runtime Environment (build 1.6.0_25-b06) Java HotSpot(TM) Client VM (build 20.0-b11, mixed mode, sharing) lewismc@lewismc-HP-Mini-110-3100:~$ ant -v Apache Ant(TM) version 1.8.2 compiled on August

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-04 Thread Mattmann, Chris A (388J)
Thanks Lewis, here are mine: [chipotle:~/tmp/nutch2/apache-nutch-2.0] mattmann% ant -version Apache Ant(TM) version 1.8.2 compiled on May 17 2012 [chipotle:~/tmp/nutch2/apache-nutch-2.0] mattmann% java -version java version 1.6.0_33 Java(TM) SE Runtime Environment (build 1.6.0_33-b03-424-10M3720

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-03 Thread Julien Nioche
injecting to indexing. Great work Lewis. On Mon, Jun 25, 2012 at 6:32 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Everyone, A candidate for the Apache Nutch 2.0 RC3 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc3 The release candidate is a src.zip

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-03 Thread Mattmann, Chris A (388J)
Hey Julien, On Jul 3, 2012, at 7:49 AM, Julien Nioche wrote: [..snip..] OK, so basically signatures and checksums are fine +1, yep they are great. Tried to build and test and got this: [ivy:resolve] :: [..snip...] Try

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-03 Thread Mattmann, Chris A (388J)
!guava.jar [ivy:resolve] :: [ivy:resolve] [ivy:resolve] :: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS BUILD FAILED /Users/mattmann/tmp/nutch2/apache-nutch-2.0/build.xml:431: impossible to resolve dependencies: resolve failed - see

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-03 Thread Lewis John Mcgibbney
] [ivy:resolve] :: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS BUILD FAILED /Users/mattmann/tmp/nutch2/apache-nutch-2.0/build.xml:431: impossible to resolve dependencies: resolve failed - see output for details Total time: 1 minute 56 seconds [chipotle:~/tmp/nutch2/apache

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-02 Thread Lewis John Mcgibbney
injecting to indexing. Great work Lewis. On Mon, Jun 25, 2012 at 6:32 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Everyone, A candidate for the Apache Nutch 2.0 RC3 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc3 The release candidate

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-02 Thread Julien Nioche
for the Apache Nutch 2.0 RC3 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc3 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc3 We release Nutch 2.0 in this fashion due

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-07-02 Thread Mattmann, Chris A (388J)
, A candidate for the Apache Nutch 2.0 RC3 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc3 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc3 We release Nutch 2.0 in this fashion due

Re: [VOTE] Apache Nutch 2.0 Release Candidate #3

2012-06-27 Thread Ferdy Galema
+1 Crawling with HBaseStore works from injecting to indexing. Great work Lewis. On Mon, Jun 25, 2012 at 6:32 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Everyone, A candidate for the Apache Nutch 2.0 RC3 is available at: http://people.apache.org/~lewismc/apache-nutch

Re: Nutch 2.0 Press Announcement

2012-06-22 Thread Sally Khudairi
Mcgibbney lewis.mcgibb...@gmail.com To: Sally Khudairi s...@apache.org Cc: dev@nutch.apache.org Sent: Thursday, 21 June 2012, 16:49 Subject: Nutch 2.0 Press Announcement Good Evening Sally, First and foremost I hope you are keeping well and that the beginning of the summer has been kind to you

Nutch 2.0 Press Announcement

2012-06-21 Thread Lewis John Mcgibbney
Good Evening Sally, First and foremost I hope you are keeping well and that the beginning of the summer has been kind to you... all the good weather still to come not to worry :0) The reason I contact you is that we (the Apache Nutch community) are nearly ready to release Nutch 2.0 which

Re: [VOTE] Apache Nutch 2.0 RC2

2012-06-19 Thread Lewis John Mcgibbney
Hi Ferdy, On Mon, Jun 18, 2012 at 11:27 AM, Ferdy Galema ferdy.gal...@kalooga.comwrote: Tested it with HBase but there is a slight issue with the dependencies. After building rc2 with ivy-enabled HBase, it seems a test HBase jar is deployed in local/lib, even though it's called

Re: [VOTE] Apache Nutch 2.0 RC2

2012-06-18 Thread Julien Nioche
...@apache.org wrote: Hi Everyone, A candidate for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc2 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc2

Re: [VOTE] Apache Nutch 2.0 RC2

2012-06-18 Thread Ferdy Galema
, A candidate for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc2 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc2 We release Nutch 2.0

Re: [VOTE] Apache Nutch 2.0 RC2

2012-06-18 Thread Sebastian Nagel
for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc2 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc2 We release Nutch 2.0 in this fashion due to the inclusion

Re: [VOTE] Apache Nutch 2.0 RC2

2012-06-18 Thread Ferdy Galema
and the content of the Maven repo looks OK . +1 for releasing Thanks a lot. Julien On 15 June 2012 13:48, lewis john mcgibbney lewi...@apache.org wrote: Hi Everyone, A candidate for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Ferdy Galema
Agree with only releasing src. On Thu, Jun 14, 2012 at 11:32 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Or just not ship a bin release at all. Src is the only thing we really VOTE on legally though bin is provided for convenience purposes. Will type more on this

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
+1 On 15 June 2012 09:00, Ferdy Galema ferdy.gal...@kalooga.com wrote: Agree with only releasing src. On Thu, Jun 14, 2012 at 11:32 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Or just not ship a bin release at all. Src is the only thing we really VOTE on legally

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Lewis John Mcgibbney
I'll push this in an hour or so guys. Thanks for the input. Lewis On Fri, Jun 15, 2012 at 9:39 AM, Julien Nioche lists.digitalpeb...@gmail.com wrote: +1 On 15 June 2012 09:00, Ferdy Galema ferdy.gal...@kalooga.com wrote: Agree with only releasing src. On Thu, Jun 14, 2012 at 11:32

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
Before you do, could you check that NutchGora passes ant test successfully. I just tried and got an error related to the parse-tika tests. Am about to open a JIRA to update to the latest version of Tika for NutchGora which should fix the problem and put it at the same level as trunk J On 15 June

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
see https://issues.apache.org/jira/browse/NUTCH-1396 On 15 June 2012 10:43, Julien Nioche lists.digitalpeb...@gmail.com wrote: Before you do, could you check that NutchGora passes ant test successfully. I just tried and got an error related to the parse-tika tests. Am about to open a JIRA to

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Mattmann, Chris A (388J)
OK you are just making us all look bad now Juls ;) Super fast! Cheers, Chris On Jun 15, 2012, at 2:54 AM, Julien Nioche wrote: see https://issues.apache.org/jira/browse/NUTCH-1396 On 15 June 2012 10:43, Julien Nioche lists.digitalpeb...@gmail.com wrote: Before you do, could you check

[VOTE] Apache Nutch 2.0 RC2

2012-06-15 Thread lewis john mcgibbney
Hi Everyone, A candidate for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc2 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc2 We release Nutch 2.0

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
That was not intented. Just that am on holidays, it's raining and the children were either asleep or playing nicely :-) On 15 June 2012 18:19, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: OK you are just making us all look bad now Juls ;) Super fast! Cheers, Chris On

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Ferdy Galema
(WebTableReader.java:412) -- readdb -dump works. Confirmed and ticket opened as NUTCH-1391 % ./bin/nutch fetch 1339621550-203073321 -threads 1 -parse Exception in thread main java.lang.IllegalArgumentException: arg -parse not recognized The parse argument was removed in Nutch 2.0 and now throws

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Sebastian Nagel
We only supply src distributions... Does this principle apply to Nutch 2 as well? Maybe, yes. The situation with the current binary package is uncomfortable: I had to copy/link gora-hbase and hbase jars into lib/ to get nutch running. 2012/6/13 Lewis John Mcgibbney lewis.mcgibb...@gmail.com

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Lewis John Mcgibbney
Aye this is no good at all. Depending on which backend you wish to use with Gora, you will need to go and manually fetch the correct .jar's from maven central. Does anyone else have either solution or a workaround before I push RC2 with just src dists? Thanks Lewis On Thu, Jun 14, 2012 at 4:52

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Mattmann, Chris A (388J)
Hey Guys, I think the annoyance is probably something folks can live with as they have been waiting for an official release of 2.x for years :) My +1 to roll RC #2 with or without a solution to this and mark it as a TODO. release eary, release often :) Cheers, Chris On Jun 14, 2012, at 10:04

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Julien Nioche
I disagree. You'd expect a binary release to work out of the box - which is not the case. Plus we'd have to spend more time explaining the workaround, answering the same questions over and over on the ML etc... Fixing this should not be a big deal (i.e. add the gore-x modules for the backends to

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Lewis John Mcgibbney
Hi Julien, Do you suggest with the binary release that we simply open up all gora-* deps and ship it with every jar available? Lewis On Thu, Jun 14, 2012 at 9:39 PM, Julien Nioche lists.digitalpeb...@gmail.com wrote: I disagree. You'd expect a binary release to work out of the box - which

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Julien Nioche
yep, remember that you can't build from the bin package so inevitably someone will wonder why only such or such backend is available etc... another option is to NOT have a binary release at all, in which case it is acceptable I think not to include the deps in ivy. Maybe we should at least add

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Lewis John Mcgibbney
This is what is currently done and what I was essentially proposing. I really don't know about the size of the bin artifact if we enable all gora-* dependencies before packaging it for distribution... thanks to input from yourselves we recently sorted out some size issues with 1.5, it would be

Re: VOTE Apache Nutch 2.0 RC1

2012-06-14 Thread Mattmann, Chris A (388J)
Or just not ship a bin release at all. Src is the only thing we really VOTE on legally though bin is provided for convenience purposes. Will type more on this later... Sent from my iPhone On Jun 14, 2012, at 2:18 PM, Lewis John Mcgibbney

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Ferdy Galema
Findings about Nutch-2.0 RC 1. The Nutch job jar is not present in the binary archive. This means distributed running of jobs is not supported. I'm not sure if this is a problem (since users can always build one themselves), merely pointing it out. The recently released 1.5 also lacks this job

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Ferdy Galema
:00 AM, Ferdy Galema ferdy.gal...@kalooga.comwrote: Findings about Nutch-2.0 RC 1. The Nutch job jar is not present in the binary archive. This means distributed running of jobs is not supported. I'm not sure if this is a problem (since users can always build one themselves), merely pointing

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Lewis John Mcgibbney
in development/production. Thanks very much in advance Best Lewis On Fri, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc/nutch-2.0 The release

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Lewis John Mcgibbney
Hi Seb, Quick update On Tue, Jun 12, 2012 at 11:33 PM, Sebastian Nagel wastl.na...@googlemail.com wrote: 1 some guidance would be nice. README.txt points to http://wiki.apache.org/nutch/NutchTutorial which refers to 1.x Please see http://wiki.apache.org/nutch/Nutch2Tutorial which is an update

Suitable Nutch 2.0 Project Description

2012-06-13 Thread Lewis John Mcgibbney
, a link-graph database and parsing support handled by Apache Tika for HTML and and array other document formats. This is merely a pot shot, but I was thinking for Nutch 2.0, something like ** Apache Nutch 2.X is an experimental branch of the Apache Nutch open source web-search software project

Re: Suitable Nutch 2.0 Project Description

2012-06-13 Thread Ferdy Galema
formats. This is merely a pot shot, but I was thinking for Nutch 2.0, something like ** Apache Nutch 2.X is an experimental branch of the Apache Nutch open source web-search software project. It builds on Apache Gora for data persistence and Apache Solr for indexing adding web-specifics

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Julien Nioche
Ferdy The Nutch job jar is not present in the binary archive. This means distributed running of jobs is not supported. I'm not sure if this is a problem (since users can always build one themselves), merely pointing it out. The recently released 1.5 also lacks this job jar, so at least no

Re: Suitable Nutch 2.0 Project Description

2012-06-13 Thread Julien Nioche
-specifics, such as a crawler, a link-graph database and parsing support handled by Apache Tika for HTML and and array other document formats. This is merely a pot shot, but I was thinking for Nutch 2.0, something like ** Apache Nutch 2.X is an experimental branch of the Apache Nutch open

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Lewis John Mcgibbney
Hi Guys, Whilst updating the Nutch2Tutorial I got thinking that within Gora we don't supply binary distributions of the code, this is because when using Gora a user may wish/require to recompile the code to accomodate config changes etc. We only supply src distributions... Does this principle

Re: Suitable Nutch 2.0 Project Description

2012-06-13 Thread Mattmann, Chris A (388J)
a pot shot, but I was thinking for Nutch 2.0, something like ** Apache Nutch 2.X is an experimental branch of the Apache Nutch open source web-search software project. It builds on Apache Gora for data persistence and Apache Solr for indexing adding web-specifics, such as a crawler, a link

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Sebastian Nagel
Hi Lewis, Please see http://wiki.apache.org/nutch/Nutch2Tutorial which is an update of Julien's (I think) page on GORA_HBase. Thsi will get you rocking with HBase. The changes between Cassandra, Accumulo and the other data stores are fairly trivial. I'll managed to perform a crawl with 2.0

Re: VOTE Apache Nutch 2.0 RC1

2012-06-13 Thread Lewis John Mcgibbney
not recognized The parse argument was removed in Nutch 2.0 and now throws an illegalargumentexception. This is now normal. To enable parsing during fetching please set config in nutch-site.xml. The reason that the incorrect -parse argument is till in the Usage message, is because I was not diligent enough

Re: VOTE Apache Nutch 2.0 RC1

2012-06-12 Thread Lewis John Mcgibbney
the thorough stuff can be undertaken by those directly using the artifacts and code in development/production. Thanks very much in advance Best Lewis On Fri, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1

Re: VOTE Apache Nutch 2.0 RC1

2012-06-12 Thread Mattmann, Chris A (388J)
, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc/nutch-2.0 The release candidate is a src.zip, bin.zip, src.tar.gz and bin.tar.gz archive

Re: VOTE Apache Nutch 2.0 RC1

2012-06-12 Thread Lewis John Mcgibbney
using the artifacts and code in development/production. Thanks very much in advance Best Lewis On Fri, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc

Re: VOTE Apache Nutch 2.0 RC1

2012-06-12 Thread Sebastian Nagel
using the artifacts and code in development/production. Thanks very much in advance Best Lewis On Fri, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org

Re: VOTE Apache Nutch 2.0 RC1

2012-06-12 Thread Mattmann, Chris A (388J)
. Thanks very much in advance Best Lewis On Fri, Jun 8, 2012 at 3:49 PM, lewis john mcgibbney lewi...@apache.org wrote: Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc/nutch-2.0 The release candidate is a src.zip

VOTE Apache Nutch 2.0 RC1

2012-06-08 Thread lewis john mcgibbney
Good Evening Everyone, A candidate for the Apache Nutch 2.0 RC1 is available at: http://people.apache.org/~lewismc/nutch-2.0 The release candidate is a src.zip, bin.zip, src.tar.gz and bin.tar.gz archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc1 Further

[jira] [Updated] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-04-25 Thread Lewis John McGibbney (JIRA)
) 2.1 Set and classify Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874 URL: https://issues.apache.org/jira/browse/NUTCH

[jira] [Updated] (NUTCH-841) Nutch 2.0 webapp

2012-04-25 Thread Lewis John McGibbney (JIRA)
) 2.1 Nutch 2.0 webapp Key: NUTCH-841 URL: https://issues.apache.org/jira/browse/NUTCH-841 Project: Nutch Issue Type: Improvement Components: web gui Affects Versions: nutchgora

[jira] [Updated] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2012-04-25 Thread Lewis John McGibbney (JIRA)
versions of the same library in nutch-2.0-dev.job and local\lib directory Key: NUTCH-849 URL: https://issues.apache.org/jira/browse/NUTCH-849 Project: Nutch

[jira] [Updated] (NUTCH-841) Nutch 2.0 webapp

2012-03-08 Thread Ferdy Galema (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema updated NUTCH-841: --- Priority: Major (was: Blocker) Nutch 2.0 webapp Key: NUTCH

[jira] [Commented] (NUTCH-841) Nutch 2.0 webapp

2012-03-08 Thread Chris A. Mattmann (Commented) (JIRA)
! Nutch 2.0 webapp Key: NUTCH-841 URL: https://issues.apache.org/jira/browse/NUTCH-841 Project: Nutch Issue Type: Improvement Components: web gui Environment: Nutch 2.0 Reporter: Chris A. Mattmann

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-01-05 Thread Lewis John McGibbney (Commented) (JIRA)
are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874 URL: https://issues.apache.org/jira/browse/NUTCH-874 Project: Nutch Issue Type: Bug Components: parser

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread Julien Nioche
://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move: $ svn copy -m Branching current Nutch (Gora) 2.0 trunk. See: http

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread lewis john mcgibbney
! Cheers, Chris On Sep 23, 2011, at 4:57 PM, Mattmann, Chris A (388J) wrote: Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread lewis john mcgibbney
, Mattmann, Chris A (388J) wrote: Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move: $ svn

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread Mattmann, Chris A (388J)
. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move: $ svn copy -m Branching current Nutch (Gora) 2.0 trunk. See: http://s.apache.org/Zld for more information. https

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread lewis john mcgibbney
, Mattmann, Chris A (388J) wrote: Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-24 Thread Markus Jelsma
great! Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move: $ svn copy -m Branching

[NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-23 Thread Mattmann, Chris A (388J)
Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched to: https://svn.apache.org/repos/asf/nutch/branches/nutchgora/ I ran these commands to perform the move: $ svn copy -m Branching current Nutch (Gora

Re: [NOTICE] Nutch trunk is now 1.4-snapshot and Nutch 2.0 trunk is now the Nutch Gora branch

2011-09-23 Thread Mattmann, Chris A (388J)
Okey dok, the news item is now published. Let the dev'ing commence! Cheers, Chris On Sep 23, 2011, at 4:57 PM, Mattmann, Chris A (388J) wrote: Hi Folks, Per: http://s.apache.org/Zld the Nutch trunk is now 1.4-snapshot development. The old (Gora-based) Nutch 2.0 trunk has been branched

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-17 Thread Mattmann, Chris A (388J)
16, 2011, at 11:44 AM, Markus Jelsma wrote: Option B) Shelve trunk in a branch and promote 1.4 to trunk. We can always choose to hardwire HBASE (option D) later. Markus Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-17 Thread Markus Jelsma
in a branch and promote 1.4 to trunk. We can always choose to hardwire HBASE (option D) later. Markus Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien On 15 September 2011 19

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-17 Thread Mattmann, Chris A (388J)
Hey Markus, No worries. I actually have no dog in this fight to be honest. I want Gora to be successful, and I want Nutch to be successful. I haven't contributed much to Nutch 2.0 trunk but I have been to the 1.x series branch. I wish I knew more about Gora's internals (and am trying

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-17 Thread lewis john mcgibbney
no dog in this fight to be honest. I want Gora to be successful, and I want Nutch to be successful. I haven't contributed much to Nutch 2.0 trunk but I have been to the 1.x series branch. I wish I knew more about Gora's internals (and am trying to learn) so I could help more with it. I think

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Julien Nioche
Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien On 15 September 2011 19:55, Markus Jelsma markus.jel...@openindex.iowrote: Hi Guys, I thought I'd chime in on this thread. My comments below

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread lewis john mcgibbney
wrote: Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien On 15 September 2011 19:55, Markus Jelsma markus.jel...@openindex.iowrote: Hi Guys, I thought I'd chime in on this thread. My

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Markus Jelsma
Option B) Shelve trunk in a branch and promote 1.4 to trunk. We can always choose to hardwire HBASE (option D) later. Markus Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien On 15 September

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-16 Thread Mattmann, Chris A (388J)
Jelsma wrote: Option B) Shelve trunk in a branch and promote 1.4 to trunk. We can always choose to hardwire HBASE (option D) later. Markus Am happy to call for a vote on the future of Nutch 2.0 if you want. Shall we reduce the various options described before to a single one? Julien

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-15 Thread Markus Jelsma
). The real issue behind all this is what we should do with Nutch 2.0. What follows is only my opinion and I would love to hear what others have to say on this subject. Since we (actually mostly Dogacan) wrote 2.0 and delegated the storage to Gora, the latter hasn't really taken off since

Re: Future of Nutch 2.0 [Was: Unresolved dependencies org.apache.gora#gora-hbase;0.1: not found in Nutch trunk]

2011-09-15 Thread Sami Siren
On Thu, Sep 15, 2011 at 9:55 PM, Markus Jelsma markus.jel...@openindex.io wrote: There are many things i can write about this topic right now but don't feel it's neccessary. The choice is difficult and perhaps painful but when the voting round is opened by our project lead, i will vote for

[jira] [Commented] (NUTCH-841) Nutch 2.0 webapp

2011-09-10 Thread Lewis John McGibbney (JIRA)
. Nutch 2.0 webapp Key: NUTCH-841 URL: https://issues.apache.org/jira/browse/NUTCH-841 Project: Nutch Issue Type: Improvement Components: web gui Environment: Nutch 2.0 Reporter: Chris

[jira] [Commented] (NUTCH-841) Nutch 2.0 webapp

2011-09-08 Thread Lewis John McGibbney (JIRA)
resource from our wiki. http://wiki.apache.org/nutch/NutchAdministrationUserInterface Nutch 2.0 webapp Key: NUTCH-841 URL: https://issues.apache.org/jira/browse/NUTCH-841 Project: Nutch Issue Type: Improvement

Re: InvocationTargetException with Nutch 2.0 Gora 0.2 and Cassandra 0.8.4

2011-08-30 Thread lewis john mcgibbney
Hi Tom, Well this is strange... No versions of hector in Nutch 2.0/runtime/deploy/nutch-2.0-dev.job or /local/lib however Gora 0.2 uses it a dependency as per /gora-cassandra/lib/hector-core0.8.0-1.jar I'm going to take some time later and try various debug combinations within eclipse to get

Re: InvocationTargetException with Nutch 2.0 Gora 0.2 and Cassandra 0.8.4

2011-08-30 Thread Alexis
Hi Tom, I'm having the same issue. The two missing jars in the nutch-2.0-dev.job, cassandra-all-0.8.0.jar and hector-core-0.8.0-1.jar, have been manually uploaded for the Gora build to work into gora-cassandra/lib-ext SVN directory, because for some reason I did not get them downloaded through

InvocationTargetException with Nutch 2.0 Gora 0.2 and Cassandra 0.8.4

2011-08-29 Thread lewis john mcgibbney
Hi, I believe the following error can be attributed to the java compiler finding (or not finding) more than one version of me.prettyprint.hector.api.Serializer. Has anyone experienced this whilst getting the above (or similar) setup configured and running?

RE: InvocationTargetException with Nutch 2.0 Gora 0.2 and Cassandra 0.8.4

2011-08-29 Thread Tom Davidson
Subject: InvocationTargetException with Nutch 2.0 Gora 0.2 and Cassandra 0.8.4 Hi, I believe the following error can be attributed to the java compiler finding (or not finding) more than one version of me.prettyprint.hector.api.Serializer. Has anyone experienced this whilst getting the above

[jira] [Created] (NUTCH-1094) create comprehensive documentation for Nutch 2.0 trunk

2011-08-24 Thread Lewis John McGibbney (JIRA)
create comprehensive documentation for Nutch 2.0 trunk -- Key: NUTCH-1094 URL: https://issues.apache.org/jira/browse/NUTCH-1094 Project: Nutch Issue Type: Sub-task Components

Re: Unresolved compilation problems with Nutch 2.0

2011-08-15 Thread Markus Jelsma
I've seen similar issues anmd reported at least one: https://issues.apache.org/jira/browse/NUTCH-970 Using HSQLDB did work. At least it didn't throw exceptions... On Wednesday 10 August 2011 21:45:51 lewis john mcgibbney wrote: Hi, Pretty nasty looking error when using most recent Nutch

  1   2   >