[Nutch-dev] Adobe Photoshop 8.0 - wholesale price

2004-12-28 Thread Alice A . Walker
Access all the popular software imaginable for prices substantially lower than in stores!We sell software 2-6 times cheaper than retail price.Just a few examples:$70 Windows XP Professional (Including: Service Pack 2)$80 Microsoft Office 2003 Profes

[Nutch-dev] Rules for removing session IDs

2004-12-28 Thread Chirag Chaman
Just incase anyone is looking for rules to remove session IDs Following are rules that will remove duplicate pages from being indexed. These need to be added to regex-normalize.xml These rule cover about 80% of the cases I've seen on the net. If anyone wants to send me URLs to other implementati

RE: [Nutch-dev] Is there any way to merge the segment directories?

2004-12-28 Thread Chirag Chaman
By 2 segments I mean -- "It will merge all the segments in the source to the target" You can also specific the list on source segments on the command line. Run the tool without any flags and it should Display usage. NOTE: please get the latest one from CVS -- it went thru a major rewrite.

[Nutch-dev] Fetch / Parse errors and a Bug

2004-12-28 Thread Chirag Chaman
So, after some research I think one of the 2 issues I reported earlier can get fixed. To refresh, the error I question is: > fetch okay, but can't parse http://java.sun.com/j2se/1.4.2/docs/api/java/nio/charset/Charset.html, reason: Content-Type not application/pdf: The problem is that this page d

RE: [Nutch-dev] Is there any way to merge the segment directories?

2004-12-28 Thread Chirag Chaman
Use the fastSegmentMerge tool (now the default one) Call the SegmentMergeTool with the source directory and the target directory and it will merge the 2 segments (it wont recrawl, but it will reindex) -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Mic

[Nutch-dev] Is there any way to merge the segment directories?

2004-12-28 Thread Michael Sashnikov
Is there any way to merge the segment directories without recrawling everything? --- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up

RE: [Nutch-dev] (no subject)

2004-12-28 Thread Chirag Chaman
Here's another problem: fetch okay, but can't parse http://java.sun.com/j2se/1.4.2/docs/api/java/nio/channels/FileLock.html, reason: Content-Type not application/pdf: This should not even be going into the PDF plugin. After some research I seemed to have found what I think is the problem. All t

Re: [Nutch-dev] Results Syndication Feed

2004-12-28 Thread Hasan Diwan
Sir: Please refer to my bug report #1092096 for a first draft. Essentially, I added a generateRSS method to the Hits class. On Tue, 28 Dec 2004 08:35:28 -0800, Doug Cutting <[EMAIL PROTECTED]> wrote: > A servlet that uses NutchBean should do most of what's required. If you > need results date-ord

Re: [Nutch-dev] Results Syndication Feed

2004-12-28 Thread Doug Cutting
Hasan Diwan wrote: I was wondering if anyone's working on adding RSS output of search results. If not, I have found a decent RSS library for Java and was wondering where I can get the Results of a search in the tree? Thanks for the help! I have not seen anyone do this. It should not be too hard a

[Nutch-dev] (no subject)

2004-12-28 Thread Chirag Chaman
Hello and Happy Holidays! Does anyone know what this error means and how we can avoid it? - fetching http://workforcesecurity.doleta.gov/foreign/times.asp - fetch of http://workforcesecurity.doleta.gov/foreign/times.asp failed with: java.nio.charset.IllegalCharsetNameException: Thankx CC

[Nutch-dev] [ nutch-Bugs-1092096 ] RSS Feed for searches

2004-12-28 Thread SourceForge.net
Bugs item #1092096, was opened at 2004-12-28 01:43 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=1092096&group_id=59548 Category: searcher Group: mainline Status: Open Resoluti

[Nutch-dev] Results Syndication Feed

2004-12-28 Thread Hasan Diwan
I was wondering if anyone's working on adding RSS output of search results. If not, I have found a decent RSS library for Java and was wondering where I can get the Results of a search in the tree? Thanks for the help! -- Cheers, Hasan Diwan <[EMAIL PROTECTED]> -