in context:
http://www.nabble.com/Image-Search-Engine-Input-tf3469257.html#a14112016
Sent from the Nutch - Dev mailing list archive at Nabble.com.
Hi,
My question is not strictly to do with image search but I can't help feeling
the issue is somewhat related in terms of where to store what: I want to
spell correct web pages prior to indexing and only index the corrected
terms. I still want to store the original errorful text so this can be
Steve Severance wrote:
I am not looking to really make an image retrieval engine. During indexing
referencing docs will be analyzed and text content will be associated with the
image. Currently I want to keep this in a separate index. So despite the fact
that images will be returned the
Severance [mailto:[EMAIL PROTECTED]
Sent: Monday, March 26, 2007 4:04 PM
To: nutch-dev@lucene.apache.org
Subject: Image Search Engine Input
Hey all,
I am working on the basics of an image search engine. I want to ask for
feedback on something.
Should I create a new directory in a segment parse_image
Steve Severance wrote:
So now that I have spent a few hours looking into how this works a lot more
deeply I am even more of a conundrum. The fetcher passes the contents of the
page to the parsers. It assumes that text will be output from the parsers.
For instance even the SWF parser returns
Hey all,
I am working on the basics of an image search engine. I want to ask for
feedback on something.
Should I create a new directory in a segment parse_image? And then put the
images there? If not where should I put them, in the parse_text? I created a
class ImageWritable just like the Jira
to change the way a lot of things work in
the process. Let me know what you all think.
Steve
-Original Message-
From: Steve Severance [mailto:[EMAIL PROTECTED]
Sent: Monday, March 26, 2007 4:04 PM
To: nutch-dev@lucene.apache.org
Subject: Image Search Engine Input
Hey all,
I am working