[
https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12583877#action_12583877
]
Gordon Mohr commented on NUTCH-296:
---
FYI: We've suggested image-search exte
at to see you started work on this!", but then noticed
your comment is from March 2007, not March 2008. :)
So, instead, let me ask: "Did anything come out of this?" I, too, am seeing a
need for a Nutch-based search engine.
> Image Search
>
>
context:
http://www.nabble.com/Image-Search-Engine-Input-tf3469257.html#a14112016
Sent from the Nutch - Dev mailing list archive at Nabble.com.
Hi,
My question is not strictly to do with image search but I can't help feeling
the issue is somewhat related in terms of where to store what: I want to
spell correct web pages prior to indexing and only index the corrected
terms. I still want to store the original errorful text so this c
Steve Severance wrote:
I am not looking to really make an image retrieval engine. During indexing
referencing docs will be analyzed and text content will be associated with the
image. Currently I want to keep this in a separate index. So despite the fact
that images will be returned the search
Hey guys. Thanks for the replies.
> -Original Message-
> From: Andrzej Bialecki [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, March 27, 2007 3:52 AM
> To: nutch-dev@lucene.apache.org
> Subject: Re: Image Search Engine Input
>
> Steve Severance wrote:
> > So
al
ones in content/). Another example: we could convert PDF/DOC/PPT files
to HTML, and store this output in the "HTML preview" part.
So there are 3 choices for moving forward with an image search,
1. All image data can be encoded as strings. I really don't like that choice
Hi Steve,
Good point.
We are also working on a image search. For the time being, we store the
parsed content (a downscaled version of the image) by replacing the
original content during parsing Not an ideal solution, I know!
My first reaction is that your 2nd suggestion is the way to go.
On
, images,
videos, music, etc... this is problematic. Potentially confounding the
problem even further in the case of music is that text and binary data can
come from the same file. Even if that is a problem I am not going to tackle
it.
So there are 3 choices for moving forward with an image search,
1
Hey all,
I am working on the basics of an image search engine. I want to ask for
feedback on something.
Should I create a new directory in a segment parse_image? And then put the
images there? If not where should I put them, in the parse_text? I created a
class ImageWritable just like the Jira
would be great.
Steve
> Image Search
>
>
> Key: NUTCH-296
> URL: https://issues.apache.org/jira/browse/NUTCH-296
> Project: Nutch
> Issue Type: New Feature
>Reporter: Thomas Delnoij
>
[ http://issues.apache.org/jira/browse/NUTCH-296?page=all ]
Thomas Delnoij updated NUTCH-296:
-
Description:
Per the discussion in the Nutch-User mailing list, there is a wish for an
"Image Search" add-on component that will index images.
Image Search
Key: NUTCH-296
URL: http://issues.apache.org/jira/browse/NUTCH-296
Project: Nutch
Type: New Feature
Reporter: Thomas Delnoij
Priority: Minor
Per the discussion in the Nutch-User mailing list, there is a wish for an
"Image S
Somebody try create image search based on nutch ?
Dear Developers,
Have anyone an 'image search' solution?
Regards,
Ferenc
15 matches
Mail list logo