Re: Lucene Indexing structure

Vaijanath N. Rao Sun, 04 May 2008 06:40:54 -0700

Hi Chris,

Sorry for the cross-posting and also for not making clear the problem.Let me try to explain the problem at my hand.

I am tying to write a CBIR (Content Based Image Reterival) frame workusing lucene. As each document have entities such as title, description,author and so on. I am decomposing each image and extracting featureslike color histogram, texture and other important attributes from everyimage and indexing it in lucene such a way that each of this attributeis a field. I convert the float values as string for every feature thatI have extracted from the image.

While searching for similar image I extract the same set of features forthe query Image and than query lucene to get all those images which haveatleast one of the features, than I do the re-ranking according to thedifference of the features. Once the re-ranking is done I submit theresult.Here is where I need help, I need to know an optimal way to store thevalues, so that searching take less time and I don't have to re-ranking.Is there any way I can compare array of values rather than one value.What I essentially need is to get the query of type, give me all thosefeatures which are less than K distance from the current feature.


--Thanks and Regagrds
Vaijanath

Chris Hostetter wrote:

: Hi Lucene-user and Lucene-dev,
Please do not cross post -- java-user is the suitable place for yourquestion.
: Obviously there is something wrong with the above approach (as to get the
: correct document we need to get all the documents and than do the required
: distance calculation), but that' due to lack of my knowledge of Luce and
: lucene's Index storage.
:: What I want to know how to improve upon the exsisting architecture other than
: making number of fields in the lucene equalling to total number of
: feature*size of each feature.
I suspect one of the reasons you haven't gotten much of a response yet isthat people may not understand your problem statement -- I know nothing ofImage Processing and even after googling "Color Histogram" I don't reallyunderstand how the examples you gave represent Color Histograms, or whatit would mean to search on it with your example input.
Perhaps you could describe in more detail what exactly some sampledata looks like, why certian objects should match certain queries, (andjust as importantly: why other objects shouldn't match, and give examplesof one one object is a "better" match then another object for each examplequery.
don't worry about Lucene Document/Field/QueryParse specifics -- justexplain the concepts you are dealing with.
-Hoss



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene Indexing structure

Reply via email to