RE: fdt file format

2011-10-04 Thread Michael Ryan
> After those 4 bytes it should match? Thanks. Yup, seems to match after that. -Michael - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: fdt file format

2011-10-03 Thread Michael McCandless
Look at src/java/org/apache/lucene/index/FieldsWriter.java It looks like the file format docs don't specify the 4 byte (int, value=3, big-endian I think) header we write. After those 4 bytes it should match? Mike McCandless http://blog.mikemccandless.com On Mon, Oct 3, 2011 at 3:11 PM, Michael

Re: .fdt file

2008-07-11 Thread Michael McCandless
I think the FieldCache API fits for this... Its purpose is to do a one-time caching of a particular field's values, rendered as a straight array keyed by document ID. So building that cache initially takes time and memory, but then getting the value for a particular document is extremely

Re: .fdt file

2008-07-10 Thread blazingwolf7
Well, according to him, using the reader to access the index everytime a document is found to retrieve certain values is inefficient. Meaning if there is 500k document, the index will be access 500k times. It might affect the performance of the search. So I am instructed to retrieve all the neces

Re: .fdt file

2008-07-10 Thread Grant Ingersoll
On Jul 10, 2008, at 1:42 AM, blazingwolf7 wrote: Well, I am trying to extract the URL and contentLength from the ".fdt" file. I am planning to use both of these values in a filter to remove certain links to be display in the search result. The problem is, I am told not to use the IndexR

Re: .fdt file

2008-07-10 Thread blazingwolf7
Thanks. I think I will follow the advice. But just for the sack of curiosity, can what I suggest be done ? Yonik Seeley wrote: > > On Thu, Jul 10, 2008 at 1:42 AM, blazingwolf7 <[EMAIL PROTECTED]> > wrote: >> Well, I am trying to extract the URL and contentLength from the ".fdt" >> file. >> I a

Re: .fdt file

2008-07-10 Thread Yonik Seeley
On Thu, Jul 10, 2008 at 1:42 AM, blazingwolf7 <[EMAIL PROTECTED]> wrote: > Well, I am trying to extract the URL and contentLength from the ".fdt" file. > I am planning to use both of these values in a filter to remove certain > links to be display in the search result. The problem is, I am told not

Re: .fdt file

2008-07-09 Thread blazingwolf7
Well, I am trying to extract the URL and contentLength from the ".fdt" file. I am planning to use both of these values in a filter to remove certain links to be display in the search result. The problem is, I am told not to use the IndexReader to retrieve these values for each document found match

Re: .fdt file

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 11:13 PM, blazingwolf7 <[EMAIL PROTECTED]> wrote: > Sorry,but I am still quite new to Lucene. What exactly is "cp"? The unix command for copy (hence the smiley). Some of your recent questions seem to be suffering from an XY problem: http://www.perlmonks.org/index.pl?node_id

Re: .fdt file

2008-07-09 Thread blazingwolf7
Sorry,but I am still quite new to Lucene. What exactly is "cp"? Yonik Seeley wrote: > > On Wed, Jul 9, 2008 at 9:01 PM, blazingwolf7 <[EMAIL PROTECTED]> > wrote: >> I had recently found out that Lucene will retrieve the content of a >> document >> from a file ".fdt". I am trying to retrieve the

Re: .fdt file

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 9:01 PM, blazingwolf7 <[EMAIL PROTECTED]> wrote: > I had recently found out that Lucene will retrieve the content of a document > from a file ".fdt". I am trying to retrieve the entire file in one go > instead of retrieving it based on document number. can it be done? "cp" c