RE: Put field in database vs. Lucene

2005-11-02 Thread Robert Engels
If you do not put them in Lucene, performing any sort of AND search will be VERY difficult, and/or VERY slow. -Original Message- From: Mario Alejandro M. [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 02, 2005 4:08 PM To: Lucene Developers List Subject: Put field in database vs. Lucen

Put field in database vs. Lucene

2005-11-02 Thread Mario Alejandro M.
I have a requeriment in mix structured and un-structured data. Also, the non-structured data have meta-data. If I have: Id Title Author Content DateCreated DateAccesed And a lot od schemes like that... so I put all fields less Content in a database or in lucene? -- Mario Alejandro Montoya MCP w

[jira] Resolved: (LUCENE-350) counter field in segments file is not documented in fileformats.xml

2005-11-02 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-350?page=all ] Daniel Naber resolved LUCENE-350: - Fix Version: unspecified Resolution: Fixed Assign To: (was: Lucene Developers) Both name counter and segment counter are now document at

What do you think about this "term collapsing"? technique...

2005-11-02 Thread Mario Alejandro M.
I'm reading now http://research.nitle.org/lsi/lsa_explanation.htm How much valuable is it? I'm looking for a way to lower the "noise" in searching (the infamous "lucky bug, get 100.000 hits!") and improve accuracy. I plan to build a search engine for the enterprise so I don't can count in a lot

Re: Faking index merge by modifying segments file?

2005-11-02 Thread Paul Elschot
On Wednesday 02 November 2005 12:47, Otis Gospodnetic wrote: > Hello, > > --- Paul Elschot <[EMAIL PROTECTED]> wrote: ... > > > It's possible to share segments between indexes when the file system > > allows files to be present in multiple directories. > > Oh, are you saying that I could just le

RE: Faking index merge by modifying segments file?

2005-11-02 Thread Robert Engels
There only need to be sorted if segA and segB were combined so in your case, this is not needed. I am not sure that what you are describing is any different than how MultiReader works, and it does not need to perform any file copying of linking. Just create the new index. Write the documents. And

Re: Faking index merge by modifying segments file?

2005-11-02 Thread David Balmain
> This sounds like it should be possible, except for docId clashes - if > index A had a document with Id 100 and index B also has a document with > Id 100, after my index file copying, index C will end up having 2 > documents with Id 100, and that won't work. So, documents in C would > have to be

Re: Faking index merge by modifying segments file?

2005-11-02 Thread Otis Gospodnetic
Hello, --- Paul Elschot <[EMAIL PROTECTED]> wrote: > On Tuesday 01 November 2005 08:51, Otis Gospodnetic wrote: > > Hello, > > > > I spent most of today talking to some people about Lucene, and one > of > > them said how they would really like to have an "instantaneous > index > > merge", and ho

RE: Faking index merge by modifying segments file?

2005-11-02 Thread Otis Gospodnetic
Hello, --- Robert Engels <[EMAIL PROTECTED]> wrote: > Problem is the terms need to be sorted in a single segment. Are you referring to Term Dictionary (.tis and .tii files as described at http://lucene.apache.org/java/docs/fileformats.html )? If so, is that really true? I don't have an optimiz