Re: Clustering lucene's results

2004-10-07 Thread Albert Vila
n works with "flat" version of Lingo algorithm, so you don't need to worry about it. Hope this gets you started with using Carrot2 and Lucene. Please let me know about any successes or failures. Dawid - To unsubscri

Using '*' wildcard in phrase searches

2004-07-05 Thread Albert Vila
I wanna use the * in the middle of a phrase search like "java j2*". Anyone knows how can i achieve that? Thanks, Albert - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
:)), so Lucene will simply make a cumulative index from your two separate indices: luceneID_0, code_x, title_x, content_x, language_x, date_x luceneID_1, code_y, title_y, content_y, language_y, date_y luceneID_0, code_y, cluster_y luceneID_1, code_x, cluster_x Otis --- Albert Vila <[EMAIL PR

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
ll the prevous index structure procude an unconsistent merged index? I wanna achieve the following merged index: luceneID_0, code_x, title_x, content_x, language_x, date_x, cluster_x luceneID_1, code_y, title_y, content_y, language_y, date_y, cluster_y Thanks Otis Gospodnetic wrote: Albert, ---

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
want to do, but I'm not certain I understood you fully. Otis --- Albert Vila <[EMAIL PROTECTED]> wrote: Hi all, I was wondering If I can search using the MultiSearcher over two diferent indexes at the same time (with diferent fields). I've got one big index, with the code, tit

StandardTokenizer and e-mail

2004-05-21 Thread Albert Vila
rching for '[EMAIL PROTECTED]', for 'xyz' only, or for 'company' only. How can I achieve that?, I need to write my own tokenizer? Thanks Albert -- Albert Vila Director de proyectos I+D http://www.imente.com 902 933 242 [iMente “La información con más beneficios”] --

Re: Large document collections?

2004-03-10 Thread Albert Vila Puig
itional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- Albert Vila Puig http://www.imente.com [iMente, El mayor agregador de titulares en español] Le invi

IndexOrderSearcher over multiple indexes

2004-01-15 Thread Albert Vila Puig
results sorted for each individually index, not for all hits. I need an specific MultiSearcher implementation that deals with this problem. Has anyone already implemented this feature? If not, any comments how to do it will be appreciate. Thanks for all. Albert -- Albert

Re: Remove a token from a field

2003-10-31 Thread Albert Vila Puig
I know there is no way to update a document without doing a delete/add. But I'm asking if this feature is viable to be implemented in an efficient way. Thanks Erik Hatcher wrote: On Friday, October 31, 2003, at 03:53 AM, Albert Vila Puig wrote: Hi, Is there a way to remove a token

Remove a token from a field

2003-10-31 Thread Albert Vila Puig
Hi, Is there a way to remove a token from a document field entry?. For example, I've got a UnStored field in my index and I want to remove a token from this field without doing the delete and add document (because I'm inserting the documents by date and I don't want to loose that sort).

Lucene delete feature

2003-10-24 Thread Albert Vila Puig
Hi all, I`ve got a question about the delete feature. I have a very large collection of XML documents, each document contains a classification, and one document can be in different classfications, there is no tree structure in the classfications (all the classifications are in the same leve

Is the lucene index serializable?

2003-09-23 Thread Albert Vila Puig
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way to serialize a Lucene Index? I wan to send it from the Indexer server to the Search Server, and then do a merge operation in the Search Server with the previous index file. Thanks. -

Distributed Indexing

2003-09-22 Thread Albert Vila Puig
Hi, I have to develop a distributed search engine for my company. I’m very interested with the Lucene index format, and I want to use it. The main problem is how to distribute the index in the different machines. The solution is not just copy the index, because I have to manage 50Gb of data. I

Large data sets benchmark

2003-09-16 Thread Albert Vila Puig
Hello I'm planing to use Lucene in a big environment, with large data sets (between 50-100Gb of data). I've seen some people have experience in this kind of environments. I was wondering, if someone can provide a benchmarking with this amount of data. Another question, is someone using Lucene