FieldCache vs TermVector

2006-11-22 Thread Volodymyr Bychkoviak
: - Can TermVector be used instead of FieldCache to implement sorting (and other activities where FieldCache is used) ? - Would it be much slower? -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For

Re: obtaining the number of documents stored in a .cfs file

2006-10-26 Thread Volodymyr Bychkoviak
one mistake in this code should be infos.counter = ++counter; instead of infos.counter = counter++; Volodymyr Bychkoviak wrote: I've used following code to recover index. Note: it only works with .cfs files. String path = // path to index File file = new File(path); Dire

Re: Error using Luke

2006-10-18 Thread Volodymyr Bychkoviak
with command line like this: java -classpath luke.jar;lucene-1.9.1.jar org.getopt.luke.Luke vasu shah wrote: Thank you very much. I have indeed turned off the norms. Is there any new version of Luke that I can use? Thanks, -Vasu Volodymyr Bychkoviak <[EMAIL PROTECTED]> wro

Re: Error using Luke

2006-10-18 Thread Volodymyr Bychkoviak
?? Any help would be appreciated. Thanks, -Vasu - Stay in the know. Pulse on the new Yahoo.com. Check it out. -- regards, Volodymyr Bychkoviak - To unsubscribe

MultiFieldQueryParser vs concatenated field

2006-10-03 Thread Volodymyr Bychkoviak
, Volodymyr Bychkoviak

Re: DateTools again

2006-10-03 Thread Volodymyr Bychkoviak
thanks for detailed explanation. John Haxby wrote: Volodymyr Bychkoviak wrote: User has an input (javaScript calendar) on page where he can choose some date to include in search. Search resolution is day resolution. If user will enter same date in different time of date he will get

Re: DateTools again

2006-10-03 Thread Volodymyr Bychkoviak
h "Calendar cal = Calendar.getInstance();" John Haxby wrote: Volodymyr Bychkoviak wrote: I'm using DateTools with Resolution.DAY. I know that dates internally are converted to GMT. Converting dates "2006-10-01 00:00" and "2006-10-01 15:00" from "Etc

DateTools again

2006-10-02 Thread Volodymyr Bychkoviak
e dates are identical with day resolution. Is this bug or I'm missing something? -- regards, Volodymyr Bychkoviak

Re: obtaining the number of documents stored in a .cfs file

2006-09-06 Thread Volodymyr Bychkoviak
One more note: this should be in package 'org.apache.lucene.index;' because it uses some package visible classes :) Volodymyr Bychkoviak wrote: I've used following code to recover index. Note: it only works with .cfs files. String path = // path to index File file

Re: obtaining the number of documents stored in a .cfs file

2006-09-06 Thread Volodymyr Bychkoviak
ht number of docs for each segments or any (dummy) number will do? If I have to put the right number there, how do I get it having the cfs file? Stanislav - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: Indexing very slow.

2006-07-03 Thread Volodymyr Bychkoviak
rashes windows after about 5000 docs. it sais "beep" and a complete shutdown... Peter - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: addIndexes() is taking infinite time ...

2006-06-20 Thread Volodymyr Bychkoviak
time grows exponentially Can anybody explain this in brief. Thanks in advance.. With Regards -- regards, Volodymyr Bychkoviak

Re: Compound / non-compound index files and SIGKILL

2006-06-06 Thread Volodymyr Bychkoviak
n orphaned files and/or a corrupted index, if killed. -Original Message- From: Volodymyr Bychkoviak [mailto:[EMAIL PROTECTED] Sent: 06 June 2006 10:54 To: java-user@lucene.apache.org Subject: Re: Compound / non-compound index files and SIGKILL If your content handlers should respond qui

Re: Compound / non-compound index files and SIGKILL

2006-06-06 Thread Volodymyr Bychkoviak
not in the content handlers, as I'd previously assumed. I'll put some debug before and after my addDocument() calls to confirm (and point my writer's infoStream to System.out). -Original Message- From: Volodymyr Bychkoviak [mailto:[EMAIL PROTECTED] Sent: 05 June 2006 18:33

Re: Compound / non-compound index files and SIGKILL

2006-06-05 Thread Volodymyr Bychkoviak
e's my question: How can I attempt to merge these orphaned into the compound index, using IndexWriter.addIndexes(), or would I be foolish attempting this? -- regards, Volodymyr Bychkoviak

Re: Efficiently paginating results.

2006-04-28 Thread Volodymyr Bychkoviak
To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: Return all distinct values

2006-03-30 Thread Volodymyr Bychkoviak
- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: Paging results

2006-03-29 Thread Volodymyr Bychkoviak
ditional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: File Name Search

2006-03-06 Thread Volodymyr Bychkoviak
To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: Question to Lucene Index

2006-02-24 Thread Volodymyr Bychkoviak
100"); Or, you can set a value on the JVM's command line with -D. Good luck! --MDC - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: When do files in 'deleteable' get deleted?

2006-02-14 Thread Volodymyr Bychkoviak
- does it ever work on windows? As far as I can tell, nothing should be holding a lock on those files. Yet, even when I shut down the only jvm that is using these indexes, and then open a new one, and perform a search, they don't go away. Thanks, Dan -- regards, Volodymyr Bychkoviak

Re: Memory problem

2006-02-01 Thread Volodymyr Bychkoviak
[EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: complex search

2005-10-31 Thread Volodymyr Bychkoviak
and use the FieldCache. -Hoss - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr By

Re: Indexing

2005-10-31 Thread Volodymyr Bychkoviak
Chase - Search multiple travel sites in one click. -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

complex search

2005-10-31 Thread Volodymyr Bychkoviak
hi all. I have indexed table from database into index and it looks like: itemId is not unique. I want to implement search which in SQL equivalent looks like select itemId, min(price) from where groupBy itemId Is it possible to achieve? -- regards, Volodymyr Bychkoviak

Re: Terms contain spaces

2005-10-27 Thread Volodymyr Bychkoviak
t to generate for strings in quotes? Thanks Mike -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Database File Store (SQLDirectory?)

2005-10-26 Thread Volodymyr Bychkoviak
-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Non scored results

2005-10-21 Thread Volodymyr Bychkoviak
Hello all. If I need non scored results can I use Searcher.search(Query,HitCollector) method and store bitset of found documents. will it give me some speed improvement? thanks. -- regards, Volodymyr Bychkoviak - To

Re: java on 64 bits

2005-10-21 Thread Volodymyr Bychkoviak
lation between the size of the index and the jvm used? I hope the questions make sense, thanks, roxana - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- reg

Re: IndexWriter.optimize() need to much time.

2005-10-05 Thread Volodymyr Bychkoviak
ribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: IndexWriter.optimize() need to much time.

2005-10-05 Thread Volodymyr Bychkoviak
ands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak - To unsubscribe,

Re: Renewing IndexSearcher on index change.

2005-10-04 Thread Volodymyr Bychkoviak
but this method should be used if you are running only one instance of such program (because one program can unlock index locked by another program for indexing for example) -- regards, Volodymyr Bychkoviak - To unsubscribe,

Re: Renewing IndexSearcher on index change.

2005-10-04 Thread Volodymyr Bychkoviak
xception under some scorer class, unfortunately I did not kept the trace. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: OutOfMemory error when searching

2005-08-19 Thread Volodymyr Bychkoviak
bscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: merging indexes together

2005-08-09 Thread Volodymyr Bychkoviak
eletion is done just get a new IndexReader instance to access the new documents. Aviran http://www.aviransplace.com -Original Message- From: Volodymyr Bychkoviak [mailto:[EMAIL PROTECTED] Sent: Monday, August 08, 2005 1:50 PM To: java-user@lucene.apache.org Subject: merging indexes toge

merging indexes together

2005-08-08 Thread Volodymyr Bychkoviak
owing... current time to merge indexes is about 5min. Any ideas how to optimize this? -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lock Issue

2005-08-05 Thread Volodymyr Bychkoviak
- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Distributable CD

2005-08-04 Thread Volodymyr Bychkoviak
anks, Dan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- regards, Volodymyr Bychkoviak

Re: IOException IndexReader out of date

2005-07-07 Thread Volodymyr Bychkoviak
Dirk Hennig wrote: Volodymyr Bychkoviak wrote: the problem is than index was modified between indexReader.open(index); and indexReader.delete(hitId); method calls. That would explain the exception. But How? it can be modified by another indexReader or indexWriter. The program is

Re: IOException IndexReader out of date

2005-07-07 Thread Volodymyr Bychkoviak
the problem is than index was modified between indexReader.open(index); and indexReader.delete(hitId); method calls. regards, Volodymyr Bychkoviak Dirk Hennig wrote: Hallo, When I try to use this to remove several documents from the index

Re: Question for Wildcard Search:

2005-06-23 Thread Volodymyr Bychkoviak
ange of values (to insert gaps for "*" and "?") this approach is good because it does not rewrite queries and never run into OutOfMemory or TooManyClauses Exceptions regards, Volodymyr Bychkoviak 14.03.2005 13:54 Dave Kor wrote: Quoting Dave Kor <[EMAIL PROTECTED]>:

Re: Performance with multi index

2005-06-17 Thread Volodymyr Bychkoviak
Each time you close indexWriter ram cached documents are flushed to disk. In case you open and close it per document many one-document segments are created on disk. minMergeDocs sets number of documents that are cached in ram. JM Tinghir wrote: do you keep your indexWriter open all the time

Re: Performance with multi index

2005-06-16 Thread Volodymyr Bychkoviak
can you measure "pure" index creation time (without creating XMLs) and one more question: do you keep your indexWriter open all the time during process? JM Tinghir wrote: Well, it just took 145 minutes to index 2670 files (450 MB) in one index (29 MB). It only took 33 minutes when I did it int

Re: Performance with multi index

2005-06-16 Thread Volodymyr Bychkoviak
my previous message lost somewhere :( reposting can you measure "pure" index creation time (without creating XMLs) and one more question: do you keep your indexWriter open all the time during process? best way to determine bottlenecks is profiling :) regards, Volodymyr Bychkoviak

Re: Performance with multi index

2005-06-16 Thread Volodymyr Bychkoviak
exing process. Perhaps you need to optimize the index? Perhaps, never tried it... JM - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] regards, Volodymyr Bychk

Re: Indexes auto creation

2005-06-13 Thread Volodymyr Bychkoviak
ctory, new StandardAnalyzer(), true); iw.close(); } else { IndexReader.unlock(FSDirectory.getDirectory(indexDirectory, false)); } } catch (IOException e) { // Exception happened when trying to unlock working index } regards, Volodymyr Bychkoviak Stephane Bailliez

Re: Negative Field Boost

2005-05-26 Thread Volodymyr Bychkoviak
here you can play with boost (+includes:(red frogs) -excludes:(red frogs))^2.0 excludes:(red frogs)^0.5 you can also play with numbers to achieve best results. this is first solution I've thought about(I mean there may be more efficient solutions) regards, Volodymyr Bychkoviak Ryan

Re: Negative Field Boost

2005-05-26 Thread Volodymyr Bychkoviak
string query should look like: "+includes:(red frogs) -excludes:(red frogs)" You can play with MultiFieldQueryParser a bit. regards, Volodymyr Bychkoviak Ryan Skow wrote: Here is the logical structure of the document I'm working with: The 'Document' has two fields

Re: WildCard search replacement

2005-04-27 Thread Volodymyr Bychkoviak
Glad to help you. Aalap Parikh wrote: Thanks very much. Volodymyr, your trick about wildcard search replacement has helped me a lot. Great idea!!! Thanks a lot again. --- Volodymyr Bychkoviak <[EMAIL PROTECTED]> wrote: Hi. Aalap Parikh wrote: Hi, The idea about begin marker sound

Re: WildCard search replacement

2005-04-24 Thread Volodymyr Bychkoviak
should use this approach. :) regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: WildCard search replacement

2005-04-22 Thread Volodymyr Bychkoviak
ard search like ex. *123* . But say I only need something like 123* i.e. wildcard only at the end and NOT on both sides, then how can one use your technique to avoid TooManyClauseException? Thanks, Aalap. --- Volodymyr Bychkoviak <[EMAIL PROTECTED]> wrote: I used It to measure speed and but I

Re: WildCard search replacement

2005-04-20 Thread Volodymyr Bychkoviak
. regards, Volodymyr Bychkoviak Aalap Parikh wrote: Hi, Also this analyzer is not used in any application, I wrote it only to measure search speed. So you don't use the method you described for your wildcard search trick? Thanks,

Re: What is going on with subversion.

2005-04-20 Thread Volodymyr Bychkoviak
ring[] fields, int[] flags, Analyzer analyzer) throws ParseException { return parse(query, fields, flags, analyzer, false); } } -Oorspronkelijk bericht- Van: Volodymyr Bychkoviak [mailto:[EMAIL PROTECTED] Verzonden: woensdag 20 april 2005 15:21 Aan: java-user@lucene.apache.org Onderwerp:

Re: What is going on with subversion.

2005-04-20 Thread Volodymyr Bychkoviak
gs, Analyzer analyzer) throws ParseException { return parse(query, fields, flags, analyzer, true); } public static Query parseOr(String query, String[] fields, int[] flags, Analyzer analyzer) throws ParseException { return parse(query, fields, flags, analyzer, fals

Re: What is going on with subversion.

2005-04-20 Thread Volodymyr Bychkoviak
Sorry, I've already read about servers moving. Can somebody mail me latest MultiFieldQueryParser.java and highlighting source code. Because I can't get it from subversion and I need it urgently. Thanks in advance. Regards, Volodymyr Bychkoviak Volodymyr Bychkoviak wrote: I can

What is going on with subversion.

2005-04-20 Thread Volodymyr Bychkoviak
I can't connect svn.apache.org. It seems that apache.org is down. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: WildCard search replacement

2005-04-20 Thread Volodymyr Bychkoviak
ubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] regards, Volodymyr Bychkoviak - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene bulk indexing

2005-04-20 Thread Volodymyr Bychkoviak
Hi, The best way to determine bottlenecks is profiling. (JProfiler is very good tool for that. It's commercial product with free evaluation) I was indexing 1.5 million documents in 45 minutes. before optimizing it took much more time to index. optimization was done through 'select' query changin

Re: Implementing security in search

2005-03-29 Thread Volodymyr Bychkoviak
maybe it is not the best solution, but you can form restriction clause like (+allowedRole1 +allowedRole2 +allowedRole2 -forbiddenRole1 -forbiddenRole2 ... -forbiddenRoleN) where (forbiddenRole1 ... forbiddenRoleN) are all posible roles except allowed roles. regards, Volodymyr Bychkoviak

Result grouping

2005-03-29 Thread Volodymyr Bychkoviak
achieve this. P.S. It have to be fast enough, because it will be used in incremental search. Thanks for answer. Regards, Volodymyr Bychkoviak. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

WildCard search replacement

2005-03-14 Thread Volodymyr Bychkoviak
number ","1")); This approach works faster in case when you need to do search by very short expression and never run out of memory (or throws TooManyClauses Exception). I think this can be useful for someone who needs similar functional

Re: Remove document fails

2005-03-03 Thread Volodymyr Bychkoviak
ROTECTED]> wrote: Volodymyr Bychkoviak wrote: may be you have open IndexWriter at the same time you are trying to delete document. Alex Kiselevski wrote: Hi, I have a problem doing IndexReader.delete(int doc) and it fails on lock error. Alex Kiselevski +9.729.776.4346 (desk) +9.7

Re: possible concurrent actions table

2005-03-02 Thread Volodymyr Bychkoviak
pened. (also this instance will be unable to delete documents as well) Thanks to anyone who knows and answers, Friso - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] Regards, Volo

Re: Large Index managing

2005-03-02 Thread Volodymyr Bychkoviak
this is solved by keeping document key not in list but in set. then even with two updates delete and add will appear only once. Miles Barr wrote: On Wed, 2005-03-02 at 05:49, Otis Gospodnetic wrote: Or you can just buffer your update requests and delete in batch and then add in batch. Or you cou