WildCard search replacement

2005-04-19 Thread Aalap Parikh
Hi Volodymyr, About the trick you described about wildcard search replacement, you mentioned: > So I found following workaround. I index this field as > sequence of terms, each of containing single digit from > needed value. (For example I have “123214213” value > that needs to be indexed. Then i

Re: WildCard search replacement

2005-04-20 Thread Aalap Parikh
Hi, > Also this analyzer is not used in any application, I > wrote it only to > measure search speed. So you don't use the method you described for your wildcard search trick? Thanks, Aalap. - To unsubscribe, e-mail: [EMAIL PR

Re: Lucene bulk indexing

2005-04-20 Thread Aalap Parikh
Hi, I have similar issues in indexing time. I am doing a SELECT from database and getting back 10,000 rows. I then start indexing each row and hence would have 10,000 documents in my Lucene index. Each doc has 27 fields. I added some timing code to my indexing process. The DB select call takes a

Re: WildCard search replacement

2005-04-21 Thread Aalap Parikh
y job, so I > will tune it later. > This is just an example to give you an idea how it > can work. > > regards, > Volodymyr Bychkoviak > > Aalap Parikh wrote: > > >Hi, > > > > > > > >>Also this analyzer is not used in any applicat

Re: Lucene bulk indexing

2005-04-21 Thread Aalap Parikh
can fiddle > with. > > I've seen Hibernate 2.* get sluggish once its > Session gets filled up > with a lot of objects. > > Otis > > > --- Aalap Parikh <[EMAIL PROTECTED]> wrote: > > Hi, > > > > I have similar issues in indexing time.

Re: Lucene bulk indexing

2005-04-21 Thread Aalap Parikh
ically. > > There are other IndexWriter knobs you can fiddle > with. > > > > I've seen Hibernate 2.* get sluggish once its > Session gets filled up > > with a lot of objects. > > > > Otis > > > > > > --- Aalap Parikh <[EMAIL

Re: Lucene bulk indexing

2005-04-22 Thread Aalap Parikh
Hi, > : the app using JProfiler and found out that 90% of > time > : is spent in the IndexWriter.addDocument call. As > > what analyzer are you using? I am using the StandardAnalyzer (tried using SimpleAnalyzer too, but not much affect on performance). > : My machine: Pentium 4 CPU 2.40 GHz > :

Re: WildCard search replacement

2005-04-22 Thread Aalap Parikh
eed. (it can vary depending on indexing > conditions) > > Regards, > Volosymyr Bychkoviak. > > Aalap Parikh wrote: > > >Hi, > > > >Thanks for your reply. > >One more question. You mentioned that your > technique > >can be used for wildcard searc

Re: Lucene bulk indexing

2005-04-22 Thread Aalap Parikh
y. > > There are other IndexWriter knobs you can fiddle > with. > > > > I've seen Hibernate 2.* get sluggish once its > Session gets filled up > > with a lot of objects. > > > > Otis > > > > > > --- Aalap Parikh <[EMAIL PROTE

Re: WildCard search replacement

2005-04-27 Thread Aalap Parikh
Thanks very much. Volodymyr, your trick about wildcard search replacement has helped me a lot. Great idea!!! Thanks a lot again. --- Volodymyr Bychkoviak <[EMAIL PROTECTED]> wrote: > Hi. > > Aalap Parikh wrote: > > >Hi, > > > >The idea about begin marker sou

Re: searches and updates concurrency problem

2005-06-06 Thread Aalap Parikh
Hi, As per my understanding of Lucene, I think concurrent search and update to an index should not corrupt the index, given that only a single index-modifying operation is executing at any point of time. So in short, you can have multiple search operations and not more than one index update (add a

Re: view index file

2005-06-10 Thread Aalap Parikh
Hi, Use Luke. It's an excellent tool and everybody in the Lucene community uses that. http://www.getopt.org/luke/ Aalap. --- avrootshell <[EMAIL PROTECTED]> wrote: > Hi, > >I'm curious to know,if there is any way to view > the .cfs file(the > index file created). > > Someone plz shred s

Pros/Cons of a split index over a single large index

2005-06-13 Thread Aalap Parikh
Hi, I just a general question: What are the pros and cons of a split index(a number of small indexes) as opposed to a single large index? As I have repeatedly seen in various posts at this group, people have opted for split indexes in cases where they have a large number of documents (say > 1 mil

Re: Index Partitioning ( was Re: Search deadlocking under load)

2005-07-11 Thread Aalap Parikh
>I don't really know a lot about what gets loaded into memory when you >make/use a new searcher, but the one thing i've learned from experience >is >that the FieldCache (which gets used when you sort on a field) contains >every term in the field you are sorting on, and an instance of >FieldCache