Re: ways to minimize index size?

2007-06-22 Thread Sebastin
Steve, i use your idea it works for me great,once again i say thanks to you.But when i use (Index.No_NORMS ) it increase the size in the same time when i use(Index.TOKENIZED)it will reduce the size. i use the code given by you BigInteger _bi = new java

Re: ways to minimize index size?

2007-06-22 Thread Sebastin
Hi Steve, thanks for your reply a lot.its now compress upto 50% of the original size.is there any other possiblity using this code compress upto 80%. Steve Liles wrote: > > Compression aside you could index the "contents" as terms in separate > fields instead of tokenized text, and disable

Re: ways to minimize index size?

2007-06-20 Thread Steve Liles
Compression aside you could index the "contents" as terms in separate fields instead of tokenized text, and disable storing of norms: String outgoingNumber="9198408365809"; String incomingNumber="9840861114"; _doc.add(new Field("outgoingNumber", outgoingNumber, Store.NO, Index.NO_NORMS)); _doc

Re: ways to minimize index size?

2007-06-20 Thread Sebastin
Hi Erick do u have any idea on this? jm-27 wrote: > > Hi, > > I want to make my index as small as possible. I noticed about > field.setOmitNorms(true), I read in the list the diff is 1 byte per > field per doc, not huge but hey...is the only effect the score being > different? I hardly mind abo

Re: ways to minimize index size?

2007-06-19 Thread Sebastin
When i use the standardAnalyzer storage size increases.how can i minimize index store Sebastin wrote: > > > String outgoingNumber="9198408365809"; > String incomingNumber="9840861114"; > String datesc="070601"; > String imsiNumber="444021365987"; > String callType="1"; >

Re: ways to minimize index size?

2007-06-19 Thread Sebastin
String outgoingNumber="9198408365809"; String incomingNumber="9840861114"; String datesc="070601"; String imsiNumber="444021365987"; String callType="1"; //Search Fields String contents=(outgoingNumber+" "+incomingNumber+" "+dateSc+" "+imsiNumber+" "+callType ); //Displa

Re: ways to minimize index size?

2007-06-19 Thread Erick Erickson
Show us the code you use to index. Are you storing the fields? omitting norms? Throwing out stop words? Best Erick On 6/19/07, Sebastin <[EMAIL PROTECTED]> wrote: Hi Does anyone give me an idea to reduce the Index size to down.now i am getting 42% compression in my index store.i want to reduc

RE: ways to minimize index size?

2007-06-19 Thread Sebastin
Hi Does anyone give me an idea to reduce the Index size to down.now i am getting 42% compression in my index store.i want to reduce upto 70%.i use standardanalyzer to write the document.when i use SimpleAnalyzer it reduce upto 58% but i couldnt search the document.please help me to acheive. Tha

Re: ways to minimize index size?

2007-03-14 Thread Erick Erickson
OK, I caused more confusion than rendered help by my stemming statement. The only reason I mentioned it was to illustrate that performance is not linearly related to size. It took some effort to put stemming into the index, see PorterStemmer etc. This is NOT the default. So I took it out to see w

Re: ways to minimize index size?

2007-03-14 Thread jm
hi Erick, Well, typically my application will start with some hundreds of indexes...and then grow at a rate of several per day, for ever. At some point I know I can do some merging etc if needed. Size is dependant on the customer, could be up to a 1G per index. That is way I would like to minim

RE: ways to minimize index size?

2007-03-14 Thread Jeff
I found that reducing my index from 8G to 4G (through not stemming) gave me about a 10% performance improvement. How did you do this? I don't see this as an option. Jeff

Re: ways to minimize index size?

2007-03-14 Thread Erick Erickson
Store as little as possible, index as little as possible . How big is your index, and how much do you expect it to grow? I ask this because it's probably not worth your time to try to reduce the index size below some threshold... I found that reducing my index from 8G to 4G (through not stemm