Re: DocValues memory usage

2013-03-28 Thread Peter Keegan
This is wierd. I indexed using DiskDocValuesFormat as the default codec and observed 16K qps with BinaryDocValuesField. But with a simple StoredField, I observed a much higher 30K qps. When I added both fields (BinaryDocValuesField and StoredField) to the index, I observed only 100 qps on each

Consultant Inquiry

2013-03-28 Thread Nick Hoffman
I'm looking for a consultant for Lucene Solr. Our team of 3 extended OpenBravo (Java ERP) with a built-in Shopping Cart (written in JS). I'm now on the search/faceted-search part of the shopping cart, and would love to have someone lead our team in implementing Lucene Solr. Thanks for your

Re: Consultant Inquiry

2013-03-28 Thread Marc Schwarz
Hi Nick, do you have exact spezification of what you wan't to get done by solr ? do you need assistance and consultation in your office or would you be fine with remote work / assistance ? Greetings, Marc Am 28.03.2013 19:32, schrieb Nick Hoffman: I'm looking for a consultant for Lucene

Indexing Term Frequency Vectors

2013-03-28 Thread Sharon Tam
I believe that when Lucene indexes documents, it generates counts for a term by counting how many times the term appears in a particular document. Instead of having Lucene do the counting, I want to do my own counting and feed a term-frequency vector representation of a document directly into the

Storing Documents in Lucene

2013-03-28 Thread Paul
Hi, Some of the stuff I've read suggests that Lucene is not especially well-suited to storing the documents. It's supposed to be great at indexing those documents, but not so great at storing the docs themselves. Can someone shed some light on this? If this is true, then am I right to think

Re: Storing Documents in Lucene

2013-03-28 Thread Yann-Erwan Perio
On Thu, Mar 28, 2013 at 11:06 PM, Paul arach...@gmail.com wrote: Hello, Some of the stuff I've read suggests that Lucene is not especially well-suited to storing the documents. It's supposed to be great at indexing those documents, but not so great at storing the docs themselves. If you

Re: Indexing Term Frequency Vectors

2013-03-28 Thread Adrien Grand
Hi, On Thu, Mar 28, 2013 at 8:25 PM, Sharon Tam sharon...@gmail.com wrote: I believe that when Lucene indexes documents, it generates counts for a term by counting how many times the term appears in a particular document. Instead of having Lucene do the counting, I want to do my own counting

Re: Storing Documents in Lucene

2013-03-28 Thread Adrien Grand
On Thu, Mar 28, 2013 at 11:06 PM, Paul arach...@gmail.com wrote: Hi, Hi Paul, Some of the stuff I've read suggests that Lucene is not especially well-suited to storing the documents. It's supposed to be great at indexing those documents, but not so great at storing the docs themselves.