Re: Disk usage per-field
The Lucene CheckIndex program opens an index and reads many types of data from it. It's easy to start with it and change that to count up the space used by terms and store data for field X. On Tue, Nov 2, 2010 at 5:51 AM, Muneeb Ali wrote: > > Hi, > > I am currently benchmarking solr index with different fields to see the > impact on its size/ search speed etc. A feature to find the disk usage per > field of index would be really handy and save me alot of time. Do we have > any updates on this? > > Has anyone tried writing custom code for it ? > > - Muneeb > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Disk-usage-per-field-tp934765p1827739.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Lance Norskog goks...@gmail.com
Re: Disk usage per-field
Hi, I am currently benchmarking solr index with different fields to see the impact on its size/ search speed etc. A feature to find the disk usage per field of index would be really handy and save me alot of time. Do we have any updates on this? Has anyone tried writing custom code for it ? - Muneeb -- View this message in context: http://lucene.472066.n3.nabble.com/Disk-usage-per-field-tp934765p1827739.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Disk usage per-field
You can write a Lucene program that walks the various index data structures and tots up estimated values. It's a big job. I would start with the Lucene CheckIndex program since it does most of this. On Sat, Jul 3, 2010 at 4:32 AM, Israel Ekpo wrote: > Currently, this feature is not available. > > The amount of space a field consumes varies and depends on whether the field > is index only, stored only or indexed and stored. > > It also depends on how the field is analyzed > > On Fri, Jul 2, 2010 at 2:59 PM, Shawn Heisey wrote: > >> On 6/30/2010 5:44 PM, Shawn Heisey wrote: >> >>> Is it possible for Solr (or Luke/Lucene) to tell me exactly how much of >>> the total index disk space is used by each field? It would also be very >>> nice to know, for each field, how much is used by the index and how much is >>> used for stored data. >>> >>> >> Still interested in this. It would be perfectly OK if such a thing were >> completely external to Solr and required a good chunk of time to calculate. >> I would not need to do it very often. >> >> > > > -- > "Good Enough" is not good enough. > To give anything less than your best is to sacrifice the gift. > Quality First. Measure Twice. Cut Once. > http://www.israelekpo.com/ > -- Lance Norskog goks...@gmail.com
Re: Disk usage per-field
Currently, this feature is not available. The amount of space a field consumes varies and depends on whether the field is index only, stored only or indexed and stored. It also depends on how the field is analyzed On Fri, Jul 2, 2010 at 2:59 PM, Shawn Heisey wrote: > On 6/30/2010 5:44 PM, Shawn Heisey wrote: > >> Is it possible for Solr (or Luke/Lucene) to tell me exactly how much of >> the total index disk space is used by each field? It would also be very >> nice to know, for each field, how much is used by the index and how much is >> used for stored data. >> >> > Still interested in this. It would be perfectly OK if such a thing were > completely external to Solr and required a good chunk of time to calculate. > I would not need to do it very often. > > -- "Good Enough" is not good enough. To give anything less than your best is to sacrifice the gift. Quality First. Measure Twice. Cut Once. http://www.israelekpo.com/
Re: Disk usage per-field
On 6/30/2010 5:44 PM, Shawn Heisey wrote: Is it possible for Solr (or Luke/Lucene) to tell me exactly how much of the total index disk space is used by each field? It would also be very nice to know, for each field, how much is used by the index and how much is used for stored data. Still interested in this. It would be perfectly OK if such a thing were completely external to Solr and required a good chunk of time to calculate. I would not need to do it very often.
Disk usage per-field
Is it possible for Solr (or Luke/Lucene) to tell me exactly how much of the total index disk space is used by each field? It would also be very nice to know, for each field, how much is used by the index and how much is used for stored data.