This doesn't answer your question, but Erick Erickson's blog on this topic is invaluable:
https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/ -----Original Message----- From: Vasu Y [mailto:vya...@gmail.com] Sent: Monday, October 3, 2016 2:09 PM To: solr-user@lucene.apache.org Subject: SOLR Sizing Hi, I am trying to estimate disk space requirements for the documents indexed to SOLR. I went through the LucidWorks blog ( https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/) and using this as the template. I have a question regarding estimating "Avg. Document Size (KB)". When calculating Disk Storage requirements, can we use the Java Types sizing ( https://docs.oracle.com/javase/tutorial/java/nutsandbolts/datatypes.html) & come up average document size? Please let know if the following assumptions are correct. Data Type Size -------------- ------ long 8 bytes tint 4 bytes tdate 8 bytes (Stored as long?) string 1 byte per char for ASCII chars and 2 bytes per char for Non-ASCII chars (Double byte chars) text 1 byte per char for ASCII chars and 2 bytes per char for Non-ASCII (Double byte chars) (For both with & without norm?) ICUCollationField 2 bytes per char for Non-ASCII (Double byte chars) boolean 1 bit? Thanks, Vasu