Hi Alexis, this is not a reason for the 20Gb overhead, but for sure you are using ina wrong way the suggester component. You don't want the analysis chain to produce edge ngrams and then build the FST out of those tokens. Read the chapters related the suggesters you are interested. it may be useful to understand how the suggesters work. You should use an analysis without the edgeNgram token filter at least.
[1] http://alexbenedetti.blogspot.co.uk/2015/07/solr-you-complete-me.html Cheers ----- --------------- Alessandro Benedetti Search Consultant, R&D Software Engineer, Director Sease Ltd. - www.sease.io -- View this message in context: http://lucene.472066.n3.nabble.com/Problems-creating-index-for-suggestions-tp4328392p4328914.html Sent from the Solr - User mailing list archive at Nabble.com.