Hi Jack,

On Sat, Dec 15, 2012 at 4:36 PM, Jack Krupansky <j...@basetechnology.com> wrote:
> I have seen a few inquiries concerned with the overhead of storing time of
> day for simple dates. The concerns are both storage and performance. So, the
> question/proposal is whether a variant of TrieDate with no time of day
> component, call it TrieDay or TrieDateTimeless or TrieDateNoTime (or
> incompatibly rename TrieDate to TrieDateTime and use TrieDate for the new
> format), could be stored with, say, 40% more storage efficiency and maybe a
> comparable or at least significant performance improvement for queries.

Storing only the day in a 32-bits integer could save space, but I'm
not sure Solr should provide a type for all granularities of dates?
Have you tried to set your dates' hours, minutes, seconds and
milliseconds to 0 before indexing them ? This should help postings
lists share terms and improve storage efficiency (especially with the
new Lucene41PostingsFormat).

-- 
Adrien

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to