Adrien Grand created LUCENE-6764: ------------------------------------ Summary: Payloads should be compressed Key: LUCENE-6764 URL: https://issues.apache.org/jira/browse/LUCENE-6764 Project: Lucene - Core Issue Type: Improvement Reporter: Adrien Grand Priority: Minor
I think we should at least try to do something simple, eg. deduplicate or apply simple LZ77 compression. For instance if you use enclosing html tags to give different weights to individual terms, there might be lots of repetitions as there are not that many unique html tags. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org