[ https://issues.apache.org/jira/browse/METRON-545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jon Zeolla updated METRON-545: ------------------------------ Summary: Truncate fields larger than 32766 bytes (was: Truncate fields larger than 32766) > Truncate fields larger than 32766 bytes > --------------------------------------- > > Key: METRON-545 > URL: https://issues.apache.org/jira/browse/METRON-545 > Project: Metron > Issue Type: Sub-task > Reporter: Jon Zeolla > Priority: Minor > > Due to a limitation with using lucene where an individual term cannot be > larger than 32766 bytes (assuming UTF-8 encoding, this is 8,191 characters), > and assuming that we cannot easily identify the field datatype per the intent > of the user (string vs integer vs ...), we should truncate fields if they are > larger than 32766. This should be somewhat rare, but even in cases where it > occurs we can leverage the dual storage (HDFS and Lucene), integrity checking > fields (METRON-544), and customizability of the UI (METRON-195) in order to > retrieve the full original field value. -- This message was sent by Atlassian JIRA (v6.3.4#6332)