[ https://issues.apache.org/jira/browse/FLUME-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13815353#comment-13815353 ]
wolfgang hoschek commented on FLUME-1988: ----------------------------------------- Tip: Looks like in this case (read all lines) you could replace the readMultiLine command with a readClob command. > Add Support for Additional Deserializers for SpoolingDirectorySource > -------------------------------------------------------------------- > > Key: FLUME-1988 > URL: https://issues.apache.org/jira/browse/FLUME-1988 > Project: Flume > Issue Type: New Feature > Components: Docs, Sinks+Sources > Affects Versions: v1.4.0 > Reporter: Israel Ekpo > Assignee: Israel Ekpo > Labels: serializers > Attachments: EventDeserializerType.java, > RegexDelimiterDeSerializer.java, ResettableTestStringInputStream.java, > TestRegexDelimiterDeSerializer.java > > > There are certain use cases for SpoolingDirectorySource where the events in > the log file are not delimited with newline characters. > Certain log files that contain stack traces, xml documents and pretty JSON > strings seem to contain multiple new line characters within each event. > We can use alternative logic such as specific characters, strings or regular > expressions to determine when the event is complete. > Hence I am proposing the following new deserializers based on > org.apache.flume.serialization.LineDeserializer > # org.apache.flume.serialization.RegexDelimiterDeSerializer > Allows the user to specify a regular expression that is a delimiter for > events within the log file > # org.apache.flume.serialization.CharSequenceDelimiterDeSerializer > Allows the user to specify a comma separated character sequence that is a > delimiter for events within the log file > The user will specify an integer for the ascii characters and we will use > that as the delimter. > For example support for \r\n could be specified as 13,10 > A list of codes is available at http://www.asciitable.com/ > We will also need to update the user guide with examples on how to configure > and specify a custom deserializer. -- This message was sent by Atlassian JIRA (v6.1#6144)