[ https://issues.apache.org/jira/browse/NIFI-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404093#comment-15404093 ]
Joseph Witt commented on NIFI-1868: ----------------------------------- [~mattyb149] As a general statement, please just be careful to provide as complete information as you can on the discussion itself as from a community point of view that is more important than the decision itself. It lets others help reach alternative conclusions. When I first read this I was taken back a bit by 'per offline discussion' but it does look like explained the thinking a bit. Anyway, we've had a few threads recently like this so just a reminder that we need to keep the focus on the discussion and less on the decision so that we can enable more interaction. Thanks! > Add support for Hive Streaming > ------------------------------ > > Key: NIFI-1868 > URL: https://issues.apache.org/jira/browse/NIFI-1868 > Project: Apache NiFi > Issue Type: New Feature > Reporter: Matt Burgess > Assignee: Matt Burgess > Fix For: 1.0.0 > > > Traditionally adding new data into Hive requires gathering a large amount of > data onto HDFS and then periodically adding a new partition. This is > essentially a “batch insertion”. Insertion of new data into an existing > partition is not permitted. Hive Streaming API allows data to be pumped > continuously into Hive. The incoming data can be continuously committed in > small batches of records into an existing Hive partition or table. Once data > is committed it becomes immediately visible to all Hive queries initiated > subsequently. > This case is to add a PutHiveStreaming processor to NiFi, to leverage the > Hive Streaming API to allow continuous streaming of data into a Hive > partition/table. -- This message was sent by Atlassian JIRA (v6.3.4#6332)