[ 
https://issues.apache.org/jira/browse/NIFI-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404093#comment-15404093
 ] 

Joseph Witt commented on NIFI-1868:
-----------------------------------

[~mattyb149] As a general statement, please just be careful to provide as 
complete information as you can on the discussion itself as from a community 
point of view that is more important than the decision itself.  It lets others 
help reach alternative conclusions.  When I first read this I was taken back a 
bit by 'per offline discussion' but it does look like explained the thinking a 
bit.  Anyway, we've had a few threads recently like this so just a reminder 
that we need to keep the focus on the discussion and less on the decision so 
that we can enable more interaction.  Thanks!

> Add support for Hive Streaming
> ------------------------------
>
>                 Key: NIFI-1868
>                 URL: https://issues.apache.org/jira/browse/NIFI-1868
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Matt Burgess
>            Assignee: Matt Burgess
>             Fix For: 1.0.0
>
>
> Traditionally adding new data into Hive requires gathering a large amount of 
> data onto HDFS and then periodically adding a new partition. This is 
> essentially a “batch insertion”. Insertion of new data into an existing 
> partition is not permitted. Hive Streaming API allows data to be pumped 
> continuously into Hive. The incoming data can be continuously committed in 
> small batches of records into an existing Hive partition or table. Once data 
> is committed it becomes immediately visible to all Hive queries initiated 
> subsequently.
> This case is to add a PutHiveStreaming processor to NiFi, to leverage the 
> Hive Streaming API to allow continuous streaming of data into a Hive 
> partition/table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to