[jira] [Commented] (NIFI-1280) Create QueryFlowFile Processor

Mark Payne (JIRA) Thu, 06 Apr 2017 11:12:58 -0700

    [ 
https://issues.apache.org/jira/browse/NIFI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15959473#comment-15959473
 ]


Mark Payne commented on NIFI-1280:
----------------------------------

I've created a PR that I think is sufficient. There are a few more things that 
I would like to do, but this has dragged on long enough without me pushing 
anything, so I've pushed a PR so that people can review & hopefully get merged. 
Will create separate JIRA's for the remaining enhances that I would like to 
perform. The most significant is to allow more flexibility in choosing the 
schema to use. Rather than requiring a Schema Name be provided with a Schema 
registry would like to allow user to use an attribute or read schema from the 
content of the FlowFile itself in cases such as Avro. In addition, I want to 
add updates to include the schema on the outgoing records when appropriate.

> Create QueryFlowFile Processor
> ------------------------------
>
>                 Key: NIFI-1280
>                 URL: https://issues.apache.org/jira/browse/NIFI-1280
>             Project: Apache NiFi
>          Issue Type: Task
>          Components: Extensions
>            Reporter: Mark Payne
>            Assignee: Mark Payne
>             Fix For: 1.2.0
>
>
> We should have a Processor that allows users to easily filter out specific 
> columns from CSV data. For instance, a user would configure two different 
> properties: "Columns of Interest" (a comma-separated list of column indexes) 
> and "Filtering Strategy" (Keep Only These Columns, Remove Only These Columns).
> We can do this today with ReplaceText, but it is far more difficult than it 
> would be with this Processor, as the user has to use Regular Expressions, 
> etc. with ReplaceText.
> Eventually a Custom UI could even be built that allows a user to upload a 
> Sample CSV and choose which columns from there, similar to the way that Excel 
> works when importing CSV by dragging and selecting the desired columns? That 
> would certainly be a larger undertaking and would not need to be done for an 
> initial implementation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (NIFI-1280) Create QueryFlowFile Processor

Reply via email to