[ https://issues.apache.org/jira/browse/NIFI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15959473#comment-15959473 ]
Mark Payne commented on NIFI-1280: ---------------------------------- I've created a PR that I think is sufficient. There are a few more things that I would like to do, but this has dragged on long enough without me pushing anything, so I've pushed a PR so that people can review & hopefully get merged. Will create separate JIRA's for the remaining enhances that I would like to perform. The most significant is to allow more flexibility in choosing the schema to use. Rather than requiring a Schema Name be provided with a Schema registry would like to allow user to use an attribute or read schema from the content of the FlowFile itself in cases such as Avro. In addition, I want to add updates to include the schema on the outgoing records when appropriate. > Create QueryFlowFile Processor > ------------------------------ > > Key: NIFI-1280 > URL: https://issues.apache.org/jira/browse/NIFI-1280 > Project: Apache NiFi > Issue Type: Task > Components: Extensions > Reporter: Mark Payne > Assignee: Mark Payne > Fix For: 1.2.0 > > > We should have a Processor that allows users to easily filter out specific > columns from CSV data. For instance, a user would configure two different > properties: "Columns of Interest" (a comma-separated list of column indexes) > and "Filtering Strategy" (Keep Only These Columns, Remove Only These Columns). > We can do this today with ReplaceText, but it is far more difficult than it > would be with this Processor, as the user has to use Regular Expressions, > etc. with ReplaceText. > Eventually a Custom UI could even be built that allows a user to upload a > Sample CSV and choose which columns from there, similar to the way that Excel > works when importing CSV by dragging and selecting the desired columns? That > would certainly be a larger undertaking and would not need to be done for an > initial implementation. -- This message was sent by Atlassian JIRA (v6.3.15#6346)