[ https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789248#comment-17789248 ]
Daniel Stieglitz edited comment on NIFI-11167 at 11/23/23 7:52 PM: ------------------------------------------------------------------- [~iiojj2] I believe you can use [QueryRecord|https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.6.0/org.apache.nifi.processors.standard.QueryRecord/index.html] to filter out the empty columns. See [~exceptionfactory] comment earlier from 3/20/2023. Please see this very detailed [tutorial |https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5]and see the section titled 'How to Use It — Filter Out Columns' . As for Starting Row having a flowfile attribute scope, that I believe you can make a Jira ticket for. Thanks! was (Author: JIRAUSER294662): [~iiojj2] I believe you can use [QueryRecord|https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.6.0/org.apache.nifi.processors.standard.QueryRecord/index.html] to filter out the empty columns. See [~exceptionfactory] comment earlier from 3/20/2023. Please see this very detailed [tutorial |https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5]and see the section titled 'How to Use It — Filter Out Columns' . As for Starting Row having a flowfile attribute scope, that I believe you can make a Jira ticket for. Thanks! > Add Excel Record Reader > ----------------------- > > Key: NIFI-11167 > URL: https://issues.apache.org/jira/browse/NIFI-11167 > Project: Apache NiFi > Issue Type: New Feature > Components: Extensions > Reporter: David Handermann > Assignee: Daniel Stieglitz > Priority: Minor > Fix For: 2.0.0-M1, 1.23.0 > > Time Spent: 10h 10m > Remaining Estimate: 0h > > A new Excel Record Reader should be implemented to support reading XSLX > spreadsheet rows as NiFi Records. This Reader will enable integration with > various record-oriented components, obviating the need for the narrowly > focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader > should not support the legacy binary XLS format. > The ExcelReader should use a library that supports reading from a stream of > rows to avoid consuming large amounts of heap memory during processing. > The ExcelReader should support configurable properties to read selected > sheets. With Excel supporting typed field values, some amount of field type > mapping will be required. Additional input filtering properties should not be > implemented as existing Processors like QueryRecord support a wide variety of > filtering and projection use cases. -- This message was sent by Atlassian Jira (v8.20.10#820010)