[jira] [Comment Edited] (NIFI-11167) Add Excel Record Reader

Daniel Stieglitz (Jira) Thu, 23 Nov 2023 11:53:05 -0800


    [ 
https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789248#comment-17789248
 ]


Daniel Stieglitz edited comment on NIFI-11167 at 11/23/23 7:52 PM:
-------------------------------------------------------------------

[~iiojj2] 

I believe you can use 
[QueryRecord|https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.6.0/org.apache.nifi.processors.standard.QueryRecord/index.html]
 to filter out the empty columns. See [~exceptionfactory] comment earlier from 
3/20/2023.

Please see this very detailed [tutorial 
|https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5]and
 see the section titled 'How to Use It — Filter Out Columns' .

As for Starting Row having a flowfile attribute scope, that I believe you can 
make a Jira ticket for. Thanks!


was (Author: JIRAUSER294662):
[~iiojj2] 

I believe you can use 
[QueryRecord|https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.6.0/org.apache.nifi.processors.standard.QueryRecord/index.html]
 to filter out the empty columns. See [~exceptionfactory] comment earlier from 
3/20/2023.

Please see this very detailed [tutorial 
|https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5]and
 see the section titled 'How to Use It — Filter Out Columns' .

 

As for Starting Row having a flowfile attribute scope, that I believe you can 
make a Jira ticket for. Thanks!

> Add Excel Record Reader
> -----------------------
>
>                 Key: NIFI-11167
>                 URL: https://issues.apache.org/jira/browse/NIFI-11167
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>            Reporter: David Handermann
>            Assignee: Daniel Stieglitz
>            Priority: Minor
>             Fix For: 2.0.0-M1, 1.23.0
>
>          Time Spent: 10h 10m
>  Remaining Estimate: 0h
>
> A new Excel Record Reader should be implemented to support reading XSLX 
> spreadsheet rows as NiFi Records. This Reader will enable integration with 
> various record-oriented components, obviating the need for the narrowly 
> focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader 
> should not support the legacy binary XLS format.
> The ExcelReader should use a library that supports reading from a stream of 
> rows to avoid consuming large amounts of heap memory during processing.
> The ExcelReader should support configurable properties to read selected 
> sheets. With Excel supporting typed field values, some amount of field type 
> mapping will be required. Additional input filtering properties should not be 
> implemented as existing Processors like QueryRecord support a wide variety of 
> filtering and projection use cases.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Comment Edited] (NIFI-11167) Add Excel Record Reader

Reply via email to