[ 
https://issues.apache.org/jira/browse/DRILL-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

PJ Fanning updated DRILL-8095:
------------------------------
    Description: 
I've recently added a feature to excel-streaming-reader (in v3.3.0) to 
optionally ignore cell style information. This is not enabled by default. It 
saves memory and processing time to ignore the cell styles.

The current Drill format-excel code does not use the cell styles.

At some point in the future, it may be worth having a Drill feature that allows 
it to infer the schema for the sheet based on the cell styles but until such a 
feature is added, the parsing the cell styles is a waste of compute resources.

If this sounds, useful, I can submit a PR.

  was:
I've recently added a feature to excel-streaming-reader (in v3.3.0) to 
optionally ignore cell style information. This is not enabled by default. It 
saves memory and processing time to ignore the cell styles.

The current Drill format-excel code does not use the cell styles.

At some point in the future, it may be worth having a Drill feature that allows 
it to infer the schema for the sheet based on the cell styles but until such a 
feature is added, the parsing the cell styles is a waste of compute resources.


> format-excel reader should ignore cell styles
> ---------------------------------------------
>
>                 Key: DRILL-8095
>                 URL: https://issues.apache.org/jira/browse/DRILL-8095
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Execution - Data Types
>            Reporter: PJ Fanning
>            Priority: Major
>
> I've recently added a feature to excel-streaming-reader (in v3.3.0) to 
> optionally ignore cell style information. This is not enabled by default. It 
> saves memory and processing time to ignore the cell styles.
> The current Drill format-excel code does not use the cell styles.
> At some point in the future, it may be worth having a Drill feature that 
> allows it to infer the schema for the sheet based on the cell styles but 
> until such a feature is added, the parsing the cell styles is a waste of 
> compute resources.
> If this sounds, useful, I can submit a PR.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to