[
https://issues.apache.org/jira/browse/DRILL-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
PJ Fanning updated DRILL-8095:
------------------------------
Description:
I've recently added a feature to excel-streaming-reader (in v3.3.0) to
optionally ignore cell style information. This is not enabled by default. It
saves memory and processing time to ignore the cell styles.
The current Drill format-excel code does not use the cell styles.
At some point in the future, it may be worth having a Drill feature that allows
it to infer the schema for the sheet based on the cell styles but until such a
feature is added, the parsing the cell styles is a waste of compute resources.
If this sounds, useful, I can submit a PR.
was:
I've recently added a feature to excel-streaming-reader (in v3.3.0) to
optionally ignore cell style information. This is not enabled by default. It
saves memory and processing time to ignore the cell styles.
The current Drill format-excel code does not use the cell styles.
At some point in the future, it may be worth having a Drill feature that allows
it to infer the schema for the sheet based on the cell styles but until such a
feature is added, the parsing the cell styles is a waste of compute resources.
> format-excel reader should ignore cell styles
> ---------------------------------------------
>
> Key: DRILL-8095
> URL: https://issues.apache.org/jira/browse/DRILL-8095
> Project: Apache Drill
> Issue Type: Improvement
> Components: Execution - Data Types
> Reporter: PJ Fanning
> Priority: Major
>
> I've recently added a feature to excel-streaming-reader (in v3.3.0) to
> optionally ignore cell style information. This is not enabled by default. It
> saves memory and processing time to ignore the cell styles.
> The current Drill format-excel code does not use the cell styles.
> At some point in the future, it may be worth having a Drill feature that
> allows it to infer the schema for the sheet based on the cell styles but
> until such a feature is added, the parsing the cell styles is a waste of
> compute resources.
> If this sounds, useful, I can submit a PR.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)