[
https://issues.apache.org/jira/browse/DRILL-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
PJ Fanning updated DRILL-8095:
------------------------------
Description: Upgrade to latest POI release (was: I've recently added a
feature to excel-streaming-reader (in v3.3.0) to optionally ignore cell style
information. This is not enabled by default. It saves memory and processing
time to ignore the cell styles.
The current Drill format-excel code does not use the cell styles.
At some point in the future, it may be worth having a Drill feature that allows
it to infer the schema for the sheet based on the cell styles but until such a
feature is added, the parsing the cell styles is a waste of compute resources.
If this sounds, useful, I can submit a PR.)
Summary: format-excel reader - upgrade to POI 5.2.0 (was: format-excel
reader should ignore cell styles)
It appears that Drill code need the excel styles to work out if the cell data
is a cell - so need to keep parsing the style data.
was:
I've recently added a feature to excel-streaming-reader (in v3.3.0) to
optionally ignore cell style information. This is not enabled by default. It
saves memory and processing time to ignore the cell styles.
The current Drill format-excel code does not use the cell styles.
At some point in the future, it may be worth having a Drill feature that allows
it to infer the schema for the sheet based on the cell styles but until such a
feature is added, the parsing the cell styles is a waste of compute resources.
If this sounds, useful, I can submit a PR.
> format-excel reader - upgrade to POI 5.2.0
> ------------------------------------------
>
> Key: DRILL-8095
> URL: https://issues.apache.org/jira/browse/DRILL-8095
> Project: Apache Drill
> Issue Type: Improvement
> Components: Execution - Data Types
> Reporter: PJ Fanning
> Priority: Major
>
> Upgrade to latest POI release
--
This message was sent by Atlassian Jira
(v8.20.1#820001)