[ 
https://issues.apache.org/jira/browse/DRILL-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17474659#comment-17474659
 ] 

Charles Givre commented on DRILL-8095:
--------------------------------------

[~pj.fanning] It would definitely be helpful to add this feature.  I don't know 
if this is related or not, but we've also run into some issues where the 
formatting seems to cause errors in parsing. 

> format-excel reader should ignore cell styles
> ---------------------------------------------
>
>                 Key: DRILL-8095
>                 URL: https://issues.apache.org/jira/browse/DRILL-8095
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Execution - Data Types
>            Reporter: PJ Fanning
>            Priority: Major
>
> I've recently added a feature to excel-streaming-reader (in v3.3.0) to 
> optionally ignore cell style information. This is not enabled by default. It 
> saves memory and processing time to ignore the cell styles.
> The current Drill format-excel code does not use the cell styles.
> At some point in the future, it may be worth having a Drill feature that 
> allows it to infer the schema for the sheet based on the cell styles but 
> until such a feature is added, the parsing the cell styles is a waste of 
> compute resources.
> If this sounds, useful, I can submit a PR.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to