[
https://issues.apache.org/jira/browse/METAMODEL-82?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152082#comment-14152082
]
Kasper Sørensen commented on METAMODEL-82:
------------------------------------------
Well yes and no. It's a tough one...
The trouble is that the data here can never be guaranteed as any other type. If
you put a number in a CSV file or an Excel sheet, it can only be treated as a
number if all the other rows have a number in this column. And how do we
guarantee that (except for obviously eager-reading all records, which isn't
acceptable IMO).
If we change this I would suggest to make a "type detection" optional (and
default disabled for compatibility). Then we could eager-read up to 1000
records or so and use the schema inference utils to determine a pretty precise
type.
> Metadata type is always VARCHAR for CSV & Excel modules
> -------------------------------------------------------
>
> Key: METAMODEL-82
> URL: https://issues.apache.org/jira/browse/METAMODEL-82
> Project: Metamodel
> Issue Type: Bug
> Reporter: Alberto Rodriguez
>
> The metadata types fetched from CSV and Excel DataContexts are always VARCHAR
> although the data of some columns are not a string.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)