[ 
https://issues.apache.org/jira/browse/PHOENIX-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prashant Vithani updated PHOENIX-5258:
--------------------------------------
    Description: 
Currently, CsvBulkLoadTool does not support reading header from the input csv 
and expects the content of the csv to match with the table schema. The support 
for the header can be added to dynamically map the schema with the header.

The proposed solution is to introduce another option for the tool `–header`. If 
this option is passed, the input columns list is constructed by reading the 
first line of the input CSV file.
 * If there is only one file, read the header from the first line and generate 
the `ColumnInfo` list.
 * If there are multiple files, read the header from all the files, and throw 
an error if the headers across files do not match.

  was:Currently, CsvBulkLoadTool does not support reading header from the input 
csv and expects the content of the csv to match with the table schema. The 
support for the header can be added to dynamically map the schema with the 
header.


> Add support for header in input CSV for CsvBulkLoadTool
> -------------------------------------------------------
>
>                 Key: PHOENIX-5258
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-5258
>             Project: Phoenix
>          Issue Type: Improvement
>            Reporter: Prashant Vithani
>            Priority: Minor
>
> Currently, CsvBulkLoadTool does not support reading header from the input csv 
> and expects the content of the csv to match with the table schema. The 
> support for the header can be added to dynamically map the schema with the 
> header.
> The proposed solution is to introduce another option for the tool `–header`. 
> If this option is passed, the input columns list is constructed by reading 
> the first line of the input CSV file.
>  * If there is only one file, read the header from the first line and 
> generate the `ColumnInfo` list.
>  * If there are multiple files, read the header from all the files, and throw 
> an error if the headers across files do not match.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to