[
https://issues.apache.org/jira/browse/PHOENIX-7278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nihal Jain updated PHOENIX-7278:
--------------------------------
Labels: bulkload (was: )
> Add support to dump BAD_ROWS in a directory for CsvBulkLoadTool
> ----------------------------------------------------------------
>
> Key: PHOENIX-7278
> URL: https://issues.apache.org/jira/browse/PHOENIX-7278
> Project: Phoenix
> Issue Type: Improvement
> Reporter: Nihal Jain
> Assignee: Nihal Jain
> Priority: Major
> Labels: bulkload
>
> CsvBulkLoadTool should have the functionality to dump BAD_ROWS into a
> specified directory. This will enhance the tool's error handling capabilities
> and provide users with a clear understanding of which rows have failed to
> load during the bulk import process.
> * CsvBulkLoadTool should have a feature to identify and isolate BAD_ROWS
> during the bulk import process.
> * The tool should provide an option for users to specify a directory where
> these BAD_ROWS will be dumped.
> * Upon execution, if there are any BAD_ROWS, the tool should create a file
> in the specified directory containing these rows.
> * The file should clearly indicate the reason for each row being labeled as
> BAD_ROW, such as data inconsistency, format error, etc.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)