[ 
https://issues.apache.org/jira/browse/CARBONDATA-400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravindra Pesala resolved CARBONDATA-400.
----------------------------------------
       Resolution: Fixed
    Fix Version/s: 1.1.0-incubating

> [Bad Records] Load data is fail and displaying the string value in beeline as 
> exception
> ---------------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-400
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-400
>             Project: CarbonData
>          Issue Type: Bug
>          Components: data-load
>    Affects Versions: 0.1.0-incubating
>         Environment: 3node cluster
>            Reporter: MAKAMRAGHUVARDHAN
>            Assignee: Akash R Nilugal
>            Priority: Minor
>             Fix For: 1.1.0-incubating
>
>          Time Spent: 4h 40m
>  Remaining Estimate: 0h
>
> Steps
> 1. Create table
> CREATE TABLE String_test2 (string_col string) STORED BY 
> 'org.apache.carbondata.format';
> 2. Load the data with parameter 'BAD_RECORDS_ACTION'='FORCE' and csv contains 
> a string value that is out of boundary.
> LOAD DATA INPATH 'hdfs://hacluster/Carbon/Priyal/string5.csv' into table 
> String_test2 OPTIONS('DELIMITER'=',' , 
> 'QUOTECHAR'='"','BAD_RECORDS_LOGGER_ENABLE'='TRUE', 
> 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='string_col');
> Actual Result: Load data is failed and displaying the string value in beeline 
> as exception trace.
> Expected Result:Should display a correct error message and should  not print 
> the exception trace on the console.
> Exception thrown on console is as shown below.
> Error: com.univocity.parsers.common.TextParsingException: Error processing 
> input: Length of parsed input (100001) exceeds the maximum number of 
> characters defined in your parser settings (100000).
> Hint: Number of characters processed may have exceeded limit of 100000 
> characters per column. Use settings.setMaxCharsPerColumn(int) to define the 
> maximum number of characters a column can have
> Ensure your configuration is correct, with delimiters, quotes and escape 
> sequences that match the input format you are trying to parse
> Parser Configuration: CsvParserSettings:
>         Column reordering enabled=true
>         Empty value=null
>         Header extraction enabled=false
>         Headers=null
>         Ignore leading whitespaces=true
>         Ignore trailing whitespaces=true
>         Input buffer size=128
>         Input reading on separate thread=false
>         Line separator detection enabled=false
>         Maximum number of characters per column=100000
>         Maximum number of columns=20480
>         Null value=
>         Number of records to read=all
>         Parse unescaped quotes=true
>         Row processor=none
>         Selected fields=none
>         Skip empty lines=trueFormat configuration:
>         CsvFormat:
>                 Comment character=#
>                 Field delimiter=,
>                 Line separator (normalized)=\n
>                 Line separator sequence=\n
>                 Quote character="
>                 Quote escape character=quote escape
>                 Quote escape escape character=\0, line=0, char=100002. 
> Content parsed: 
> [hellohowareyouwelcomehellohellohellohellohellohellohellohelloheellooabcdefghijklmnopqrstuvwxyzabcqwertuyioplkjhgfdsazxcvbnmpoiuytrewqasdfghjklmnbvcxzasdghskhdgkhdbkshkjchskdhfssudkdjdudusdjhdshdshsjddshjdkdhgdhdshdhdududushdudududududududududududududududududuudududududududuudududududududududududududududududududududududududududuhellohowareyouwelcomehellohellohellohellohellohellohellohelloheellooabcdefghijklmnopqrstuvwxyzabcqwertuyioplkjhgfdsazxcvbnmpoiuytrewqasdfghjklmnbvcxzasdghskhdgkhdbkshkjchskdhfssudkdjdudusdjhdshdshsjddshjdkdhgdhdshdhdududushdudududududududududududududududududuudududududududuududududududududuu



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to