[ https://issues.apache.org/jira/browse/ARROW-11978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pavol Knapek updated ARROW-11978: --------------------------------- Description: It would be very nice to have some opt-in dynamic casting supported in the JSON schema inference process. Example input.json file: {"col1": "1"}\n \{"col1": 1} Example schema-inference invocation: {{pyarrow.json.read_json('input.json')}} Expected output: pyarrow.Table with a schema of \{col1: string} Actual output: {{ArrowInvalid: JSON parse error: Column(/col1) changed from string to number in row 1}} This applies for all the DataTypes, convertible to a super-type, i.e.: Integer -> String Object -> String Anything -> String was: It would be very nice to have some opt-in dynamic casting supported in the JSON schema inference process. Example input.json file: {"col1": "1"} {"col1": 1} Example schema-inference invocation: {{pyarrow.json.read_json('input.json')}} Expected output: pyarrow.Table with a schema of \{col1: string} Actual output: {{ArrowInvalid: JSON parse error: Column(/col1) changed from string to number in row 1}} This applies for all the DataTypes, convertible to a super-type, i.e.: Integer -> String Object -> String Anything -> String > [Python] Dynamic casting during JSON schema inference > ----------------------------------------------------- > > Key: ARROW-11978 > URL: https://issues.apache.org/jira/browse/ARROW-11978 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Affects Versions: 3.0.0 > Reporter: Pavol Knapek > Priority: Major > > It would be very nice to have some opt-in dynamic casting supported in the > JSON schema inference process. > > Example input.json file: > {"col1": "1"}\n \{"col1": 1} > > Example schema-inference invocation: > {{pyarrow.json.read_json('input.json')}} > > Expected output: > pyarrow.Table with a schema of \{col1: string} > > Actual output: > {{ArrowInvalid: JSON parse error: Column(/col1) changed from string to number > in row 1}} > > This applies for all the DataTypes, convertible to a super-type, i.e.: > Integer -> String > Object -> String > Anything -> String -- This message was sent by Atlassian Jira (v8.3.4#803005)