[ https://issues.apache.org/jira/browse/ARROW-11978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pavol Knapek updated ARROW-11978: --------------------------------- Description: It would be very nice to have some opt-in dynamic casting supported in the JSON schema inference process. Example input.json file: > \{"col1": "1"} > \{"col1": 1} Example schema-inference invocation: > pyarrow.json.read_json('input.json') Expected output: > pyarrow.Table with a schema of \{col1: string} Actual output: > ArrowInvalid: JSON parse error: Column(/col1) changed from string to number > in row 1 This applies for all the DataTypes, convertible to a super-type, i.e.: > Integer -> String > Object -> String > Anything -> String was: It would be very nice to have some opt-in dynamic casting supported in the JSON schema inference process. Example input.json file: {"col1": "1"} \{"col1": 1} Example schema-inference invocation: {{pyarrow.json.read_json('input.json')}} Expected output: pyarrow.Table with a schema of \{col1: string} Actual output: {{ArrowInvalid: JSON parse error: Column(/col1) changed from string to number in row 1}} This applies for all the DataTypes, convertible to a super-type, i.e.: Integer -> String Object -> String Anything -> String > [Python] Dynamic casting during JSON schema inference > ----------------------------------------------------- > > Key: ARROW-11978 > URL: https://issues.apache.org/jira/browse/ARROW-11978 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Affects Versions: 3.0.0 > Reporter: Pavol Knapek > Priority: Major > > It would be very nice to have some opt-in dynamic casting supported in the > JSON schema inference process. > > Example input.json file: > > \{"col1": "1"} > > \{"col1": 1} > > Example schema-inference invocation: > > pyarrow.json.read_json('input.json') > > Expected output: > > pyarrow.Table with a schema of \{col1: string} > > Actual output: > > ArrowInvalid: JSON parse error: Column(/col1) changed from string to number > > in row 1 > > This applies for all the DataTypes, convertible to a super-type, i.e.: > > Integer -> String > > Object -> String > > Anything -> String -- This message was sent by Atlassian Jira (v8.3.4#803005)