yadavay-amzn opened a new pull request, #3524: URL: https://github.com/apache/parquet-java/pull/3524
Fixes #3364 ## Problem `parquet cat` (and other CLI commands) reject valid Parquet files with column names containing hyphens (e.g. `Creation-Time`). The Parquet spec allows any UTF-8 string as a field name, but the `AvroSchemaConverter` fails because Avro's `Schema.Field` name validation only allows `[A-Za-z_][A-Za-z0-9_]*`. ## Fix Temporarily disable Avro name validation during Parquet-to-Avro schema conversion in `AvroSchemaConverter.convert()`. The field names are already valid per the Parquet spec — the restriction is purely an Avro naming convention that should not apply when reading Parquet files. ## Testing - Added `testHyphenatedColumnName` test in `TestAvroSchemaConverter` - All 42 existing tests continue to pass -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
