yadavay-amzn opened a new pull request, #3524:
URL: https://github.com/apache/parquet-java/pull/3524

   Fixes #3364
   
   ## Problem
   `parquet cat` (and other CLI commands) reject valid Parquet files with 
column names containing hyphens (e.g. `Creation-Time`). The Parquet spec allows 
any UTF-8 string as a field name, but the `AvroSchemaConverter` fails because 
Avro's `Schema.Field` name validation only allows `[A-Za-z_][A-Za-z0-9_]*`.
   
   ## Fix
   Temporarily disable Avro name validation during Parquet-to-Avro schema 
conversion in `AvroSchemaConverter.convert()`. The field names are already 
valid per the Parquet spec — the restriction is purely an Avro naming 
convention that should not apply when reading Parquet files.
   
   ## Testing
   - Added `testHyphenatedColumnName` test in `TestAvroSchemaConverter`
   - All 42 existing tests continue to pass


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to