novemberkilo commented on pull request #1063: URL: https://github.com/apache/arrow-rs/pull/1063#issuecomment-1053939744
@alamb I have an update. 1. I have confirmed that the issue persists as of current `master` 2. I have re-examined and confirmed the approach -- I think that the test here is correct 3. I've built `json2parquet` with the fix that appears in this PR (see [this comment](https://github.com/apache/arrow-rs/pull/1063#pullrequestreview-838891472) specifically) and this results in a parquet file. 4. I can use `pyarrow` to read the file written out in step 3. I am not sure whether I have uncovered a bug with `array_reader` or if I am writing out a dud parquet file. I want to investigate further and if I am reasonably sure that I have uncovered a bug, I will create an issue with a minimal example to reproduce. I am going to close this PR for now because it is stale and a bit noisy. I will recreate a clean PR when I am clearer about the outcome of the code that appears here. Copying @tustvold because I wonder if this is now close to the work you've done recently in https://github.com/apache/arrow-rs/pull/1246 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
