TheNeuralBit commented on PR #21762: URL: https://github.com/apache/beam/pull/21762#issuecomment-1150534792
> and convert_dtypes requires looking at the entire PCollection to figure out the proxy object Right, I know we can't do `convert_dtypes` exactly, I just meant we could add something similar to it, that assumes all object columns are coercible to strings, and raises error at execution time if they're not. I think doing that conversion explicitly with the DataFrame API would be preferable to plumbing the typehint through the schema code. But maybe that uglies things up to have to do `read_csv().coerce_objects_to_strings()`? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
