rohdesamuel commented on a change in pull request #14778:
URL: https://github.com/apache/beam/pull/14778#discussion_r637118393
##########
File path: sdks/python/apache_beam/runners/interactive/interactive_beam.py
##########
@@ -539,8 +541,16 @@ def collect(pcoll, n='inf', duration='inf',
include_window_info=False):
# Run the pipeline and bring the PCollection into memory as a Dataframe.
in_memory_square = head(square, n=5)
"""
+ # Remember the element type so we can make an informed decision on how to
+ # collect the result in elements_to_df.
if isinstance(pcoll, DeferredBase):
- pcoll = to_pcollection(pcoll)
+ # Get the proxy so we can get the output shape of the DataFrame.
+ element_type = pcoll._expr.proxy()
Review comment:
Done, added a TODO associated with that JIRA to track this.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]