[
https://issues.apache.org/jira/browse/BEAM-11155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17548963#comment-17548963
]
Danny McCormick commented on BEAM-11155:
----------------------------------------
This issue has been migrated to https://github.com/apache/beam/issues/20573
> Series.str.repeat zipping operation produces incorrect proxy
> ------------------------------------------------------------
>
> Key: BEAM-11155
> URL: https://issues.apache.org/jira/browse/BEAM-11155
> Project: Beam
> Issue Type: Bug
> Components: sdk-py-core
> Reporter: Brian Hulette
> Priority: P3
>
> https://github.com/apache/beam/pull/13139#discussion_r513684704
> This proxy is incorrectly inferred as bool.
> {code}
> In [10]: proxy.dtypes
> Out[10]:
> str object
> repeats int64
> dtype: object
> In [11]: proxy.str.str.repeat(proxy.repeats)
> Out[11]: Series([], Name: str, dtype: bool)
> {code}
> The actual operation does produce object though:
> {code}
> In [13]: df.str.str.repeat(df.repeats)
> Out[13]:
> 0 AAA
> 1 B
> 2 CCCC
> 3 DDDDD
> 4 EE
> Name: str, dtype: object
> {code}
> Currently we work around this by specifying the proxy manually, maybe it can
> be fixed upstream?
--
This message was sent by Atlassian Jira
(v8.20.7#820007)