[ https://issues.apache.org/jira/browse/BEAM-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15947561#comment-15947561 ]
Tibor Kiss commented on BEAM-778: --------------------------------- I'm still working on seek() implementation and I have noticed that there is no lock to protect the {{_read_buffer}} object. I'm not completely sure if it is a valid scenario that multiple threads accessing the same _CompressedFile object though. Any thoughts on extending this class with a lock on {{_read_buffer}}? > Make fileio._CompressedFile seekable. > ------------------------------------- > > Key: BEAM-778 > URL: https://issues.apache.org/jira/browse/BEAM-778 > Project: Beam > Issue Type: Improvement > Components: sdk-py > Reporter: Chamikara Jayalath > Assignee: Tibor Kiss > Fix For: Not applicable > > > We have a TODO to make fileio._CompressedFile seekable. > https://github.com/apache/incubator-beam/blob/python-sdk/sdks/python/apache_beam/io/fileio.py#L692 > Without this, compressed file objects produce for FileBasedSource > implementations may not be able to use libraries that utilize methods seek() > and tell(). > For example tarfile.open(). -- This message was sent by Atlassian JIRA (v6.3.15#6346)