[ 
https://issues.apache.org/jira/browse/BEAM-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fabian updated BEAM-7266:
-------------------------
    Summary: Pipeline run does not terminate because of Dataflow runner can not 
close file system writer  (was: Pipeline run does not terminate because of 
Dataflow runner can close file system writer)

> Pipeline run does not terminate because of Dataflow runner can not close file 
> system writer
> -------------------------------------------------------------------------------------------
>
>                 Key: BEAM-7266
>                 URL: https://issues.apache.org/jira/browse/BEAM-7266
>             Project: Beam
>          Issue Type: Bug
>          Components: io-python-gcp, runner-dataflow
>    Affects Versions: 2.11.0
>            Reporter: Fabian
>            Priority: Major
>
> We are using Apache Beam in version 2.11.0 (Python SDK) with the Dataflow 
> runner running on the Google Cloud Platform. Two pipeline runs did not 
> terminate, i.e. after multiple days (instead of some minutes) they where 
> still running. The only error that was logged is:
> If fails to close a writer:
> {code:java}
> Traceback (most recent call last):
>   File 
> "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py", line 
> 649, in do_work
>     work_executor.execute()
>   File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/executor.py", 
> line 178, in execute
>     op.finish()
>   File "dataflow_worker/native_operations.py", line 93, in 
> dataflow_worker.native_operations.NativeWriteOperation.finish
>     def finish(self):
>   File "dataflow_worker/native_operations.py", line 94, in 
> dataflow_worker.native_operations.NativeWriteOperation.finish
>     with self.scoped_finish_state:
>   File "dataflow_worker/native_operations.py", line 95, in 
> dataflow_worker.native_operations.NativeWriteOperation.finish
>     self.writer.__exit__(None, None, None)
>   File 
> "/usr/local/lib/python2.7/dist-packages/dataflow_worker/nativeavroio.py", 
> line 277, in __exit__
>     self._data_file_writer.close()
>   File "/usr/local/lib/python2.7/dist-packages/avro/datafile.py", line 220, 
> in close
>     self.writer.close()
>   File 
> "/usr/local/lib/python2.7/dist-packages/apache_beam/io/filesystemio.py", line 
> 202, in close
>     self._uploader.finish()
>   File "/usr/local/lib/python2.7/dist-packages/apache_beam/io/gcp/gcsio.py", 
> line 606, in finish
>     raise self._upload_thread.last_error  # pylint: disable=raising-bad-type
> NotImplementedError{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to