[
https://issues.apache.org/jira/browse/ARROW-4836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661858#comment-17661858
]
Rok Mihevc commented on ARROW-4836:
-----------------------------------
This issue has been migrated to [issue
#21350|https://github.com/apache/arrow/issues/21350] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [Python] "Cannot tell() a compressed stream" when using
> RecordBatchStreamWriter
> -------------------------------------------------------------------------------
>
> Key: ARROW-4836
> URL: https://issues.apache.org/jira/browse/ARROW-4836
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.12.1
> Reporter: Mike Pedersen
> Assignee: Antoine Pitrou
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.15.0
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> It does not seem like RecordBatchStreamWriter works with compressed streams:
> {code:python}
> >>> import pyarrow as pa
> >>> pa.__version__
> '0.12.1'
> >>> stream = pa.output_stream('/tmp/a.gz')
> >>> batch = pa.RecordBatch.from_arrays([pa.array([1])], ['a'])
> >>> writer = pa.RecordBatchStreamWriter(stream, batch.schema)
> >>> writer.write(batch)
> Traceback (most recent call last):
> File "<stdin>", line 1, in <module>
> File "pyarrow/ipc.pxi", line 181, in pyarrow.lib._RecordBatchWriter.write
> File "pyarrow/ipc.pxi", line 196, in
> pyarrow.lib._RecordBatchWriter.write_batch
> File "pyarrow/error.pxi", line 89, in pyarrow.lib.check_status
> pyarrow.lib.ArrowNotImplementedError: Cannot tell() a compressed stream
> {code}
> As I understand the documentation, this should be possible, right?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)