pokemaster7 opened a new issue #9104:
URL: https://github.com/apache/arrow/issues/9104
Is it possible to embed a feather file in another file (with known
offset/length) and read the feather portion in a correct and performant way?
Here is a naive idea of what I'm trying to do, though it throws an error for
some reason:
``` python
import pandas as pd
import numpy as np
import os
df = pd.DataFrame(np.random.randint(0,100,size=(15, 4)),
columns=list('ABCD'))
pth = "TMP"
with open(pth, "wb") as fh:
fh.write(b"\x01") # custom header, one byte
df.to_feather(fh)
with open(pth, "rb") as gh:
gh.read(1) # read header
print(pd.read_feather(gh)) # throws 'pyarrow.lib.ArrowInvalid: Not a
Feather V1 or Arrow IPC file'
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]