[ 
https://issues.apache.org/jira/browse/ARROW-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rok Mihevc updated ARROW-2264:
------------------------------
    External issue URL: https://github.com/apache/arrow/issues/18222

> [Python] Efficiently serialize numpy arrays with dtype of unicode fixed 
> length string
> -------------------------------------------------------------------------------------
>
>                 Key: ARROW-2264
>                 URL: https://issues.apache.org/jira/browse/ARROW-2264
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 0.8.0
>            Reporter: Mitar
>            Assignee: Robert Nishihara
>            Priority: Major
>             Fix For: 0.10.0
>
>
> Looking at the numpy array serialization code it seems that if I have a dtype 
> like "<U3" this will go through custom ndarray serializer and not through an 
> efficient one.
> {{Example:}}{{>>> np.array(['aaa', 'bbb'])}}
> {{array(['aaa', 'bbb'], dtype='<U3')}}
> This should be able to work, no? It has fixed offsets and memory layout.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to