[ 
https://issues.apache.org/jira/browse/HDDS-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16991858#comment-16991858
 ] 

YiSheng Lien commented on HDDS-2443:
------------------------------------

Hello, I just shared some method to use pyarrow with Ozone for now.
And the following perf-measure and etc would be update in this JIRA
(Here is the [link|https://hackmd.io/@oT-VsVwvR6WYbprBccqsKw/cxorm [^Ozone with 
pyarrow.html]  [^Ozone with pyarrow.odt] ] if you interested in.)
Thanks.

> Python client/interface for Ozone
> ---------------------------------
>
>                 Key: HDDS-2443
>                 URL: https://issues.apache.org/jira/browse/HDDS-2443
>             Project: Hadoop Distributed Data Store
>          Issue Type: New Feature
>          Components: Ozone Client
>            Reporter: Li Cheng
>            Priority: Major
>         Attachments: Ozone with pyarrow.html, Ozone with pyarrow.odt, 
> OzoneS3.py
>
>
> Original ideas: item#25 in 
> [https://cwiki.apache.org/confluence/display/HADOOP/Ozone+project+ideas+for+new+contributors]
> Ozone Client(Python) for Data Science Notebook such as Jupyter.
>  # Size: Large
>  # PyArrow: [https://pypi.org/project/pyarrow/]
>  # Python -> libhdfs HDFS JNI library (HDFS, S3,...) -> Java client API 
> Impala uses  libhdfs
>  
> Path to try:
> # s3 interface: Ozone s3 gateway(already supported) + AWS python client 
> (boto3)
> # python native RPC
> # pyarrow + libhdfs, which use the Java client under the hood.
> # python + C interface of go / rust ozone library. I created POC go / rust 
> clients earlier which can be improved if the libhdfs interface is not good 
> enough. [By [~elek]]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: ozone-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: ozone-issues-h...@hadoop.apache.org

Reply via email to