[ 
https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14746623#comment-14746623
 ] 

Xiaoyong Zhu commented on HADOOP-3315:
--------------------------------------

Hi guys,

We want to check if there's a TFile parser in non-JVM languages? For example in 
Python/.NET, etc., or anyone has some prototypes that we could reference...Our 
scenario is that we want to parse the TFile in non-Hadoop machines offline - 
besides what I mentioned above, do you have any suggestions? Also, which file 
should we look at for the implementation details....?

Thanks!
Xiaoyong

> New binary file format
> ----------------------
>
>                 Key: HADOOP-3315
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3315
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: io
>            Reporter: Owen O'Malley
>            Assignee: Hong Tang
>             Fix For: 0.20.1
>
>         Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch, 
> HADOOP-3315_20080915_TFILE.patch, TFile Specification 20081217.pdf, 
> hadoop-3315-0507.patch, hadoop-3315-0509-2.patch, hadoop-3315-0509.patch, 
> hadoop-3315-0513.patch, hadoop-3315-0514.patch, hadoop-3315-0601.patch, 
> hadoop-3315-0602.patch, hadoop-3315-0605.patch, hadoop-3315-0612.patch, 
> hadoop-3315-0623-2.patch, hadoop-3315-0701-yhadoop-20.patch, 
> hadoop-3315-0710-1-hadoop-20.patch, hadoop-trunk-tfile.patch, 
> hadoop-trunk-tfile.patch
>
>
> SequenceFile's block compression format is too complex and requires 4 codecs 
> to compress or decompress. It would be good to have a file format that only 
> needs 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to