[
https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14746623#comment-14746623
]
Xiaoyong Zhu commented on HADOOP-3315:
--------------------------------------
Hi guys,
We want to check if there's a TFile parser in non-JVM languages? For example in
Python/.NET, etc., or anyone has some prototypes that we could reference...Our
scenario is that we want to parse the TFile in non-Hadoop machines offline -
besides what I mentioned above, do you have any suggestions? Also, which file
should we look at for the implementation details....?
Thanks!
Xiaoyong
> New binary file format
> ----------------------
>
> Key: HADOOP-3315
> URL: https://issues.apache.org/jira/browse/HADOOP-3315
> Project: Hadoop Common
> Issue Type: New Feature
> Components: io
> Reporter: Owen O'Malley
> Assignee: Hong Tang
> Fix For: 0.20.1
>
> Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch,
> HADOOP-3315_20080915_TFILE.patch, TFile Specification 20081217.pdf,
> hadoop-3315-0507.patch, hadoop-3315-0509-2.patch, hadoop-3315-0509.patch,
> hadoop-3315-0513.patch, hadoop-3315-0514.patch, hadoop-3315-0601.patch,
> hadoop-3315-0602.patch, hadoop-3315-0605.patch, hadoop-3315-0612.patch,
> hadoop-3315-0623-2.patch, hadoop-3315-0701-yhadoop-20.patch,
> hadoop-3315-0710-1-hadoop-20.patch, hadoop-trunk-tfile.patch,
> hadoop-trunk-tfile.patch
>
>
> SequenceFile's block compression format is too complex and requires 4 codecs
> to compress or decompress. It would be good to have a file format that only
> needs
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)