[
https://issues.apache.org/jira/browse/ORC-8?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15105933#comment-15105933
]
ASF GitHub Bot commented on ORC-8:
----------------------------------
Github user omalley commented on the pull request:
https://github.com/apache/orc/pull/15#issuecomment-172677305
Ok, I just extended the Reader API to get the additional details. I also
added a "--verbose" flag to file-metadata that prints the streams and encoding
information for each stripe.
```
owen@laptop> tools/src/file-metadata -v ../examples/decimal.orc
{ "name": "../examples/decimal.orc",
"type": "struct<_col0:decimal(10,5)>",
"rows": 6000,
"stripe count": 1,
"format": "0.12", "writer version": "original",
"compression": "none",
"file length": 16337,
"content": 16186, "stripe stats": 44, "footer": 89, "postscript": 17,
"row index stride": 10000,
"user metadata": {
},
"stripes": [
{ "stripe": 0, "rows": 6000,
"offset": 3, "length": 16183,
"index": 54, "data": 16079, "footer": 50,
"encodings": [
{ "column": 0, "encoding": "direct" },
{ "column": 1, "encoding": "direct rle2" }
],
"streams": [
{ "id": 0, "column": 0, "kind": "index",
"offset": 3, "length": 7 },
{ "id": 1, "column": 1, "kind": "index",
"offset": 10, "length": 47 },
{ "id": 2, "column": 1, "kind": "present",
"offset": 57, "length": 12 },
{ "id": 3, "column": 1, "kind": "data",
"offset": 69, "length": 13673 },
{ "id": 4, "column": 1, "kind": "secondary",
"offset": 13742, "length": 2394 }
]
}
]
}
```
> Replace tool implementation to use Reader interface
> ---------------------------------------------------
>
> Key: ORC-8
> URL: https://issues.apache.org/jira/browse/ORC-8
> Project: Orc
> Issue Type: Bug
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
>
> Currently FileMetadata is using custom parsing of the ORC footer and
> postscript. It should use the standard interface from Reader.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)