[
https://issues.apache.org/jira/browse/CRUNCH-340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Shi updated CRUNCH-340:
----------------------------
Attachment: crunch-340-v3.patch
ChangeLog:
- add an example ScanHiveTable, which can dump arbitrary hive table into tsv
format
- add FromHCat, a helper class to create HCatSource
- remove HCatalogTarget code skeleton, which will be done in another ticket
> HCatSource
> ----------
>
> Key: CRUNCH-340
> URL: https://issues.apache.org/jira/browse/CRUNCH-340
> Project: Crunch
> Issue Type: New Feature
> Reporter: Chao Shi
> Attachments: crunch-340-v2.patch, crunch-340-v3.patch,
> crunch-340.patch
>
>
> This patch adds HCatSource, which enables crunch pipeline to read from Hive
> tables. This is the very first version, leaving a few TODOs in code.
> It adds new dependency from crunch-core to hcatalog (as well as several hive
> components). I guess maybe we should create a new subproject (e.g.
> crunch-hcatalog) rather than add it into crunch-core.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)