[
https://issues.apache.org/jira/browse/HIVE-5871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14124301#comment-14124301
]
Lefty Leverenz commented on HIVE-5871:
--------------------------------------
Doc note: MultiDelimitSerDe needs to be documented in the wiki (with version
information and a link to this JIRA ticket).
It belongs in some existing docs, and a new doc with limitations and usage
examples could be a child page to the SerDe doc (or a new section in the SerDe
doc):
* [SerDe -- Built-in, Third-Party, and Custom SerDes |
https://cwiki.apache.org/confluence/display/Hive/SerDe#SerDe-Built-in,Third-Party,andCustomSerDes]
* [DDL -- Create Table -- Row Format, Storage Format, and SerDe |
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RowFormat,StorageFormat,andSerDe]
* possibly [DDL -- Add SerDe Properties |
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AddSerDeProperties]
* [Developer Guide -- Hive SerDe (add to "Also:" list at end of section) |
https://cwiki.apache.org/confluence/display/Hive/DeveloperGuide#DeveloperGuide-HiveSerDe]
* optionally [HCatalog Storage Formats -- SerDes and Storage Formats (first
paragraph) |
https://cwiki.apache.org/confluence/display/Hive/HCatalog+StorageFormats#HCatalogStorageFormats-SerDesandStorageFormats]
A release note could include the example from [~lirui]'s first comment.
> Use multiple-characters as field delimiter
> ------------------------------------------
>
> Key: HIVE-5871
> URL: https://issues.apache.org/jira/browse/HIVE-5871
> Project: Hive
> Issue Type: Improvement
> Components: Contrib
> Affects Versions: 0.12.0
> Reporter: Rui Li
> Assignee: Rui Li
> Labels: TODOC14
> Fix For: 0.14.0
>
> Attachments: HIVE-5871.2.patch, HIVE-5871.3.patch, HIVE-5871.4.patch,
> HIVE-5871.5.patch, HIVE-5871.6.patch, HIVE-5871.patch
>
>
> By default, hive only allows user to use single character as field delimiter.
> Although there's RegexSerDe to specify multiple-character delimiter, it can
> be daunting to use, especially for amateurs.
> In the patch, I add a new SerDe named MultiDelimitSerDe. With
> MultiDelimitSerDe, users can specify a multiple-character field delimiter
> when creating tables, in a way most similar to typical table creations.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)