[
https://issues.apache.org/jira/browse/HADOOP-3380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12596484#action_12596484
]
Doug Cutting commented on HADOOP-3380:
--------------------------------------
> What's the relationship between this proposal and
> JobConf#getOutputValueGroupingComparator() and
> JobConf#getOutputKeyComparator()?
Those are ways to override the "natural" (or default) comparator under
MapReduce. This proposal is about defining the natural comparator. If we had
a good configurable comparator, then we perhaps wouldn't need those methods,
but I'm not sure... The framework might set io.comparator.context=grouping,
and then the configurable comparator implementation could use this to decide to
use the user-specified value of io.record.compare.grouping or somesuch. Yuck!
BTW, those methods should both be altered to return RawComparator, not a
WritableComparator, no?
> need comparators in serializer framework
> ----------------------------------------
>
> Key: HADOOP-3380
> URL: https://issues.apache.org/jira/browse/HADOOP-3380
> Project: Hadoop Core
> Issue Type: New Feature
> Components: io
> Reporter: Doug Cutting
>
> The new serialization framework permits Hadoop to incorporate different
> serialization systems, including Hadoop's Writable, Thrift, Java
> Serialization, etc. It provides a generic, extensible means
> (SerializationFactory) to create serializers and deserializers for arbitrary
> Java classes. However it does not include a generic means to create
> comparators for these classes. Comparators are required for MapReduce keys
> and many other computations. Thus we should enhance the serialization
> framwork to provide comparators too.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.