(I think that I need to implement my own OutputCollector, but I don't know how to tell hadoop to use it.) How can I do this?
-Derek
You probably need to define your own OutputFormat and tell Hadoop to use it by calling setOutputFormat method of JobConf. OutputFormat instance is used to create RecordWriter instance which is used by OutputCollector to process output data. You may want to take a look at implementation of SequenceFileOutputFormat for example

Reply via email to