Andrew,

I'm about to dive into figuring out how to use Camus without Avro.  Perhaps we 
should join forces?  (Be warned thought! My java fu is low at the moment. :) ).

-Ao


On Aug 12, 2013, at 11:20 PM, Andrew Psaltis <andrew.psal...@webtrends.com> 
wrote:

> Kam,
> I am perfectly fine if you pick this up. After thinking about it for a
> while, we are going to upgrade to Kafka 0.8.0 and also use Camus as it
> more closely matches our use case, with the caveat of we do not use Avro.
> With that said, I will try and work on the back-port of custom data writer
> patch[1], however, I am not sure how quickly I will get this done as we
> are going to work towards upgrading our Kafka cluster.
> 
> Thanks,
> Andrew
> 
> [1] 
> https://github.com/linkedin/camus/commit/87917a2aea46da9d21c8f67129f6463af5
> 2f7aa8
> 
> 
> 
> 
> 
> On 8/12/13 6:16 PM, "Kam Kasravi" <kamkasr...@yahoo.com> wrote:
> 
>> I would like to do this refactoring since I did a high level consumer a
>> while ago. 
>> A few weeks ago I had opened KAFKA-949 Kafka on Yarn which I was also
>> hoping to add to contribute.
>> It's almost done. KAFKA-949 is paired with BIGTOP-989 which adds kafka
>> 0.8 to the bigtop distribution.
>> KAFKA-949 basically allows kafka brokers to be started up using sysvinit
>> services and would ease some of the
>> startup/configuration issues that newbies have when getting started with
>> kafka. Ideally I would like to
>> fold a number of kafka/bin/* commands into the kafka service. Andrew
>> please let me know if would like to
>> pick this up instead. Thanks!
>> 
>> Kam
>> 
>> 
>> ________________________________
>> From: Jay Kreps <jay.kr...@gmail.com>
>> To: Ken Goodhope <kengoodh...@gmail.com>
>> Cc: Andrew Psaltis <psaltis.and...@gmail.com>;
>> dibyendu.bhattacha...@pearson.com; "camus_...@googlegroups.com"
>> <camus_...@googlegroups.com>; "ao...@wikimedia.org"
>> <ao...@wikimedia.org>; Felix GV <fe...@mate1inc.com>; Cosmin Lehene
>> <cleh...@adobe.com>; "d...@kafka.apache.org" <d...@kafka.apache.org>;
>> "users@kafka.apache.org" <users@kafka.apache.org>
>> Sent: Saturday, August 10, 2013 3:30 PM
>> Subject: Re: Kafka/Hadoop consumers and producers
>> 
>> 
>> So guys, just to throw my 2 cents in:
>> 
>> 1. We aren't deprecating anything. I just noticed that the Hadoop contrib
>> package wasn't getting as much attention as it should.
>> 
>> 2. Andrew or anyone--if there is anyone using the contrib package who
>> would
>> be willing to volunteer to kind of adopt it that would be great. I am
>> happy
>> to help in whatever way I can. The practical issue is that most of the
>> committers are either using Camus or not using Hadoop at all so we just
>> haven't been doing a good job of documenting, bug fixing, and supporting
>> the contrib packages.
>> 
>> 3. Ken, if you could document how to use Camus that would likely make it a
>> lot more useful to people. I think most people would want a full-fledged
>> ETL solution and would likely prefer Camus, but very few people are using
>> Avro.
>> 
>> -Jay
>> 
>> 
>> On Fri, Aug 9, 2013 at 12:27 PM, Ken Goodhope <kengoodh...@gmail.com>
>> wrote:
>> 
>>> I just checked and that patch is in .8 branch.   Thanks for working on
>>> back porting it Andrew.  We'd be happy to commit that work to master.
>>> 
>>> As for the kafka contrib project vs Camus, they are similar but not
>>> quite
>>> identical.  Camus is intended to be a high throughput ETL for bulk
>>> ingestion of Kafka data into HDFS.  Where as what we have in contrib is
>>> more of a simple KafkaInputFormat.  Neither can really replace the
>>> other.
>>> If you had a complex hadoop workflow and wanted to introduce some Kafka
>>> data into that workflow, using Camus would be a gigantic overkill and a
>>> pain to setup.  On the flipside, if what you want is frequent reliable
>>> ingest of Kafka data into HDFS, a simple InputFormat doesn't provide you
>>> with that.
>>> 
>>> I think it would be preferable to simplify the existing contrib
>>> Input/OutputFormats by refactoring them to use the more stable higher
>>> level
>>> Kafka APIs.  Currently they use the lower level APIs.  This should make
>>> them easier to maintain, and user friendly enough to avoid the need for
>>> extensive documentation.
>>> 
>>> Ken
>>> 
>>> 
>>> On Fri, Aug 9, 2013 at 8:52 AM, Andrew Psaltis
>>> <psaltis.and...@gmail.com>wrote:
>>> 
>>>> Dibyendu,
>>>> According to the pull request:
>>>> https://github.com/linkedin/camus/pull/15it was merged into the
>>>> camus-kafka-0.8
>>>> branch. I have not checked if the code was subsequently removed,
>>>> however,
>>>> two at least one the important files from this patch
>>>> (camus-api/src/main/java/com/linkedin/camus/etl/RecordWriterProvider.jav
>>>> a)
>>>> is still present.
>>>> 
>>>> Thanks,
>>>> Andrew
>>>> 
>>>> 
>>>> On Fri, Aug 9, 2013 at 9:39 AM,
>>>> <dibyendu.bhattacha...@pearson.com>wrote:
>>>> 
>>>>> Hi Ken,
>>>>> 
>>>>> I am also working on making the Camus fit for Non Avro message for our
>>>>> requirement.
>>>>> 
>>>>> I see you mentioned about this patch (
>>>>> 
>>>>> https://github.com/linkedin/camus/commit/87917a2aea46da9d21c8f67129f646
>>>>> 3af52f7aa8)
>>>>> which supports custom data writer for Camus. But this patch is not
>>>>> pulled
>>>>> into camus-kafka-0.8 branch. Is there any plan for doing the same ?
>>>>> 
>>>>> Regards,
>>>>> Dibyendu
>>>>> 
>>>>> --
>>>>> You received this message because you are subscribed to a topic in the
>>>>> Google Groups "Camus - Kafka ETL for Hadoop" group.
>>>>> To unsubscribe from this topic, visit
>>>>> https://groups.google.com/d/topic/camus_etl/KKS6t5-O-Ng/unsubscribe.
>>>>> To unsubscribe from this group and all its topics, send an email to
>>>>> camus_etl+unsubscr...@googlegroups.com.
>>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>>> 
>>>> 
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups
>>>> "Camus - Kafka ETL for Hadoop" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an
>>>> email to camus_etl+unsubscr...@googlegroups.com.
>>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>> 
>>>> 
>>>> 
>>> 
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups
>>> "Camus - Kafka ETL for Hadoop" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an
>>> email to camus_etl+unsubscr...@googlegroups.com.
>>> For more options, visit https://groups.google.com/groups/opt_out.
> 
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Camus - Kafka ETL for Hadoop" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to camus_etl+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/groups/opt_out.
> 
> 

Reply via email to