Review Request 34626: SAMZA-688 Fix for YarnJobFactory not supporting 'https' filesystem

2015-05-22 Thread Jose Barrueta
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/34626/ --- Review request for samza. Repository: samza Description --- Adding 'fs.h

Re: Not able to consume produced data

2015-05-22 Thread Yan Fang
Hi Renato, There maybe a misunderstanding in the concept. Consumer is to feed the msgs into the Samza, while Producer is to send the msg from Samza to other systems. So if you implement the Consumer, should be able to see the msgs in the StreamTask. That's why you confuse Naveen. Cheers, Fang, Y

Re: Not able to consume produced data

2015-05-22 Thread Guozhang Wang
Hello Renato, Could you paste your console consumer command here? Guozhang On Fri, May 22, 2015 at 9:43 AM, Naveen S wrote: > Hey Renato, > Is there any specific reason why you are extending the > blocking envelope class instead of implementing a StreamTask ? > http://samza

Re: Not able to consume produced data

2015-05-22 Thread Naveen S
Hey Renato, Is there any specific reason why you are extending the blocking envelope class instead of implementing a StreamTask ? http://samza.apache.org/learn/documentation/0.9/api/overview.html --Naveen On Fri, May 22, 2015 at 8:06 AM, Renato Marroquín Mogrovejo < renatoj.m

Fwd: Not able to consume produced data

2015-05-22 Thread Renato Marroquín Mogrovejo
Hi there, I have developed an RSS reader application using Samza but I am having problems when using kafka-console-consumer to read what has been gotten from the RSS feeds. My RssConsumer uses an object RssFeed which fetches (until stopped by the RssConsumer) all new rss feeds and hands them back

Re: Samza job throughput much lower than Kafka throughput

2015-05-22 Thread George Li
Hi Guozhang, I pre-populated the topic with 25 million rows. There is no new writes to the topic Below is the metrics for a run that consumes about 6 million rows from topic throughput-test2 at the 60th second. To isolate the problem, I am going to write a simple non-samza program that consum

Re: Number of partitions

2015-05-22 Thread Davide Simoncelli
Garry, Thanks for sharing. That was an interesting read! I Have few questions for you. How did you test your needed throughput? Only at Kafka level or with your Samza job as well? Did you measure it with metrics? Thanks Davide > On 21 May 2015, at 8:50 pm, Garry Turkington > wrote: > > Hi,

Re: Review Request 34564: SAMZA-401: add utilization metrics for the event loop

2015-05-22 Thread Yan Fang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/34564/#review84886 --- samza-core/src/main/scala/org/apache/samza/container/RunLoop.scala