Re: Review Request 35577: Fix SAMZA-714

2015-06-17 Thread Guozhang Wang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35577/ --- (Updated June 17, 2015, 6:48 p.m.) Review request for samza. Bugs: SAMZA-714

Re: Review Request 35348: SAMZA-707 Remove dependency on samza-serializers in the samza.apache.org/startup/download page

2015-06-17 Thread Aleksandar Bircakovic
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35348/ --- (Updated June 17, 2015, 8:15 p.m.) Review request for samza. Changes ---

Re: Measuring Samza Job Throughput

2015-06-17 Thread Yi Pan
Hi, Milinda, Tao @LinkedIn has done some Samza benchmark test using a standard word-count task. You may want to reach out to him for some detailed ideas on how to set up the perf tests. Best! -Yi On Wed, Jun 17, 2015 at 11:25 AM, Milinda Pathirage mpath...@umail.iu.edu wrote: Thank you all

Re: Measuring Samza Job Throughput

2015-06-17 Thread Milinda Pathirage
Thank you all for the ideas. I'll have a look at KafkaSystem metrics and SamzaContainerMetrics. Milinda On Wed, Jun 17, 2015 at 2:38 AM, Tao Feng fengta...@gmail.com wrote: Hi, One metric I could think of related to Samza job throughput is the process-envelop metric listed in

Re: Review Request 35577: Fix SAMZA-714

2015-06-17 Thread Guozhang Wang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35577/ --- (Updated June 17, 2015, 6:47 p.m.) Review request for samza. Bugs: KAFKA-714

Review Request 35577: Fix SAMZA-714

2015-06-17 Thread Guozhang Wang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35577/ --- Review request for samza. Bugs: KAFKA-714

Re: Review Request 35397: Fix SAMZA-697

2015-06-17 Thread Guozhang Wang
On June 17, 2015, 9:03 p.m., Yan Fang wrote: samza-core/src/main/java/org/apache/samza/task/TaskClassLoader.java, line 75 https://reviews.apache.org/r/35397/diff/2/?file=985834#file985834line75 use if to be consistent? I thought there may be some class paths that contain multiple

Re: Review Request 35348: SAMZA-707 Remove dependency on samza-serializers in the samza.apache.org/startup/download page

2015-06-17 Thread Navina Ramesh
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35348/#review88287 --- Ship it! Ship It! - Navina Ramesh On June 17, 2015, 8:15 p.m.,

[VOTE] Apache Samza 0.9.1 RC0

2015-06-17 Thread Guozhang Wang
Hey all, This is a call for a vote on a release of Apache Samza 0.9.1. This is a bug-fix release against 0.9.0. The release candidate can be downloaded from here: http://people.apache.org/~guozhang/samza-0.9.1-rc0/ The release candidate is signed with pgp key 911402D8, which is included in the

Re: Review Request 35397: Fix SAMZA-697

2015-06-17 Thread Yan Fang
On June 17, 2015, 9:03 p.m., Yan Fang wrote: samza-core/src/main/scala/org/apache/samza/container/TaskInstance.scala, lines 103-111 https://reviews.apache.org/r/35397/diff/2/?file=985837#file985837line103 a little concernted about this. This means we will load the class every

Re: Review Request 35397: Fix SAMZA-697

2015-06-17 Thread Yan Fang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35397/#review88260 ---

Re: Measuring Samza Job Throughput

2015-06-17 Thread Tao Feng
Hi, One metric I could think of related to Samza job throughput is the process-envelop metric listed in SamzaContainerMetrics. This counter get incremented whenever the container process meaningful message(

Re: Measuring Samza Job Throughput

2015-06-17 Thread Chris Riccomini
Hey Milinda, Specifically, for bytes/sec, you might want to look at serde metrics. I believe the serde manager tracks bytes serialized and deserialized per second. The consumers and producers also do this for Kafka, but on a more granular basis. If you want container-level throughput, serde

Re: Measuring Samza Job Throughput

2015-06-17 Thread Chris Riccomini
Hmm, correction. I think this has to be done at tbhe KafkaSystem level. We allow consumers and producers to return non-byte messages, which means nothing in container can safely assume that a message is a byte array except the serde manager. I took a look there but didn't see any byte throughout

Re: improving hello-samza / testing

2015-06-17 Thread Tim Williams
Thanks for the great feedback! the SAMZA-681 patch looks great, when it's merged it'd be nice to add some tests to hello-samza that demonstrate it... Thanks again, --tim On Wed, Jun 17, 2015 at 12:17 AM, Chris Riccomini criccom...@apache.org wrote: Hey Tim, This is a really good discussion