Re: Fast way search data in kafka

2017-03-23 Thread Marko Bonaći
You can use something like this to get a comma-separated list of all filed in a folder: ls -l | awk '{print $9}' ORS=',' Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http:/

Re: Tracking when a batch of messages has arrived?

2016-12-04 Thread Marko Bonaći
Do you know in advance (when sending the first message) how many messages that batch is going to have? Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://semat

Re: rack aware consumer

2016-09-30 Thread Marko Bonaći
nc replica becomes the leader. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Thu, Sep 29, 2016 at 7:30 PM, Ezra Stuetzel <e

Re: Publish to 1 topic, consume from N

2016-09-15 Thread Marko Bonaći
enough for any type of use case? What do you think cannot be achieved? Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Thu, Sep 15,

Re: Kafka related queries

2016-09-05 Thread Marko Bonaći
BTW regarding latency: https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Con

Re: Kafka related queries

2016-09-05 Thread Marko Bonaći
lag; i.e. lag can fit in the OS page cache so you're not even hitting disk when consuming) measured in low 10s of ms. No read replicas. You only read from partition's master. I.e. replicas are used to achieve redundancy. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log

Re: Questions about Apache Kafka

2016-08-24 Thread Marko Bonaći
Hi Karin, regarding 5 (fsyncing to disk), take a look at the broker configuration parameters whose names start with log.flush. http://kafka.apache.org/documentation.html#brokerconfigs Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Sup

Re: Monitoring offset lag

2016-07-07 Thread Marko Bonaći
Hi Tom, if you need a commercially proven lag monitoring solution (and all other Kafka and ZK metrics) take a look at our SPM. Hope you don't mind me plugging this one in :) [image: Inline image 1] Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr

Re: Not able to run multiple consumer instances

2016-04-29 Thread Marko Bonaći
to assign the second consumer with a different group, each consumer would consume all messages (independently of one another). BTW, Kafka is not broadcasting anything, your consumers are pulling messages out of Kafka :) Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log

Re: Filter plugins in Kafka

2016-04-26 Thread Marko Bonaći
Instantly reminded me of Streams API, where you can use Java8 streams semantics (filter being one of them) to do the first thing in Gouzhang's response (filter messages from one topic into another - I assume that's what you were looking for). Marko Bonaći Monitoring | Alerting | Anomaly Detection

Re: kafka producers and consumers on different machine

2016-04-20 Thread Marko Bonaći
I'm assuming that you created a topic with replication factor 3, while having only a single broker. Try with replication factor 1 or add additional brokers. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext &

Re: please add me

2016-04-14 Thread Marko Bonaći
Also sent to: ggol...@hortonworks.com Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Fri, Apr 15, 2016 at 1:47 AM, Gurudi

Re: please add me

2016-04-14 Thread Marko Bonaći
Automated reply: thank you for attempting to subscribe to Kafka mailing list. To finish the subscription process send email to users-subscr...@kafka.apache.org < users-subscr...@kafka.apache.org> :) Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Managemen

Consumer Lag JMX in 0.9

2016-03-23 Thread Marko Bonaći
+ Has anything changed in 0.9? Thanks Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html>

Re: Would Kafka streams be a good choice for a collaborative web app?

2016-03-22 Thread Marko Bonaći
-providers Amazon Kinesis would also work. Anything really that would "outsource" the initial effort until you're ready to commit to Kafka. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sem

Re: Would Kafka streams be a good choice for a collaborative web app?

2016-03-21 Thread Marko Bonaći
regardless of the load. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Mon, Mar 21, 2016 at 6:25 PM, Ben Stopford <b...@

Re: Spark Streaming + Kafka 0.9

2016-03-01 Thread Marko Bonaći
These two issues track progress of Kafka consumer 0.9. https://github.com/apache/spark/pull/10953 https://github.com/apache/spark/pull/11143 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/>

Re: Determine the number of consumers

2016-01-08 Thread Marko Bonaći
Hello again Cosmin :) I think this is because offsets are kept in a special _consumer_offsets topic, as opposed to ZK previously. Take a look here: http://search-hadoop.com/m/uyzND1T1i3BNkRFM1=Re+Kafka+0+8+2+1+how+to+read+from+__consumer_offsets+topic+ Marko Bonaći Monitoring | Alerting | Anomaly

Re: Bizarre crash when creating a consumer

2016-01-07 Thread Marko Bonaći
Hi Cosmin, do you have default server configuration on these new nodes you're setting up? I'd check consumer's socket.timeout.ms, maybe someone set it to 30 instead of 30 000 :) Speaking from my own experience (I had the same symptom and this turned out to be the cause). Marko Bonaći Monitoring

Re: Bizarre crash when creating a consumer

2016-01-07 Thread Marko Bonaći
Actually, why don't you use the same code as outlined here (that includes timeout in props): http://kafka.apache.org/090/javadoc/org/apache/kafka/clients/consumer/KafkaConsumer.html Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Sup

Re: Best way to commit offset on demand

2016-01-04 Thread Marko Bonaći
n requests? FINALLY: tell us more about your use case. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Mon, Jan 4, 2016 at 1

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-30 Thread Marko Bonaći
here: https://github.com/quantifind/KafkaOffsetMonitor Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Wed, Dec 30, 2015 at

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-30 Thread Marko Bonaći
by default) I'm still on Kafka 0.8, so I can't shed any light on your issue. Thx for the AdminClient info. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-29 Thread Marko Bonaći
I was refering to Dana Powers's answer in the link I posted (to use a client API). You can find an example here: http://kafka.apache.org/090/javadoc/org/apache/kafka/clients/consumer/KafkaConsumer.html Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-29 Thread Marko Bonaći
We recently had the same question: http://search-hadoop.com/m/uyzND1kM7q1gElhy1 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On T

Re: how to reset kafka offset in zookeeper

2015-12-18 Thread Marko Bonaći
-class.sh kafka.tools.ImportZkOffsets --input-file /tmp/zk-offsets --zkconnect localhost:2181 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html>

Re: how to reset kafka offset in zookeeper

2015-12-18 Thread Marko Bonaći
Hmm, I guess you're right Tod :) Just to confirm, you meant that, while you're changing the exported file it might happen that one of the segment files becomes eligible for cleanup by retention, which would then make the imported offsets out of range? Marko Bonaći Monitoring | Alerting | Anomaly

Re: failed with LeaderNotAvailableError -

2015-12-17 Thread Marko Bonaći
Spotify's image since it contains both ZK and Kafka, but I think the latest version they built is 0.8.2.1, so you might have to build the new image yourself if you need 0.9, but that's trivial to do. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr

Consumer lag JMX in 0.9

2015-12-16 Thread Marko Bonaći
Hi, there was a problem with JMX consumer lag in 0.8: http://search-hadoop.com/m/uyzND14v72215XZpK=Re+Consumer+lag+lies+orphaned+offsets+ Has anything changed now with 0.9? Thanks

Re: Maximum Topic Length in Kafka

2015-11-29 Thread Marko Bonaći
Did not know that quotas landed in 0.9. Very nice! Being able to throttle clients that don't have real-time SLAs (in favor of those who do) is a great addition. Thanks for that Grant. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Sup

Re: Maximum Topic Length in Kafka

2015-11-29 Thread Marko Bonaći
case. Perhaps you could check Consumer offsets from your Producer and then decide based on that information whether to throttle Producer or not. Could get complicated really fast, though. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Sup

Re: Maximum Topic Length in Kafka

2015-11-28 Thread Marko Bonaći
AFAIK there is no such notion as maximum length of a topic, i.e. offset has no limit, except Long.MAX_VALUE I think, which should be enough for a couple of lifetimes (9 * 10E18, or quintillion or million trillions). What would be the purpose of that, besides being a nice foot-gun :) Marko Bonaći

Producer retries

2015-11-24 Thread Marko Bonaći
running producers, brokers and consumers on AWS. Is it possible that the network is that much flaky? What's your experience? Thanks, Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Con

Monitoring leader changes

2015-11-03 Thread Marko Bonaći
g the same thing? Or should I just forget about it and use the recommended approach from the low-level consumer code example in the wiki (which I currently use as the fallback)? Thanks, Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Su