Re: Metron 0.7.1 (METRON INDEXING) bug

2019-03-29 Thread Ali Nazemian
You need to first make sure delete.topic.enable is set to true. Make sure there is no producer and consumer registered to the topic. (Stop all storm topologies if you can and make sure all of them are stopped entirely.) Then, you can use kafka topic cli to delete the topic by using --delete [topic_

Unified enrichment topology debug timestamp fields

2018-07-15 Thread Ali Nazemian
Hi All, The recent version of Metron ships with unified enrichment topology. It looks like this topology does come with the detailed timestamp fields at it was covered in the previous enrichment architecture. What are the timestamp fields available with the new topology? Given we have been using s

Filter fields from landing Elasticsearch only

2018-06-11 Thread Ali Nazemian
Hi All, I was wondering if it is possible to filter out some fields from landing indexer only (Elasticsearch) from Metron side. Either in enrichment or indexing topology. I want to store all the parsed/enriched fields in HDFS, but I want to filter some of them for Elasticsearch. Cleary, we can de

Re: Alert View Deduplication

2018-02-22 Thread Ali Nazemian
Hi Jack, Good to see you here. Would it help if you can introduce a signature for every event and then try to filter based on the signature? a duplicate message might be related to the source or at least once guarantee of Storm. Cheers, Ali On Fri, Feb 23, 2018 at 3:14 PM, Jack Burgess wrote:

Re: Stellar post-parsing transformation conditional statement

2018-02-13 Thread Ali Nazemian
Any example would be appreciated. On 9 Feb. 2018 15:37, "Ali Nazemian" wrote: > Match is not available in our version. What about MAP_GET? How can I use > it for this matter? I couldn't understand what the solution is. > > On Fri, Feb 9, 2018 at 3:02 PM,

Re: Stellar post-parsing transformation conditional statement

2018-02-08 Thread Ali Nazemian
ement in there I > expect. See the match statement at https://github.com/apache/ > metron/blob/master/metron-stellar/stellar-common/README.md under core > functions (it’s relatively new) > > Simon > > Sent from my iPhone > > On 9 Feb 2018, at 03:55, Ali Nazemian wrote: &

Stellar post-parsing transformation conditional statement

2018-02-08 Thread Ali Nazemian
Hi All, I was wondering how we can address if statement in the config section to have a different mapping in certain conditions. The following syntax is not acceptable. { "parserClassName": "org.apache.metron.parsers.asa.BasicAsaParser", "filterClassName": null, "sensorTopic": "test-asa", "writer

Re: Define a function that can be used in Stellar

2018-02-02 Thread Ali Nazemian
metron.enrichment.stellar.ObjectGet >> that was purpose-built to retrieve files from HDFS. If you wanted to >> retrieve a configuration from HDFS that would be a good example (if you >> can't just use that functions directly). >> >> On Fri, Feb 2, 2018 at 8:50 AM A

Re: Define a function that can be used in Stellar

2018-02-02 Thread Ali Nazemian
t; global config option. > > Simon > > On 2 Feb 2018, at 13:42, Ali Nazemian wrote: > > Does it mean every time the function gets called it will load the config, > but if I use the global one it will only read it one time and it will be > available in memory? > > On 2 F

Re: HBase enrichment vs Stellar enrichment for HBase look up

2018-02-02 Thread Ali Nazemian
use the Stellar version, and in fact that’s the > general direction the project is heading. We haven’t quite deprecated the > plain HBase Bolt… but Stellar is definitely the preferred option. > > Simon > > > On 2 Feb 2018, at 07:10, Ali Nazemian wrote: > > > > Hi All, &g

Re: Define a function that can be used in Stellar

2018-02-02 Thread Ali Nazemian
, since most instances of > stellar apply global config to their context. > > Simon > > > On 2 Feb 2018, at 07:14, Ali Nazemian wrote: > > Will be any problem if the Stellar function we want to implement need to > load an external config file? > > Cheers, > A

Re: Define a function that can be used in Stellar

2018-02-01 Thread Ali Nazemian
Will be any problem if the Stellar function we want to implement need to load an external config file? Cheers, Ali On Thu, Jan 18, 2018 at 4:58 PM, Ali Nazemian wrote: > Thanks, All. > > Yes, Nick. It is highly related to our use case and the way that we are > going to enrich

HBase enrichment vs Stellar enrichment for HBase look up

2018-02-01 Thread Ali Nazemian
Hi All, Is there any performance difference between HBase enrichment and Stellar enrichment? We have an HBase enrichment that we need to have a customised key for it. HBase enrichment doesn't give us the full flexibility of using any logic for a Key generation, so I was wondering whether there wil

Re: Define a function that can be used in Stellar

2018-01-17 Thread Ali Nazemian
t; > > On Wed, Jan 17, 2018 at 7:38 AM Simon Elliston Ball < > si...@simonellistonball.com> wrote: > > Have you looked at the recent TLSH functions in Stellar? We already have > that for similarity preserving hashes. > > > > Simon > > > > > On 1

Re: Define a function that can be used in Stellar

2018-01-17 Thread Ali Nazemian
ed, Jan 17, 2018 at 6:29 AM Ali Nazemian > wrote: > >> Thanks, Simon. We have already got a script to deal with classpath >> management for the parsers. We should be able to use it for this extension >> as well. >> >> Yeah, I agree. It will be much easier to defi

Re: Define a function that can be used in Stellar

2018-01-17 Thread Ali Nazemian
e dev list is probably the notion of > defining stellar functions in stellar, which would be a much simpler > solution than custom java functions if you can already express you logic in > stellar. > > Simon > > > On 17 Jan 2018, at 10:37, Ali Nazemian wrote: > > Hi Simon, >

Re: Define a function that can be used in Stellar

2018-01-17 Thread Ali Nazemian
r example. Is that what > you’re looking for? Maybe some sort of syntax to create a named stellar > function similar to the way we create lambdas? > > Simon > > > On 17 Jan 2018, at 07:25, Ali Nazemian wrote: > > > > Hi all, > > > > Is there any way that

Define a function that can be used in Stellar

2018-01-16 Thread Ali Nazemian
Hi all, Is there any way that we can define a function that can be used rather than duplicating a logic multiple times? Cheers, Ali

Bizarre situation with Kafka Offset Lag

2017-12-03 Thread Ali Nazemian
Hi, We have a very bizarre situation with one of our platforms. Our problem is we have about 6-7 mins extra latency on our platform. We have noticed there is about 60k-70k total lag on the indexing consumer of the indexing topic. This lag neither decrease nor increase! The bizarre situation is thi

Re: java.io.IOException: Stream closed on writing to HDFS

2017-09-06 Thread Ali Nazemian
>> I am not sure why the stream is closed. But, I have opened >> https://issues.apache.org/jira/browse/METRON-1153, because we should >> verify the stream before attempting to write. >> >> >> On September 3, 2017 at 21:28:16, Ali Nazemian (alinazem...@gmail.com) &

Re: java.io.IOException: Stream closed on writing to HDFS

2017-09-04 Thread Ali Nazemian
, I have opened > https://issues.apache.org/jira/browse/METRON-1153, because we should > verify the stream before attempting to write. > > > On September 3, 2017 at 21:28:16, Ali Nazemian (alinazem...@gmail.com) > wrote: > > > Hi all, > > We have run into an issue o

java.io.IOException: Stream closed on writing to HDFS

2017-09-03 Thread Ali Nazemian
Hi all, We have run into an issue on Indexing topology on the HDFS bolt recently. We are using HDFS TDE for encryption at rest and it is working properly for 2-3 days. After that, we can see the following exception frequently on HDFS writer bolt and the throughput of this topology drops significan

[no subject]

2017-09-03 Thread Ali Nazemian

Re: Geo enrichment failure after blocking internet connectivity

2017-07-31 Thread Ali Nazemian
cal file on the Metron master node, as long as that file > exists prior to Ambari's attempt to use it. > > Let me know if that solves the problem; I haven't taken a look at that > stuff in a little bit, so I may have to dig a bit deeper if that doesn't > resolve it. >

Geo enrichment failure after blocking internet connectivity

2017-07-27 Thread Ali Nazemian
Hi, Recently we have blocked internet connection to one of our platforms. After we had restarted Enrichment topology, we found out that topology cannot start anymore and it keeps throwing the following exception. 2017-07-28 04:41:38.816 o.a.c.f.r.c.TreeCache [ERROR] java.lang.IllegalStateExcepti

Re: How to change Elasticsearch indexing policy

2017-07-14 Thread Ali Nazemian
ly going to kill your disks at any reasonable > scale. > > Simon > > > On 14 Jul 2017, at 10:31, Ali Nazemian wrote: > > > > Hi, > > > > I am investigating different tuning aspects, and I was wondering how I > can change the policy of Elasticsearch index

How to change Elasticsearch indexing policy

2017-07-14 Thread Ali Nazemian
Hi, I am investigating different tuning aspects, and I was wondering how I can change the policy of Elasticsearch indexing. Currently, as a default behaviour, events are stored in separate indices hourly. How can I change this behaviour? Is this a hard-coded design or I can change it through confi

Re: Treat Triage boost aggregation

2017-06-22 Thread Ali Nazemian
ike the score aggregated? > > On Thu, Jun 22, 2017 at 8:07 PM, Ali Nazemian > wrote: > >> Thanks, Casey and Nick. Is there any way that we can somehow overcome >> this requirement with the current features? Exclude MAAS. >> >> On Thu, Jun 22, 2017 at 11:42 PM,

Re: Treat Triage boost aggregation

2017-06-22 Thread Ali Nazemian
Stella wrote: > >> That's correct that it's the last step. Honestly, the threat triage >> functions were added prior to Stellar really being a thing. We should >> allow arbitrary stellar statements in there rather than a fixed approach, >> so it's

Treat Triage boost aggregation

2017-06-22 Thread Ali Nazemian
Hi all, I know there are four different Treat Triage aggregation functions we can use for the case of triggering multiple rules. These functions are "max', "min", "mean", "positive mean". I was wondering whether there is any way I can implement the following logic with the Treat Triage functions f

Re: Metron in-memory enrichment

2017-06-19 Thread Ali Nazemian
be nice if our enrichment cache mechanism reported hit/miss >> stats or something. >> >> >> On June 19, 2017 at 09:58:25, Ali Nazemian (alinazem...@gmail.com) wrote: >> >> I have already increased the cache value. However, clearly, the HBase >> enrichment i

Re: Metron in-memory enrichment

2017-06-19 Thread Ali Nazemian
same clientside caching as >>> the Hbase bolt? >>> >>> Simon >>> >>> On 19 Jun 2017, at 06:21, Casey Stella wrote: >>> >>> In order to do that, the easiest thing to do is to create a stellar >>> function to load and do in-me

Metron in-memory enrichment

2017-06-18 Thread Ali Nazemian
Hi all, We are using Metron HBase enrichment for a few use cases, but we have noticed the achievable throughput is not very great. I was wondering whether there is a way to load the external enrichment data in-memory and use it with normal Stellar enrichments. In our use cases, the number of rows

Re: Metron components connectivity

2017-05-31 Thread Ali Nazemian
; Hortonworks for HDP. You probably need to look mostly at the ports > specified in your config files. > > I would suggest that you treat all HDP nodes as able to talk to each other > across all ports - but limit anything which talks to those nodes. That is a > lot easier.

Re: Metron components connectivity

2017-05-30 Thread Ali Nazemian
o those nodes. That is a > lot easier. > > > On 30 May 2017 at 10:49, Ali Nazemian wrote: > >> Hi all, >> >> For deploying Metron in production, we need to specify all of the port >> and protocols connectivities. I was wondering how Metron components

Re: Metron components connectivity

2017-05-30 Thread Ali Nazemian
it before giving up. > > I would be happy to share what I have, or help maintain something more > complete if it exists somewhere that I'm not aware of. > > Jon > > On Tue, May 30, 2017, 5:49 AM Ali Nazemian wrote: > >> Hi all, >> >> For deploying Metron

Metron components connectivity

2017-05-30 Thread Ali Nazemian
Hi all, For deploying Metron in production, we need to specify all of the port and protocols connectivities. I was wondering how Metron components connected to each other. Is there any document available regarding the ports and connectivities of Metron components? Regards, Ali

Re: Metron HBase conditional enrichment

2017-05-24 Thread Ali Nazemian
"tenant_name+device_type+default_device". > > Yes, you can. You've got if/else, JOIN, IS_EMPTY, and others that should > make implementing this logic pretty easy. > > > > > On Tue, May 23, 2017 at 10:34 PM, Ali Nazemian > wrote: > >> Hi, >> >

Metron HBase conditional enrichment

2017-05-23 Thread Ali Nazemian
Hi, I was wondering how I can manage Stellar syntax to be aligned with the following structure for the HBase enrichment: HBase_row_key: tenant_name+device_type+device_name At the high-level, I need to create a separate field via a post-parse Stellar function to be a concatenation of tenan_name,

Re: Kafka spout error in the new HCP product

2017-05-16 Thread Ali Nazemian
ion. There is a PR out for this currently: > https://github.com/apache/metron/pull/584 > > Casey > > On Tue, May 16, 2017 at 4:26 AM, Ali Nazemian > wrote: > >> I am still facing this issue and couldn't manage to fix it. I would be >> really grateful If somebody c

Re: Kafka spout error in the new HCP product

2017-05-16 Thread Ali Nazemian
I am still facing this issue and couldn't manage to fix it. I would be really grateful If somebody can help me. Thanks, Ali On Sun, May 14, 2017 at 1:58 PM, Ali Nazemian wrote: > I was wrong. I think I couldn't increase the timeout value for Kafka spout > properly. Therefore,

Re: Kafka spout error in the new HCP product

2017-05-13 Thread Ali Nazemian
e I didn't have this issue with the previous version? On Sun, May 14, 2017 at 3:00 AM, Ali Nazemian wrote: > Hi, > > I have installed the new version of HCP recently. I can see that the > following error has appeared in Storm UI at Kafka spout sectio

Kafka spout error in the new HCP product

2017-05-13 Thread Ali Nazemian
Hi, I have installed the new version of HCP recently. I can see that the following error has appeared in Storm UI at Kafka spout section related to Parser topologies: org.apache.kafka.clients.consumer.CommitFailedException: Commit cannot be completed since the group has already rebalanced and ass

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-22 Thread Ali Nazemian
through disabling Storm reliability!! Another wired fact is I have this problem only for the enrichments and indexing topologies. All of the parsers are fine! On Sun, Apr 23, 2017 at 12:39 AM, Ali Nazemian wrote: > In response to your question for decreasing the value of spout pending, no >

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-22 Thread Ali Nazemian
In response to your question for decreasing the value of spout pending, no even with the value of 10 failure ratio was the same. However, throughput dropped significantly. On Sun, Apr 23, 2017 at 12:27 AM, Ali Nazemian wrote: > I have noticed if I decrease the parallelism for spouts the fail

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-22 Thread Ali Nazemian
spout pending config lower do you get to a point with no > errors (at obvious consequences to throughput)? Also how many ackers are > you running? > > On Sat, Apr 22, 2017 at 00:50 Ali Nazemian wrote: > >> I have disabled the reliability retry by setting the number of >>

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
that. On Sat, Apr 22, 2017 at 2:36 PM, Ali Nazemian wrote: > Is the following fact rings any bell? > > There is no failure at the bolt level acknowledgement, but from the > topology status, the rate of failure is very high! This is the same > scenario for both indexing and enrich

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
Is the following fact rings any bell? There is no failure at the bolt level acknowledgement, but from the topology status, the rate of failure is very high! This is the same scenario for both indexing and enrichment topologies. On Sat, Apr 22, 2017 at 2:29 PM, Ali Nazemian wrote: > The va

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
at's your storm configuration for topology.max.spout.pending? If it's > not set, then try setting it to 1000 and bouncing the topologies. > > On Fri, Apr 21, 2017 at 12:54 PM, Ali Nazemian > wrote: > >> No, nothing ... >> >> On Sat, Apr 22, 2017 at 2:46 AM, Casey Stella wrot

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
No, nothing ... On Sat, Apr 22, 2017 at 2:46 AM, Casey Stella wrote: > Anything going on in the kafka broker logs? > > On Fri, Apr 21, 2017 at 12:24 PM, Ali Nazemian > wrote: > >> Although this is a test platform with a way less spec than production, it >> should

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
atencies are pretty high. I think what's happening is > that the tuples aren't being acked fast enough and are timing out. How > taxed is your ES box? Can you drop the batch size down to maybe 100 and > see what happens? > > On Fri, Apr 21, 2017 at 12:05 PM, Ali Nazemi

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
0:53 AM, Casey Stella >> wrote: >> >>> Could I see a little more of that screen? Specifically what the bolts >>> look like. >>> >>> On Fri, Apr 21, 2017 at 11:51 AM, Ali Nazemian >>> wrote: >>> >>>> Please find the st

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
Please find the storm-UI screenshot as follows. http://imgur.com/FhIrGFd On Sat, Apr 22, 2017 at 1:41 AM, Ali Nazemian wrote: > Hi Casey, > > - topology.message.timeout: It was 30s at first. I have increased it to > 300s, no changes! > - It is a very basic geo-enrichment and

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
errors in the >> logs. Would you mind sending over a screenshot of the indexing topology >> from the storm UI? You might not be able to paste the image on the mailing >> list, so maybe an imgur link would be in order. >> >> Thanks, >> >> Casey >> >> O

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
are > several catch blocks across the different topologies that transform errors > into json objects and forward them on to the indexing topology. If you're > not seeing anything in the worker logs it's likely the errors were captured > there instead. > > Ryan > > On Fri,

Re: High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
No everything is fine at the log level. Also, when I checked resource consumption at the workers, there had been plenty resources still available! On Fri, Apr 21, 2017 at 10:04 PM, Casey Stella wrote: > Seeing anything in the storm logs for the workers? > > On Fri, Apr 21, 2017 at

High percentage of failed/timed out tuples after performance tuning!

2017-04-21 Thread Ali Nazemian
Hi all, After I tried to tune the Metron performance I have noticed the rate of failure for the indexing/enrichment topologies are very high (about 95%). However, I can see the messages in Elasticsearch. I have tried to increase the timeout value for the acknowledgement. It didn't fix the problem.

Re: Metron performance tuning best practice

2017-04-20 Thread Ali Nazemian
md#notes-on-performance-tuning> > > Jon > > On Thu, Apr 20, 2017 at 8:45 AM Ali Nazemian > wrote: > >> Hi all, >> >> I was wondering what the best practice would be in terms of defining a >> right value for the number of workers and executors as well as

Metron performance tuning best practice

2017-04-20 Thread Ali Nazemian
Hi all, I was wondering what the best practice would be in terms of defining a right value for the number of workers and executors as well as right value for spout and bolt parallelisation? What about the number of partitions for "indexing", "enrichments" and device parsers Kafka topics? I have s