[ https://issues.apache.org/jira/browse/METRON-1717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16609601#comment-16609601 ]
ASF GitHub Bot commented on METRON-1717: ---------------------------------------- Github user merrimanr commented on a diff in the pull request: https://github.com/apache/metron/pull/1187#discussion_r216416991 --- Diff: metron-analytics/metron-profiler-common/README.md --- @@ -0,0 +1,386 @@ +<!-- +Licensed to the Apache Software Foundation (ASF) under one +or more contributor license agreements. See the NOTICE file +distributed with this work for additional information +regarding copyright ownership. The ASF licenses this file +to you under the Apache License, Version 2.0 (the +"License"); you may not use this file except in compliance +with the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + +Unless required by applicable law or agreed to in writing, software +distributed under the License is distributed on an "AS IS" BASIS, +WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +See the License for the specific language governing permissions and +limitations under the License. +--> +# Metron Profiler + +* [Introduction](#introduction) +* [Getting Started](#getting-started) +* [Profiles](#profiles) +* [Examples](#examples) + +## Introduction + +The Profiler is a feature extraction mechanism that can generate a profile describing the behavior of an entity. An entity might be a server, user, subnet or application. Once a profile has been generated defining what normal behavior looks-like, models can be built that identify anomalous behavior. + +This is achieved by summarizing the telemetry data consumed by Metron over tumbling windows. A summary statistic is applied to the data received within a given window. Collecting these values across many windows result in a time series that is useful for analysis. + +Any field contained within a message can be used to generate a profile. A profile can even be produced by combining fields that originate in different data sources. A user has considerable power to transform the data used in a profile by leveraging the Stellar language. + +There are three separate ports of the Profiler that share this common code base. +* The [Storm Profiler](../metron-profiler-storm/README.md) builds low-latency profiles over streaming data sets. +* The [Spark Profiler](../metron-profiler-spark/README.md) backfills profiles using archived telemetry. +* The [REPL Profiler](../metron-profiler-repl/README.md) allows profiles to be tested and debugged within the Stellar REPL. + +## Getting Started + +1. [Create a profile](../metron-profiler-repl/README.md#getting-started) using the Stellar REPL. Validate your profile using mock data, then apply real, live data. + +1. [Backfill your profile](../metron-profiler-spark/README.md#getting-started) using archived telemetry to see how your profile behaves over time. + +1. [Deploy your profile](../metron-profiler-storm/README.md#getting-started) to Storm to maintain a low-latency profile over a streaming data set. + +1. [Retrieve your profile data](../metron-profiler-client/README.md) using the Stellar API so that you can build enrichments, alert on abnormalities --- End diff -- period at the end > Relocate Storm Profiler Code > ---------------------------- > > Key: METRON-1717 > URL: https://issues.apache.org/jira/browse/METRON-1717 > Project: Metron > Issue Type: Sub-task > Reporter: Nick Allen > Assignee: Nick Allen > Priority: Major > > The Storm port of the Profiler currently lives in > metron-analytics/metron-profiler. This should be moved to > metron-analytics/metron-profiler-storm. This would mirror the project names > for the Spark port (metron-profiler-spark) and the REPL port > (metron-profiler-repl). > The package name for the Storm port of the Profiler should be changed to > org.apache.metron.profiler.storm. This would mimic the package name used for > Spark; org.apache.metron.profiler.spark. -- This message was sent by Atlassian JIRA (v7.6.3#76005)