[GitHub] metron issue #652: METRON-1039: Add ZIP function to Stellar

2017-07-21 Thread jjmeyer0
Github user jjmeyer0 commented on the issue:

https://github.com/apache/metron/pull/652
  
@cestella open ticket for ZIP_WITH and ZIP_WITH_INDEX. To make sure we are 
on the same page for ZIP_WITH_INDEX, can you skim over the jira when you have 
time?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] metron issue #622: METRON-1005 Create Decodable Row Key for Profiler

2017-07-21 Thread mattf-horton
Github user mattf-horton commented on the issue:

https://github.com/apache/metron/pull/622
  
And btw, since there is no easily expressed algorithm for the NLP part of 
the problem, I'm +1 on doing both a decodable rowkey and a ToC.  For the 
existing profiles that @cestella expressed concern about, I would point out 
that as long as one DOES have the Profile specs still lying around, it's 
actually easy to re-write the old Profiles into new format with decodable 
rowkeys.  That is a very modest-sized program, the main problem being noticing 
and dealing with duplicate titled Profiles with different periodDurations.  But 
the info I pointed out in the paper helps sufficiently, I think.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] metron issue #622: METRON-1005 Create Decodable Row Key for Profiler

2017-07-21 Thread mattf-horton
Github user mattf-horton commented on the issue:

https://github.com/apache/metron/pull/622
  
Here's what I've got on decoding old rowkeys:
https://gist.github.com/mattf-horton/8e685e373b1a3fa6aeec8ef8828be096

The format of the keys is
`salt (4B) + profile name (?) + entity name (?) + groupvalues (?) + period 
(8B)`
with most of it (all but the salt and period number) in the clear as 
human-readable strings.

Deducing periodDuration has a nice arithmetic answer, I think.
The NLP issues are of course harder.  Enjoy the read, it's only two pages.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---