[ 
https://issues.apache.org/jira/browse/METRON-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16095221#comment-16095221
 ] 

ASF GitHub Bot commented on METRON-1005:
----------------------------------------

Github user cestella commented on the issue:

    https://github.com/apache/metron/pull/622
  
    So, in my mind the feature here is the enablement of batch analytics on the 
profiles.  To that end, I'm in general in favor of a decodable row key. I think 
that the question really isn't a ToC *or* a decodable rowkey.  I think, rather, 
we will want both.  The two will follow different access patterns.  A decodable 
rowkey sans ToC will be suitable only for full table scan-style access.  A ToC 
would enable to slice or dice by profile/entity/etc.  
    
    That being said, a ToC without a decodable rowkey is substantially less 
nice.  Without being able to decode the rowkey, we will not be able to 
regenerate the ToC to provide alternative indexing.  I see this as a first step 
to enable a broader discussion on just what kind of access semantics beyond 
Get/Put we want to place on the profiles.
    
    All that to say, I'm in favor of the effort.  I worry at the impact going 
forward to existing profiles, though.  From the point where we do this, we will 
create a fork whereby new profiles and old profiles diverge.  I think we need 
to discuss the migration story more explicitly and see if it is plausible to 
create a migration tool that is fuzzy (i.e. will look at the existing profiles 
and try to pick them apart).
    
    I'd be ok for that work to be a follow-on, but I would want the plan to be 
very explicit and I would be -1 for a release until it's in.


> Create Decodable Row Key for Profiler
> -------------------------------------
>
>                 Key: METRON-1005
>                 URL: https://issues.apache.org/jira/browse/METRON-1005
>             Project: Metron
>          Issue Type: Improvement
>    Affects Versions: 0.3.0
>            Reporter: Nick Allen
>            Assignee: Nick Allen
>             Fix For: Next + 1
>
>
> To be able to answer the types of questions that I outlined in METRON-450, we 
> need a row key that is decodable.  Right now there is no logic to decode a 
> row key, nor is the existing row key easily decodable.  
> Once the row keys can be decoded, you could scan all of the row keys in the 
> Profiler's HBase table, decode each of them and extract things like, the 
> names of all your profiles, the names of entities within a profile, the 
> period duration of a given profile.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to