[jira] [Updated] (YARN-5715) introduce entity prefix for return and sort order

Rohith Sharma K S (JIRA) Thu, 13 Oct 2016 01:41:46 -0700

     [ 
https://issues.apache.org/jira/browse/YARN-5715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Rohith Sharma K S updated YARN-5715:
------------------------------------
    Attachment: YARN-5715-YARN-5355.03.patch

Updated patch with following delta changes from previous
# Changed object *Long* to primitive *long* in TimelineEntity object. And all 
the subsequent getter of idPrefix is changed to primitive long
# Currently I have added support for storing FileSystemTimelineWriterImpl. 
IdPrefix is used as directory in where entities are stored. i.e 
cluster_id/user_id/flow_name/flow_version/12345678/app_id/world/*0*/hello.thist.
 Again problem with FileSystemStorage support is sorting !!! Here, we can not 
use bytes as directory. I think need to ignore entityPrefix for file system 
storage and carry on with default sorting order. Thoughts?

> introduce entity prefix for return and sort order
> -------------------------------------------------
>
>                 Key: YARN-5715
>                 URL: https://issues.apache.org/jira/browse/YARN-5715
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Sangjin Lee
>            Assignee: Rohith Sharma K S
>            Priority: Critical
>         Attachments: YARN-5715-YARN-5355.01.patch, 
> YARN-5715-YARN-5355.02.patch, YARN-5715-YARN-5355.03.patch
>
>
> While looking into YARN-5585, we have come across the need to provide a sort 
> order different than the current entity id order. The current entity id order 
> returns entities strictly in the lexicographical order, and as such it 
> returns the earliest entities first. This may not be the most natural return 
> order. A more natural return/sort order would be from the most recent 
> entities.
> To solve this, we would like to add what we call the "entity prefix" in the 
> row key for the entity table. It is a number (long) that can be easily 
> provided by the client on write. In the row key, it would be added before the 
> entity id itself.
> The entity prefix would be considered mandatory. On all writes (including 
> updates) the correct entity prefix should be set by the client so that the 
> correct row key is used. The entity prefix needs to be unique only within the 
> scope of the application and the entity type.
> For queries that return a list of entities, the prefix values will be 
> returned along with the entity id's. Queries that specify the prefix and the 
> id should be returned quickly using the row key. If the query omits the 
> prefix but specifies the id (query by id), the query may be less efficient.
> This JIRA should add the entity prefix to the entity API and add its handling 
> to the schema and the write path. The read path will be addressed in 
> YARN-5585.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

[jira] [Updated] (YARN-5715) introduce entity prefix for return and sort order

Reply via email to