Sangjin Lee created YARN-5109:
---------------------------------

             Summary: timestamps are stored unencoded causing parse errors
                 Key: YARN-5109
                 URL: https://issues.apache.org/jira/browse/YARN-5109
             Project: Hadoop YARN
          Issue Type: Sub-task
          Components: timelineserver
    Affects Versions: YARN-2928
            Reporter: Sangjin Lee
            Priority: Blocker


When we store timestamps (for example as part of the row key or part of the 
column name for an event), the bytes are used as is without any encoding. If 
the byte value happens to contain a separator character we use (e.g. "!" or 
"="), it causes a parse failure when we read it.

I came across this while looking into this error in the timeline reader:
{noformat}
2016-05-17 21:28:38,643 WARN 
org.apache.hadoop.yarn.server.timelineservice.storage.common.TimelineStorageUtils:
 incorrectly formatted column name: it will be discarded
{noformat}

I traced the data that was causing this, and the column name (for the event) 
was the following:
{noformat}
i:e!YARN_RM_CONTAINER_CREATED=\x7F\xFF\xFE\xABDY=\x99=YARN_CONTAINER_ALLOCATED_HOST
{noformat}

Note that the column name is supposed to be of the format (event 
id)=(timestamp)=(event info key). However, observe the timestamp portion:
{noformat}
\x7F\xFF\xFE\xABDY=\x99
{noformat}

The presence of the separator ("=") causes the parse error.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to