[ https://issues.apache.org/jira/browse/HDFS-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14494799#comment-14494799 ]
Billie Rinaldi commented on HDFS-8088: -------------------------------------- I would also lean towards goal #1, figuring out what parts of an operation took the longest. I have it on my to-do list to analyze the spans Accumulo is creating and cut down on the ones that don't seem to be adding much information. I think targeting 2.7.1 for changes in HDFS tracing will be fine for Accumulo. I've made our sampling configurable and made our span receiver able to configurably drop 0ms spans to handle any excess of spans from HDFS tracing. (I know dropping some of the spans in a trace is not a recommended procedure, but it ends up being okay since nearly all 0ms spans have no children.) > Reduce the number of HTrace spans generated by HDFS reads > --------------------------------------------------------- > > Key: HDFS-8088 > URL: https://issues.apache.org/jira/browse/HDFS-8088 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Colin Patrick McCabe > Assignee: Colin Patrick McCabe > Attachments: HDFS-8088.001.patch > > > HDFS generates too many trace spans on read right now. Every call to read() > we make generates its own span, which is not very practical for things like > HBase or Accumulo that do many such reads as part of a single operation. > Instead of tracing every call to read(), we should only trace the cases where > we refill the buffer inside a BlockReader. -- This message was sent by Atlassian JIRA (v6.3.4#6332)