[ https://issues.apache.org/jira/browse/LOG4J2-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Remko Popma updated LOG4J2-1179: -------------------------------- Description: Reorganize and extend performance data on the site. *Async Loggers Manual Page* Should be more focussed. Proposed changes: (x) Move _"Location, location, location..."_ section on Location Info to general Performance page (keep anchors and link to the relevant Performance page section to avoid breaking existing links) (x) Similarly, move _"Throughput of Logging With Location (includeLocation="true")"_ table with throughput results to general Performance page (x) Move _"FileAppender vs. RandomAccessFileAppender"_ section to general Performance page. (Again, keep anchors and link to new section on Performance page to avoid breaking links.) (x) Rewrite opening paragraph of Async Logger manual page to remove reference to RandomAccessFile appender (x) Rewrite section on _Latency_ * The histogram shows service time (more useful for users is response time: service time + wait time). * Bar chart diagram on "average latency" is nonsense. Latency is not a normal distribution so terms like "average latency" don't make sense. Remove this. (A histogram showing the full range of percentiles _does_ make sense.) * Bar chart diagram with max of 99.99% of observations is better than average but still has large drawbacks: this is service time (omitting the crucial wait time) and how high are the peaks in the 0.01% we did not report? Better to remove this and instead show a histogram with the full range of percentages. *Performance Page* (/) Briefly explain about various aspects of "performance": peak measured throughput (what kind of bursts can we deal with?), sustained throughput, and response time (service time + wait time). 2. Then show how Log4j 2 compares to the alternatives (Logback, Log4j-1.2 and JUL) on all these three performance dimensions. 3. Finally, document some performance trade-offs for Log4j 2 functionality. *2. Comparison to alternative logging libraries* (/) Peak throughput comparison Async Loggers vs async appenders for bursty logging. (x) Response time comparison of Async Loggers vs async appenders (/) Parameterized messages: use these JMH [benchmark results|https://issues.apache.org/jira/browse/LOG4J2-1278?focusedCommentId=15216236&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15216236]? (Looks like parameterized messages are currently quite expensive...) (?) Clarify that Async Appender exists to minimize dependencies but should be avoided if performance is a concern. Async Appender is NOT the default and should NOT be used for benchmarking. (I found [this loggly article|https://www.loggly.com/blog/benchmarking-java-logging-frameworks/] very frustrating in that respect.) Should probably also clarify that any benchmark that tests with only one thread doing logging is of limited use. (x) compare performance impact of including location between logging libraries For various appenders, compare Log4j2 to alternatives with regards to max sustained throughput (and separately, response time). (/) [File Appender max sustained thoughput|https://issues.apache.org/jira/browse/LOG4J2-1297?focusedCommentId=15256490&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15256490] (x) File Appender response time comparison (?) Socket appender (TCP/UDP) (?) Syslog appender (TCP/UDP) *3. Log4j 2 functionality performance trade-offs* (x) Cost of various layouts (Gelf, HTML, XML, CSV, Pattern) (x) Cost of various Pattern Layout options (x) Cost of various appenders (File, RandomAccess File, MemoryMapped File, Console, Rewrite, other?). Use the same layout for comparison. Perhaps the PatternLayout with the {{%d \[%t\] %p %c - %m%n}} pattern. (x) Cost of various APIs/wrappers (SLF4J, Log4j1, JUL, Commons Logging) (?) JDBC appenders? - different JDBC drivers and target databases may have very different performance. May become a big project. We could do a quick comparison of the JDBC appender to the JDK Derby DB compared against FileAppender just to get an idea of max sustained throughput? ------------------- Of the existing Performance page sections: (x) Briefly mention that disabled logging has no measurable cost, but de-emphasize this section by moving it down the page. (x) I like the part about the filters because it a) compares Log4j 2 to Logback and b) considers multithreaded applications. I'll turn this into a JMH test and show the result as a bar chart. was: Reorganize and extend performance data on the site. *Async Loggers Manual Page* Should be more focussed. Proposed changes: (x) Move _"Location, location, location..."_ section on Location Info to general Performance page (keep anchors and link to the relevant Performance page section to avoid breaking existing links) (x) Similarly, move _"Throughput of Logging With Location (includeLocation="true")"_ table with throughput results to general Performance page (x) Move _"FileAppender vs. RandomAccessFileAppender"_ section to general Performance page. (Again, keep anchors and link to new section on Performance page to avoid breaking links.) (x) Rewrite opening paragraph of Async Logger manual page to remove reference to RandomAccessFile appender (x) Rewrite section on _Latency_ * The histogram shows service time (more useful for users is response time: service time + wait time). * Bar chart diagram on "average latency" is nonsense. Latency is not a normal distribution so terms like "average latency" don't make sense. Remove this. (A histogram showing the full range of percentiles _does_ make sense.) * Bar chart diagram with max of 99.99% of observations is better than average but still has large drawbacks: this is service time (omitting the crucial wait time) and how high are the peaks in the 0.01% we did not report? Better to remove this and instead show a histogram with the full range of percentages. *Performance Page* (/) Briefly explain about various aspects of "performance": peak measured throughput (what kind of bursts can we deal with?), sustained throughput, and response time (service time + wait time). (x) Then show how Log4j 2 compares to the alternatives (Logback, Log4j-1.2 and JUL) on all these three performance dimensions. (x) Finally, document some performance trade-offs for Log4j 2 functionality. *2. Comparison to alternative logging libraries* (/) Link to Async Loggers page for bursty logging. (?) Clarify that Async Appender exists to minimize dependencies but should be avoided if performance is a concern. Async Appender is NOT the default and should NOT be used for benchmarking. (I found [this loggly article|https://www.loggly.com/blog/benchmarking-java-logging-frameworks/] very frustrating in that respect.) Should probably also clarify that any benchmark that tests with only one thread doing logging is of limited use. For various appenders, compare Log4j2 to alternatives with regards to max sustained throughput (and separately, response time). (/) [File Appender max sustained thoughput|https://issues.apache.org/jira/browse/LOG4J2-1297?focusedCommentId=15256490&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15256490] (?) Socket appender (TCP/UDP) (?) Syslog appender (TCP/UDP) *3. Log4j 2 functionality performance trade-offs* (x) including location (x) Cost of various layouts (Gelf, HTML, XML, CSV, Pattern) (x) Cost of various Pattern Layout options (x) Cost of various appenders (File, RandomAccess File, MemoryMapped File, Console, Rewrite, other?). Use the same layout for comparison. Perhaps the PatternLayout with the {{%d \[%t\] %p %c - %m%n}} pattern. (x) Cost of various APIs/wrappers (SLF4J, Log4j1, JUL, Commons Logging) (?) JDBC appenders? - different JDBC drivers and target databases may have very different performance. May become a big project. We could do a quick comparison of the JDBC appender to the JDK Derby DB compared against FileAppender just to get an idea of max sustained throughput? ------------------- Of the existing Performance page sections: (x) Briefly mention that disabled logging has no measurable cost, but de-emphasize this section by moving it down the page. (/) Parameterized messages: use these JMH [benchmark results|https://issues.apache.org/jira/browse/LOG4J2-1278?focusedCommentId=15216236&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15216236]? (Looks like parameterized messages are currently quite expensive...) (x) I like the part about the filters because it a) compares Log4j 2 to Logback and b) considers multithreaded applications. I'll turn this into a JMH test and show the result as a bar chart. > Log4j performance documentation > ------------------------------- > > Key: LOG4J2-1179 > URL: https://issues.apache.org/jira/browse/LOG4J2-1179 > Project: Log4j 2 > Issue Type: Documentation > Components: Documentation, Performance Benchmarks > Affects Versions: 2.4.1 > Reporter: Remko Popma > Assignee: Remko Popma > Fix For: 2.6 > > Attachments: ParamMsgThrpt1T.png, ParamMsgThrpt2T.png, > ParamMsgThrpt4T.png > > > Reorganize and extend performance data on the site. > *Async Loggers Manual Page* > Should be more focussed. Proposed changes: > (x) Move _"Location, location, location..."_ section on Location Info to > general Performance page (keep anchors and link to the relevant Performance > page section to avoid breaking existing links) > (x) Similarly, move _"Throughput of Logging With Location > (includeLocation="true")"_ table with throughput results to general > Performance page > (x) Move _"FileAppender vs. RandomAccessFileAppender"_ section to general > Performance page. (Again, keep anchors and link to new section on Performance > page to avoid breaking links.) > (x) Rewrite opening paragraph of Async Logger manual page to remove reference > to RandomAccessFile appender > (x) Rewrite section on _Latency_ > * The histogram shows service time (more useful for users is response time: > service time + wait time). > * Bar chart diagram on "average latency" is nonsense. Latency is not a normal > distribution so terms like "average latency" don't make sense. Remove this. > (A histogram showing the full range of percentiles _does_ make sense.) > * Bar chart diagram with max of 99.99% of observations is better than average > but still has large drawbacks: this is service time (omitting the crucial > wait time) and how high are the peaks in the 0.01% we did not report? Better > to remove this and instead show a histogram with the full range of > percentages. > *Performance Page* > (/) Briefly explain about various aspects of "performance": peak measured > throughput (what kind of bursts can we deal with?), sustained throughput, and > response time (service time + wait time). > 2. Then show how Log4j 2 compares to the alternatives (Logback, Log4j-1.2 and > JUL) on all these three performance dimensions. > 3. Finally, document some performance trade-offs for Log4j 2 functionality. > *2. Comparison to alternative logging libraries* > (/) Peak throughput comparison Async Loggers vs async appenders for bursty > logging. > (x) Response time comparison of Async Loggers vs async appenders > (/) Parameterized messages: use these JMH [benchmark > results|https://issues.apache.org/jira/browse/LOG4J2-1278?focusedCommentId=15216236&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15216236]? > (Looks like parameterized messages are currently quite expensive...) > (?) Clarify that Async Appender exists to minimize dependencies but should be > avoided if performance is a concern. Async Appender is NOT the default and > should NOT be used for benchmarking. (I found [this loggly > article|https://www.loggly.com/blog/benchmarking-java-logging-frameworks/] > very frustrating in that respect.) Should probably also clarify that any > benchmark that tests with only one thread doing logging is of limited use. > (x) compare performance impact of including location between logging libraries > For various appenders, compare Log4j2 to alternatives with regards to max > sustained throughput (and separately, response time). > (/) [File Appender max sustained > thoughput|https://issues.apache.org/jira/browse/LOG4J2-1297?focusedCommentId=15256490&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15256490] > (x) File Appender response time comparison > (?) Socket appender (TCP/UDP) > (?) Syslog appender (TCP/UDP) > *3. Log4j 2 functionality performance trade-offs* > (x) Cost of various layouts (Gelf, HTML, XML, CSV, Pattern) > (x) Cost of various Pattern Layout options > (x) Cost of various appenders (File, RandomAccess File, MemoryMapped File, > Console, Rewrite, other?). Use the same layout for comparison. Perhaps the > PatternLayout with the {{%d \[%t\] %p %c - %m%n}} pattern. > (x) Cost of various APIs/wrappers (SLF4J, Log4j1, JUL, Commons Logging) > (?) JDBC appenders? - different JDBC drivers and target databases may have > very different performance. May become a big project. We could do a quick > comparison of the JDBC appender to the JDK Derby DB compared against > FileAppender just to get an idea of max sustained throughput? > ------------------- > Of the existing Performance page sections: > (x) Briefly mention that disabled logging has no measurable cost, but > de-emphasize this section by moving it down the page. > (x) I like the part about the filters because it a) compares Log4j 2 to > Logback and b) considers multithreaded applications. I'll turn this into a > JMH test and show the result as a bar chart. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: log4j-dev-unsubscr...@logging.apache.org For additional commands, e-mail: log4j-dev-h...@logging.apache.org