[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei-Chiu Chuang updated HADOOP-15598: - Resolution: Fixed Fix Version/s: 3.0.4 3.1.1 3.2.0 Status: Resolved (was: Patch Available) Pushed 002 patch to trunk, branch-3.1 and branch-3.0. Thanks! > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Major > Fix For: 3.2.0, 3.1.1, 3.0.4 > > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png, > hadoop-sync-contention.svg > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os.arch is > "sparc". java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Attachment: hadoop-sync-contention.svg > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png, > hadoop-sync-contention.svg > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os.arch is > "sparc". java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Description: When profiling a multi-threaded hive streaming ingest, observed lock contention on java.util.Properties getProperty() to check if os.arch is "sparc". java.util.Properties internally uses HashTable. HashTable.get() is synchronized method. In the test application, on a 30s profile with 64 threads ~40% CPU time is spent on getProperty() contention. See attached snapshot. (was: When profiling a multi-threaded hive streaming ingest, observed lock contention on java.util.Properties getProperty() to check if os is "sparc". java.util.Properties internally uses HashTable. HashTable.get() is synchronized method. In the test application, on a 30s profile with 64 threads ~40% CPU time is spent on getProperty() contention. See attached snapshot.) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os.arch is > "sparc". java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei-Chiu Chuang updated HADOOP-15598: - Target Version/s: 3.2.0, 3.1.1, 3.0.4 (was: 3.2.0, 3.1.1) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei-Chiu Chuang updated HADOOP-15598: - Priority: Major (was: Minor) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Target Version/s: 3.2.0, 3.1.1 (was: 3.2.0) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Affects Version/s: 3.1.1 > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0, 3.1.1 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated HADOOP-15598: Status: Patch Available (was: Open) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated HADOOP-15598: Status: Open (was: Patch Available) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated HADOOP-15598: Attachment: HADOOP-15598.1.patch > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, HADOOP-15598.1.patch, Screen Shot > 2018-07-11 at 1.45.06 AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Priority: Minor (was: Major) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, Screen Shot 2018-07-11 at 1.45.06 > AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Attachment: Screen Shot 2018-07-11 at 2.01.54 AM.png > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Minor > Attachments: HADOOP-15598.1.patch, Screen Shot 2018-07-11 at 1.45.06 > AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Issue Type: Improvement (was: Bug) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Improvement > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, Screen Shot 2018-07-11 at 1.45.06 > AM.png, Screen Shot 2018-07-11 at 2.01.54 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Status: Patch Available (was: Open) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Bug > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, Screen Shot 2018-07-11 at 1.45.06 > AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Attachment: HADOOP-15598.1.patch > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Bug > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Major > Attachments: HADOOP-15598.1.patch, Screen Shot 2018-07-11 at 1.45.06 > AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-15598) DataChecksum calculate checksum is contented on hashtable synchronization
[ https://issues.apache.org/jira/browse/HADOOP-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HADOOP-15598: --- Summary: DataChecksum calculate checksum is contented on hashtable synchronization (was: DataChecksum calculate checksum is contented on hashtable lock) > DataChecksum calculate checksum is contented on hashtable synchronization > - > > Key: HADOOP-15598 > URL: https://issues.apache.org/jira/browse/HADOOP-15598 > Project: Hadoop Common > Issue Type: Bug > Components: common >Affects Versions: 3.2.0 >Reporter: Prasanth Jayachandran >Priority: Major > Attachments: Screen Shot 2018-07-11 at 1.45.06 AM.png > > > When profiling a multi-threaded hive streaming ingest, observed lock > contention on java.util.Properties getProperty() to check if os is "sparc". > java.util.Properties internally uses HashTable. HashTable.get() is > synchronized method. In the test application, on a 30s profile with 64 > threads ~40% CPU time is spent on getProperty() contention. See attached > snapshot. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org