Thomas Marquardt created HADOOP-15547: -----------------------------------------
Summary: WASB: listStatus performance Key: HADOOP-15547 URL: https://issues.apache.org/jira/browse/HADOOP-15547 Project: Hadoop Common Issue Type: Bug Components: fs/azure Affects Versions: 3.0.2, 2.9.1 Reporter: Thomas Marquardt Assignee: Thomas Marquardt The WASB implementation of Filesystem.listStatus is very slow due to O(n!) algorithm to remove duplicates and uses too much memory due to the extra conversion from BlobListItem to FileMetadata to FileStatus. It takes over 30 minutes to list 700,000 files. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org