[ https://issues.apache.org/jira/browse/CASSANDRA-5555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13666403#comment-13666403 ]
Nick Bailey commented on CASSANDRA-5555: ---------------------------------------- At the very least we should make the heap size configurable with an environment variable instead of hard coded in the executable. But I would still prefer a solution that doesn't require using as much memory to begin with. That makes it easier to restore multiple column families at the same time (potentially hundreds of thousands of sstables), and makes it easier to run the sstableloader on a node that is already running cassandra (restoring a local backup instead of a remote one). > Allow sstableloader to handle a larger number of files > ------------------------------------------------------ > > Key: CASSANDRA-5555 > URL: https://issues.apache.org/jira/browse/CASSANDRA-5555 > Project: Cassandra > Issue Type: Improvement > Components: Tools > Reporter: Tyler Hobbs > Assignee: Dave Brosius > Fix For: 1.2.6 > > Attachments: 5555-01.txt > > > With the default heap size, sstableloader will OOM when there are roughly 25k > files in the directory to load. It's easy to reach this number of files in a > single LCS column family. > By avoiding creating all SSTableReaders up front in SSTableLoader, we should > be able to increase the number of files that sstableloader can handle > considerably. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira