ppkarwasz commented on PR #1327: URL: https://github.com/apache/commons-lang/pull/1327#issuecomment-2524066753
@IBue, > note on performance: boxing is the main responsible for increase in memory footprint here, that is +3GB for a 100M input sequence, but the current implementation takes already a 10² order of magnitude more runtime than the proposed one for a 10M input sequence (minutes!) and a result index at half of the input size. Can you tell us more on the usage you have for this method? That will tell us a lot on how it should be optimized. A common usage for this method could be extracting contiguous series of digits from a long string. With series of 10-20 digits it doesn't really make sense to go parallel, but if you want to validate a 100M input to make sure it only contains digits, you can sacrifice more memory. **Note**: The `indexOfAnyBut` method can have multiple implementations and switched between them based on the input size. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
