ppkarwasz commented on PR #1327:
URL: https://github.com/apache/commons-lang/pull/1327#issuecomment-2524066753

   @IBue,
   
   > note on performance: boxing is the main responsible for increase in memory 
footprint here, that is +3GB for a 100M input sequence, but the current 
implementation takes already a 10² order of magnitude more runtime than the 
proposed one for a 10M input sequence (minutes!) and a result index at half of 
the input size.
   
   Can you tell us more on the usage you have for this method? That will tell 
us a lot on how it should be optimized.
   
   A common usage for this method could be extracting contiguous series of 
digits from a long string. With series of 10-20 digits it doesn't really make 
sense to go parallel, but if you want to validate a 100M input to make sure it 
only contains digits, you can sacrifice more memory.
   
   **Note**: The `indexOfAnyBut` method can have multiple implementations and 
switched between them based on the input size.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to