rzo1 commented on PR #1728: URL: https://github.com/apache/stormcrawler/pull/1728#issuecomment-3575983623
> I haven't looked at the details yet but I agree with @rzo1 that this feels like it should be a ParseFilter. I thought a bit about it (over the last weekend) and I think implementing this as a separate bolt would be ok, as it allows the NLP processing to run as an independent step, enabling better isolation, scaling, and more precise measurement of tuple processing time without impacting the parsing bolt (at all). This also allows redaction to be treated as an isolated use case, separate from the core parsing logic. So I would be ok with a bolt in that case. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
