rzo1 commented on PR #1728:
URL: https://github.com/apache/stormcrawler/pull/1728#issuecomment-3575983623

   > I haven't looked at the details yet but I agree with @rzo1 that this feels 
like it should be a ParseFilter.
   
   I thought a bit about it (over the last weekend) and I think implementing 
this as a separate bolt would be ok, as it allows the NLP processing to run as 
an independent step, enabling better isolation, scaling, and more precise 
measurement of tuple processing time without impacting the parsing bolt (at 
all). This also allows redaction to be treated as an isolated use case, 
separate from the core parsing logic. So I would be ok with a bolt in that case.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to