Re: ChatGPT > Spamassassin? :)

Adam Bowen Tue, 25 Jun 2024 10:22:20 -0700

On 25/06/2024 16:21, Bill Cole wrote:

On 2024-06-24 at 17:18:11 UTC-0400 (Mon, 24 Jun 2024 17:18:11 -0400)
Mark London <m...@psfc.mit.edu>
is rumored to have said:
I received a spam email with the text below, that wasn't caught bySpamassasin (at least mine). The text actually looks like somethingthat was generated using ChatGPT. In any event, I put the textthrough ChatGPT, and asked if it looked like spam. At the bottom ofthis email , is it's analysis. I've not been fully reading thisgroup. Has there been any work to allow Spamassassin to use AI?
"Artificial intelligence" does not exist. It is a misnomer.
Large language models like ChatGPT have a provenance problem. There'sno way to know why exactly the model "says" anything. In a singleparagraph, ChatGPT is capable of making completely and directlyinconsistent assertions. The only way to explain that is that despiteappearances, a request to answer the ham/spasm question generates textwith no semantic connection to the original, but which seems like anexplanation.
SpamAssassin's code and rules all come from ASF committers, and thescores are determined by examining the scan results from contributorsand optimizing them to a threshold of 5.0. Every scan of a messageresults in a list of hits against documented rules. The results can beanalyzed and understood.
We know that ChatGPT and other LLMs that are publicly available havebeen trained on data to which they had no license. There is no way toremove any particular ingested data. There's no way to know where anyparticular LLM will have problems and no way to fix those problems.This all puts them outside of the boundaries we have as an ASFproject. However, we do have a plugin architecture, so it is possiblefor 3rd parties to create a plugin for LLM integration.

I asked a well known chatbot: What would Bill Cole say if he was askedabout integrating AI in to spamassassin?

Bill Cole, the maintainer of SpamAssassin, is likely to beenthusiastic about the potential of AI to improve spam detection.
Here's what he might say:
* Improved Spam Filtering: AI could be used to analyze spam emailswith greater accuracy than traditional rule-based methods.SpamAssassin currently relies on matching email content to predefinedrules, but AI could identify spam based on subtler cues like phrasing,sender behavior, and content structure.
* Adapting to Evolving Threats: Spammers constantly develop newtactics to bypass filters. AI can learn and adapt to these evolvingthreats much faster than manually updating rules.
* Mitigating False Positives: AI could help reduce the number oflegitimate emails mistakenly flagged as spam (false positives). Thisis a major concern for SpamAssassin users.
However, Bill Cole might also mention some challenges:
* Training Data Bias: The effectiveness of AI depends on the qualityand balance of training data. Biases in the training data could leadto the AI disproportionately flagging certain types of emails as spam.* Transparency and Explainability: It can be difficult to understandhow AI systems arrive at their decisions. This lack of transparencycould make it hard to debug errors or identify potential biases.* Computational Cost: Training and running AI models can requiresignificant computational resources, which might not be feasible forall SpamAssassin users.
Overall, Bill Cole would likely be cautiously optimistic about thepotential of AI to improve SpamAssassin. He would acknowledge thepotential benefits but also highlight the challenges that need to beaddressed.

:)

Re: ChatGPT > Spamassassin? :)

Reply via email to