Re: [spambayes-dev] A spambayes-esque approach for 4 email categories

2024-04-25 Thread Skip Montanaro
Just stumbled upon a statement the other day which indicated the scikit.learn package has a classification subsystem. Might be worth reading up on it, even if it turns out not to be useful here. https://scikit-learn.org/stable/auto_examples/classification/plot_classifier_comparison.html Skip

Re: [spambayes-dev] Dev environment setup

2023-11-15 Thread Skip Montanaro
me error on loading of the Add In, and the add-in is > disabled. Any ideas on where I could go to see that exception? I was hoping > something would go in the EventViewer, or in a log file somewhere, but I > haven’t found anything yet. > > > > Thanks, > > jwc > > >

Re: [spambayes-dev] Dev environment setup

2023-11-13 Thread Skip Montanaro
Thanks for responding Mark. As you indicated, SpamBayes has been on long-term hiatus. The biggest impressive for me are a) Gmail does a good job, and b) I've so far been unable to convince anyone with Windows packaging experience to update that side of things. That said, porting most of it to Pyth

Re: [spambayes-dev] Python 3 branch

2022-09-04 Thread Skip Montanaro
> With the sudden flurry of emails about SpamBayes, I went ahead and created a new python3 branch at > > https://github.com/smontanaro/spambayes > > This is the result of a basic run of 2to3 nearly two years ago (!) followed by most functional changes to Neil Schemenauer's Python 3 partial port: >

[spambayes-dev] Python 3 branch

2022-08-25 Thread Skip Montanaro
With the sudden flurry of emails about SpamBayes, I went ahead and created a new python3 branch at https://github.com/smontanaro/spambayes This is the result of a basic run of 2to3 nearly two years ago (!) followed by most functional changes to Neil Schemenauer's Python 3 partial port: https://g

Re: [spambayes-dev] ImportError: No module named spambayes.resources (line 12 of setup_all.py)

2022-08-25 Thread Skip Montanaro
> > I'm afraid I have no idea about that. It looks like I don't even have > the latest spambayes source tree here, and don't even have svn to update > it! > Hi Mark, Good to see you're still around. The current SpamBayes source is on GitHub: https://github.com/smontanaro/spambayes My intent in

Re: [spambayes-dev] ImportError: No module named spambayes.resources (line 12 of setup_all.py)

2022-08-24 Thread Skip Montanaro
> > You should be able to find the version of spambayes that setup_all.py > is trying to use. Does it in fact have a directory called resources? > At the top-level spambayes directory, there is a spambayes/windows/setup_all.py. That suggests to me that something about the way the Windows package i

[spambayes-dev] sb_imapfilter with Hotmail causes loss of emails

2021-02-01 Thread Skip Montanaro
(willinging, at least). I will poke around a bit though. Thanks, Skip -- Forwarded message - From: Alexei Potashnik Date: Mon, Feb 1, 2021 at 2:38 PM Subject: Re: [smontanaro/spambayes] sb_imapfilter with Hotmail causes loss of emails (#6) To: smontanaro/spambayes Cc: Skip Mont

Re: [spambayes-dev] Bayesian Spam Filter

2020-04-23 Thread Skip Montanaro
> I am looking to get involved in some python volunteer work, I greatly enjoy > this programming language and have completed some study in this area. > > Checking through volunteer opportunities on python.org I have come across > yours. > > Do you still need help? Liam, I'm not sure where you'v

Re: [spambayes-dev] Anybody still have a test ham/spam database?

2018-07-12 Thread Skip Montanaro
> > I have various corpora still but they're on drives that I would have to > dig out of storage. If the existing offers fall through let me know and I > can pull them out. > I think I'm good, at least for now. Between what I had already and what Matt had, I should have enough. Of course, if I nee

Re: [spambayes-dev] Anybody still have a test ham/spam database?

2018-07-12 Thread Skip Montanaro
> > Your current task is much clearer: > > > ... I am looking to insure that a Py3 port of SpamBayes > > works the same as the Py2 code. > > For _that_ purpose, you can take any pile of email at all; split it into > "ham" and "spam" at random, and "just" ensure you get the same results from > the o

Re: [spambayes-dev] Anybody still have a test ham/spam database?

2018-07-10 Thread Skip Montanaro
> Sorry, Skip - I don't. And I was surprised just now to see that we > apparently never checked test data files into the Sourceforge source tree > either! > > But it shouldn't matter. SB learns pretty quickly, and it would be better to > use _current_ examples of spam and ham anyway (their cha

Re: [spambayes-dev] Anybody still have a test ham/spam database?

2018-07-10 Thread Skip Montanaro
> I have both the Ham and Spam directories that I think were Tim's > original data and also a corpus of (as of today) 38,509 spam emails. > I would be happy to give you any of that. Though you'll have to > explain to me how to use this newfangled Google Drive. Thanks, Matt. Sharing email sent. Let

[spambayes-dev] Anybody still have a test ham/spam database?

2018-07-10 Thread Skip Montanaro
I'm going to take a crack at porting SpamBayes to Python 3. For that I should probably have some test data. My goal is to replicate existing behavior, not improve the breed. I long ago deleted what I used BITD. Does anyone still have their setup? If so, let me know. I can provide a writable folder

[spambayes-dev] Need a bit of help with the Windows installer

2017-12-28 Thread Skip Montanaro
It seems I might have successfully built a Windows installer using the Appveyor continuous integration platform (https://www.appveyor.com/). I don't have direct access to Windows, however I was able to run the installer without error using WINE, and start up the POP3 proxy. I didn't go any farther.

[spambayes-dev] Need a bit of help with the Windows installer

2017-12-27 Thread Skip Montanaro
It seems I might have successfully built a Windows installer using the Appveyor continuous integration platform. I don't have direct access to Windows, however I was able to run the installer without error using WINE, and start up the POP3 proxy. I've not tried to go any farther. I'm looking for a

[spambayes-dev] SpamBayes 1.1b3

2017-11-23 Thread Skip Montanaro
It's so long between releases that I am not sure how I should handle version numbers, but I just uploaded spambayes-1.1b3.tar.gz to PyPI. It fixes a single bug in the tokenizer uncovered a couple days ago as Mark Sapiro and I were getting gate_news (the SpamBayes-wielding Usenet-to-email gateway) g

[spambayes-dev] Image spam

2016-10-08 Thread Skip Montanaro
Anybody still here? I noticed recently and a number of mortgage refinance spams using images to contain the text have slipped through Gmails spam filters. It's nice to recall that we had that problem pretty well solved several years ago. :-) Skip ___ sp

Re: [spambayes-dev] Fwd: Message with no Package: tag cannot be processed! (Trying to contact Debian's SpamBayes package maintainer)

2015-07-02 Thread Skip Montanaro
Adding spambayes-dev back in. Hopefully the conversation will be smoother now that we have a working email address for you. Rather than respond point-by-point, I'm just going to quote on part of your note: Skip> I'm not sure I've seen any bug reports, and like I said, I had no idea Skip> it was in

Re: [spambayes-dev] Spambayes and the Debian Project

2015-07-01 Thread Skip Montanaro
I don't think it would be a big deal to get you added as an official maintainer. I'm not sure I've seen any bug reports, and like I said, I had no idea it was included in any Linux distributions. Do you have specific bugs to report? Skip Montanaro On Mon, Jun 29, 2015 at 10:07 AM

[spambayes-dev] SpamBayes 1.1b2

2015-05-20 Thread Skip Montanaro
I released SpamBayes 1.1b2. It fixes two bugs. One was an incorrectly structured except clause. The other handles the case where urlparse.urlparse fails to parse some bogus URLs. I uploaded to PyPI and was able to pip install it on mail.python.org, so despite the fact that it's been ages since I d

Re: [spambayes-dev] Volunteer Opportunities

2013-10-25 Thread Skip Montanaro
> I may be interested helping with your project, if there any opportunities I > can tell you about myself. Glad to have help. Our biggest stumbling block for the past several years has been lack of help on the Windows side (creating installers and debugging issues Windows users encounter). Pretty

Re: [spambayes-dev] New Disgruntled SpamBayes user

2013-04-26 Thread Skip Montanaro
> I have Outlook Express 6 and Windows XP SP2. > I had to restore my PC to before I installed Spambayes to get it back to > square one. > After the restore, it worked of course. > > However, this was an awful lot of work for a piece of software I thought had > been tested. > > I look forward t

Re: [spambayes-dev] [Spambayes] SpamBayes classifier to the arabic language _ Help

2013-02-22 Thread Skip Montanaro
with spambayes/TESTING.txt to learn how to test your changes. Finally, you will need fairly large collections of spam and ham emails. The TESTING file should describe the requirements there. Skip Montanaro ___ spambayes-dev mailing list spambayes-dev@python.org http://mail.python.org/mailman/listinfo/spambayes-dev

[spambayes-dev] GitHub

2013-02-06 Thread Skip Montanaro
After some discussion with people on chic...@python.org, I decided to at least clone the SpamBayes repository into Git. I put it up on GitHub because that's the hip git place to be. (I'm nothing if not hip. ). That git repo is disconnected from its Subversion source. I will probably push the repo b

Re: [spambayes-dev] [Spambayes] Outlook 2013

2013-01-15 Thread Skip Montanaro
This is a discussion better held on spambayes-dev, so I'm cc'ing that list and removing the spambayes list. Since most of the people here won't have seen the original note, I've included it below in its entirety. On Mon, Jan 14, 2013 at 5:15 AM, Thomas Hruska wrote: > On

[spambayes-dev] If we don't get some more Windows help pretty soon...

2011-06-24 Thread Skip Montanaro
If we can't find someone with the time and inclination to dig into problems running SpamBayes in a Win7 64-bit+Outlook 2010 environment pretty soon, I think it might be time to think about closing up shop. Other than the occasional, "Help! I deleted my Junk Suspects folder!" questions, the bulk o

Re: [spambayes-dev] some tokenising ideas for someone who wants to experiment

2005-06-16 Thread Skip Montanaro
Anthony> Got your own ideas for tokenising tricks that are worth trying? Anthony> Post them, we can collect them somewhere for people who want to Anthony> experiment... There's a "Stuff to try" section on the front page of the Spambayes wiki: http://www.entrian.com/sbwiki Skip _

Re: [spambayes-dev] [Spambayes-checkins] website related.ht, 1.15, 1.16

2005-05-23 Thread Skip Montanaro
Anthony> Log Message: Anthony> wecanstopspam is a spam site now (ironically) ... Anthony> - Gary Robinson has a well-organized Anthony> - http://wecanstopspam.org/jsp/Wiki?StartingPoints";>Spam Wiki. Anthony> - Any idea what happened? Skip _

Re: [spambayes-dev] address book

2005-05-17 Thread Skip Montanaro
>> Spambayes does not pay attention to my address book email addresses >> and mark as spam mail that should not as they come from friends. How >> can I train Spambayes to accept as Ham all emails coming from anybody >> in my address book ? I tried .. but could not find. Check the

[spambayes-dev] RE: [Spambayes-checkins] spambayes/spambayes Corpus.py, 1.21, 1.22 FileCorpus.py, 1.18, 1.19

2005-03-17 Thread Skip Montanaro
>> future imports have to come first... Tony> I thought that too, but if I run this script (with 2.2), it works: ... Sure, but not with CVS HEAD: % python Python 2.5a0 (#75, Mar 15 2005, 21:55:51) [GCC 3.3 20030304 (Apple Computer, Inc. build 1671)] on darwin Type "

Re: [spambayes-dev] Bug Report

2005-02-28 Thread Skip Montanaro
Andrew> What seems to happen is: Andrew> * Email client connects to POP3PROXY Andrew> * POP3PROXY tries to connect to proxied mail server Andrew> * POP3PROXY goes into an infinite loop, FILLING up it's log file Andrew> with the same error message and the CPU usage goes to 100%.

RE: [spambayes-dev] "approximately" the same size

2005-01-24 Thread Skip Montanaro
Kenny> Mathematically, the total number of tokens should have no effect Kenny> on the probabilities. We only count a token once per message, Kenny> and we divide the number of messages that have contained the Kenny> token by the total number of messages. The total number of K

[spambayes-dev] "approximately" the same size

2005-01-21 Thread Skip Montanaro
When we tell people not to let their ham/spam imbalance get too bad, we are referring to the number of messages trained. There is another way to look at this imbalance though: number of tokens generated from each stream. For me, ham messages are much larger on average than spam messages. Consequ

RE: [spambayes-dev] minor csv module problem

2005-01-20 Thread Skip Montanaro
>> Perhaps we could replace '\r' with ' ' in the subject before >> tokenizing without losing much/any accuracy. I don't believe we can >> get whitespace in body tokens. Tony> +1. Tony> (I presume that this is a nicer solution than having our own csv Tony> subclass that h

[spambayes-dev] minor csv module problem

2005-01-20 Thread Skip Montanaro
In my message training I train into a pickle (faster at that point), then use sb_dbexpimp to dump it to a csv file. For use by sb_bnfilter I then convert that to a Berkeley db file. (The csv file also serves as a convenient debug/interchange format.) The Python csv module is used both to write a

Re: [spambayes-dev] More stupid beats smart timcv.py results

2005-01-18 Thread Skip Montanaro
Tony> The latter was prompted by a comment in JGC's latest newsletter Tony> (though I'm sure I've seen this somewhere before, too). Who's JGC? Has anyone tried de-l33t-ing words that contain numbers? http://www.bbc.co.uk/dna/h2g2/A787917 Skip _

Re: [spambayes-dev] require subscription?

2004-12-06 Thread Skip Montanaro
>> ... http://mail.python.org/mailman/admindb/python-mode \ >> http://mail.python.org/mailman/admindb/pydotorg \ >> http://mail.python.org/mailman/admindb/python-dev \ >> http://mail.python.org/mailman/admindb/python-help \ >> http://mail.python.org/mailman/admindb/pythonmac-si

RE: [spambayes-dev] require subscription?

2004-12-06 Thread Skip Montanaro
Tony> I suppose I would be willing to moderate (non-member postings to) Tony> [EMAIL PROTECTED] for a while to see how it went, but I Tony> wouldn't want to be the only one doing so. I'll help out. Any potential European moderators? You might want to take a look at the mmfold.py scr

RE: [spambayes-dev] require subscription?

2004-12-03 Thread Skip Montanaro
Barry> I sure wish Mailman did a mail-back confirmation for non-member Barry> postings. I also wish I knew someone who could do something Barry> about that . Don't mail-back confirmations just help overload the net? Maybe I misunderstand what the term means. Skip

RE: [spambayes-dev] require subscription?

2004-12-03 Thread Skip Montanaro
Tony> Requiring registration for [EMAIL PROTECTED] would break the Tony> ability to automatically submit a problem report that sb_server Tony> offers (and Outlook may offer in the future). I don't have the Tony> data, but I suspect that a great many of the people that report T