Re: vampirex plugin naw ruz edition avalaible to download

2003-03-23 Thread Jonathan Angliss
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On Sunday, March 23, 2003, Jonathan Angliss wrote...

>> http://W%57%57.%42%49%4cET%49%4b.G%49S%49S.R%55 instead of real URL?

> url decode :) I'll write some code, and post it shortly.

Find attached a URL decode routine.  Not sure how perfect it is,
haven't given it a full blown road run yet, but tested it on a few
encoded strings that I have, including this one.  Format for usage is:

  str = URLDecode( EncodedString );

For example:

  str = URLDecode( 'http://W%57%57.%42%49%4cET%49%4b.G%49S%49S.R%55');

Str will then return http://WWW.BILETIK.GISIS.RU

Hope that helps.

- --
Jonathan Angliss
([EMAIL PROTECTED])

-BEGIN PGP SIGNATURE-
Comment: Fingerprint: 676A 1701 665B E343 E393  B8D2 2B83 E814 F8FD 1F73

iQA/AwUBPn51mSuD6BT4/R9zEQK3fgCeNTgzYcR9D38ozkZlyMNSI4kNa2oAoP3U
aIlCy2djV/bdfbCy3lj2HQ7o
=PkyT
-END PGP SIGNATURE-

decode.pas
Description: Binary data

Current version is 1.62 | "Using TBDEV" information:
http://www.silverstones.com/thebat/TBUDLInfo.html


Re: vampirex plugin naw ruz edition avalaible to download

2003-03-23 Thread Jonathan Angliss
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On Sunday, March 23, 2003, Alexey N. Vinogradov wrote...

> If spammer include a comment inside HTML - as

> PORNOGRAPHY, or

I've seen a lot of this recently.  If you're using the right
languages, you can simple use a regular expression to ignore that, I'm
not sure on the syntax, but a general idea would be something like:

  $str = replace( '\<\!--\d+--\>' , '' , $str );

Of course... I'm probably off, but it probably gives you an idea. I
doubt the delphi's replace function takes regexp, but there is some
regexp handling in there somewhere.

> http://W%57%57.%42%49%4cET%49%4b.G%49S%49S.R%55 instead of real URL?

url decode :) I'll write some code, and post it shortly.

- --
Jonathan Angliss
([EMAIL PROTECTED])

-BEGIN PGP SIGNATURE-
Comment: Fingerprint: 676A 1701 665B E343 E393  B8D2 2B83 E814 F8FD 1F73

iQA/AwUBPn5vCyuD6BT4/R9zEQJDqgCgikfVnoTLGZQg185TMcgZ0zKUU9IAn0hn
tMlnCvZzcIuIFYqiUIRAqDGh
=WTtG
-END PGP SIGNATURE-



Current version is 1.62 | "Using TBDEV" information:
http://www.silverstones.com/thebat/TBUDLInfo.html


Re[2]: vampirex plugin naw ruz edition avalaible to download

2003-03-23 Thread Task Control
Hi tbdev at thebat.dutaint.com:


In the past Alexey was post:


ANV> Hey! Remember me? I have translated an interface of PacSpam (aka
ANV> Vampire) some time ago...
if you make a translate of the actual files, i'm very happy.

ANV> Now  I  am  about  finishing  my own filter
congratulations, i woulfd like test it.

ANV> -  based  on  completely another principle of regarding spam. The
ANV> idea  is  baesyan  method. Good [...] the moment I collected only
ANV> 500 :(.
Yeah, i was read about this, i do not like it, because your collected
spam files are not my spam files, the method is great, but, each user
need have a spam collection.


ANV> I  just want to ask some question to you - because you also wrote
ANV> your  own  Vampire  and of course know what to deal with The Bat!
ANV> interface.

ANV> how  it  is possible to know EXACT size of a part? If I just call
ANV> the  function  with  NULL  as  a bufer and zero as a size - can I
ANV> receive a necessary size as result?

  procedure GetStr(Idx: Integer; VAR str_part: string;
 VAR size_part: integer);
  var
I : Integer;
  begin
Result := '';
try
  I := GetData(MsgID, Idx, nil, 0);
  if I > 0 then
  begin
if I > Sz then
begin
  ReallocMem(P, I);
  Sz := I;
end;
I := GetData(MsgID, Idx, P, Sz);
size_part:=I; { < LOOK HERE }
if I > 0 then SetString(Result, P, I);
str_part:=Result;
  end;
except
  Result := '';
  str_part:=Result;
end;
  end; {GetStr}

ANV> And two other question - when you filtering a text letter by Vampire,
ANV> what are you doing with such things:
ANV> PORNOGRAPHY, or
ANV> http://W%57%57.%42%49%4cET%49%4b.G%49S%49S.R%55 instead of real URL?

I'm looking for a external library that change valid html code to a
simple text code.

I'm waiting the complete interface with decoded body support.

-- 
Best Regards,
 Task Control 
   mail: TaskControl at SoftHome dot net

Using: 
- Windows 98 4.10.1998 
- AVG 6.0 Free Edition
- The Bat! 1.63 Beta/7
- Trillian PRO 1.0 C



Current version is 1.62 | "Using TBDEV" information:
http://www.silverstones.com/thebat/TBUDLInfo.html


Re: vampirex plugin naw ruz edition avalaible to download

2003-03-23 Thread Kjartan Ásþórsson
ANV> Now I am about finishing my own filter - based on completely
ANV> another principle of regarding spam. The idea is baesyan method.

That is the same method as used in the new Mozilla, right?

You say you are about to finish this project. When can we expect the
first version to be born?

- kjartan



Current version is 1.62 | "Using TBDEV" information:
http://www.silverstones.com/thebat/TBUDLInfo.html


Re: vampirex plugin naw ruz edition avalaible to download

2003-03-23 Thread Alexey N. Vinogradov
Hello, Task. 
You wrote in 

TC> Hellow tbdev at thebat.dutaint.com

TC>   I wish you a happy Naw-Rúz and the best for the new year.

TC>   today (160-01-01) i updated the plugin vampirex
TC>   where? http://fyberger.tripod.com/vampire/vampire.htm

TC>   changes
TC>   - new non-letters
TC>   - save expert mode configuration
TC>   - a picture of me :-)

Hey! Remember me? I have translated an interface of PacSpam (aka
Vampire) some time ago... Now I am about finishing my own filter -
based on completely another principle of regarding spam. The idea is
baesyan method. Good side that it is not necessary to investigate a
mass of spam trying to determine the rules for how to filter them. The
method based on self-studying and need only an answer to you about a
letter - is it spam or not. All further analysis is the deal of the
method itself. Bad side (for a first time) is that you need many spam
letters to "train" the program (about 2000. For the moment I collected
only 500 :(.

I just want to ask some question to you - because you also wrote your
own Vampire and of course know what to deal with The Bat! interface.

For working of the filter I need only raw head and raw body of a
letter. There is function for getting it in the interface which
receive the address of buffer and the size of it. As I understand it
means that I must assign the size before call the function. But how it
is possible to know EXACT size of a part? If I just call the function
with NULL as a bufer and zero as a size - can I receive a necessary
size as result?

And two other question - when you filtering a text letter by Vampire,
what are you doing with such things:

If spammer include a comment inside HTML - as

PORNOGRAPHY, or

http://W%57%57.%42%49%4cET%49%4b.G%49S%49S.R%55 instead of real URL?


-- 
Sincerely,
 Alexey.
Using TB 1.63b7 on WinXP SP1 Corp + MUI RU, spelling by ORFO2002
   mailto:[EMAIL PROTECTED]



Current version is 1.62 | "Using TBDEV" information:
http://www.silverstones.com/thebat/TBUDLInfo.html