Hi,

On Thu, Aug 24, 2017 at 8:00 PM, Alex <mysqlstud...@gmail.com> wrote:
> Hi,
>
> On Wed, Aug 23, 2017 at 3:01 PM, Matus UHLAR - fantomas
> <uh...@fantomas.sk> wrote:
>> On 22.08.17 14:55, Alex wrote:
>>>
>>> We've been hit a number of times lately by phishing attacks using PDF
>>> documents with a link in them. Has anyone had any success in blocking
>>> these PDFs?
>>>
>>> You can download one such example here:
>>> https://www.dropbox.com/s/b97pcvl1wm1oocq/pdf-phish.pdf?dl=0
>>>
>>> I know there was a PDF OCR plugin of some sort, but I don't recall it
>>> being all that effective. Ideas greatly appreciated.
>>
>>
>> I think you mean PDFassassin, but I'd prefer ExtractText
>> both described at
>> https://wiki.apache.org/spamassassin/UnmaintainedCustomPlugins
>
> Both links to download ExtractText are dead :-( Given it's from 2007,
> is there any reasonable expectation it would even come close to
> working anyway?

Much to my surprise, I've managed to find it and actually make it
(almost) work. Does someone feel like helping me figure it out the
rest of the way?
https://github.com/DavidGoodwin/ExtractText

The plugin consists of an "ExtractText" part to extract text from PDFs
and an OpenXML part that extracts text from Word docs.

I'm having a problem with the OpenXML.pm plugin. It's lacking a new() function:
Aug 26 16:01:53.512 [18151] warn: plugin: failed to create instance of
plugin Mail::SpamAssassin::Plugin::OpenXML: Can't locate object method
"new" via package "Mail::SpamAssassin::Plugin::OpenXML" at (eval 2566)
line 1.

I'm not very good at OO perl. Would someone have some ideas? I've
pasted it here.
https://pastebin.com/Ac8fHJ3X

Reply via email to