The Sysadmin’s Daily Grind: FuzzyOCR

1000 MASTERPIECES

Article from Issue 74/2007
Author(s):

The latest trend is to hide spam in images. The admin’s response: an OCR tool that extracts the texts and feeds them to the spam filter.

If you run Spamassassin, the Fuzzy-OCR plugin is a good choice of image evaluation tool. FuzzyOCR isn’t hard to install, except for having to fulfill a few dependencies. Make sure your version of Spamassassin is as up-to-date as possible; it should be anyway, of course, but version 3.1.4 is a must. You also need the NetPBM tools, from the Imagemagick convert binary, Giflib, two Perl modules, and gocr for optical character recognition.

Buy this article as PDF

Express-Checkout as PDF
Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES
 
SUBSCRIPTIONS
 
TABLET & SMARTPHONE APPS
Get it on Google Play

US / Canada

Get it on Google Play

UK / Australia

Related content

  • Version 2.5 of Gibraltar Security Software Released

    Development of the new version of the Gibraltar security software took more than a year. The release of Gibraltar 2.5 sees enhanced functionality and simplified use.

  • Charly's Column: Biabam and Tre-agrep

    Most of the tools that show up in this column are small, smart, fast, and easily explained. This month is no exception; we feature a dynamic duo of tools.

  • Charly's Column

    SA-Update helps beleaguered admins face the onslaught of consumer trash.

  • Charly's Column

    If protocols were human beings, NNTP would be a kind and slightly confused person that always believes the best of other people – even if they drop trash in the mailbox. Postfilter gives NNTP a watchdog.

  • Charly's Column

    SpamAssassin is the backbone of countless anti-spam strategies. Its maintainers are cautious people and have just released the last major version since 2007. It’s definitely worthwhile.

comments powered by Disqus

Direct Download

Read full article as PDF:

Charlys_Column.pdf  (89.25 kB)

News