Bayesian filter

Filtering of email using probabilities of the occurrence of individual words in ham and spam

A Bayesian filter is a spam filter that uses of the probabilities of the occurrence of individual words in ham and spam emails, and then computes a probability that the email containing those words is spam.

The likeliness of specific words occurring in ham and spam differs a lot between users: for instance, medical terminology and stock exchange-related words are commonly seen in spam emails - however, employees in the pharmaceutical or financial industries respectively are likely to see these words in legitimate email. Bayesian filters therefore work best if they are 'taught' by the user as to what is considered spam and ham.

Spammers are known to use various technologies in their attempts to surpass spam filters, such as adding random bits of text or hiding the content of the email in an image. An overview of such technologies can be found in the The Spammers' Compendium on this website.

Related web links


Poll

Who in your company is responsible for installing software patches?
System administrators
End users
I don't know

Leave a comment

Jobs Recruit Sidebar

VB100 certification

VB100 The final VB100 of the year sees a double whammy of potential pitfalls for our comparative participants - the Vista operating system, which still seems shiny and new as well as a little scary (to both developers and users), as well as the x64 architecture, whose ostensible compatibility with standard 32-bit software belies oddities and intricacies that developers ignore at their peril. The announcement of the test brought a few surprises, as several regulars opted to skip this one, but the majority of veteran competitors took part as usual, along with several newer faces, many of whom look set to join the ranks of our regulars.
See full results.

Virus Bulletin currently has 148,295 registered users.