Justin du Coeur (jducoeur) wrote,
Justin du Coeur

Anti-spam measures

I'm beginning to think that the best weapon in the war against spam may be a dictionary. I don't believe I've gotten a single spam to the Borough mailing list in the past month that did *not* have at least one, and usually several, misspellings in the Subject. Clearly the spammers have cottoned to the recent findings that misspellings internal to a word get glossed over by the eye, and they're using this as a way of getting around keyword-based spam filters.

Hmm. I wonder if there's an anti-spam filter that does permutations? It doesn't seem like that hard an enhancement, and it would be very useful: if a Subject keyword has the same first and last letters as one marked in a spam filter, but the internal letters are rearranged, and the exact word is not in the dictionary, you mark it as even *more* likely to be spam, since someone appears to be trying to hide a spam keyword. If things get worse, I may go do some research in this direction...

