[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Asrg] Spam detection system proposal
On Wed, 5 Mar 2003, Vernon Schryver wrote:
> > What are the *detectable* differences between a spammer and a legitimate
> > mass mailer, assuming we can't read the minds of the recipients?
> There are no such differences, detectable or not.
Then this ASRG is a waste of time.
> ] > This means that purely from
> ] > message contents and headers it is difficult to tell the difference.
> ] Exactly.
> "Exactly"? Who blocks messages simply because they have List-Unsubscribe
> and similar headers such as those listed in RFC 2369?
You misunderstood. I meant "Exactly -- it's hard to tell the difference
purely from message contents and headers."
> I disagree, because many spammers work hard to remove bad addresses
> from their target lists.
Really? Don't you think it's worth a shot to try to gather hard data?
If you're right, then my idea is no good. If you're wrong, then it is.
Unfortunately, without setting up a system to gather this data, we'll
never know.
> Generalizations such as
> "all spammers have lots of bad addresses in their lists" are as wrong
> as "all spammers use open relays" or "spam involves forged headers."
I never said that. I said I believe that many spammers have lots of bad
addresses, simply based on how they obtain addresses in the first place.
Maybe you're right; I don't know. But we should at least try to find out.
> I think the only way to detect spam runs is to examine passing mail bodies
> and look for those that are substantially identical and therefore bulk.
Bulk != Spam. Any system to detect "similar but not identical"
messages can be thwarted if it uses a checksum scheme, and is too slow
to be practical if it uses more sophisticated message-closeness
measures.
--
David.
_______________________________________________
Asrg mailing list
Asrg@ietf.org
https://www1.ietf.org/mailman/listinfo/asrg