[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Asrg] differential confidence



Dave CROCKER wrote:
> 
> Chris Lewis wrote:
>>   Each of the numbers is
>> scaled differently in a computation something like this:
>>
>> if (((complaints * cf + contentblocked * bf + trap * tf) / non-blocked)
>>> 1) {
>> 	go block the IP
>> }
>>
>> [Notice that we're not factoring in blocked IP.  Specifically to avoid
>> the thresholder locking up thru positive feedback ;-).  They're blocked
>> anyway, so it doesn't matter.]

>> Where cf, bf, and tf are chosen thru experience and experimentation.

> Relative to my question, what you've said is that you have a scaling factor 
> specifically for the TIS hits.  That lets you tune its specific confidence level.
> 
> But you don't cite any of the "good" attributes, nevermind scaling factors, per 
> your reference to good/bad.

Not quite - the counts in the numerator are "bads" and in the
denominator the "goods".  Each of the counts have scaling factors
(normalized so that "passed through"'s scaling factor is 1).

I don't look at them as confidence indicators, but rather as scaling
factors to derive some sort of notion of "what's the probability of a
given email from this IP being spam?".

I form that equation that way strictly thru force of habit, because
given the headroom I want it to have, pendantic accuracy is irrelevant.
 A different equation for "block this IP?" would be something like this:

(complaints * cf + contentblocked * bf + trap * tf)
--------------------------------------------------- > maxratio
(complaints + contentblocked + trap + passthru)

Where if cf, bf and tf were all 1, is a simple ratio of spam/total.

It's probably easier to comprehend, but, might not have the
characteristics I want.  I'll play with it.

> I'm being pedantic because it was the differential nature of postive vs. 
> negative measurements that intrigued me, in terms of practical impact.

_______________________________________________
Asrg mailing list
Asrg at irtf.org
https://www.irtf.org/mailman/listinfo/asrg