Let's think about the statistical significance a bit. The Net-wide average is about 3 people per host, but most hosts have only one or two people behind them. The discrepancy may well be made up for by a small number of hosts handling very large numbers of people.in principle yes ... however I doubt that the systems at the top of theWe then carefully worked through all the calculations, using the best data that we could obtain -- and we did indeed come to the conclusion that proof-of-work is not a viable proposal :(That's a very interesting paper, thank you. I wonder, however, what the distribution curves are like when "regular correspondents" are exempted from proof-of-work, not just mailing lists. Would it be possible to re-examine the MTA logs for this type of pattern?
curve (sending lots of email per day) would have regular correspondents.
Besides the people running mailing lists, they will be e-commerce
systems sending acknowledgements, hospitals confirming appointments, fax
delivery systems relaying incoming messages etc.
I'm afraid I don't have much insight into how business e-mail patterns go. That's why I'm asking you for the statistics. :)By "regular correspondents" I mean people who know each other wellI don't see why one should expect any correlation between machine speed
enough to send mail regularly, not necessarily frequently - even once a
week over a period of months. I ask this because I expect that users
with slow machines - who would otherwise be the group most
inconvenienced by proof-of-work schemes - send mail that mostly falls
into this category. I don't know, however, how much of the overall
picture is accounted for by these.
and regularity of sending email. Many businesses will not splash out for
admin staff machines, so it is they as well as aged parents who might be
expected to have old kit :)
For the brute-force proof-of-work scheme you assume in the paper, this is undoubtedly true. I'm asking for more statistics to try and reveal whether the ways we've thought of, for making it less brute-force, are viable.indeed so ... though you should note that there is not much differenceFor future work, it might be instructive to identify various non-spam use-cases which appear to have a high proof-of-work load - ie. on the "long tail" of the distribution curves presented - and consider practical ways of relieving or accommodating it.
between spam viability thresholds and the average case, let alone power-
users.
For proof-of-work to look plausible (and not a high-risk strategy) I'dFor my own usage pattern, assuming proof-of-work is exempted for regular correspondents, this is approximately true. I talk almost exclusively to mailing lists and people I know pretty well.
like to see factors of a thousand or more between plausible workloads
for legitimate senders and any economically viable spamming activity :-(