Re: [p2pi] Charter and problem statement
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [p2pi] Charter and problem statement



Maybe it is better to discuss if ALTO has to determine legit. of content or not. IMHO, it is not a protocol design problem.

Damien Saucez

Laird Popkin wrote:

Some content owners are optimistic that if they provide high quality content for free (ad supported) that they could not only grow the download volumes with people that don't currently download, but shift significant usage from piracy.

I'm not saying that those estimates are guaranteed to be right, just that it's possible that they could be, and that ALTO should keep that in mind when designing the protocol.

- Laird Popkin, CTO, Pando Networks
 mobile: 646/465-0570

----- Original Message -----
From: "Stas Khirman" <stas at khirman.com>
To: "Laird Popkin" <laird at pando.com>, "Richard Woundy" <Richard_Woundy at cable.comcast.com>
Cc: p2pi at ietf.org
Sent: Tuesday, July 22, 2008 12:01:59 PM (GMT-0500) America/New_York
Subject: RE: [p2pi] Charter and problem statement

I will not be so optimistic....
Just a few numbers to consider:
- From time of system deployment (2003?), Comcast sold 6 billions VoDs
views.
- iTune sold only sold 125 million TV shows and 7 million movies since its
launch (2005?)
- Mininova ( most popular BT portal) facilitated 3.3 billion torrents
downloads ( about 80% - movies and TV shows) in 12 months (May 2007-May
2008)
- ThePirateBay and IsoHunt have in total more daily visitors then Mininova
(and so - facilitated downloads)

-----Original Message-----
From: Laird Popkin [mailto:laird at pando.com]
Sent: Tuesday, July 22, 2008 8:52 AM
To: Richard Woundy
Cc: p2pi at ietf.org; Stas Khirman
Subject: Re: [p2pi] Charter and problem statement

This is a good point. There are estimates that "legit" p2p content could
rapidly grow to be much larger than "pirate" p2p content. For example, if
every home on broadband watched one HD TV show a day, that would be more
data than the internet moves now (or can move from any central location).

- Laird Popkin, CTO, Pando Networks
 mobile: 646/465-0570

----- Original Message -----
From: "Richard Woundy" <Richard_Woundy at cable.comcast.com>
To: "Stas Khirman" <stas at khirman.com>, "Laird Popkin" <laird at pando.com>
Cc: p2pi at ietf.org
Sent: Saturday, July 19, 2008 12:38:31 PM (GMT-0500) America/New_York
Subject: RE: [p2pi] Charter and problem statement

In this use case, inclusion of content id into ALTO request makes no
sense at all but opens a Pandora box of potential liabilities. Making it
optional with end-user manual configuration will not help - even if small
part of users keep it ON, it will create a sufficient reason for DMCA-like
request to filter P2P content.

Maybe the client could determine by out-of-band means (e.g. metadata)
whether it makes sense to send the content ID in an ALTO request.

There is an underlying assumption here that every P2P user is going to
request at least some illicit content (copyright infringing or kiddie
porn). What if P2P was primarily used for "legitimate" content
distribution? How would that assumption influence our requirements?

At the same time, I would caution against assuming that the lack of
content identification in any ALTO request implies that the user is
attempting something illegal. There are obviously many privacy concerns
besides trying to evade one's legal system...

-- Rich

________________________________

From: p2pi-bounces at ietf.org on behalf of Stas Khirman
Sent: Sat 7/19/2008 1:13 AM
To: 'Laird Popkin'; Woundy, Richard
Cc: p2pi at ietf.org
Subject: Re: [p2pi] Charter and problem statement



-----Original Message-----
From: Laird Popkin [mailto:laird at pando.com]
Sent: Friday, July 18, 2008 3:57 AM
To: Richard Woundy
Cc: p2pi at ietf.org; Stas Khirman
Subject: Re: [p2pi] Charter and problem statement

I agree  with Richard - as long as content ID's are optional and
abstracted (e.g. content ID's are not Torrent ID's) I don't think
there's
a privacy issue.
[Stas Khirman]

Laird and Richard,

Thank you for clarifying this issue - I convinced that content id does not
create additional problems in use case when ISP is using cache servers.
Indeed, if any kind of caching implemented on ISP level, they already have
privacy/DMCA questions resolved somehow. It also true for use case of P2P
CDN that use P4P architecture (ALTO server polled by pTracker ). In those
use cases, including content id in ALTO request makes no harm, and can be
quite useful for cache location and [probably] other goals (differentiated
QoS ?)

However, I still worry about use case that seems to be more "popular" ( in
term on number of users) : due variety of [legal] reasons,  ISP does not
install caching at all, but want to use ALTO-powered transport
optimization
for BitTorrent, eDonkey and similar P2P traffic (assuming that major P2P
clients adopted ALTO).

In this use case, inclusion of content id into ALTO request makes no sense
at all but opens a Pandora box of potential liabilities. Making it
optional
with end-user manual configuration will not help - even if small part of
users keep it ON, it will create a sufficient reason for DMCA-like request
to filter P2P content. And it is not just hypothetical speculation - see
for
example http://yro.slashdot.org/article.pl?sid=07/07/05/1512247&from=rss
http://torrentfreak.com/isps-should-block-bittorrent-and-tpb-071226/ .

Probably, "optionally" can be achieved if during ALTO server discovery
stage
( DNS?) client receives some kind of "configuration" flag and will be
mandated NOT TO send content id. Will it pass legal scrutiny? Will be ALTO
server used by DMCA/IFPA as a proof of ISP ability to filter P2P ? I have
no
idea, but assume that the better "DMCA-proof" solution is to separate
"content-sensitive" functionalities (like cache discovery) from "content
agnostic" ones.


There are a number of use cases where metadata related to the content
could be useful in allowing the ALTO server to make decisions. For
example,
an ISP might want to implement policies in their ALTO server such as:

- Don't cache consumer generated content (requires being able to
differentiate between CGC and commercial content).
- Provide different guidance to different applications (requires
knowledge
of p2p application).
- Provide guidance based on distribution of that content (requires ID of
that specific content).

It's a good point that once content is cached the ISP knows a lot about
the content and its distribution. In particular, since it's
participating
in the p2p network it knows the IP addresses of people exchanging the
content, whatever metadata is exposed about the content in the protocol
(e.g. size, number of files, names). And for p2p networks that don't
secure content on caches (Pando does, but some others do not) the ISP
has
the content on the cache server and can inspect it directly. But this is
(IMO, IANAL) similar to the case of an HTTP cache or an email server -
the
ISP is providing a service, and privacy of use of that service is
protected by policy, not by technology.

Given all of that, ALTO can certainly be applied to content that isn't
cached, and it should preserve privacy for that content. :-)

- Laird Popkin, CTO, Pando Networks
 mobile: 646/465-0570

----- Original Message -----
From: "Richard Woundy" <Richard_Woundy at cable.comcast.com>
To: "Stas Khirman" <stas at khirman.com>, "Laird Popkin" <laird at pando.com>
Cc: p2pi at ietf.org
Sent: Thursday, July 17, 2008 10:34:48 PM (GMT-0500) America/New_York
Subject: RE: [p2pi] Charter and problem statement

It seems that we agree on the key point - content identity in ALTO
["peer
selection"] request will be harmful.

It will be very important to get confirmation/rejection of this
assumption
from ISP's representatives.

I've been thinking about this since yesterday, and I'm not really sure I
agree with this statement.

I definitely understand that there would likely be privacy concerns if
ALTO mandated content identification in its messages. Content identities
should be an optional parameter in the ALTO message exchange (if we can
agree that it should be kept at all).

But I would think that having the content identity could help an ALTO
server make a better recommendation, at least for P2P caches.

Consider the case when the ISP is operating an ALTO server as well as a
P2P cache. If there isn't an issue of violating customer privacy, why
would it be undesirable to reveal the content identity to the ALTO
server, when it is necessary to reveal the content identity to the P2P
caching server?

In fact, the P2P caching server knows potentially more valuable
information than the content identity. ALTO would only know about
metadata for content, whereas the P2P cache would have a copy of the
actual content.

To illustrate, suppose that some content's metadata (e.g. torrent)
indicates that the file is "Spiderman" and not much more. It is possible
that the file represents a DVD rip of a Spiderman movie. It is equally
possible that the file represents a digital home video of someone's
child wearing a Spiderman costume for Halloween. Examining the actual
content (as stored on the P2P cache) would help determine which scenario
is true.

-- Rich

-----Original Message-----
From: p2pi-bounces at ietf.org [mailto:p2pi-bounces at ietf.org] On Behalf Of
Stas Khirman
Sent: Wednesday, July 16, 2008 5:55 PM
To: 'Laird Popkin'
Cc: p2pi at ietf.org
Subject: Re: [p2pi] Charter and problem statement

-----Original Message-----
From: Laird Popkin [mailto:laird at pando.com]
Sent: Wednesday, July 16, 2008 11:15 AM

I agree that keeping content identity out of cache location is a good
idea.

[Stas Khirman]
Laird,

It seems that we agree on the key point - content identity in ALTO
["peer
selection"] request will be harmful.

It will be very important to get confirmation/rejection of this
assumption
from ISP's representatives.


That being said, cache location doesn't need to be content-specific,
because one the p2p client connects to the caches, the caches can
determine what they do on a content-specific basis. So ALTO could (for
example) decide which cache servers to send to a user based on the
user's
network location and protocol (e.g. Pando caches might run on
different
servers from Kontiki caches or HTTP caches, etc.). But as long as ALTO
can
return multiple cache servers (i.e. pools) the cache servers can
manage
whether to cache specific content, or where to assign it,
transparently to
ALTO, without ALTO having to know (for cache location) what content is
assigned where. This is good for the caches, as it gives them maximum
flexibility, without having to implement a complex state-change
mechanism
between the cache servers and the ALTO server.

[Stas Khirman]
Lets review a few usage cases to decide if ALTO can locate optimal cache
without content identity.

1.) Cache implemented as a single virtual peer or as a set of peers with
the
same cached content. Certainly, no content knowledge needed to locate a
most
appropriate server. However, such architecture doesn't scale.

2.) Cache implemented as distributed set of servers where given content
may
be located in some subset of nodes. If ALTO return [ordered] list of all
known servers, it will be up to client application to poll them to
discover
which one has appropriate content. Sometimes it will work, but with
large
list it takes significant time, especially harmful for applications that
uses cache for bootstrapped playback.

3.) Cache implemented in master-slave hierarchy. ALTO returns a pointer
to a
master cache (cache director). Upon client connection, master redirects
it
to a slave cache that has appropriate content. However, "master" do not
have
enough knowledge to pick "ALTO optimized" slave server from set of
appropriate ( I assume that popular content may be presented in multiple
cache locations). Master can resolve this issue with a second ALTO
lookup,
but it not different from separated "discovery" and "optimization" calls
as
I suggested.

4.) Cache service implemented by third party CDN ( I assume you referred
to
this specific case).  In this case decision what cache to be used is
under
control on another party (CDN owner) who may have "optimization" policy
quite different then originating ISP (those diminishing value of ISP-own
ALTO service). Probably it is extreme speculation, but in theory, Tier-1
ISP
operated CDN may use "selection" policy that increase its own profit,
but
not decrease cost of originating ISP.




_______________________________________________
p2pi mailing list
p2pi at ietf.org
https://www.ietf.org/mailman/listinfo/p2pi
_______________________________________________
p2pi mailing list
p2pi at ietf.org
https://www.ietf.org/mailman/listinfo/p2pi




_______________________________________________
p2pi mailing list
p2pi at ietf.org
https://www.ietf.org/mailman/listinfo/p2pi

_______________________________________________
p2pi mailing list
p2pi at ietf.org
https://www.ietf.org/mailman/listinfo/p2pi



Note: Messages sent to this list are the opinions of the senders and do not imply endorsement by the IETF.