Re: "RFC 2461bis" issue: MTU handling
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: "RFC 2461bis" issue: MTU handling
On 23 okt 2003, at 11:08, Erik Nordmark wrote:
[Having hosts discover and use a larger than standard MTU between them]
This might be useful when the L2 doesn't have any MTU limitations.
For instance, with IP over ATM the default MTU is 9k but AAL5 has a 16
bit length field (if I don't misremember). Thus a pair of nodes can
agree to use e.g. 37.5k MTU.
Isn't the MTU for IP over ATM something around 4500 bytes? And ATM is
mostly a point-to-point protocol where hosts with a larger MTU are
restricted by the need to be compatible with hosts with a regular MTU
isn't relevant.
But for links where the L2 has tigher limits things are quite a bit
more difficuly. Taking Ethernet as an example. Having two hosts say
that they want to use a 9k MTU over Ethernet is fine as long as the
bridges/switches on the path all support that.
Yes, this is an important limitation.
But how can the hosts know that?
The simple solution would be for routers to announce the maximum packet
size to be used on a subnet.
Even if the hosts can determine this (send a 9k packet and see if it
gets through?), what happens when an addition Ethernet switch or wire
comes up (or goes away) and spanning tree changes the L2 path between
those two hosts so that the packets no longer go through the same set
of bridges/switches?
In theory it should be possible to create some kind of layer 2 PMTUD.
But in practice I would expect operators to simply limit the MTU for
the full subnet to the minimum of the MTUs supported by all switches on
the subnet.
If one would want to pursue this I think the best way would be for
IEEE 802 to define a "frame size discovery" protocol at L2 so that
hosts can robustly determine the frame size a given destination
MAC address and be notified when it changes.
Such a capability would enhance a layer 3 neighbor MTU management
capability, but it isn't required. Operators can simply have routers
annouce an upper limit on the MTU that may be used on a subnet. This
doesn't impede the best case scenario where all the switches support a
good sized MTU. Since the number of switches per subnet is typically
much smaller than the number of hosts per subnet, getting all switches
to support jumboframes is significantly simpler than getting all hosts
to do the same.
But since Ethernet jumboframes are non-standard (even though more and
more commonly used it seems) and there is concern about the coverage
of the Ethernet CRC for larger frames, the IEEE has been unvilling to
look into this.
Well we use a non-(IEEE-)standard way to encapsulate IP in ethernet
frames (Ethernet II rather than 802.3) so we're not without precedent
here.
I'm not a math or physics guy, but I'm not necessarily convinced by the
CRC arguments. The ethernet CRC is pretty strong, it's not going to
immediately break down at 9000 bytes or so, especially not in wired
networks with very low bit error rates. And if the IEEE picks this up
there is no reason they can't drop in a stronger CRC. As for the
TCP/UDP checksum... yes, this is a problem (even at 1500 or ~4500 bytes
that many IP over xxx RFCs specify) but why would the IEEE care?
Once you have such L2 capability then a ND option for "I can receive
packets with this MTU" allowing the a larger value than the default
link MTU, would make sense to me. But without the L2 capability it
doesn't seem very useful.
But what would be the usefulness of such a layer 2 feature if higher
layers don't take advantage of it? I think it's the other way around:
having the layer 3 feature is useful in itself, when we have this the
layer 2 feature is a nice bonus.
--------------------------------------------------------------------
IETF IPv6 working group mailing list
ipv6@ietf.org
Administrative Requests: https://www1.ietf.org/mailman/listinfo/ipv6
--------------------------------------------------------------------
Note: Messages sent to this list are the opinions of the senders and do not imply endorsement by the IETF.