idnits 2.17.1 

draft-ietf-aqm-recommendation-08.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

  -- The draft header indicates that this document obsoletes RFC2309, but the
     abstract doesn't seem to directly say this.  It does mention RFC2309
     though, so this could be OK.


  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  -- The document date (August 13, 2014) is 3544 days in the past.  Is this
     intentional?


  Checking references for intended status: Best Current Practice
  ----------------------------------------------------------------------------

     (See RFCs 3967 and 4897 for information about using normative references
     to lower-maturity documents in RFCs)

  ** Obsolete normative reference: RFC 5405 (Obsoleted by RFC 8085)

  -- Obsolete informational reference (is this intentional?): RFC  793
     (Obsoleted by RFC 9293)

  -- Obsolete informational reference (is this intentional?): RFC  896
     (Obsoleted by RFC 7805)

  -- Obsolete informational reference (is this intentional?): RFC 2309
     (Obsoleted by RFC 7567)

  -- Obsolete informational reference (is this intentional?): RFC 2460
     (Obsoleted by RFC 8200)

  -- Obsolete informational reference (is this intentional?): RFC 4960
     (Obsoleted by RFC 9260)


     Summary: 1 error (**), 0 flaws (~~), 1 warning (==), 7 comments (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------


2	Network Working Group                                      F. Baker, Ed.
3	Internet-Draft                                             Cisco Systems
4	Obsoletes: 2309 (if approved)                          G. Fairhurst, Ed.
5	Intended status: Best Current Practice            University of Aberdeen
6	Expires: February 14, 2015                               August 13, 2014

8	         IETF Recommendations Regarding Active Queue Management
9	                    draft-ietf-aqm-recommendation-08

11	Abstract

13	   This memo presents recommendations to the Internet community
14	   concerning measures to improve and preserve Internet performance.  It
15	   presents a strong recommendation for testing, standardization, and
16	   widespread deployment of active queue management (AQM) in network
17	   devices, to improve the performance of today's Internet.  It also
18	   urges a concerted effort of research, measurement, and ultimate
19	   deployment of AQM mechanisms to protect the Internet from flows that
20	   are not sufficiently responsive to congestion notification.

22	   The note largely repeats the recommendations of RFC 2309, and
23	   replaces these after fifteen years of experience and new research.

25	Status of This Memo

27	   This Internet-Draft is submitted in full conformance with the
28	   provisions of BCP 78 and BCP 79.

30	   Internet-Drafts are working documents of the Internet Engineering
31	   Task Force (IETF).  Note that other groups may also distribute
32	   working documents as Internet-Drafts.  The list of current Internet-
33	   Drafts is at http://datatracker.ietf.org/drafts/current/.

35	   Internet-Drafts are draft documents valid for a maximum of six months
36	   and may be updated, replaced, or obsoleted by other documents at any
37	   time.  It is inappropriate to use Internet-Drafts as reference
38	   material or to cite them other than as "work in progress."

40	   This Internet-Draft will expire on February 14, 2015.

42	Copyright Notice

44	   Copyright (c) 2014 IETF Trust and the persons identified as the
45	   document authors.  All rights reserved.

47	   This document is subject to BCP 78 and the IETF Trust's Legal
48	   Provisions Relating to IETF Documents
49	   (http://trustee.ietf.org/license-info) in effect on the date of
50	   publication of this document.  Please review these documents
51	   carefully, as they describe your rights and restrictions with respect
52	   to this document.  Code Components extracted from this document must
53	   include Simplified BSD License text as described in Section 4.e of
54	   the Trust Legal Provisions and are provided without warranty as
55	   described in the Simplified BSD License.

57	Table of Contents

59	   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
60	     1.1.  Congestion Collapse . . . . . . . . . . . . . . . . . . .   3
61	     1.2.  Active Queue Management to Manage Latency . . . . . . . .   3
62	     1.3.  Document Overview . . . . . . . . . . . . . . . . . . . .   4
63	     1.4.  Changes to the recommendations of RFC2309 . . . . . . . .   5
64	     1.5.  Requirements Language . . . . . . . . . . . . . . . . . .   6
65	   2.  The Need For Active Queue Management  . . . . . . . . . . . .   6
66	     2.1.  AQM and Multiple Queues . . . . . . . . . . . . . . . . .   9
67	     2.2.  AQM and Explicit Congestion Marking (ECN) . . . . . . . .  10
68	     2.3.  AQM and Buffer Size . . . . . . . . . . . . . . . . . . .  10
69	   3.  Managing Aggressive Flows . . . . . . . . . . . . . . . . . .  11
70	   4.  Conclusions and Recommendations . . . . . . . . . . . . . . .  14
71	     4.1.  Operational deployments SHOULD use AQM procedures . . . .  15
72	     4.2.  Signaling to the transport endpoints  . . . . . . . . . .  15
73	       4.2.1.  AQM and ECN . . . . . . . . . . . . . . . . . . . . .  16
74	     4.3.  AQM algorithms deployed SHOULD NOT require operational
75	           tuning  . . . . . . . . . . . . . . . . . . . . . . . . .  17
76	     4.4.  AQM algorithms SHOULD respond to measured congestion, not
77	           application profiles. . . . . . . . . . . . . . . . . . .  19
78	     4.5.  AQM algorithms SHOULD NOT be dependent on specific
79	           transport protocol behaviours . . . . . . . . . . . . . .  19
80	     4.6.  Interactions with congestion control algorithms . . . . .  20
81	     4.7.  The need for further research . . . . . . . . . . . . . .  21
82	   5.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .  22
83	   6.  Security Considerations . . . . . . . . . . . . . . . . . . .  22
84	   7.  Privacy Considerations  . . . . . . . . . . . . . . . . . . .  22
85	   8.  Acknowledgements  . . . . . . . . . . . . . . . . . . . . . .  22
86	   9.  References  . . . . . . . . . . . . . . . . . . . . . . . . .  23
87	     9.1.  Normative References  . . . . . . . . . . . . . . . . . .  23
88	     9.2.  Informative References  . . . . . . . . . . . . . . . . .  24
89	   Appendix A.  Change Log . . . . . . . . . . . . . . . . . . . . .  27
90	   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  28

92	1.  Introduction

94	   The Internet protocol architecture is based on a connectionless end-
95	   to-end packet service using the Internet Protocol, whether IPv4
96	   [RFC0791] or IPv6 [RFC2460].  The advantages of its connectionless
97	   design: flexibility and robustness, have been amply demonstrated.
98	   However, these advantages are not without cost: careful design is
99	   required to provide good service under heavy load.  In fact, lack of
100	   attention to the dynamics of packet forwarding can result in severe
101	   service degradation or "Internet meltdown".  This phenomenon was
102	   first observed during the early growth phase of the Internet in the
103	   mid 1980s [RFC0896][RFC0970], and is technically called "congestion
104	   collapse" and was a key focus of RFC2309.

106	   Since 1998, when RFC2309 was written, the Internet has become used
107	   for a variety of traffic.  In the current Internet low latency is
108	   extremely important for many interactive and transaction-based
109	   applications.  The same type of technology that RFC2309 advocated for
110	   combating congestion collapse is also effective at limiting delays to
111	   reduce the interaction delay experienced by applications.  While
112	   there is still a need to avoid congestion collapse, there is now also
113	   a focus on reducing network latency using the same technology.

115	1.1.  Congestion Collapse

117	   The original fix for Internet meltdown was provided by Van Jacobsen.
118	   Beginning in 1986, Jacobsen developed the congestion avoidance
119	   mechanisms [Jacobson88] that are now required for implementations of
120	   the Transport Control Protocol (TCP) [RFC0768] [RFC1122].  These
121	   mechanisms operate in Internet hosts to cause TCP connections to
122	   "back off" during congestion.  We say that TCP flows are "responsive"
123	   to congestion signals (i.e., packets that are dropped or marked with
124	   explicit congestion notification [RFC3168]).  It is primarily these
125	   TCP congestion avoidance algorithms that prevent the congestion
126	   collapse of today's Internet.  Similar algorithms are specified for
127	   other non-TCP transports.

129	   However, that is not the end of the story.  Considerable research has
130	   been done on Internet dynamics since 1988, and the Internet has
131	   grown.  It has become clear that the congestion avoidance mechanisms
132	   [RFC5681], while necessary and powerful, are not sufficient to
133	   provide good service in all circumstances.  Basically, there is a
134	   limit to how much control can be accomplished from the edges of the
135	   network.  Some mechanisms are needed in the network devices to
136	   complement the endpoint congestion avoidance mechanisms.  These
137	   mechanisms may be implemented in network devices that include
138	   routers, switches, and other network middleboxes.

140	1.2.  Active Queue Management to Manage Latency

142	   Internet latency has become a focus of attention to increase the
143	   responsiveness of Internet applications and protocols.  One major
144	   source of delay is the build-up of queues in network devices.

146	   Queueing occurs whenever the arrival rate of data at the ingress to a
147	   device exceeds the current egress rate.  Such queueing is normal in a
148	   packet-switched network and often necessary to absorb bursts in
149	   transmission and perform statistical multiplexing of traffic, but
150	   excessive queueing can lead to unwanted delay, reducing the
151	   performance of some Internet applications.

153	   RFC 2309 introduced the concept of "Active Queue Management" (AQM), a
154	   > class of technologies that, by signaling to common congestion-
155	   controlled transports such as TCP, manages the size of queues that
156	   build in network buffers.  RFC 2309 also describes a specific AQM
157	   algorithm, Random Early Detection (RED), and recommends that this be
158	   widely implemented and used by default in routers.

160	   With an appropriate set of parameters, RED is an effective algorithm.
161	   However, dynamically predicting this set of parameters was found to
162	   be difficult.  As a result, RED has not been enabled by default, and
163	   its present use in the Internet is limited.  Other AQM algorithms
164	   have been developed since RC2309 was published, some of which are
165	   self-tuning within a range of applicability.  Hence, while this memo
166	   continues to recommend the deployment of AQM, it no longer recommends
167	   that RED or any other specific algorithm is used as a default;
168	   instead it provides recommendations on how to select appropriate
169	   algorithms and recommends that algorithms should be used that a
170	   recommended algorithm is able to automate any required tuning for
171	   common deployment scenarios.

173	   Deploying AQM in the network can significantly reduce the latency
174	   across an Internet path and since writing RFC2309, this has become a
175	   key motivation for using AQM in the Internet.  In the context of AQM,
176	   it is useful to distinguish between two related classes of
177	   algorithms: "queue management" versus "scheduling" algorithms.  To a
178	   rough approximation, queue management algorithms manage the length of
179	   packet queues by marking or dropping packets when necessary or
180	   appropriate, while scheduling algorithms determine which packet to
181	   send next and are used primarily to manage the allocation of
182	   bandwidth among flows.  While these two mechanisms are closely
183	   related, they address different performance issues and operate on
184	   different timescales.  Both may be used in combination.

186	1.3.  Document Overview

188	   The discussion in this memo applies to "best-effort" traffic, which
189	   is to say, traffic generated by applications that accept the
190	   occasional loss, duplication, or reordering of traffic in flight.  It
191	   also applies to other traffic, such as real-time traffic that can
192	   adapt its sending rate to reduce loss and/or delay.  It is most
193	   effective when the adaption occurs on time scales of a single Round
194	   Trip Time (RTT) or a small number of RTTs, for elastic traffic
195	   [RFC1633].

197	   Two performance issues are highlighted:

199	   The first issue is the need for an advanced form of queue management
200	   that we call "Active Queue Management", AQM.  Section 2 summarizes
201	   the benefits that active queue management can bring.  A number of AQM
202	   procedures are described in the literature, with different
203	   characteristics.  This document does not recommend any of them in
204	   particular, but does make recommendations that ideally would affect
205	   the choice of procedure used in a given implementation.

207	   The second issue, discussed in Section 4 of this memo, is the
208	   potential for future congestion collapse of the Internet due to flows
209	   that are unresponsive, or not sufficiently responsive, to congestion
210	   indications.  Unfortunately, while scheduling can mitigate some of
211	   the side-effects of sharing a network queue with an unresponsive
212	   flow, there is currently no consensus solution to controlling the
213	   congestion caused by such aggressive flows.  Methods such as
214	   congestion exposure (ConEx) [RFC6789] offer a framework [CONEX] that
215	   can update network devices to alleviate these effects.  Significant
216	   research and engineering will be required before any solution will be
217	   available.  It is imperative that work to mitigate the impact of
218	   unresponsive flows is energetically pursued, to ensure acceptable
219	   performance and the future stability of the Internet.

221	   Section 4 concludes the memo with a set of recommendations to the
222	   Internet community on the use of AQM and recommendations for defining
223	   AQM algorithms.

225	1.4.  Changes to the recommendations of RFC2309

227	   This memo replaces the recommendations in [RFC2309], which resulted
228	   from past discussions of end-to-end performance, Internet congestion,
229	   and RED in the End-to-End Research Group of the Internet Research
230	   Task Force (IRTF).  It follows experience with this and other
231	   algorithms, and the AQM discussion within the IETF [AQM-WG].

233	   While RFC2309 described AQM in terms of the length of a queue.  This
234	   memo changes this, to use AQM to refer to any method that allows
235	   network devices to control either the queue length and/or the mean
236	   time that a packet spends in a queue.

238	   This memo also explicitly obsoletes the recommendation that Random
239	   Early Detection (RED) was to be used as the default AQM mechanism for
240	   the Internet.  This is replaced by a detailed set of recommendations
241	   for selecting an appropriate AQM algorithm.  As in RFC2309, this memo
242	   also motivates the need for continued research, but clarifies the
243	   research with examples appropriate at the time that this memo is
244	   published.

246	1.5.  Requirements Language

248	   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
249	   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
250	   document are to be interpreted as described in [RFC2119].

252	2.  The Need For Active Queue Management

254	   Active Queue Management (AQM) is a method that allows network devices
255	   to control the queue length or the mean time that a packet spends in
256	   a queue.  Although AQM can be applied across a range of deployment
257	   environments, the recommendations in this document are directed to
258	   use in the general Internet.  It is expected that the principles and
259	   guidance are also applicable to a wide range of environments, but may
260	   require tuning for specific types of link/network (e.g. to
261	   accommodate the traffic patterns found in data centres, the
262	   challenges of wireless infrastructure, or the higher delay
263	   encountered on satellite Internet links).  The remainder of this
264	   section identifies the need for AQM and the advantages of deploying
265	   AQM methods.

267	   The traditional technique for managing the queue length in a network
268	   device is to set a maximum length (in terms of packets) for each
269	   queue, accept packets for the queue until the maximum length is
270	   reached, then reject (drop) subsequent incoming packets until the
271	   queue decreases because a packet from the queue has been transmitted.
272	   This technique is known as "tail drop", since the packet that arrived
273	   most recently (i.e., the one on the tail of the queue) is dropped
274	   when the queue is full.  This method has served the Internet well for
275	   years, but it has four important drawbacks:

277	   1.  Full Queues

279	       The tail drop discipline allows queues to maintain a full (or,
280	       almost full) status for long periods of time, since tail drop
281	       signals congestion (via a packet drop) only when the queue has
282	       become full.  It is important to reduce the steady-state queue
283	       size, and this is perhaps the most important goal for queue
284	       management.

286	       The naive assumption might be that there is a simple tradeoff
287	       between delay and throughput, and that the recommendation that
288	       queues be maintained in a "non-full" state essentially translates
289	       to a recommendation that low end-to-end delay is more important
290	       than high throughput.  However, this does not take into account
291	       the critical role that packet bursts play in Internet
292	       performance.  For example, even though TCP constrains the
293	       congestion window of a flow, packets often arrive at network
294	       devices in bursts [Leland94].  If the queue is full or almost
295	       full, an arriving burst will cause multiple packets to be dropped
296	       from the same flow.  Bursts of loss can result in a global
297	       synchronization of flows throttling back, followed by a sustained
298	       period of lowered link utilization, reducing overall throughput
299	       [Flo94], [Zha90]

301	       The goal of buffering in the network is to absorb data bursts and
302	       to transmit them during the (hopefully) ensuing bursts of
303	       silence.  This is essential to permit transmission of bursts of
304	       data.  Normally small queues are preferred in network devices,
305	       with sufficient queue capacity to absorb the bursts.  The
306	       counter-intuitive result is that maintaining normally-small
307	       queues can result in higher throughput as well as lower end-to-
308	       end delay.  In summary, queue limits should not reflect the
309	       steady state queues we want to be maintained in the network;
310	       instead, they should reflect the size of bursts that a network
311	       device needs to absorb.

313	   2.  Lock-Out

315	       In some situations tail drop allows a single connection or a few
316	       flows to monopolize the queue space starving other connection
317	       preventing them from getting room in the queue [Flo92].

319	   3.  Mitigating the Impact of Packet Bursts

321	       Large burst of packets can delay other packets, disrupting the
322	       control loop (e.g. the pacing of flows by the TCP ACK-Clock), and
323	       reducing the performance of flows that share a common bottleneck.

325	   4.  Control loop synchronization

327	       Congestion control, like other end-to-end mechanisms, introduces
328	       a control loop between hosts.  Sessions that share a common
329	       network bottleneck can therefore become synchronised, introducing
330	       periodic disruption (e.g.  jitter/loss). "lock-out" is often also
331	       the result of synchronization or other timing effects

333	   Besides tail drop, two alternative queue management disciplines that
334	   can be applied when a queue becomes full are "random drop on full" or
335	   "head drop on full".  When a new packet arrives at a full queue using
336	   the random drop on full discipline, the network device drops a
337	   randomly selected packet from the queue (which can be an expensive
338	   operation, since it naively requires an O(N) walk through the packet
339	   queue).  When a new packet arrives at a full queue using the head
340	   drop on full discipline, the network device drops the packet at the
341	   front of the queue [Lakshman96].  Both of these solve the lock-out
342	   problem, but neither solves the full-queues problem described above.

344	   We know in general how to solve the full-queues problem for
345	   "responsive" flows, i.e., those flows that throttle back in response
346	   to congestion notification.  In the current Internet, dropped packets
347	   provide a critical mechanism indicating congestion notification to
348	   hosts.  The solution to the full-queues problem is for network
349	   devices to drop or ECN-mark packets before a queue becomes full, so
350	   that hosts can respond to congestion before buffers overflow.  We
351	   call such a proactive approach AQM.  By dropping or ECN-marking
352	   packets before buffers overflow, AQM allows network devices to
353	   control when and how many packets to drop.

355	   In summary, an active queue management mechanism can provide the
356	   following advantages for responsive flows.

358	   1.  Reduce number of packets dropped in network devices

360	       Packet bursts are an unavoidable aspect of packet networks
361	       [Willinger95].  If all the queue space in a network device is
362	       already committed to "steady state" traffic or if the buffer
363	       space is inadequate, then the network device will have no ability
364	       to buffer bursts.  By keeping the average queue size small, AQM
365	       will provide greater capacity to absorb naturally-occurring
366	       bursts without dropping packets.

368	       Furthermore, without AQM, more packets will be dropped when a
369	       queue does overflow.  This is undesirable for several reasons.
370	       First, with a shared queue and the tail drop discipline, this can
371	       result in unnecessary global synchronization of flows, resulting
372	       in lowered average link utilization, and hence lowered network
373	       throughput.  Second, unnecessary packet drops represent a waste
374	       of network capacity on the path before the drop point.

376	       While AQM can manage queue lengths and reduce end-to-end latency
377	       even in the absence of end-to-end congestion control, it will be
378	       able to reduce packet drops only in an environment that continues
379	       to be dominated by end-to-end congestion control.

381	   2.  Provide a lower-delay interactive service

383	       By keeping a small average queue size, AQM will reduce the delays
384	       experienced by flows.  This is particularly important for
385	       interactive applications such as short web transfers, POP/IMAP,
386	       DNS, terminal traffic (telnet, ssh, mosh, RDP, etc), gaming or
387	       interactive audio-video sessions, whose subjective (and
388	       objective) performance is better when the end-to-end delay is
389	       low.

391	   3.  Avoid lock-out behavior

393	       AQM can prevent lock-out behavior by ensuring that there will
394	       almost always be a buffer available for an incoming packet.  For
395	       the same reason, AQM can prevent a bias against low capacity, but
396	       highly bursty, flows.

398	       Lock-out is undesirable because it constitutes a gross unfairness
399	       among groups of flows.  However, we stop short of calling this
400	       benefit "increased fairness", because general fairness among
401	       flows requires per-flow state, which is not provided by queue
402	       management.  For example, in a network device using AQM with only
403	       FIFO scheduling, two TCP flows may receive very different share
404	       of the network capacity simply because they have different round-
405	       trip times [Floyd91], and a flow that does not use congestion
406	       control may receive more capacity than a flow that does.  AQM can
407	       therefore be combined with a scheduling mechanism that divides
408	       network traffic between multiple queues (section 2.1).

410	   4.  Reduce the probability of control loop synchronization

412	       The probability of network control loop synchronization can be
413	       reduced if network devices introduce randomness in the AQM
414	       functions that trigger congestion avoidance at the sending host.

416	2.1.  AQM and Multiple Queues

418	   A network device may use per-flow or per-class queuing with a
419	   scheduling algorithm to either prioritize certain applications or
420	   classes of traffic, limit the rate of transmission, or to provide
421	   isolation between different traffic flows within a common class.  For
422	   example, a router may maintain per-flow state to achieve general
423	   fairness by a per-flow scheduling algorithm such as various forms of
424	   Fair Queueing (FQ) [Dem90] [Sut99], including Weighted Fair Queuing
425	   (WFQ), Stochastic Fairness Queueing (SFQ) [McK90] Deficit Round Robin
426	   (DRR) [Shr96], [Nic12], and/or a Class-Based Queue scheduling
427	   algorithm such as CBQ [Floyd95].  Hierarchical queues may also be
428	   used e.g., as a part of a Hierarchical Token Bucket (HTB), or
429	   Hierarchical Fair Service Curve (HFSC) [Sto97].  These methods are
430	   also used to realize a range of Quality of Service (QoS) behaviours
431	   designed to meet the need of traffic classes (e.g. using the
432	   integrated or differentiated service models).

434	   AQM is needed even for network devices that use per-flow or per-class
435	   queuing, because scheduling algorithms by themselves do not control
436	   the overall queue size or the size of individual queues.  AQM
437	   mechanisms might need to control the overall queue sizes, to ensure
438	   that arriving bursts can be accommodated without dropping packets.
439	   AQM should also be used to control the queue size for each individual
440	   flow or class, so that they do not experience unnecessarily high
441	   delay.  Using a combination of AQM and scheduling between multiple
442	   queues has been shown to offer good results in experimental and some
443	   types of operational use.

445	   In short, scheduling algorithms and queue management should be seen
446	   as complementary, not as replacements for each other.

448	2.2.  AQM and Explicit Congestion Marking (ECN)

450	   An AQM method may use Explicit Congestion Notification (ECN)
451	   [RFC3168] instead of dropping to mark packets under mild or moderate
452	   congestion.  ECN-marking can allow a network device to signal
453	   congestion at a point before a transport experiences congestion loss
454	   or additional queuing delay [ECN-Benefit].  Section 4.2.1 describes
455	   some of the benefits of using ECN with AQM.

457	2.3.  AQM and Buffer Size

459	   It is important to differentiate the choice of buffer size for a
460	   queue in a switch/router or other network device, and the
461	   threshold(s) and other parameters that determine how and when an AQM
462	   algorithm operates.  The optimum buffer size is a function of
463	   operational requirements and should generally be sized to be
464	   sufficient to buffer the largest normal traffic burst that is
465	   expected.  This size depends on the number and burstiness of traffic
466	   arriving at the queue and the rate at which traffic leaves the queue.

468	   One objective of AQM is to minimize the effect of lock-out, where one
469	   flow prevents other flows from effectively gaining capacity.  This
470	   need can be illustrated by a simple example of drop-tail queuing when
471	   a new TCP flow injects packets into a queue that happens to be almost
472	   full.  A TCP flow's congestion control algorithm [RFC5681] increases
473	   the flow rate to maximize its effective window.  This builds a queue
474	   in the network, inducing latency to the flow and other flows that
475	   share this queue.  Once a drop-tail queue fills, there will also be
476	   loss.  A new flow, sending its initial burst, has an enhanced
477	   probability of filling the remaining queue and dropping packets.  As
478	   a result, the new flow can be effectively prevented from effectively
479	   sharing the queue for a period of many RTTs.  In contrast, AQM can
480	   minimize the mean queue depth and therefore reducing the probability
481	   that competing sessions can materially prevent each other from
482	   performing well.

484	   AQM frees a designer from having to limit the buffer space assigned
485	   to a queue to achieve acceptable performance, allowing allocation of
486	   sufficient buffering to satisfy the needs of the particular traffic
487	   pattern.  Different types of traffic and deployment scenarios will
488	   lead to different requirements.  The choice of AQM algorithm and
489	   associated parameters is therefore a function of the way in which
490	   congestion is experienced and the required reaction to achieve
491	   acceptable performance.  This latter is the primary topic of the
492	   following sections.

494	3.  Managing Aggressive Flows

496	   One of the keys to the success of the Internet has been the
497	   congestion avoidance mechanisms of TCP.  Because TCP "backs off"
498	   during congestion, a large number of TCP connections can share a
499	   single, congested link in such a way that link bandwidth is shared
500	   reasonably equitably among similarly situated flows.  The equitable
501	   sharing of bandwidth among flows depends on all flows running
502	   compatible congestion avoidance algorithms, i.e., methods conformant
503	   with the current TCP specification [RFC5681].

505	   In this document a flow is known as "TCP-friendly" when it has a
506	   congestion response that approximates the average response expected
507	   of a TCP flow.  One example method of a TCP-friendly scheme is the
508	   TCP-Friendly Rate Control algorithm [RFC5348].  In this document, the
509	   term is used more generally to describe this and other algorithms
510	   that meet these goals.

512	   There are a variety of types of network flow.  Some convenient
513	   classes that describe flows are: (1) TCP Friendly flows, (2)
514	   unresponsive flows, i.e., flows that do not slow down when congestion
515	   occurs, and (3) flows that are responsive but are less responsive to
516	   congestion than TCP.  The last two classes contain more aggressive
517	   flows that can pose significant threats to Internet performance.

519	   1.  TCP-Friendly flows

521	       A TCP-friendly flow responds to congestion notification within a
522	       small number of path Round Trip Times (RTT), and in steady-state
523	       it uses no more capacity than a conformant TCP running under
524	       comparable conditions (drop rate, RTT, packet size, etc.).  This
525	       is described in the remainder of the document.

527	   2.  Non-Responsive Flows
528	       The User Datagram Protocol (UDP) [RFC0768] provides a minimal,
529	       best-effort transport to applications and upper-layer protocols
530	       (both simply called "applications" in the remainder of this
531	       document) and does not itself provide mechanisms to prevent
532	       congestion collapse and establish a degree of fairness [RFC5405].

534	       There is a growing set of UDP-based applications whose congestion
535	       avoidance algorithms are inadequate or nonexistent (i.e, a flow
536	       that does not throttle its sending rate when it experiences
537	       congestion).  Examples include some UDP streaming applications
538	       for packet voice and video, and some multicast bulk data
539	       transport.  If no action is taken, such unresponsive flows could
540	       lead to a new congestion collapse.  Some applications can even
541	       increase their traffic volume in response to congestion (e.g. by
542	       adding forward error correction when loss is experienced), with
543	       the possibility that they contribute to congestion collapse.

545	       In general, UDP-based applications need to incorporate effective
546	       congestion avoidance mechanisms [RFC5405].  Further research and
547	       development of ways to accomplish congestion avoidance for
548	       presently unresponsive applications continue to be important.
549	       Network devices need to be able to protect themselves against
550	       unresponsive flows, and mechanisms to accomplish this must be
551	       developed and deployed.  Deployment of such mechanisms would
552	       provide an incentive for all applications to become responsive by
553	       either using a congestion-controlled transport (e.g.  TCP, SCTP
554	       [RFC4960] and DCCP [RFC4340].) or by incorporating their own
555	       congestion control in the application [RFC5405], [RFC6679].

557	       Lastly, some applications (e.g. current web browsers) open a
558	       large numbers of short TCP flows for a single session.  This can
559	       lead to each individual flow spending the majority of time in the
560	       exponential TCP slow start phase, rather than in TCP congestion
561	       avoidance.  The resulting traffic aggregate can therefore be much
562	       less responsive than a single standard TCP flow.

564	   3.  Transport Flows that are less responsive than TCP

566	       A second threat is posed by transport protocol implementations
567	       that are responsive to congestion, but, either deliberately or
568	       through faulty implementation, reduce less than a TCP flow would
569	       have done in response to congestion.  This covers a spectrum of
570	       behaviours between (1) and (2).  If applications are not
571	       sufficiently responsive to congestion signals, they may gain an
572	       unfair share of the available network capacity.

574	       For example, the popularity of the Internet has caused a
575	       proliferation in the number of TCP implementations.  Some of
576	       these may fail to implement the TCP congestion avoidance
577	       mechanisms correctly because of poor implementation.  Others may
578	       deliberately be implemented with congestion avoidance algorithms
579	       that are more aggressive in their use of capacity than other TCP
580	       implementations; this would allow a vendor to claim to have a
581	       "faster TCP".  The logical consequence of such implementations
582	       would be a spiral of increasingly aggressive TCP implementations,
583	       leading back to the point where there is effectively no
584	       congestion avoidance and the Internet is chronically congested.

586	       Another example could be an RTP/UDP video flow that uses an
587	       adaptive codec, but responds incompletely to indications of
588	       congestion or responds over an excessively long time period.
589	       Such flows are unlikely to be responsive to congestion signals in
590	       a timeframe comparable to a small number of end-to-end
591	       transmission delays.  However, over a longer timescale, perhaps
592	       seconds in duration, they could moderate their speed, or increase
593	       their speed if they determine capacity to be available.

595	       Tunneled traffic aggregates carrying multiple (short) TCP flows
596	       can be more aggressive than standard bulk TCP.  Applications
597	       (e.g. web browsers and peer-to-peer file-sharing) have exploited
598	       this by opening multiple connections to the same endpoint.

600	   The projected increase in the fraction of total Internet traffic for
601	   more aggressive flows in classes 2 and 3 could pose a threat to the
602	   performance of the future Internet.  There is therefore an urgent
603	   need for measurements of current conditions and for further research
604	   into the ways of managing such flows.  This raises many difficult
605	   issues in finding methods with an acceptable overhead cost that can
606	   identify and isolate unresponsive flows or flows that are less
607	   responsive than TCP.  Finally, there is as yet little measurement or
608	   simulation evidence available about the rate at which these threats
609	   are likely to be realized, or about the expected benefit of
610	   algorithms for managing such flows.

612	   Another topic requiring consideration is the appropriate granularity
613	   of a "flow" when considering a queue management method.  There are a
614	   few "natural" answers: 1) a transport (e.g.  TCP or UDP) flow (source
615	   address/port, destination address/port, protocol); 2) Differentiated
616	   Services Code Point, DSCP; 3) a source/destination host pair (IP
617	   address); 4) a given source host or a given destination host, or
618	   various combinations of the above; 5) a subscriber or site receiving
619	   the Internet service (enterprise or residential).

621	   The source/destination host pair gives an appropriate granularity in
622	   many circumstances, However, different vendors/providers use
623	   different granularities for defining a flow (as a way of
624	   "distinguishing" themselves from one another), and different
625	   granularities may be chosen for different places in the network.  It
626	   may be the case that the granularity is less important than the fact
627	   that a network device needs to be able to deal with more unresponsive
628	   flows at *some* granularity.  The granularity of flows for congestion
629	   management is, at least in part, a question of policy that needs to
630	   be addressed in the wider IETF community.

632	4.  Conclusions and Recommendations

634	   The IRTF, in publishing [RFC2309], and the IETF in subsequent
635	   discussion, has developed a set of specific recommendations regarding
636	   the implementation and operational use of AQM procedures.  The
637	   recommendations provided by this document are summarised as:

639	   1.  Network devices SHOULD implement some AQM mechanism to manage
640	       queue lengths, reduce end-to-end latency, and avoid lock-out
641	       phenomena within the Internet.

643	   2.  Deployed AQM algorithms SHOULD support Explicit Congestion
644	       Notification (ECN) as well as loss to signal congestion to
645	       endpoints.

647	   3.  The algorithms that the IETF recommends SHOULD NOT require
648	       operational (especially manual) configuration or tuning.

650	   4.  AQM algorithms SHOULD respond to measured congestion, not
651	       application profiles.

653	   5.  AQM algorithms SHOULD NOT interpret specific transport protocol
654	       behaviours.

656	   6.  Transport protocol congestion control algorithms SHOULD maximize
657	       their use of available capacity (when there is data to send)
658	       without incurring undue loss or undue round trip delay.

660	   7.  Research, engineering, and measurement efforts are needed
661	       regarding the design of mechanisms to deal with flows that are
662	       unresponsive to congestion notification or are responsive, but
663	       are more aggressive than present TCP.

665	   These recommendations are expressed using the word "SHOULD".  This is
666	   in recognition that there may be use cases that have not been
667	   envisaged in this document in which the recommendation does not
668	   apply.  Therefore, care should be taken in concluding that one's use
669	   case falls in that category; during the life of the Internet, such
670	   use cases have been rarely if ever observed and reported.  To the
671	   contrary, available research [Choi04] says that even high speed links
672	   in network cores that are normally very stable in depth and behavior
673	   experience occasional issues that need moderation.  The
674	   recommendations are detailed in the following sections.

676	4.1.  Operational deployments SHOULD use AQM procedures

678	   AQM procedures are designed to minimize the delay and buffer
679	   exhaustion induced in the network by queues that have filled as a
680	   result of host behavior.  Marking and loss behaviors provide a signal
681	   that buffers within network devices are becoming unnecessarily full,
682	   and that the sender would do well to moderate its behavior.

684	   The use of scheduling mechanisms, such as priority queuing, classful
685	   queuing, and fair queuing, is often effective in networks to help a
686	   network serve the needs of a range of applications.  Network
687	   operators can use these methods to manage traffic passing a choke
688	   point.  This is discussed in [RFC2474] and [RFC2475].  When
689	   scheduling is used AQM should be applied across the classes or flows
690	   as well as within each class or flow:

692	   o  AQM mechanisms need to control the overall queue sizes, to ensure
693	      that arriving bursts can be accommodated without dropping packets.

695	   o  AQM mechanisms need to allow combination with other mechanisms,
696	      such as scheduling, to allow implementation of policies for
697	      providing fairness between different flows.

699	   o  AQM should be used to control the queue size for each individual
700	      flow or class, so that they do not experience unnecessarily high
701	      delay.

703	4.2.  Signaling to the transport endpoints

705	   There are a number of ways a network device may signal to the end
706	   point that the network is becoming congested and trigger a reduction
707	   in rate.  The signalling methods include:

709	   o  Delaying transport segments (packets) in flight, such as in a
710	      queue.

712	   o  Dropping transport segments (packets) in transit.

714	   o  Marking transport segments (packets), such as using Explicit
715	      Congestion Control[RFC3168] [RFC4301] [RFC4774] [RFC6040]
716	      [RFC6679].

718	   Increased network latency is used as an implicit signal of
719	   congestion.  E.g., in TCP additional delay can affect ACK Clocking
720	   and has the result of reducing the rate of transmission of new data.
721	   In the Real Time Protocol (RTP), network latency impacts the RTCP-
722	   reported RTT and increased latency can trigger a sender to adjust its
723	   rate.  Methods such as Low Extra Delay Background Transport (LEDBAT)
724	   [RFC6817] assume increased latency as a primary signal of congestion.
725	   Appropriate use of delay-based methods and the implications of AQM
726	   presently remains an area for further research.

728	   It is essential that all Internet hosts respond to loss [RFC5681],
729	   [RFC5405][RFC4960][RFC4340].  Packet dropping by network devices that
730	   are under load has two effects: It protects the network, which is the
731	   primary reason that network devices drop packets.  The detection of
732	   loss also provides a signal to a reliable transport (e.g.  TCP, SCTP)
733	   that there is potential congestion using a pragmatic heuristic; "when
734	   the network discards a message in flight, it may imply the presence
735	   of faulty equipment or media in a path, and it may imply the presence
736	   of congestion.  To be conservative, a transport must assume it may be
737	   the latter."  Unreliable transports (e.g. using UDP) need to
738	   similarly react to loss [RFC5405]

740	   Network devices SHOULD use an AQM algorithm to determine the packets
741	   that are marked or discarded due to congestion.  Procedures for
742	   dropping or marking packets within the network need to avoid
743	   increasing synchronization events, and hence randomness SHOULD be
744	   introduced in the algorithms that generate these congestion signals
745	   to the endpoints.

747	   Loss also has an effect on the efficiency of a flow and can
748	   significantly impact some classes of application.  In reliable
749	   transports the dropped data must be subsequently retransmitted.
750	   While other applications/transports may adapt to the absence of lost
751	   data, this still implies inefficient use of available capacity and
752	   the dropped traffic can affect other flows.  Hence, congestion
753	   signalling by loss is not entirely positive; it is a necessary evil.

755	4.2.1.  AQM and ECN

757	   Explicit Congestion Notification (ECN) [RFC4301] [RFC4774] [RFC6040]
758	   [RFC6679] is a network-layer function that allows a transport to
759	   receive network congestion information from a network device without
760	   incurring the unintended consequences of loss.  ECN includes both
761	   transport mechanisms and functions implemented in network devices,
762	   the latter rely upon using AQM to decider when and whether to ECN-
763	   mark.

765	   Congestion for ECN-capable transports is signalled by a network
766	   device setting the "Congestion Experienced (CE)" codepoint in the IP
767	   header.  This codepoint is noted by the remote receiving end point
768	   and signalled back to the sender using a transport protocol
769	   mechanism, allowing the sender to trigger timely congestion control.
770	   The decision to set the CE codepoint requires an AQM algorithm
771	   configured with a threshold.  Non-ECN capable flows (the default) are
772	   dropped under congestion.

774	   Network devices SHOULD use an AQM algorithm that marks ECN-capable
775	   traffic when making decisions about the response to congestion.
776	   Network devices need to implement this method by marking ECN-capable
777	   traffic or by dropping non-ECN-capable traffic.

779	   Safe deployment of ECN requires that network devices drop excessive
780	   traffic, even when marked as originating from an ECN-capable
781	   transport.  This is a necessary safety precaution because:

783	   1.  A non-conformant, broken or malicious receiver could conceal an
784	       ECN mark, and not report this to the sender;

786	   2.  A non-conformant, broken or malicious sender could ignore a
787	       reported ECN mark, as it could ignore a loss without using ECN;

789	   3.  A malfunctioning or non-conforming network device may "hide" an
790	       ECN mark (or fail to correctly set the ECN codepoint at an egress
791	       of a network tunnel).

793	   In normal operation, such cases should be very uncommon, however
794	   overload protection is desirable to protect traffic from
795	   misconfigured or malicious use of ECN (e.g. a denial-of-service
796	   attack that generates ECN-capable traffic that is unresponsive to CE-
797	   marking).

799	   An AQM algorithm that supports ECN needs to define the threshold and
800	   algorithm for ECN-marking.  This threshold MAY differ from that used
801	   for dropping packets that are not marked as ECN-capable, and SHOULD
802	   be configurable.

804	   Network devices SHOULD use an algorithm to drop excessive traffic
805	   (e.g. at some level above the threshold for CE-marking), even when
806	   the packets are marked as originating from an ECN-capable transport.

808	4.3.  AQM algorithms deployed SHOULD NOT require operational tuning

810	   A number of AQM algorithms have been proposed.  Many require some
811	   form of tuning or setting of parameters for initial network
812	   conditions.  This can make these algorithms difficult to use in
813	   operational networks.

815	   AQM algorithms need to consider both "initial conditions" and
816	   "operational conditions".  The former includes values that exist
817	   before any experience is gathered about the use of the algorithm,
818	   such as the configured speed of interface, support for full duplex
819	   communication, interface MTU and other properties of the link.  The
820	   latter includes information observed from monitoring the size of the
821	   queue, experienced queueing delay, rate of packet discard, etc.

823	   This document therefore specifies that AQM algorithms that are
824	   proposed for deployment in the Internet have the following
825	   properties:

827	   o  SHOULD NOT require tuning of initial or configuration parameters.
828	      An algorithm needs to provide a default behaviour that auto-tunes
829	      to a reasonable performance for typical network operational
830	      conditions.  This is expected to ease deployment and operation.
831	      Initial conditions, such as the interface rate and MTU size or
832	      other values derived from these, MAY be required by an AQM
833	      algorithm.

835	   o  MAY support further manual tuning that could improve performance
836	      in a specific deployed network.  Algorithms that lack such
837	      variables are acceptable, but if such variables exist, they SHOULD
838	      be externalized (made visible to the operator).  Guidance needs to
839	      be provided on the cases where auto-tuning is unlikely to achieve
840	      acceptable performance and to identify the set of parameters that
841	      can be tuned.  For example, the expected response of an algorithm
842	      may need to be configured to accommodate the largest expected Path
843	      RTT, since this value can not be known at initialization.  This
844	      guidance is expected to enable the algorithm to be deployed in
845	      networks that have specific characteristics (paths with variable/
846	      larger delay; networks where capacity is impacted by interactions
847	      with lower layer mechanisms, etc).

849	   o  MAY provide logging and alarm signals to assist in identifying if
850	      an algorithm using manual or auto-tuning is functioning as
851	      expected. (e.g., this could be based on an internal consistency
852	      check between input, output, and mark/drop rates over time).  This
853	      is expected to encourage deployment by default and allow operators
854	      to identify potential interactions with other network functions.

856	   Hence, self-tuning algorithms are to be preferred.  Algorithms
857	   recommended for general Internet deployment by the IETF need to be
858	   designed so that they do not require operational (especially manual)
859	   configuration or tuning.

861	4.4.  AQM algorithms SHOULD respond to measured congestion, not
862	      application profiles.

864	   Not all applications transmit packets of the same size.  Although
865	   applications may be characterized by particular profiles of packet
866	   size this should not be used as the basis for AQM (see next section).
867	   Other methods exist, e.g.  Differentiated Services queueing, Pre-
868	   Congestion Notification (PCN) [RFC5559], that can be used to
869	   differentiate and police classes of application.  Network devices may
870	   combine AQM with these traffic classification mechanisms and perform
871	   AQM only on specific queues within a network device.

873	   An AQM algorithm should not deliberately try to prejudice the size of
874	   packet that performs best (i.e.  Preferentially drop/mark based only
875	   on packet size).  Procedures for selecting packets to mark/drop
876	   SHOULD observe the actual or projected time that a packet is in a
877	   queue (bytes at a rate being an analog to time).  When an AQM
878	   algorithm decides whether to drop (or mark) a packet, it is
879	   RECOMMENDED that the size of the particular packet should not be
880	   taken into account [RFC7141].

882	   Applications (or transports) generally know the packet size that they
883	   are using and can hence make their judgments about whether to use
884	   small or large packets based on the data they wish to send and the
885	   expected impact on the delay or throughput, or other performance
886	   parameter.  When a transport or application responds to a dropped or
887	   marked packet, the size of the rate reduction should be proportionate
888	   to the size of the packet that was sent [RFC7141].

890	   AQM-enabled system MAY instantiate different instances of an AQM
891	   algorithm to be applied within the same traffic class.  Traffic
892	   classes may be differentiated based on an Access Control List (ACL),
893	   the packet Differentiated Services Code Point (DSCP) [RFC5559],
894	   enabling use of the ECN field (i.e. any of ECT(0), ECT(1) or
895	   CE)[RFC3168] [RFC4774], a multi-field (MF) classifier that combines
896	   the values of a set of protocol fields (e.g.  IP address, transport,
897	   ports) or an equivalent codepoint at a lower layer.  This
898	   recommendation goes beyond what is defined in RFC 3168, by allowing
899	   that an implementation MAY use more than one instance of an AQM
900	   algorithm to handle both ECN-capable and non-ECN-capable packets.

902	4.5.  AQM algorithms SHOULD NOT be dependent on specific transport
903	      protocol behaviours

905	   In deploying AQM, network devices need to support a range of Internet
906	   traffic and SHOULD NOT make implicit assumptions about the
907	   characteristics desired by the set transports/applications the
908	   network supports.  That is, AQM methods should be opaque to the
909	   choice of transport and application.

911	   AQM algorithms are often evaluated by considering TCP [RFC0793] with
912	   a limited number of applications.  Although TCP is the predominant
913	   transport in the Internet today, this no longer represents a
914	   sufficient selection of traffic for verification.  There is
915	   significant use of UDP [RFC0768] in voice and video services, and
916	   some applications find utility in SCTP [RFC4960] and DCCP [RFC4340].
917	   Hence, AQM algorithms should also demonstrate operation with
918	   transports other than TCP and need to consider a variety of
919	   applications.  Selection of AQM algorithms also needs to consider use
920	   of tunnel encapsulations that may carry traffic aggregates.

922	   AQM algorithms SHOULD NOT target or derive implicit assumptions about
923	   the characteristics desired by specific transports/applications.
924	   Transports and applications need to respond to the congestion signals
925	   provided by AQM (i.e. dropping or ECN-marking) in a timely manner
926	   (within a few RTT at the latest).

928	4.6.  Interactions with congestion control algorithms

930	   Applications and transports need to react to received implicit or
931	   explicit signals that indicate the presence of congestion.  This
932	   section identifies issues that can impact the design of transport
933	   protocols when using paths that use AQM.

935	   Transport protocols and applications need timely signals of
936	   congestion.  The time taken to detect and respond to congestion is
937	   increased when network devices queue packets in buffers.  It can be
938	   difficult to detect tail losses at a higher layer and this may
939	   sometimes require transport timers or probe packets to detect and
940	   respond to such loss.  Loss patterns may also impact timely
941	   detection, e.g. the time may be reduced when network devices do not
942	   drop long runs of packets from the same flow.

944	   A common objective of an elastic transport congestion control
945	   protocol is to allow an application to deliver the maximum rate of
946	   data without inducing excessive delays when packets are queued in a
947	   buffers within the network.  To achieve this, a transport should try
948	   to operate at rate below the inflexion point of the load/delay curve
949	   (the bend of what is sometimes called a "hockey-stick" curve)
950	   [Jain94].  When the congestion window allows the load to approach
951	   this bend, the end-to-end delay starts to rise - a result of
952	   congestion, as packets probabilistically arrive at non-overlapping
953	   times.  On the one hand, a transport that operates above this point
954	   can experience congestion loss and could also trigger operator
955	   activities, such as those discussed in [RFC6057].  On the other hand,
956	   a flow may achieve both near-maximum throughput and low latency when
957	   it operates close to this knee point, with minimal contribution to
958	   router congestion.  Choice of an appropriate rate/congestion window
959	   can therefore significantly impact the loss and delay experienced by
960	   a flow and will impact other flows that share a common network queue.

962	   Some applications may send less than permitted by the congestion
963	   control window (or rate).  Examples include multimedia codecs that
964	   stream at some natural rate (or set of rates) or an application that
965	   is naturally interactive (e.g., some web applications, gaming,
966	   transaction-based protocols).  Such applications may have different
967	   objectives.  They may not wish to maximize throughput, but may desire
968	   a lower loss rate or bounded delay.

970	   The correct operation of an AQM-enabled network device MUST NOT rely
971	   upon specific transport responses to congestion signals.

973	4.7.  The need for further research

975	   The second recommendation of [RFC2309] called for further research
976	   into the interaction between network queues and host applications,
977	   and the means of signaling between them.  This research has occurred,
978	   and we as a community have learned a lot.  However, we are not done.

980	   We have learned that the problems of congestion, latency and buffer-
981	   sizing have not gone away, and are becoming more important to many
982	   users.  A number of self-tuning AQM algorithms have been found that
983	   offer significant advantages for deployed networks.  There is also
984	   renewed interest in deploying AQM and the potential of ECN.

986	   In 2013, an obvious example of further research is the need to
987	   consider the use of Map/Reduce applications in data centers; do we
988	   need to extend our taxonomy of TCP/SCTP sessions to include not only
989	   "mice" and "elephants", but "lemmings"?  "Lemmings" are flash crowds
990	   of "mice" that the network inadvertently try to signal to as if they
991	   were elephant flows, resulting in head of line blocking in data
992	   center applications.

994	   Examples of other required research include:

996	   o  Research into new AQM and scheduling algorithms.

998	   o  Appropriate use of delay-based methods and the implications of
999	      AQM.

1001	   o  Research into the use of and deployment of ECN alongside AQM.

1003	   o  Tools for enabling AQM (and ECN) deployment and measuring the
1004	      performance.

1006	   o  Methods for mitigating the impact of non-conformant and malicious
1007	      flows.

1009	   o  Research to understand the implications of using new network and
1010	      transport methods on applications.

1012	   Hence, this document therefore reiterates the call of RFC 2309: we
1013	   need continuing research as applications develop.

1015	5.  IANA Considerations

1017	   This memo asks the IANA for no new parameters.

1019	6.  Security Considerations

1021	   While security is a very important issue, it is largely orthogonal to
1022	   the performance issues discussed in this memo.

1024	   Many deployed network devices use queueing methods that allow
1025	   unresponsive traffic to capture network capacity, denying access to
1026	   other traffic flows.  This could potentially be used as a denial-of-
1027	   service attack.  This threat could be reduced in network devices
1028	   deploy AQM or some form of scheduling.  We note, however, that a
1029	   denial-of-service attack that results in unresponsive traffic flows
1030	   may be indistinguishable from other traffic flows (e.g. tunnels
1031	   carrying aggregates of short flows, high-rate isochronous
1032	   applications).  New methods therefore may remain vulnerable, and this
1033	   document recommends that ongoing research should consider ways to
1034	   mitigate such attacks.

1036	7.  Privacy Considerations

1038	   This document, by itself, presents no new privacy issues.

1040	8.  Acknowledgements

1042	   The original version of this document describing best current
1043	   practice was based on the informational text of [RFC2309].  This was
1044	   written by the End-to-End Research Group, which is to say Bob Braden,
1045	   Dave Clark, Jon Crowcroft, Bruce Davie, Steve Deering, Deborah
1046	   Estrin, Sally Floyd, Van Jacobson, Greg Minshall, Craig Partridge,
1047	   Larry Peterson, KK Ramakrishnan, Scott Shenker, John Wroclawski, and
1048	   Lixia Zhang.  Although there are important differences, many of the
1049	   key arguments in the present document remain unchanged from those in
1050	   RFC 2309.

1052	   The need for an updated document was agreed to in the tsvarea meeting
1053	   at IETF 86.  This document was reviewed on the aqm@ietf.org list.
1054	   Comments were received from Colin Perkins, Richard Scheffenegger,
1055	   Dave Taht, John Leslie, David Collier-Brown and many others.

1057	   Gorry Fairhurst was in part supported by the European Community under
1058	   its Seventh Framework Programme through the Reducing Internet
1059	   Transport Latency (RITE) project (ICT-317700).

1061	9.  References

1063	9.1.  Normative References

1065	   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
1066	              Requirement Levels", BCP 14, RFC 2119, March 1997.

1068	   [RFC3168]  Ramakrishnan, K., Floyd, S., and D. Black, "The Addition
1069	              of Explicit Congestion Notification (ECN) to IP", RFC
1070	              3168, September 2001.

1072	   [RFC4301]  Kent, S. and K. Seo, "Security Architecture for the
1073	              Internet Protocol", RFC 4301, December 2005.

1075	   [RFC4774]  Floyd, S., "Specifying Alternate Semantics for the
1076	              Explicit Congestion Notification (ECN) Field", BCP 124,
1077	              RFC 4774, November 2006.

1079	   [RFC5405]  Eggert, L. and G. Fairhurst, "Unicast UDP Usage Guidelines
1080	              for Application Designers", BCP 145, RFC 5405, November
1081	              2008.

1083	   [RFC5681]  Allman, M., Paxson, V., and E. Blanton, "TCP Congestion
1084	              Control", RFC 5681, September 2009.

1086	   [RFC6040]  Briscoe, B., "Tunnelling of Explicit Congestion
1087	              Notification", RFC 6040, November 2010.

1089	   [RFC6679]  Westerlund, M., Johansson, I., Perkins, C., O'Hanlon, P.,
1090	              and K. Carlberg, "Explicit Congestion Notification (ECN)
1091	              for RTP over UDP", RFC 6679, August 2012.

1093	   [RFC7141]  Briscoe, B. and J. Manner, "Byte and Packet Congestion
1094	              Notification", BCP 41, RFC 7141, February 2014.

1096	9.2.  Informative References

1098	   [AQM-WG]   "IETF AQM WG", .

1100	   [CONEX]    Mathis, M. and B. Briscoe, "The Benefits to Applications
1101	              of using Explicit Congestion Notification (ECN)", IETF
1102	              (Work-in-Progress) draft-ietf-conex-abstract-mech, March
1103	              2014.

1105	   [Choi04]   Choi, Baek-Young., Moon, Sue., Zhang, Zhi-Li.,
1106	              Papagiannaki, K., and C. Diot, "Analysis of Point-To-Point
1107	              Packet Delay In an Operational Network", March 2004.

1109	   [Dem90]    Demers, A., Keshav, S., and S. Shenker, "Analysis and
1110	              Simulation of a Fair Queueing Algorithm, Internetworking:
1111	              Research and Experience", SIGCOMM Symposium proceedings on
1112	              Communications architectures and protocols , 1990.

1114	   [ECN-Benefit]
1115	              Welzl, M. and G. Fairhurst, "The Benefits to Applications
1116	              of using Explicit Congestion Notification (ECN)", IETF
1117	              (Work-in-Progress) , February 2014.

1119	   [Flo92]    Floyd, S. and V. Jacobsen, "On Traffic Phase Effects in
1120	              Packet-Switched Gateways", 1992.

1122	   [Flo94]    Floyd, S. and V. Jacobsen, "The Synchronization of
1123	              Periodic Routing Messages,
1124	              http://ee.lbl.gov/papers/sync_94.pdf", 1994.

1126	   [Floyd91]  Floyd, S., "Connections with Multiple Congested Gateways
1127	              in Packet-Switched Networks Part 1: One-way Traffic.",
1128	              Computer Communications Review , October 1991.

1130	   [Floyd95]  Floyd, S. and V. Jacobson, "Link-sharing and Resource
1131	              Management Models for Packet Networks", IEEE/ACM
1132	              Transactions on Networking , August 1995.

1134	   [Jacobson88]
1135	              Jacobson, V., "Congestion Avoidance and Control", SIGCOMM
1136	              Symposium proceedings on Communications architectures and
1137	              protocols , August 1988.

1139	   [Jain94]   Jain, Raj., Ramakrishnan, KK., and Chiu. Dah-Ming,
1140	              "Congestion avoidance scheme for computer networks", US
1141	              Patent Office 5377327, December 1994.

1143	   [Lakshman96]
1144	              Lakshman, TV., Neidhardt, A., and T. Ott, "The Drop From
1145	              Front Strategy in TCP Over ATM and Its Interworking with
1146	              Other Control Features", IEEE Infocomm , 1996.

1148	   [Leland94]
1149	              Leland, W., Taqqu, M., Willinger, W., and D. Wilson, "On
1150	              the Self-Similar Nature of Ethernet Traffic (Extended
1151	              Version)", IEEE/ACM Transactions on Networking , February
1152	              1994.

1154	   [McK90]    McKenney, PE. and G. Varghese, "Stochastic Fairness
1155	              Queuing",
1156	              http://www2.rdrop.com/~paulmck/scalability/paper/
1157	              sfq.2002.06.04.pdf , 1990.

1159	   [Nic12]    Nichols, K., "Controlling Queue Delay", Communications of
1160	              the ACM Vol. 55 No. 11, July, 2012, pp.42-50. , July 2002.

1162	   [RFC0768]  Postel, J., "User Datagram Protocol", STD 6, RFC 768,
1163	              August 1980.

1165	   [RFC0791]  Postel, J., "Internet Protocol", STD 5, RFC 791, September
1166	              1981.

1168	   [RFC0793]  Postel, J., "Transmission Control Protocol", STD 7, RFC
1169	              793, September 1981.

1171	   [RFC0896]  Nagle, J., "Congestion control in IP/TCP internetworks",
1172	              RFC 896, January 1984.

1174	   [RFC0970]  Nagle, J., "On packet switches with infinite storage", RFC
1175	              970, December 1985.

1177	   [RFC1122]  Braden, R., "Requirements for Internet Hosts -
1178	              Communication Layers", STD 3, RFC 1122, October 1989.

1180	   [RFC1633]  Braden, B., Clark, D., and S. Shenker, "Integrated
1181	              Services in the Internet Architecture: an Overview", RFC
1182	              1633, June 1994.

1184	   [RFC2309]  Braden, B., Clark, D., Crowcroft, J., Davie, B., Deering,
1185	              S., Estrin, D., Floyd, S., Jacobson, V., Minshall, G.,
1186	              Partridge, C., Peterson, L., Ramakrishnan, K., Shenker,
1187	              S., Wroclawski, J., and L. Zhang, "Recommendations on
1188	              Queue Management and Congestion Avoidance in the
1189	              Internet", RFC 2309, April 1998.

1191	   [RFC2460]  Deering, S. and R. Hinden, "Internet Protocol, Version 6
1192	              (IPv6) Specification", RFC 2460, December 1998.

1194	   [RFC2474]  Nichols, K., Blake, S., Baker, F., and D. Black,
1195	              "Definition of the Differentiated Services Field (DS
1196	              Field) in the IPv4 and IPv6 Headers", RFC 2474, December
1197	              1998.

1199	   [RFC2475]  Blake, S., Black, D., Carlson, M., Davies, E., Wang, Z.,
1200	              and W. Weiss, "An Architecture for Differentiated
1201	              Services", RFC 2475, December 1998.

1203	   [RFC4340]  Kohler, E., Handley, M., and S. Floyd, "Datagram
1204	              Congestion Control Protocol (DCCP)", RFC 4340, March 2006.

1206	   [RFC4960]  Stewart, R., "Stream Control Transmission Protocol", RFC
1207	              4960, September 2007.

1209	   [RFC5348]  Floyd, S., Handley, M., Padhye, J., and J. Widmer, "TCP
1210	              Friendly Rate Control (TFRC): Protocol Specification", RFC
1211	              5348, September 2008.

1213	   [RFC5559]  Eardley, P., "Pre-Congestion Notification (PCN)
1214	              Architecture", RFC 5559, June 2009.

1216	   [RFC6057]  Bastian, C., Klieber, T., Livingood, J., Mills, J., and R.
1217	              Woundy, "Comcast's Protocol-Agnostic Congestion Management
1218	              System", RFC 6057, December 2010.

1220	   [RFC6789]  Briscoe, B., Woundy, R., and A. Cooper, "Congestion
1221	              Exposure (ConEx) Concepts and Use Cases", RFC 6789,
1222	              December 2012.

1224	   [RFC6817]  Shalunov, S., Hazel, G., Iyengar, J., and M. Kuehlewind,
1225	              "Low Extra Delay Background Transport (LEDBAT)", RFC 6817,
1226	              December 2012.

1228	   [Shr96]    Shreedhar, M. and G. Varghese, "Efficient Fair Queueing
1229	              Using Deficit Round Robin", IEEE/ACM Transactions on
1230	              Networking Vol 4, No. 3 , July 1996.

1232	   [Sto97]    Stoica, I. and H. Zhang, "A Hierarchical Fair Service
1233	              Curve algorithm for Link sharing, real-time and priority
1234	              services", ACM SIGCOMM , 1997.

1236	   [Sut99]    Suter, B., "Buffer Management Schemes for Supporting TCP
1237	              in Gigabit Routers with Per-flow Queueing", IEEE Journal
1238	              on Selected Areas in Communications Vol. 17 Issue 6, June,
1239	              1999, pp. 1159-1169. , 1999.

1241	   [Willinger95]
1242	              Willinger, W., Taqqu, M., Sherman, R., Wilson, D., and V.
1243	              Jacobson, "Self-Similarity Through High-Variability:
1244	              Statistical Analysis of Ethernet LAN Traffic at the Source
1245	              Level", SIGCOMM Symposium proceedings on Communications
1246	              architectures and protocols , August 1995.

1248	   [Zha90]    Zhang, L. and D. Clark, "Oscillating Behavior of Network
1249	              Traffic: A Case Study Simulation,
1250	              http://groups.csail.mit.edu/ana/Publications/Zhang-DDC-
1251	              Oscillating-Behavior-of-Network-Traffic-1990.pdf", 1990.

1253	Appendix A.  Change Log

1255	   RFC-Editor please remove this appendix before publication.

1257	   Initial Version:  March 2013

1259	   Minor update of the algorithms that the IETF recommends SHOULD NOT
1260	   require operational (especially manual) configuration or tuning
1261	      April 2013

1263	   Major surgery.  This draft is for discussion at IETF-87 and expected
1264	   to be further updated.
1265	      July 2013

1267	   -00 WG Draft - Updated transport recommendations; revised deployment
1268	   configuration section; numerous minor edits.
1269	      Oct 2013

1271	   -01 WG Draft - Updated transport recommendations; revised deployment
1272	   configuration section; numerous minor edits.
1273	      Jan 2014 - Feedback from WG.

1275	   -02 WG Draft - Minor edits  Feb 2014 - Mainly language fixes.

1277	   -03 WG Draft - Minor edits  Feb 2013 - Comments from David Collier-
1278	      Brown and David Taht.

1280	   -04 WG Draft - Minor edits  May 2014 - Comments during WGLC: Provided
1281	      some introductory subsections to help people (with subsections and
1282	      better text). - Written more on the role scheduling.  - Clarified
1283	      that ECN mark threshold needs to be configurable. - Reworked your
1284	      "knee" para.  Various updates in response to feedback.

1286	   -05 WG Draft - Minor edits  June 2014 - New text added to address
1287	      further comments, and improve introduction - adding context,
1288	      reference to Conex, linking between sections, added text on
1289	      synchronization.

1291	   -06 WG Draft - Minor edits  July 2014 - Reorganised the introduction
1292	      following WG feedback to better explain how this relates to the
1293	      original goals of RFC2309.  Added item on packet bursts.  Various
1294	      minor corrections incorporated - no change to main
1295	      recommendations.

1297	   -07 WG Draft - Minor edits  July 2014 - Replaced ID REF by RFC 7141.
1298	      Changes made to introduction following inputs from Wes Eddy and
1299	      John Leslie.  Corrections and additions proposed by Bob Briscoe.

1301	   -08 WG Draft - Minor edits  August 2014 - Review comments from John
1302	      Leslie and Bob Briscoe.  Text corrections including; updated
1303	      Acknowledgments (RFC2309 ref) s/congestive/congestion/g; changed
1304	      the more bold language from RFC2309 to reflect a more considered
1305	      perceived threat to Internet Performance; modified the category
1306	      that is not-TCP-like to be "less responsive to congestion than
1307	      TCP" and more clearkly noted that represents a range of
1308	      behaviours.

1310	Authors' Addresses

1312	   Fred Baker (editor)
1313	   Cisco Systems
1314	   Santa Barbara, California  93117
1315	   USA

1317	   Email: fred@cisco.com

1319	   Godred Fairhurst (editor)
1320	   University of Aberdeen
1321	   School of Engineering
1322	   Fraser Noble Building
1323	   Aberdeen, Scotland  AB24 3UE
1324	   UK

1326	   Email: gorry@erg.abdn.ac.uk
1327	   URI:   http://www.erg.abdn.ac.uk