Re: [tcpm] Adoption of draft-zimmermann-tcpm-cubic?

Neal Cardwell <ncardwell@google.com> Thu, 19 March 2015 01:21 UTC

Return-Path: <ncardwell@google.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 85E5B1AC3CC for <tcpm@ietfa.amsl.com>; Wed, 18 Mar 2015 18:21:28 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.389
X-Spam-Level:
X-Spam-Status: No, score=-1.389 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FM_FORGED_GMAIL=0.622, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id abpGSUy_W9SQ for <tcpm@ietfa.amsl.com>; Wed, 18 Mar 2015 18:21:27 -0700 (PDT)
Received: from mail-ob0-x230.google.com (mail-ob0-x230.google.com [IPv6:2607:f8b0:4003:c01::230]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 79C7E1AC407 for <tcpm@ietf.org>; Wed, 18 Mar 2015 18:20:55 -0700 (PDT)
Received: by obdfc2 with SMTP id fc2so44118746obd.3 for <tcpm@ietf.org>; Wed, 18 Mar 2015 18:20:55 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=XwrwfNB7mwalcm5tbJ1BfR4dGhS2ZZnrRZbVnC7W5RY=; b=T9FMlyQ6Dn9LlXyNxbf2KhwV7kT69eBleLQ1w/wruNu5D6Ndqu7G7WJWkwnfx+rcVb 3jgTPr8WtiyQtPjz8g0xc2oPh7XulUnPqnGm4tPeAvhm1Pl5dC9YFj3kstnuM6qhahte /64zDWnMjigHEUoNyO5BFV64ypE/Cx8zk1mrXX1wV3/CJrwYJiPe+Tv3mIHCl6xMQn/O P2GpuYM9z78KYVgGukd44ZsjlwtaWmee3KBciMTLMI+5Zo0DVfsNcJPTJdXaNNARR7Ac oPu+Po3E7DoDqHswEjwc5zXkbD9dXtgyOVVC1qPKkr6mvP/xCmXvY3+FMbV8kB4svyHP ECMA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=XwrwfNB7mwalcm5tbJ1BfR4dGhS2ZZnrRZbVnC7W5RY=; b=QbG1w3lULztktuiVADcWxYVtAQAfhotl6WlPdUMUeK8n96cuwdAP7xS3o3amIj4y4t 4E50mcKi0IDNzQ486CIaHwXSSKDdSyjgpATOQKgHsM+2kAPjiiLHG/itJ9mxJ6RtDpSn +mrJsRYfUCfXNyJiUbgnlwXL2tUJ8bFwVpz/ig8ClFpaw0DWAZthEHudDBprbSccowTI jD515KK8T44PbmO7HMSTt4hyf/2gVFM1v2FtJ8WRKrhMXFZkWWaG5+EHlK/8f5gg5+67 6jFdwA6f1RvR2btYleyTuOdLphxv1yMmtKYPVMdaa2LCo4tYBcUrG0F0StIjvoX5DLuA QhUg==
X-Gm-Message-State: ALoCoQkzWcsZePPXwps74F0jA3A9mILDa1Bn629oWBZ4z0YgfZ+Y3xh0iYbMwR9Lqndh5WBGPRy2
MIME-Version: 1.0
X-Received: by 10.202.206.8 with SMTP id e8mr8899427oig.112.1426728054880; Wed, 18 Mar 2015 18:20:54 -0700 (PDT)
Received: by 10.202.174.72 with HTTP; Wed, 18 Mar 2015 18:20:54 -0700 (PDT)
In-Reply-To: <2235E611-C0A1-42A9-9B00-D584FF3C8B3B@netapp.com>
References: <ae97de90f83c460f8cfd0273f47611dd@hioexcmbx05-prd.hq.netapp.com> <655C07320163294895BBADA28372AF5D16C60F4D@FR712WXCHMBA15.zeu.alcatel-lucent.com> <2235E611-C0A1-42A9-9B00-D584FF3C8B3B@netapp.com>
Date: Wed, 18 Mar 2015 21:20:54 -0400
Message-ID: <CADVnQyk-hqAf_2VDTx841yTRO3SaLJNZpi3XmnQcyKgcQ4nAPA@mail.gmail.com>
From: Neal Cardwell <ncardwell@google.com>
To: "Zimmermann, Alexander" <Alexander.Zimmermann@netapp.com>
Content-Type: text/plain; charset="UTF-8"
Archived-At: <http://mailarchive.ietf.org/arch/msg/tcpm/wZicL4ps9Rk_jHccHQwyYK5qedQ>
Cc: "tcpm-chairs@tools.ietf.org" <tcpm-chairs@tools.ietf.org>, Eric Dumazet <edumazet@google.com>, "tcpm@ietf.org" <tcpm@ietf.org>
Subject: Re: [tcpm] Adoption of draft-zimmermann-tcpm-cubic?
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 19 Mar 2015 01:21:28 -0000

On Wed, Mar 18, 2015 at 6:26 AM, Zimmermann, Alexander
<Alexander.Zimmermann@netapp.com> wrote:
> Hi Michael,
>
>> Am 18.03.2015 um 11:18 schrieb Scharf, Michael (Michael) <michael.scharf@alcatel-lucent.com>:
>>
>> My understanding is that TCPM recharting is still pending and IESG waits for potential feedback until March 23 [1].
>>
>> The chairs are well aware of the strong support for draft-zimmermann-tcpm-cubic during IETF 91.
>> We plan to run a formal adoption call once our charter explicitly allows adoption in TCPM.

Regarding draft-zimmermann-tcpm-cubic-00:

  https://tools.ietf.org/id/draft-zimmermann-tcpm-cubic-00.txt

I support this draft.

But I did want to note that our experience with CUBIC at Google shows
that (1) it's important to consider stretch ACKs, and (2) it's critical to
carefully consider the maximum rate of cwnd increase for CUBIC
in congestion avoidance.

I would suggest the following small diffs to the draft:

Where the draft says (in sections "3.3. Concave region" and "3.4.
Convex region"):

  In this region, cwnd MUST be incremented by (W(t+RTT) - cwnd)/cwnd
  for each received ACK.

I'd suggest the following new text in these two spots:

  In this region, cwnd MUST be incremented by
     min((W(t+RTT) - cwnd)/cwnd, 1/2)
  for each newly-acknowledged packet.

Detailed rationale:

(1) First is the issue of the treatment of "stretch ACKs" covering
more than 2 packets.  The CUBIC paper and the Linux code through v3.18
cap the de facto maximum rate of cwnd increase (in congestion
avoidance) at 1 packet for every alternate ACK. In Google's experience
with high-BDP paths with receiver hosts using receive
offload/aggregation mechanisms (LRO and GRO in Linux terminology), the
ACKs for up to 40 or more packets at a time caused this cap of "1
packet for every alternate ACK" to result in cwnd increases that were
too slow, leading to underutilization. So our team at Google
contributed some changes in v3.19 that changed the rate of increase to
be expressed in terms of packets ACKed, rather than number of ACKs.
This approach allows full utilization even with receiver offload
mechanisms.

(2) Second is the issue of the maximum rate of increase of cwnd. As it
stands, this document says in section "3.4. Convex region" that:

   In this region, cwnd MUST be incremented by (W(t+RTT) - cwnd)/cwnd
   for each received ACK.

There is similar language in section "3.3. Concave region".

Since the rate of increase of a cubic function can be quite large, if
we take this language literally then in steep sections of the curve
this can require that the sender "MUST" increase cwnd by quite
a large amount.

I am not sure what other CUBIC implementations do, but this does not
match the CUBIC paper or the Linux implementation. As mentioned above,
the CUBIC paper and the Linux code through v3.18 actually cap the de
facto maximum rate of cwnd increase (in congestion avoidance) at 1
packet for every alternate ACK.

In our recent experiments with using CUBIC for YouTube video traffic
using the revised, stretch-ACK-savvy CUBIC logic in Linux v3.19 we saw
retransmit rates double due to v3.19 allowing cwnd to increase by 1
packet for every packet ACKed.

So in Linux v4.0 we set a limit of increasing cwnd by at most 1 packet
for every 2 packets ACKed, which restored the retransmit rates to
their previous low levels.

For reference:

(1) Commits in Linux v3.19 to make CUBIC stretch-ACK-savvy:

  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=e73ebb0881ea5534ce606c1d71b4ac44db5c6930
  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=814d488c61260521b1b3cc97063700a5a6667c8f
  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=9cd981dcf174d26805a032aefa791436da709bee
  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=d6b1a8a92a1417f8859a6937d2e6ffe2dfab4e6d

(2) Commits slated for Linux v4.0 to fix excessive cwnd growth in CUBIC:

  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=9949afa42be0b76f5832db112ce51bb6b35b2abb
  http://git.kernel.org/cgit/linux/kernel/git/davem/net.git/commit/?id=d578e18ce93f5d33a7120fd57c453e22a4c0fc37

neal