<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE rfc SYSTEM 'rfc2629.dtd' []>
<rfc ipr="trust200902" category="exp" docName="draft-ietf-tcpinc-tcpcrypt-14">
<?rfc toc="yes"?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<?rfc private=""?>
<?rfc topblock="yes"?>
<?rfc comments="no"?>
<front>
<title abbrev="tcpcrypt">Cryptographic protection of TCP Streams (tcpcrypt)</title>

<author initials="A." surname="Bittau" fullname="Andrea Bittau">
<organization>Google</organization>
<address>
<postal>
<street>345 Spear Street</street>
<city>San Francisco, CA</city>
<code>94105</code>
<country>US</country>
<region></region>
</postal>
<phone></phone>
<email>bittau@google.com</email>
<uri></uri>
</address>
</author>
<author initials="D." surname="Giffin" fullname="Daniel B. Giffin">
<organization>Stanford University</organization>
<address>
<postal>
<street>353 Serra Mall, Room 288</street>
<city>Stanford, CA</city>
<code>94305</code>
<country>US</country>
<region></region>
</postal>
<phone></phone>
<email>dbg@scs.stanford.edu</email>
<uri></uri>
</address>
</author>
<author initials="M." surname="Handley" fullname="Mark Handley">
<organization>University College London</organization>
<address>
<postal>
<street>Gower St.</street>
<city>London</city>
<code>WC1E 6BT</code>
<country>UK</country>
<region></region>
</postal>
<phone></phone>
<email>M.Handley@cs.ucl.ac.uk</email>
<uri></uri>
</address>
</author>
<author initials="D." surname="Mazieres" fullname="David Mazieres">
<organization>Stanford University</organization>
<address>
<postal>
<street>353 Serra Mall, Room 290</street>
<city>Stanford, CA</city>
<code>94305</code>
<country>US</country>
<region></region>
</postal>
<phone></phone>
<email>dm@uun.org</email>
<uri></uri>
</address>
</author>
<author initials="Q." surname="Slack" fullname="Quinn Slack">
<organization>Sourcegraph</organization>
<address>
<postal>
<street>121 2nd St Ste 200</street>
<city>San Francisco, CA</city>
<code>94105</code>
<country>US</country>
<region></region>
</postal>
<phone></phone>
<email>sqs@sourcegraph.com</email>
<uri></uri>
</address>
</author>
<author initials="E." surname="Smith" fullname="Eric W. Smith">
<organization>Kestrel Institute</organization>
<address>
<postal>
<street>3260 Hillview Avenue</street>
<city>Palo Alto, CA</city>
<code>94304</code>
<country>US</country>
<region></region>
</postal>
<phone></phone>
<email>eric.smith@kestrel.edu</email>
<uri></uri>
</address>
</author>
<date year="2018" month="November" day="4"/>

<area>Internet</area>
<workgroup></workgroup>
<keyword>tcp</keyword>
<keyword>encryption</keyword>


<abstract>
<t>This document specifies tcpcrypt, a TCP encryption protocol designed
for use in conjunction with the TCP Encryption Negotiation Option
(TCP-ENO).  Tcpcrypt coexists with middleboxes by tolerating
resegmentation, NATs, and other manipulations of the TCP header.  The
protocol is self-contained and specifically tailored to TCP
implementations, which often reside in kernels or other environments
in which large external software dependencies can be undesirable.
Because the size of TCP options is limited, the protocol requires one
additional one-way message latency to perform key exchange before
application data can be transmitted.  However, the extra latency can
be avoided between two hosts that have recently established a previous
tcpcrypt connection.
</t>
</abstract>


</front>

<middle>

<section anchor="requirements-language" title="Requirements Language">
<t>The key words &quot;MUST&quot;, &quot;MUST NOT&quot;, &quot;REQUIRED&quot;, &quot;SHALL&quot;, &quot;SHALL NOT&quot;,
&quot;SHOULD&quot;, &quot;SHOULD NOT&quot;, &quot;RECOMMENDED&quot;, &quot;NOT RECOMMENDED&quot;, &quot;MAY&quot;, and
&quot;OPTIONAL&quot; in this document are to be interpreted as described in BCP
14 <xref target="RFC2119"/> <xref target="RFC8174"/> when, and only when, they appear in all
capitals, as shown here.
</t>
</section>

<section anchor="introduction" title="Introduction">
<t>This document describes tcpcrypt, an extension to TCP for
cryptographic protection of session data.  Tcpcrypt was designed to
meet the following goals:
</t>
<t>
<list style="symbols">
<t>Meet the requirements of the TCP Encryption Negotiation Option
(TCP-ENO) <xref target="I-D.ietf-tcpinc-tcpeno"/> for protecting connection
data.</t>
<t>Be amenable to small, self-contained implementations inside TCP
stacks.</t>
<t>Minimize additional latency at connection startup.</t>
<t>As much as possible, prevent connection failure in the presence of
NATs and other middleboxes that might normalize traffic or otherwise
manipulate TCP segments.</t>
<t>Operate independently of IP addresses, making it possible to
authenticate resumed sessions efficiently even when either end
changes IP address.</t>
</list>
</t>
<t>A companion document <xref target="I-D.ietf-tcpinc-api"/> describes recommended
interfaces for configuring certain parameters of this protocol.
</t>
</section>

<section anchor="encryption-protocol" title="Encryption Protocol">
<t>This section describes the operation of the tcpcrypt protocol.
The wire format of all messages is specified in <xref target="encodings"/>.
</t>

<section anchor="cryptographic-algorithms" title="Cryptographic Algorithms">
<t>Setting up a tcpcrypt connection employs three types of cryptographic
algorithms:
</t>
<t>
<list style="symbols">
<t>A <spanx style="emph">key agreement scheme</spanx> is used with a short-lived public key
to agree upon a shared secret.</t>
<t>An <spanx style="emph">extract function</spanx> is used to generate a pseudo-random key
(PRK) from some initial keying material produced by the key
agreement scheme.  The notation Extract(S, IKM) denotes the output
of the extract function with salt S and initial keying material
IKM.</t>
<t>A <spanx style="emph">collision-resistant pseudo-random function (CPRF)</spanx> is used to
generate multiple cryptographic keys from a pseudo-random key,
typically the output of the extract function.  The CPRF produces
an arbitrary amount of Output Keying Material (OKM), and we use
the notation CPRF(K, CONST, L) to designate the first L
bytes of the OKM produced by the CPRF when parameterized by key K
and the constant CONST.</t>
</list>
</t>
<t>The Extract and CPRF functions used by the tcpcrypt variants defined
in this document are the Extract and Expand functions of HKDF
<xref target="RFC5869"/>, which is built on HMAC <xref target="RFC2104"/>.  These are defined as
follows in terms of the function <spanx style="verb">HMAC-Hash(key, value)</spanx> for a
negotiated <spanx style="verb">Hash</spanx> function such as SHA-256; the symbol <spanx style="verb">|</spanx> denotes
concatenation, and the counter concatenated to the right of CONST
occupies a single octet.
</t>

<figure anchor="fig:hkdf" align="center" title="HKDF functions used for key derivation
"><artwork align="center">
HKDF-Extract(salt, IKM) -&gt; PRK
   PRK = HMAC-Hash(salt, IKM)

HKDF-Expand(PRK, CONST, L) -&gt; OKM
   T(0) = empty string (zero length)
   T(1) = HMAC-Hash(PRK, T(0) | CONST | 0x01)
   T(2) = HMAC-Hash(PRK, T(1) | CONST | 0x02)
   T(3) = HMAC-Hash(PRK, T(2) | CONST | 0x03)
   ...

   OKM  = first L octets of T(1) | T(2) | T(3) | ...
   where L &lt;= 255*OutputLength(Hash)
</artwork></figure>
<t>Lastly, once tcpcrypt has been successfully set up and encryption keys
have been derived, an algorithm for Authenticated Encryption with
Associated Data (AEAD) is used to protect the confidentiality and
integrity of all transmitted application data.  AEAD algorithms use a
single key to encrypt their input data and also to generate a
cryptographic tag to accompany the resulting ciphertext; when
decryption is performed, the tag allows authentication of the
encrypted data and of optional, associated plaintext data.
</t>
</section>

<section anchor="protocol-negotiation" title="Protocol Negotiation">
<t>Tcpcrypt depends on TCP-ENO <xref target="I-D.ietf-tcpinc-tcpeno"/> to negotiate
whether encryption will be enabled for a connection, and also which
key-agreement scheme to use.  TCP-ENO negotiates the use of a
particular TCP encryption protocol or <spanx style="emph">TEP</spanx> by including protocol
identifiers in ENO suboptions.  This document associates four TEP
identifiers with the tcpcrypt protocol, as listed in <xref target="tab:tep-ids"/> in
<xref target="iana"/>.  Each identifier indicates the use of a particular
key-agreement scheme, with an associated CPRF and length parameter.
Future standards can associate additional TEP identifiers with
tcpcrypt, following the assignment policy specified by TCP-ENO.
</t>
<t>An active opener that wishes to negotiate the use of tcpcrypt includes
an ENO option in its SYN segment.  That option includes suboptions
with tcpcrypt TEP identifiers indicating the key-agreement schemes it
is willing to enable.  The active opener MAY additionally include
suboptions indicating support for encryption protocols other than
tcpcrypt, as well as global suboptions as specified by TCP-ENO.
</t>
<t>If a passive opener receives an ENO option including tcpcrypt TEPs it
supports, it MAY then attach an ENO option to its SYN-ACK segment,
including solely the TEP it wishes to enable.
</t>
<t>To establish distinct roles for the two hosts in each connection,
tcpcrypt depends on the role-negotiation mechanism of TCP-ENO.  As one
result of the negotiation process, TCP-ENO assigns hosts unique roles
abstractly called &quot;A&quot; at one end of the connection and &quot;B&quot; at the
other.  Generally, an active opener plays the &quot;A&quot; role and a passive
opener plays the &quot;B&quot; role, but in the case of simultaneous open, an
additional mechanism breaks the symmetry and assigns a distinct role
to each host.  TCP-ENO uses the terms &quot;host A&quot; and &quot;host B&quot; to
identify each end of a connection uniquely, and this document employs
those terms in the same way.
</t>
<t>An ENO suboption includes a flag <spanx style="verb">v</spanx> which indicates the presence of
associated, variable-length data.  In order to propose fresh key
agreement with a particular tcpcrypt TEP, a host sends a one-byte
suboption containing the TEP identifier and <spanx style="verb">v = 0</spanx>.  In order to
propose session resumption (described further below) with a particular
TEP, a host sends a variable-length suboption containing the TEP
identifier, the flag <spanx style="verb">v = 1</spanx>, an identifier derived from a session
secret previously negotiated with the same host and the same TEP, and
a nonce.
</t>
<t>Once two hosts have exchanged SYN segments, TCP-ENO defines the
<spanx style="emph">negotiated TEP</spanx> to be the last valid TEP identifier in the SYN
segment of host B (that is, the passive opener in the absence of
simultaneous open) that also occurs in that of host A.  If there is no
such TEP, hosts MUST disable TCP-ENO and tcpcrypt.
</t>
<t>If the negotiated TEP was sent by host B with <spanx style="verb">v = 0</spanx>, it means that
fresh key agreement will be performed as described below in
<xref target="key-exchange"/>.  If, on the other hand, host B sent the TEP with <spanx style="verb">v =
1</spanx> and both hosts sent appropriate resumption identifiers in their
suboption data, then the key-exchange messages will be omitted in
favor of determining keys via session resumption as described in
<xref target="session-resumption"/>.  With session resumption, protected application
data MAY be sent immediately as detailed in <xref target="encryption"/>.
</t>
<t>Note that the negotiated TEP is determined without reference to the
<spanx style="verb">v</spanx> bits in ENO suboptions, so if host A offers resumption with a
particular TEP and host B replies with a non-resumption suboption with
the same TEP, that could become the negotiated TEP and fresh key
agreement will be performed.  That is, sending a resumption suboption
also implies willingness to perform fresh key agreement with the
indicated TEP.
</t>
<t>As REQUIRED by TCP-ENO, once a host has both sent and received an ACK
segment containing a valid ENO option, encryption MUST be enabled and
plaintext application data MUST NOT ever be exchanged on the
connection.  If the negotiated TEP is among those listed in
<xref target="tab:tep-ids"/>, a host MUST follow the protocol described in this
document.
</t>
</section>

<section anchor="key-exchange" title="Key Exchange">
<t>Following successful negotiation of a tcpcrypt TEP, all further
signaling is performed in the Data portion of TCP segments.  Except
when resumption was negotiated (described below in
<xref target="session-resumption"/>), the two hosts perform key exchange through two
messages, <spanx style="verb">Init1</spanx> and <spanx style="verb">Init2</spanx>, at the start of the data streams of
host A and host B, respectively.  These messages MAY span multiple TCP
segments and need not end at a segment boundary.  However, the segment
containing the last byte of an <spanx style="verb">Init1</spanx> or <spanx style="verb">Init2</spanx> message MUST have
TCP's push flag (PSH) set.
</t>
<t>The key exchange protocol, in abstract, proceeds as follows:
</t>

<figure align="center"><artwork align="center">
A -&gt; B:  Init1 = { INIT1_MAGIC, sym_cipher_list, N_A, Pub_A }
B -&gt; A:  Init2 = { INIT2_MAGIC, sym_cipher, N_B, Pub_B }
</artwork></figure>
<t>The concrete format of these messages is specified in
<xref target="key-exchange-messages"/>.
</t>
<t>The parameters are defined as follows:
</t>
<t>
<list style="symbols">
<t><spanx style="verb">INIT1_MAGIC</spanx>, <spanx style="verb">INIT2_MAGIC</spanx>: constants defined in
<xref target="constants"/>.</t>
<t><spanx style="verb">sym_cipher_list</spanx>: a list of identifiers of symmetric ciphers (AEAD
algorithms) acceptable to host A.  These are specified in
<xref target="tab:ae-algs"/> in <xref target="iana"/>.</t>
<t><spanx style="verb">sym_cipher</spanx>: the symmetric cipher selected by host B from the
<spanx style="verb">sym_cipher_list</spanx> sent by host A.</t>
<t><spanx style="verb">N_A</spanx>, <spanx style="verb">N_B</spanx>: nonces chosen at random by hosts A and B,
respectively.</t>
<t><spanx style="verb">Pub_A</spanx>, <spanx style="verb">Pub_B</spanx>: ephemeral public keys for hosts A and B,
respectively.  These, as well as their corresponding private keys,
are short-lived values that MUST be refreshed frequently.  The
private keys SHOULD NOT ever be written to persistent storage.  The
security risks associated with the storage of these keys are
discussed in <xref target="security-considerations"/>.</t>
</list>
</t>
<t>If a host receives an ephemeral public key from its peer and a
key-validation step fails (see <xref target="key-agreement-schemes"/>), it MUST
abort the connection and raise an error condition distinct from the
end-of-file condition.
</t>
<t>The ephemeral secret <spanx style="verb">ES</spanx> is the result of the key-agreement
algorithm (see <xref target="key-agreement-schemes"/>) indicated by the negotiated
TEP.   The inputs to the algorithm are the local host's ephemeral
private key and the remote host's ephemeral public key.  For example,
host A would compute <spanx style="verb">ES</spanx> using its own private key (not transmitted)
and host B's public key, <spanx style="verb">Pub_B</spanx>.
</t>
<t>The two sides then compute a pseudo-random key <spanx style="verb">PRK</spanx>, from which all
session secrets are derived, as follows:
</t>

<figure align="center"><artwork align="center">
PRK = Extract(N_A, eno-transcript | Init1 | Init2 | ES)
</artwork></figure>
<t>Above, <spanx style="verb">|</spanx> denotes concatenation; <spanx style="verb">eno-transcript</spanx> is the
protocol-negotiation transcript defined in Section 4.8 of
<xref target="I-D.ietf-tcpinc-tcpeno"/>; and <spanx style="verb">Init1</spanx> and <spanx style="verb">Init2</spanx> are the
transmitted encodings of the messages described in
<xref target="key-exchange-messages"/>.
</t>
<t>A series of &quot;session secrets&quot; are computed from <spanx style="verb">PRK</spanx> as follows:
</t>

<figure align="center"><artwork align="center">
 ss[0] = PRK
 ss[i] = CPRF(ss[i-1], CONST_NEXTK, K_LEN)
</artwork></figure>
<t>The value <spanx style="verb">ss[0]</spanx> is used to generate all key material for the current
connection.  The values <spanx style="verb">ss[i]</spanx> for <spanx style="verb">i &gt; 0</spanx> are used by session
resumption to avoid public key cryptography when establishing
subsequent connections between the same two hosts, as described later
in <xref target="session-resumption"/>.  The <spanx style="verb">CONST_*</spanx> values are constants defined
in <xref target="constants"/>.  The length <spanx style="verb">K_LEN</spanx> depends on the tcpcrypt TEP in
use, and is specified in <xref target="key-agreement-schemes"/>.
</t>
<t>Given a session secret <spanx style="verb">ss[i]</spanx>, the two sides compute a series of
master keys as follows:
</t>

<figure align="center"><artwork align="center">
mk[0] = CPRF(ss[i], CONST_REKEY | sn[i], K_LEN)
mk[j] = CPRF(mk[j-1], CONST_REKEY, K_LEN)
</artwork></figure>
<t>The process of advancing through the series of master keys is
described in <xref target="rekeying"/>.  The values <spanx style="verb">sn[i]</spanx> are &quot;session nonces.&quot;
For the initial session with <spanx style="verb">i = 0</spanx>, the session nonce is zero bytes
long.  The values for subsequent sessions are derived from fresh
connection data as described in <xref target="session-resumption"/>.
</t>
<t>Finally, each master key <spanx style="verb">mk[j]</spanx> is used to generate traffic keys for
protecting application data using authenticated encryption:
</t>

<figure align="center"><artwork align="center">
k_ab[j] = CPRF(mk[j], CONST_KEY_A, ae_key_len + ae_nonce_len)
k_ba[j] = CPRF(mk[j], CONST_KEY_B, ae_key_len + ae_nonce_len)
</artwork></figure>
<t>In the first session derived from fresh key-agreement, traffic keys
<spanx style="verb">k_ab[j]</spanx> are used by host A to encrypt and host B to decrypt, while
keys <spanx style="verb">k_ba[j]</spanx> are used by host B to encrypt and host A to decrypt.
In a resumed session, as described more thoroughly below in
<xref target="session-resumption"/>, each host uses the keys in the same way as it
did in the original session, regardless of its role in the current
session: for example, if a host played role &quot;A&quot; in the first session,
it will use keys <spanx style="verb">k_ab[j]</spanx> to encrypt in each derived session.
</t>
<t>The values <spanx style="verb">ae_key_len</spanx> and <spanx style="verb">ae_nonce_len</spanx> depend on the
authenticated-encryption algorithm selected, and are given in
<xref target="tab:ae-params"/> in <xref target="aead"/>.  The algorithm uses the first
<spanx style="verb">ae_key_len</spanx> bytes of each traffic key as an authenticated-encryption
key, and the following <spanx style="verb">ae_nonce_len</spanx> bytes as a &quot;nonce randomizer&quot;.
</t>
<t>Implementations SHOULD provide an interface allowing the user to
specify, for a particular connection, the set of AEAD algorithms to
advertize in <spanx style="verb">sym_cipher_list</spanx> (when playing role &quot;A&quot;) and also the
order of preference to use when selecting an algorithm from those
offered (when playing role &quot;B&quot;).  A companion document
<xref target="I-D.ietf-tcpinc-api"/> describes recommended interfaces for this
purpose.
</t>
<t>After host B sends <spanx style="verb">Init2</spanx> or host A receives it, that host MAY
immediately begin transmitting protected application data as described
in <xref target="encryption"/>.
</t>
<t>If host A receives <spanx style="verb">Init2</spanx> with a <spanx style="verb">sym_cipher</spanx> value that was not
present in the <spanx style="verb">sym_cipher_list</spanx> it previously transmitted in <spanx style="verb">Init1</spanx>,
it MUST abort the connection and raise an error condition distinct
from the end-of-file condition.
</t>
<t>Throughout this document, to &quot;abort the connection&quot; means to issue the
&quot;Abort&quot; command as described in <xref target="RFC0793"/>, Section 3.8.  That is, the
TCP connection is destroyed, RESET is transmitted, and the local user
is alerted to the abort event.
</t>
</section>

<section anchor="session-id" title="Session ID">
<t>TCP-ENO requires each TEP to define a <spanx style="emph">session ID</spanx> value that uniquely
identifies each encrypted connection.
</t>
<t>A tcpcrypt session ID begins with the byte transmitted by host B that
contains the negotiated TEP identifier along with the <spanx style="verb">v</spanx> bit.  The
remainder of the ID is derived from the session secret and session
nonce, as follows:
</t>

<figure align="center"><artwork align="center">
session_id[i] = TEP-byte | CPRF(ss[i], CONST_SESSID | sn[i], K_LEN)
</artwork></figure>
<t>Again, the length <spanx style="verb">K_LEN</spanx> depends on the TEP, and is specified in
<xref target="key-agreement-schemes"/>.
</t>
</section>

<section anchor="session-resumption" title="Session Resumption">
<t>If two hosts have previously negotiated a session with secret
<spanx style="verb">ss[i-1]</spanx>, they can establish a new connection without public-key
operations using <spanx style="verb">ss[i]</spanx>, the next session secret in the sequence
derived from the original PRK.
</t>
<t>A host signals willingness to resume with a particular session secret
by sending a SYN segment with a resumption suboption: that is, an ENO
suboption containing the negotiated TEP identifier of the previous
session, half of the &quot;resumption identifier&quot; for the new session, and
a &quot;resumption nonce&quot;.
</t>
<t>The resumption nonce MUST have a minimum length of zero bytes and
maximum length of eight bytes.  The value MUST be chosen randomly or
using a mechanism that guarantees uniqueness even in the face of
virtual machine cloning or other re-execution of the same session.  An
attacker who can force either side of a connection to reuse a session
secret with the same nonce will completely break the security of
tcpcrypt.  Reuse of session secrets is possible in the event of
virtual machine cloning or reuse of system-level hibernation state.
Implementations SHOULD provide an API through which to set the
resumption nonce length, and MUST default to eight bytes if they
cannot prohibit the reuse of session secrets.
</t>
<t>The resumption identifier is calculated from a session secret <spanx style="verb">ss[i]</spanx>
as follows:
</t>

<figure align="center"><artwork align="center">
resume[i] = CPRF(ss[i], CONST_RESUME, 18)
</artwork></figure>
<t>To name a session for resumption, a host sends either the first or
second half of the resumption identifier, according to the role it
played in the original session with secret <spanx style="verb">ss[0]</spanx>.
</t>
<t>A host that originally played role &quot;A&quot; and wishes to resume from a
cached session sends a suboption with the first half of the resumption
identifier:
</t>

<figure anchor="fig:active-resume-subopt" align="center" title="Resumption suboption sent when original role was &quot;A&quot;.
The TEP-byte contains a tcpcrypt TEP identifier and v = 1.
The nonce value MUST have length between 0 and 8 bytes.
"><artwork align="center">
byte     0      1             9      10
     +------+------+--...--+------+------+--...--+------+
     | TEP- |   resume[i]{0..8}   |       nonce_a       |
     | byte |                     |                     |
     +------+------+--...--+------+------+--...--+------+
</artwork></figure>
<t>Similarly, a host that originally played role &quot;B&quot; sends a suboption
with the second half of the resumption identifier:
</t>

<figure anchor="fig:passive-resume-subopt" align="center" title="Resumption suboption sent when original role was &quot;B&quot;.
The TEP-byte contains a tcpcrypt TEP identifier and v = 1.
The nonce value MUST have length between 0 and 8 bytes.
"><artwork align="center">
byte     0      1             9      10
     +------+------+--...--+------+------+--...--+------+
     | TEP- |   resume[i]{9..17}  |       nonce_b       |
     | byte |                     |                     |
     +------+------+--...--+------+------+--...--+------+
</artwork></figure>
<t>If a passive opener receives a resumption suboption containing an
identifier-half that names a session secret that it has cached and the
subobtion's TEP matches the TEP used in the previous session, it
SHOULD (with exceptions specified below) agree to resume from the
cached session by sending its own resumption suboption, which will
contain the other half of the identifier.  Otherwise, it MUST NOT
agree to resumption.
</t>
<t>If a passive opener does not agree to resumption with a particular
TEP, it MAY either request fresh key exchange by responding with a
non-resumption suboption using the same TEP, or else respond to any
other received TEP suboption.
</t>
<t>If a passive opener receives an ENO suboption with a TEP identifier
and <spanx style="verb">v = 1</spanx>, but the suboption data is less than 9 bytes in length,
it MUST behave as if the same TEP had been sent with <spanx style="verb">v = 0</spanx>.  That
is, the suboption MUST be interpreted as an offer to negotiate fresh
key exchange with that TEP.
</t>
<t>If an active opener sends a resumption suboption with a particular TEP
and the appropriate half of a resumption identifier and then, in the
same TCP handshake, receives a resumption suboption with the same TEP
and an identifier-half that does not match that resumption
identifier, it MUST ignore that suboption.  In the typical case that
this was the only ENO suboption received, this means the host MUST
disable TCP-ENO and tcpcrypt: that is, it MUST NOT send any more ENO
options and MUST NOT encrypt the connection.
</t>
<t>When a host concludes that TCP-ENO negotiation has succeeded for some
TEP that was received in a resumption suboption, it MUST then enable
encryption with that TEP using the cached session secret.  To do this,
it first constructs <spanx style="verb">sn[i]</spanx> as follows:
</t>

<figure align="center"><artwork align="center">
sn[i] = nonce_a | nonce_b
</artwork></figure>
<t>Master keys are then computed from <spanx style="verb">s[i]</spanx> and <spanx style="verb">sn[i]</spanx> as described in
<xref target="key-exchange"/>, and application data encrypted as described in
<xref target="encryption"/>.
</t>
<t>The session ID (<xref target="session-id"/>) is constructed in the same way for
resumed sessions as it is for fresh ones.  In this case the first byte
will always have <spanx style="verb">v = 1</spanx>.  The remainder of the ID is derived from the
cached session secret and the session nonce that was generated during
resumption.
</t>
<t>In the case of simultaneous open where TCP-ENO is able to establish
asymmetric roles, two hosts that simultaneously send SYN segments with
compatible resumption suboptions MAY resume the associated session.
</t>
<t>In a particular SYN segment, a host SHOULD NOT send more than one
resumption suboption (because this consumes TCP option space and is
unlikely to be a useful practice), and MUST NOT send more than one
resumption suboption with the same TEP identifier.  But in addition to
any resumption suboptions, an active opener MAY include non-resumption
suboptions describing other TEPs it supports (in addition to the TEP
in the resumption suboption).
</t>
<t>After using the session secret <spanx style="verb">ss[i]</spanx> to compute <spanx style="verb">mk[0]</spanx>,
implementations SHOULD compute and cache <spanx style="verb">ss[i+1]</spanx> for possible use by
a later session, then erase <spanx style="verb">ss[i]</spanx> from memory.  Hosts MAY retain
<spanx style="verb">ss[i+1]</spanx> until it is used or the memory needs to be reclaimed.  Hosts
SHOULD NOT write any session secrets to non-volatile storage.
</t>
<t>When proposing resumption, the active opener MUST use the lowest value
of <spanx style="verb">i</spanx> that has not already been used (successfully or not) to
negotiate resumption with the same host and for the same original
session secret <spanx style="verb">ss[0]</spanx>.
</t>
<t>A given session secret <spanx style="verb">ss[i]</spanx> MUST NOT be used to secure more than
one TCP connection.  To prevent this, a host MUST NOT resume with a
session secret if it has ever enabled encryption in the past with the
same secret, in either role.  In the event that two hosts
simultaneously send SYN segments to each other that propose resumption
with the same session secret but the two segments are not part of a
simultaneous open, both connections would need to revert to fresh
key-exchange.  To avoid this limitation, implementations MAY choose to
implement session resumption such that all session secrets derived
from a given <spanx style="verb">ss[0]</spanx> are used for either passive or active opens at
the same host, not both.
</t>
<t>If two hosts have previously negotiated a tcpcrypt session, either
host MAY later initiate session resumption regardless of which host
was the active opener or played the &quot;A&quot; role in the previous session.
</t>
<t>However, a given host MUST either encrypt with keys <spanx style="verb">k_ab[j]</spanx> for all
sessions derived from the same original session secret <spanx style="verb">ss[0]</spanx>, or
with keys <spanx style="verb">k_ba[j]</spanx>.  Thus, which keys a host uses to send segments is
not affected by the role it plays in the current connection: it
depends only on whether the host played the &quot;A&quot; or &quot;B&quot; role in the
initial session.
</t>
<t>Implementations that cache session secrets MUST provide a means for
applications to control that caching.  In particular, when an
application requests a new TCP connection, it MUST have a way to
specify two policies for the duration of the connection: 1) that
resumption requests will be ignored, and thus fresh key exchange will
be necessary; and 2) that no session secrets will be cached.  (These
policies can be specified independently or as a unit.)  And for an
established connection, an application MUST have a means to cause any
cache state that was used in or resulted from establishing the
connection to be flushed.  A companion document
<xref target="I-D.ietf-tcpinc-api"/> describes recommended interfaces for this
purpose.
</t>
</section>

<section anchor="encryption" title="Data Encryption and Authentication">
<t>Following key exchange (or its omission via session resumption), all
further communication in a tcpcrypt-enabled connection is carried out
within delimited <spanx style="emph">encryption frames</spanx> that are encrypted and
authenticated using the agreed upon keys.
</t>
<t>This protection is provided via algorithms for Authenticated
Encryption with Associated Data (AEAD).  The permitted algorithms are
listed in <xref target="tab:ae-algs"/> in <xref target="iana"/>.  Additional algorithms can be
specified in the future according to the policy in that section.  One
algorithm is selected during the negotiation described in
<xref target="key-exchange"/>.  The lengths <spanx style="verb">ae_key_len</spanx> and <spanx style="verb">ae_nonce_len</spanx>
associated with each algorithm are found in <xref target="tab:ae-params"/> in
<xref target="aead"/>, together with requirements for which algorithms MUST be
implemented.
</t>
<t>The format of an encryption frame is specified in
<xref target="encryption-frames"/>.  A sending host breaks its stream of application
data into a series of chunks.  Each chunk is placed in the <spanx style="verb">data</spanx>
portion of a &quot;plaintext&quot; value, which is then encrypted to yield a
frame's <spanx style="verb">ciphertext</spanx> field.  Chunks MUST be small enough that the
ciphertext (whose length depends on the AEAD cipher used, and is
generally slightly longer than the plaintext) has length less than
2^16 bytes.
</t>
<t>An &quot;associated data&quot; value (see <xref target="associated-data"/>) is constructed for
the frame.  It contains the frame's <spanx style="verb">control</spanx> field and the length of
the ciphertext.
</t>
<t>A &quot;frame ID&quot; value (see <xref target="frame-id"/>) is also constructed for the
frame, but not explicitly transmitted.  It contains a 64-bit <spanx style="verb">offset</spanx>
field whose integer value is the zero-indexed byte offset of the
beginning of the current encryption frame in the underlying TCP
datastream.  (That is, the offset in the framing stream, not the
plaintext application stream.)  The offset is then left-padded with
zero-valued bytes to form a value of length <spanx style="verb">ae_nonce_len</spanx>.
Because it is strictly necessary for the security of the AEAD
algorithms specified in this document, an implementation MUST NOT ever
transmit distinct frames with the same frame ID value under the same
encryption key.  In particular, a retransmitted TCP segment MUST
contain the same payload bytes for the same TCP sequence numbers, and
a host MUST NOT transmit more than 2^64 bytes in the underlying TCP
datastream (which would cause the <spanx style="verb">offset</spanx> field to wrap) before
re-keying as decribed in <xref target="rekeying"/>.
</t>
<t>With reference to the &quot;AEAD Interface&quot; described in Section 2 of
<xref target="RFC5116"/>, tcpcrypt invokes the AEAD algorithm with values taken
from the traffic key <spanx style="verb">k_ab[j]</spanx> or <spanx style="verb">k_ba[j]</spanx> for some <spanx style="verb">j</spanx>, according to
the host's role as described in <xref target="key-exchange"/>.
</t>
<t>First, the traffic key is divided into two parts:
</t>

<figure align="center"><artwork align="center">
                              ae_key_len + ae_nonce_len - 1
                                               |
byte  0                    ae_key_len          |
      |                           |            |
      v                           v            v
    +----+----+--...--+----+----+----+--...--+----+
    |             K             |        NR       |
    +----+----+--...--+----+----+----+--...--+----+

    \_____________________________________________/
                     traffic key
</artwork></figure>
<t>The first <spanx style="verb">ae_key_len</spanx> bytes of the traffic key provide the AEAD key
<spanx style="verb">K</spanx>, while the remaining <spanx style="verb">ae_nonce_len</spanx> bytes provide a &quot;nonce
randomizer&quot; value <spanx style="verb">NR</spanx>.  The frame ID is then combined via bitwise
exclusive-or with the nonce randomizer to yield <spanx style="verb">N</spanx>, the AEAD nonce
for the frame:
</t>

<figure align="center"><artwork align="center">
N = frame_ID XOR NR
</artwork></figure>
<t>The plaintext value serves as <spanx style="verb">P</spanx>, and the associated data as <spanx style="verb">A</spanx>.
The output of the encryption operation, <spanx style="verb">C</spanx>, is transmitted in the
frame's <spanx style="verb">ciphertext</spanx> field.
</t>
<t>When a frame is received, tcpcrypt reconstructs the associated data
and frame ID values (the former contains only data sent in the clear,
and the latter is implicit in the TCP stream), computes the nonce <spanx style="verb">N</spanx>
as above, and provides these and the ciphertext value to the AEAD
decryption operation.  The output of this operation is either a
plaintext value <spanx style="verb">P</spanx> or the special symbol FAIL.  In the latter case,
the implementation SHOULD abort the connection and raise an error
condition distinct from the end-of-file condition.  But if none of the
TCP segment(s) containing the frame have been acknowledged and
retransmission could potentially result in a valid frame, an
implementation MAY instead drop these segments (and &quot;renege&quot; if they
have been SACKed, according to <xref target="RFC2018"/> Section 8).
</t>
</section>

<section anchor="tcp-header-protection" title="TCP Header Protection">
<t>The <spanx style="verb">ciphertext</spanx> field of the encryption frame contains protected
versions of certain TCP header values.
</t>
<t>When the <spanx style="verb">URGp</spanx> bit is set, the <spanx style="verb">urgent</spanx> value indicates an offset
from the current frame's beginning offset; the sum of these offsets
gives the index of the last byte of urgent data in the application
datastream.
</t>
<t>A sender MUST set the <spanx style="verb">FINp</spanx> bit on the last frame it sends in the
connection (unless it aborts the connection), and MUST NOT set <spanx style="verb">FINp</spanx>
on any other frame.
</t>
<t>TCP sets the FIN flag when a sender has no more data, which with
tcpcrypt means setting FIN on the segment containing the last byte of
the last frame.  However, a receiver MUST report the end-of-file
condition to the connection's local user when and only when it
receives a frame with the <spanx style="verb">FINp</spanx> bit set.  If a host receives a
segment with the TCP FIN flag set but the received datastream
including this segment does not contain a frame with <spanx style="verb">FINp</spanx> set, the
host SHOULD abort the connection and raise an error condition distinct
from the end-of-file condition.  But if there are unacknowledged
segments whose retransmission could potentially result in a valid
frame, the host MAY instead drop the segment with the TCP FIN flag set
(and &quot;renege&quot; if it has been SACKed, according to <xref target="RFC2018"/>
Section 8).
</t>
</section>

<section anchor="rekeying" title="Re-Keying">
<t>Re-keying allows hosts to wipe from memory keys that could decrypt
previously transmitted segments.  It also allows the use of AEAD
ciphers that can securely encrypt only a bounded number of messages
under a given key.
</t>
<t>As described above in <xref target="key-exchange"/>, a master key <spanx style="verb">mk[j]</spanx> is used to
generate two encryption keys <spanx style="verb">k_ab[j]</spanx> and <spanx style="verb">k_ba[j]</spanx>.  We refer to
these as a <spanx style="emph">key-set</spanx> with <spanx style="emph">generation number</spanx> <spanx style="verb">j</spanx>.  Each host
maintains a <spanx style="emph">local generation number</spanx> that determines which key-set it
uses to encrypt outgoing frames, and a <spanx style="emph">remote generation number</spanx>
equal to the highest generation used in frames received from its peer.
Initially, these two generation numbers are set to zero.
</t>
<t>A host MAY increment its local generation number beyond the remote
generation number it has recorded.  We call this action <spanx style="emph">initiating
re-keying</spanx>.
</t>
<t>When a host has incremented its local generation number and uses the
new key-set for the first time to encrypt an outgoing frame, it MUST
set <spanx style="verb">rekey = 1</spanx> for that frame.  It MUST set <spanx style="verb">rekey = 0</spanx> in all other
cases.
</t>
<t>When a host receives a frame with <spanx style="verb">rekey = 1</spanx>, it increments its
record of the remote generation number.  If the remote generation
number is now greater than the local generation number, the receiver
MUST immediately increment its local generation number to match.
Moreover, if the receiver has not yet transmitted a segment with the
FIN flag set, it MUST immediately send a frame (with empty application
data if necessary) with <spanx style="verb">rekey = 1</spanx>.
</t>
<t>A host MUST NOT initiate more than one concurrent re-key operation if
it has no data to send; that is, it MUST NOT initiate re-keying with
an empty encryption frame more than once while its record of the
remote generation number is less than its own.
</t>
<t>Note that when parts of the datastream are retransmitted, TCP requires
that implementations always send the same data bytes for the same TCP
sequence numbers.  Thus, frame data in retransmitted segments MUST be
encrypted with the same key as when it was first transmitted,
regardless of the current local generation number.
</t>
<t>Implementations SHOULD delete older-generation keys from memory once
they have received all frames they will need to decrypt with the old
keys and have encrypted all outgoing frames under the old keys.
</t>
</section>

<section anchor="keep-alive" title="Keep-Alive">
<t>Instead of using TCP Keep-Alives to verify that the remote endpoint is
still responsive, tcpcrypt implementations SHOULD employ the re-keying
mechanism for this purpose, as follows.  When necessary, a host SHOULD
probe the liveness of its peer by initiating re-keying and
transmitting a new frame immediately (with empty application data if
necessary).
</t>
<t>As described in <xref target="rekeying"/>, a host receiving a frame encrypted under
a generation number greater than its own MUST increment its own
generation number and (if it has not already transmitted a segment
with FIN set) immediately transmit a new frame (with zero-length
application data if necessary).
</t>
<t>Implementations MAY use TCP Keep-Alives for purposes that do not
require endpoint authentication, as discussed in <xref target="verified-liveness"/>.
</t>
</section>
</section>

<section anchor="encodings" title="Encodings">
<t>This section provides byte-level encodings for values transmitted or
computed by the protocol.
</t>

<section anchor="key-exchange-messages" title="Key-Exchange Messages">
<t>The <spanx style="verb">Init1</spanx> message has the following encoding:
</t>

<figure align="center"><artwork align="center">
byte   0       1       2       3
   +-------+-------+-------+-------+
   |          INIT1_MAGIC          |
   |                               |
   +-------+-------+-------+-------+

           4        5      6       7
       +-------+-------+-------+-------+
       |          message_len          |
       |              = M              |
       +-------+-------+-------+-------+

           8
       +--------+-----+----+-----+----+---...---+-----+-----+
       |nciphers|sym_      |sym_      |         |sym_       |
       | = K    |cipher[0] |cipher[1] |         |cipher[K-1]|
       +--------+-----+----+-----+----+---...---+-----+-----+

        2*K + 9                     2*K + 9 + N_A_LEN
           |                         |
           v                         v
       +-------+---...---+-------+-------+---...---+-------+
       |           N_A           |          Pub_A          |
       |                         |                         |
       +-------+---...---+-------+-------+---...---+-------+

                           M - 1
       +-------+---...---+-------+
       |         ignored         |
       |                         |
       +-------+---...---+-------+
</artwork></figure>
<t>The constant <spanx style="verb">INIT1_MAGIC</spanx> is defined in <xref target="constants"/>.  The
four-byte field <spanx style="verb">message_len</spanx> gives the length of the entire <spanx style="verb">Init1</spanx>
message, encoded as a big-endian integer.  The <spanx style="verb">nciphers</spanx> field
contains an integer value that specifies the number of two-byte
symmetric-cipher identifiers that follow.  The <spanx style="verb">sym_cipher[i]</spanx>
identifiers indicate cryptographic algorithms in <xref target="tab:ae-algs"/> in
<xref target="iana"/>.  The length <spanx style="verb">N_A_LEN</spanx> and the length of <spanx style="verb">Pub_A</spanx> are both
determined by the negotiated TEP, as described in
<xref target="key-agreement-schemes"/>.
</t>
<t>Implementations of this protocol MUST construct <spanx style="verb">Init1</spanx> such that the
field <spanx style="verb">ignored</spanx> has zero length; that is, they MUST construct the
message such that its end, as determined by <spanx style="verb">message_len</spanx>, coincides
with the end of the field <spanx style="verb">Pub_A</spanx>.  When receiving <spanx style="verb">Init1</spanx>, however,
implementations MUST permit and ignore any bytes following <spanx style="verb">Pub_A</spanx>.
</t>
<t>The <spanx style="verb">Init2</spanx> message has the following encoding:
</t>

<figure align="center"><artwork align="center">
byte   0       1       2       3
   +-------+-------+-------+-------+
   |          INIT2_MAGIC          |
   |                               |
   +-------+-------+-------+-------+


           4        5      6       7       8       9
       +-------+-------+-------+-------+-------+-------+
       |          message_len          |  sym_cipher   |
       |              = M              |               |
       +-------+-------+-------+-------+-------+-------+

           10                      10 + N_B_LEN
           |                         |
           v                         v
       +-------+---...---+-------+-------+---...---+-------+
       |           N_B           |          Pub_B          |
       |                         |                         |
       +-------+---...---+-------+-------+---...---+-------+

                           M - 1
       +-------+---...---+-------+
       |          ignored        |
       |                         |
       +-------+---...---+-------+
</artwork></figure>
<t>The constant <spanx style="verb">INIT2_MAGIC</spanx> is defined in <xref target="constants"/>.  The
four-byte field <spanx style="verb">message_len</spanx> gives the length of the entire <spanx style="verb">Init2</spanx>
message, encoded as a big-endian integer.  The <spanx style="verb">sym_cipher</spanx> value is a
selection from the symmetric-cipher identifiers in the
previously-received <spanx style="verb">Init1</spanx> message.  The length <spanx style="verb">N_B_LEN</spanx> and the
length of <spanx style="verb">Pub_B</spanx> are both determined by the negotiated TEP, as
described in <xref target="key-agreement-schemes"/>.
</t>
<t>Implementations of this protocol MUST construct <spanx style="verb">Init2</spanx> such that the
field <spanx style="verb">ignored</spanx> has zero length; that is, they MUST construct the
message such that its end, as determined by <spanx style="verb">message_len</spanx>, coincides
with the end of the <spanx style="verb">Pub_B</spanx> field.  When receiving <spanx style="verb">Init2</spanx>, however,
implementations MUST permit and ignore any bytes following <spanx style="verb">Pub_B</spanx>.
</t>
</section>

<section anchor="encryption-frames" title="Encryption Frames">
<t>An <spanx style="emph">encryption frame</spanx> comprises a control byte and a length-prefixed
ciphertext value:
</t>

<figure align="center"><artwork align="center">
byte   0       1       2       3               clen+2
   +-------+-------+-------+-------+---...---+-------+
   |control|      clen     |        ciphertext       |
   +-------+-------+-------+-------+---...---+-------+
</artwork></figure>
<t>The field <spanx style="verb">clen</spanx> is an integer in big-endian format and gives the
length of the <spanx style="verb">ciphertext</spanx> field.
</t>
<t>The byte <spanx style="verb">control</spanx> has this structure:
</t>

<figure align="center"><artwork align="center">
bit     7                 1       0
    +-------+---...---+-------+-------+
    |          cres           | rekey |
    +-------+---...---+-------+-------+
</artwork></figure>
<t>The seven-bit field <spanx style="verb">cres</spanx> is reserved; implementations MUST set these
bits to zero when sending, and MUST ignore them when receiving.
</t>
<t>The use of the <spanx style="verb">rekey</spanx> field is described in <xref target="rekeying"/>.
</t>

<section anchor="plaintext" title="Plaintext">
<t>The <spanx style="verb">ciphertext</spanx> field is the result of applying the negotiated
authenticated-encryption algorithm to a &quot;plaintext&quot; value, which has
one of these two formats:
</t>

<figure align="center"><artwork align="center">
byte   0       1               plen-1
   +-------+-------+---...---+-------+
   | flags |           data          |
   +-------+-------+---...---+-------+


byte   0       1       2       3               plen-1
   +-------+-------+-------+-------+---...---+-------+
   | flags |    urgent     |          data           |
   +-------+-------+-------+-------+---...---+-------+
</artwork></figure>
<t>(Note that <spanx style="verb">clen</spanx> in the previous section will generally be greater
than <spanx style="verb">plen</spanx>, as the ciphertext produced by the
authenticated-encryption scheme both encrypts the application data and
provides redundancy with which to verify its integrity.)
</t>
<t>The <spanx style="verb">flags</spanx> byte has this structure:
</t>

<figure align="center"><artwork align="center">
bit    7    6    5    4    3    2    1    0
    +----+----+----+----+----+----+----+----+
    |            fres             |URGp|FINp|
    +----+----+----+----+----+----+----+----+
</artwork></figure>
<t>The six-bit value <spanx style="verb">fres</spanx> is reserved; implementations MUST set these
six bits to zero when sending, and MUST ignore them when receiving.
</t>
<t>When the <spanx style="verb">URGp</spanx> bit is set, it indicates that the <spanx style="verb">urgent</spanx> field is
present, and thus that the plaintext value has the second structure
variant above; otherwise the first variant is used.
</t>
<t>The meaning of <spanx style="verb">urgent</spanx> and of the flag bits is described in
<xref target="tcp-header-protection"/>.
</t>
</section>

<section anchor="associated-data" title="Associated Data">
<t>An encryption frame's &quot;associated data&quot; (which is supplied to the
AEAD algorithm when decrypting the ciphertext and verifying the
frame's integrity) has this format:
</t>

<figure align="center"><artwork align="center">
byte   0       1       2
   +-------+-------+-------+
   |control|     clen      |
   +-------+-------+-------+
</artwork></figure>
<t>It contains the same values as the frame's <spanx style="verb">control</spanx> and <spanx style="verb">clen</spanx>
fields.
</t>
</section>

<section anchor="frame-id" title="Frame ID">
<t>Lastly, a &quot;frame ID&quot; (used to construct the nonce for the AEAD
algorithm) has this format:
</t>

<figure align="center"><artwork align="center">
byte  0            ae_nonce_len - 8    ae_nonce_len - 1
      |                   |             |
      v                   v             v
   +-----+--...--+-----+-----+--...--+-----+
   |  0  |       |  0  |       offset      |
   +-----+--...--+-----+-----+--...--+-----+
</artwork></figure>
<t>The 8-byte <spanx style="verb">offset</spanx> field contains an integer in big-endian format.
Its value is specified in <xref target="encryption"/>.  Zero-valued bytes are
prepended to the <spanx style="verb">offset</spanx> field to form a structure of length
<spanx style="verb">ae_nonce_len</spanx>.
</t>
</section>
</section>

<section anchor="constants" title="Constant Values">
<t>The table below defines values for the constants used in the protocol.
</t>
<texttable anchor="tab:constants" title="Constant values used in the protocol
">
<ttcol align="left">Value</ttcol>
<ttcol align="left">Name</ttcol>

<c>0x01</c><c>CONST_NEXTK</c>
<c>0x02</c><c>CONST_SESSID</c>
<c>0x03</c><c>CONST_REKEY</c>
<c>0x04</c><c>CONST_KEY_A</c>
<c>0x05</c><c>CONST_KEY_B</c>
<c>0x06</c><c>CONST_RESUME</c>
<c>0x15101a0e</c><c>INIT1_MAGIC</c>
<c>0x097105e0</c><c>INIT2_MAGIC</c>
</texttable>
</section>
</section>

<section anchor="key-agreement-schemes" title="Key-Agreement Schemes">
<t>The TEP negotiated via TCP-ENO indicates the use of one of the
key-agreement schemes named in <xref target="tab:tep-ids"/> in <xref target="iana"/>.  For
example, <spanx style="verb">TCPCRYPT_ECDHE_P256</spanx> names the tcpcrypt protocol using
ECDHE-P256 together with the CPRF and length parameters specified
below.
</t>
<t>All the TEPs specified in this document require the use of
HKDF-Expand-SHA256 as the CPRF, and these lengths for nonces and
session secrets:
</t>

<figure align="center"><artwork align="center">
N_A_LEN: 32 bytes
N_B_LEN: 32 bytes
K_LEN:   32 bytes
</artwork></figure>
<t>Future documents assigning additional TEPs for use with tcpcrypt
mmight specify different values for the lengths above.  Note that the
minimum session ID length specified by TCP-ENO, together with the way
tcpcrypt constructs session IDs, implies that <spanx style="verb">K_LEN</spanx> MUST have length
at least 32 bytes.
</t>
<t>Key-agreement schemes ECDHE-P256 and ECDHE-P521 employ the ECSVDP-DH
secret value derivation primitive defined in <xref target="IEEE-1363"/>.  The named
curves are defined in <xref target="NIST-DSS"/>.  When the public-key values <spanx style="verb">Pub_A</spanx>
and <spanx style="verb">Pub_B</spanx> are transmitted as described in <xref target="key-exchange-messages"/>,
they are encoded with the &quot;Elliptic Curve Point to Octet String
Conversion Primitive&quot; described in Section E.2.3 of <xref target="IEEE-1363"/>, and
are prefixed by a two-byte length in big-endian format:
</t>

<figure align="center"><artwork align="center">
byte   0       1       2               L - 1
   +-------+-------+-------+---...---+-------+
   |   pubkey_len  |          pubkey         |
   |      = L      |                         |
   +-------+-------+-------+---...---+-------+
</artwork></figure>
<t>Implementations MUST encode these <spanx style="verb">pubkey</spanx> values in &quot;compressed
format&quot;.  Implementations MUST validate these <spanx style="verb">pubkey</spanx> values
according to the algorithm in <xref target="IEEE-1363"/> Section A.16.10.
</t>
<t>Key-agreement schemes ECDHE-Curve25519 and ECDHE-Curve448 use the
functions X25519 and X448, respectively, to perform the Diffie-Helman
protocol as described in <xref target="RFC7748"/>.  Implementations MUST check
whether the computed Diffie-Hellman shared secret is the all-zero
value and abort if so, as described in Section 6 of <xref target="RFC7748"/>.  When
using these ciphers, public-key values <spanx style="verb">Pub_A</spanx> and <spanx style="verb">Pub_B</spanx> are
transmitted directly with no length prefix: 32 bytes for Curve25519,
and 56 bytes for Curve448.
</t>
<t><xref target="tab:tep-mti"/> below specifies the requirement levels of the four TEPs
specified in this document.  In particular, all implementations of
tcpcrypt MUST support TCPCRYPT_ECDHE_Curve25519.  However, system
administrators MAY configure which TEPs a host will negotiate
independent of these implementation requirements.
</t>
<texttable anchor="tab:tep-mti" title="Requirements for implementation of TEPs
">
<ttcol align="left">Requirement</ttcol>
<ttcol align="left">TEP</ttcol>

<c>REQUIRED</c><c>TCPCRYPT_ECDHE_Curve25519</c>
<c>RECOMMENDED</c><c>TCPCRYPT_ECDHE_Curve448</c>
<c>OPTIONAL</c><c>TCPCRYPT_ECDHE_P256</c>
<c>OPTIONAL</c><c>TCPCRYPT_ECDHE_P521</c>
</texttable>
</section>

<section anchor="aead" title="AEAD Algorithms">
<t>This document uses <spanx style="verb">sym-cipher</spanx> identifiers in the messages <spanx style="verb">Init1</spanx>
and <spanx style="verb">Init2</spanx> (see <xref target="key-exchange"/>) to negotiate the use of AEAD
algorithms; the values of these identifiers are given in
<xref target="tab:ae-algs"/> in <xref target="iana"/>.  The algorithms <spanx style="verb">AEAD_AES_128_GCM</spanx> and
<spanx style="verb">AEAD_AES_256_GCM</spanx> are specified in <xref target="RFC5116"/>.  The algorithm
<spanx style="verb">AEAD_CHACHA20_POLY1305</spanx> is specified in <xref target="RFC7539"/>.
</t>
<t>Implementations MUST support certain AEAD algorithms according to
<xref target="tab:ae-params"/> below.  Note that system administrators MAY configure
which algorithms a host will negotiate independent of these
requirements.
</t>
<t>Lastly, this document uses the lengths <spanx style="verb">ae_key_len</spanx> and <spanx style="verb">ae_nonce_len</spanx>
to specify aspects of encryption and data formats.  These values
depend on the negotiated AEAD algorithm, also according to the table
below.
</t>
<texttable anchor="tab:ae-params" title="Requirement and lengths for each AEAD algorithm
">
<ttcol align="left">AEAD Algorithm</ttcol>
<ttcol align="left">Requirement</ttcol>
<ttcol align="left">ae_key_len</ttcol>
<ttcol align="left">ae_nonce_len</ttcol>

<c>AEAD_AES_128_GCM</c><c>REQUIRED</c><c>16 bytes</c><c>12 bytes</c>
<c>AEAD_AES_256_GCM</c><c>RECOMMENDED</c><c>32 bytes</c><c>12 bytes</c>
<c>AEAD_CHACHA20_POLY1305</c><c>RECOMMENDED</c><c>32 bytes</c><c>12 bytes</c>
</texttable>
</section>

<section anchor="iana" title="IANA Considerations">
<t>For use with TCP-ENO's negotiation mechanism, tcpcrypt's TEP
identifiers will need to be incorporated in IANA's &quot;TCP encryption
protocol identifiers&quot; registry under the &quot;Transmission Control
Protocol (TCP) Parameters&quot; registry, as in <xref target="tab:tep-ids"/> below.  The
various key-agreement schemes used by these tcpcrypt variants are
defined in <xref target="key-agreement-schemes"/>.
</t>
<texttable anchor="tab:tep-ids" title="TEP identifiers for use with tcpcrypt
">
<ttcol align="left">Value</ttcol>
<ttcol align="left">Meaning</ttcol>
<ttcol align="left">Reference</ttcol>

<c>0x21</c><c>TCPCRYPT_ECDHE_P256</c><c>[RFC-TBD]</c>
<c>0x22</c><c>TCPCRYPT_ECDHE_P521</c><c>[RFC-TBD]</c>
<c>0x23</c><c>TCPCRYPT_ECDHE_Curve25519</c><c>[RFC-TBD]</c>
<c>0x24</c><c>TCPCRYPT_ECDHE_Curve448</c><c>[RFC-TBD]</c>
</texttable>
<t>In <xref target="aead"/>, this document defines the use of several AEAD algorithms
for encrypting application data.  To name these algorithms, the
tcpcrypt protocol uses two-byte identifiers in the range 0x0001 to
0xFFFF inclusive, for which IANA is to maintain a new &quot;tcpcrypt AEAD
Algorithm&quot; registry under the &quot;Transmission Control Protocol (TCP)
Parameters&quot; registry.  The initial values for this registry are given
in <xref target="tab:ae-algs"/> below.
Future assignments are to be made upon satisfying either of two
policies defined in <xref target="RFC8126"/>:  &quot;IETF Review&quot; or (for non-IETF
stream specifications) &quot;Expert Review with RFC Required.&quot; IANA will
furthermore provide early allocation <xref target="RFC7120"/> to facilitate testing
before RFCs are finalized.
</t>
<texttable anchor="tab:ae-algs" title="Authenticated-encryption algorithms for use with tcpcrypt
">
<ttcol align="left">Value</ttcol>
<ttcol align="left">AEAD Algorithm</ttcol>
<ttcol align="left">Reference</ttcol>

<c>0x0001</c><c>AEAD_AES_128_GCM</c><c>[RFC-TBD] <xref target="aead"/></c>
<c>0x0002</c><c>AEAD_AES_256_GCM</c><c>[RFC-TBD] <xref target="aead"/></c>
<c>0x0010</c><c>AEAD_CHACHA20_POLY1305</c><c>[RFC-TBD] <xref target="aead"/></c>
</texttable>
</section>

<section anchor="security-considerations" title="Security Considerations">
<t>All of the security considerations of TCP-ENO apply to tcpcrypt.  In
particular, tcpcrypt does not protect against active network attackers
unless applications authenticate the session ID.  If it can be
established that the session IDs computed at each end of the
connection match, then tcpcrypt guarantees that no man-in-the-middle
attacks occurred unless the attacker has broken the underlying
cryptographic primitives (e.g., ECDH).  A proof of this property for
an earlier version of the protocol has been published <xref target="tcpcrypt"/>.
</t>
<t>To ensure middlebox compatibility, tcpcrypt does not protect TCP
headers.  Hence, the protocol is vulnerable to denial-of-service from
off-path attackers just as plain TCP is.  Possible attacks include
desynchronizing the underlying TCP stream, injecting RST or FIN
segments, and forging re-key bits.  These attacks will cause a
tcpcrypt connection to hang or fail with an error, but not in any
circumstance where plain TCP could continue uncorrupted.
Implementations MUST give higher-level software a way to distinguish
such errors from a clean end-of-stream (indicated by an authenticated
<spanx style="verb">FINp</spanx> bit) so that applications can avoid semantic truncation
attacks.
</t>
<t>There is no &quot;key confirmation&quot; step in tcpcrypt.  This is not needed
because tcpcrypt's threat model includes the possibility of a
connection to an adversary.  If key negotiation is compromised and
yields two different keys, failed integrity checks on every subsequent
frame will cause the connection either to hang or to abort. This is
not a new threat as an active attacker can achieve the same results
against a plain TCP connection by injecting RST segments or modifying
sequence and acknowledgement numbers.
</t>
<t>Tcpcrypt uses short-lived public keys to provide forward secrecy.
That is, once an implementation removes these keys from memory, a
compromise of the system will not provide any means to derive the
session secrets for past connections.  All currently-specified key
agreement schemes involve ECDHE-based key agreement, meaning a new
key-pair can be efficiently computed for each connection.  If
implementations reuse these parameters, they MUST limit the lifetime
of the private parameters as far as practical in order to minimize the
number of past connections that are vulnerable.  Of course, placing
private keys in persistent storage introduces severe risks that they
will not be destroyed reliably and in a timely fashion, and SHOULD be
avoided whenever possible.
</t>
<t>Attackers cannot force passive openers to move forward in their
session resumption chain without guessing the content of the
resumption identifier, which will be difficult without key knowledge.
</t>
<t>The cipher-suites specified in this document all use HMAC-SHA256 to
implement the collision-resistant pseudo-random function denoted by
<spanx style="verb">CPRF</spanx>.  A collision-resistant function is one for which, for
sufficiently large L, an attacker cannot find two distinct inputs
(K_1, CONST_1) and (K_2, CONST_2) such that
CPRF(K_1, CONST_1, L) = CPRF(K_2, CONST_2, L).  Collision
resistance is important to assure the uniqueness of session IDs, which
are generated using the CPRF.
</t>
<t>Lastly, many of tcpcrypt's cryptographic functions require random
input, and thus any host implementing tcpcrypt MUST have access to a
cryptographically-secure source of randomness or pseudo-randomness.
<xref target="RFC4086"/> provides recommendations on how to achieve this.
</t>
<t>Most implementations will rely on a device's pseudo-random generator,
seeded from hardware events and a seed carried over from the previous
boot.  Once a pseudo-random generator has been properly seeded, it can
generate effectively arbitrary amounts of pseudo-random data.
However, until a pseudo-random generator has been seeded with
sufficient entropy, not only will tcpcrypt be insecure, it will reveal
information that further weakens the security of the pseudo-random
generator, potentially harming other applications.  As REQUIRED by
TCP-ENO, implementations MUST NOT send ENO options unless they have
access to an adequate source of randomness.
</t>

<section anchor="asymmetric-roles" title="Asymmetric Roles">
<t>Tcpcrypt transforms a shared pseudo-random key (PRK) into
cryptographic traffic keys for each direction.  Doing so requires an
asymmetry in the protocol, as the key derivation function must be
perturbed differently to generate different keys in each direction.
Tcpcrypt includes other asymmetries in the roles of the two hosts,
such as the process of negotiating algorithms (e.g., proposing vs.
selecting cipher suites).
</t>
</section>

<section anchor="verified-liveness" title="Verified Liveness">
<t>Many hosts implement TCP Keep-Alives <xref target="RFC1122"/> as an option for
applications to ensure that the other end of a TCP connection still
exists even when there is no data to be sent.  A TCP Keep-Alive
segment carries a sequence number one prior to the beginning of the
send window, and may carry one byte of &quot;garbage&quot; data.  Such a segment
causes the remote side to send an acknowledgment.
</t>
<t>Unfortunately, tcpcrypt cannot cryptographically verify Keep-Alive
acknowledgments.  Hence, an attacker could prolong the existence of a
session at one host after the other end of the connection no longer
exists.  (Such an attack might prevent a process with sensitive data
from exiting, giving an attacker more time to compromise a host and
extract the sensitive data.)
</t>
<t>To counter this threat, tcpcrypt specifies a way to stimulate the
remote host to send verifiably fresh and authentic data, described in
<xref target="keep-alive"/>.
</t>
<t>The TCP keep-alive mechanism has also been used for its effects on
intermediate nodes in the network, such as preventing flow state from
expiring at NAT boxes or firewalls.  As these purposes do not require
the authentication of endpoints, implementations MAY safely accomplish
them using either the existing TCP keep-alive mechanism or tcpcrypt's
verified keep-alive mechanism.
</t>
</section>

<section anchor="mandatory-keyagreement-schemes" title="Mandatory Key-Agreement Schemes">
<t>This document mandates that tcpcrypt implementations provide support
for at least one key-agreement scheme: ECDHE using Curve25519.  This
choice of a single mandatory algorithm is the result of a difficult
tradeoff between cryptographic diversity and the ease and security of
actual deployment.
</t>
<t>The IETF's appraisal of best current practice on this matter
<xref target="RFC7696"/> says, &quot;Ideally, two independent sets of
mandatory-to-implement algorithms will be specified, allowing for a
primary suite and a secondary suite.  This approach ensures that the
secondary suite is widely deployed if a flaw is found in the primary
one.&quot;
</t>
<t>To meet that ideal, it might appear natural to also mandate ECDHE
using P-256.  However, implementing the Diffie-Hellman function using
NIST elliptic curves (including those specified for use with tcpcrypt,
P-256 and P-521) appears to be very difficult to achieve without
introducing vulnerability to side-channel attacks <xref target="NIST-fail"/>.
Although well-trusted implementations are available as part of large
cryptographic libraries, these can be difficult to extract for use in
operating-system kernels where tcpcrypt is usually best implemented.
In contrast, the characteristics of Curve25519 together with its
recent popularity has led to many safe and efficient implementations,
including some that fit naturally into the kernel environment.
</t>
<t><xref target="RFC7696"/> insists that, &quot;The selected algorithms need to be
resistant to side-channel attacks and also meet the performance,
power, and code size requirements on a wide variety of platforms.&quot; On
this principle, tcpcrypt excludes the NIST curves from the set of
mandatory-to-implement key-agreement algorithms.
</t>
<t>Lastly, this document encourages support for key-agreement with
Curve448, categorizing it as RECOMMENDED.  Curve448 appears likely to
admit safe and efficient implementations.  However, support is not
REQUIRED because existing implementations might not yet be
sufficiently well-proven.
</t>
</section>
</section>

<section anchor="experiments" title="Experiments">
<t>Some experience will be required to determine whether the tcpcrypt
protocol can be deployed safely and successfully across the diverse
environments of the global internet.
</t>
<t>Safety means that TCP implementations that support tcpcrypt are able
to communicate reliably in all the same settings as they would without
tcpcrypt.  As described in <xref target="I-D.ietf-tcpinc-tcpeno"/> Section 9, this
property can be subverted if middleboxes strip ENO options from
non-SYN segments after allowing them in SYN segments; or if the
particular communication patterns of tcpcrypt offend the policies of
middleboxes doing deep-packet inspection.
</t>
<t>Success, in addition to safety, means hosts that implement tcpcrypt
actually enable encryption when connecting to one another.  This
property depends on the network's treatment of the TCP-ENO handshake,
and can be subverted if middleboxes merely strip unknown TCP options
or if they terminate TCP connections and relay data back and forth
unencrypted.
</t>
<t>Ease of implementation will be a further challenge to deployment.
Because tcpcrypt requires encryption operations on frames that may
span TCP segments, kernel implementations are forced to buffer
segments in different ways than are necessary for plain TCP.  More
implementation experience will show how much additional code
complexity is required in various operating systems, and what kind of
performance effects can be expected.
</t>
</section>

<section anchor="acknowledgments" title="Acknowledgments">
<t>We are grateful for contributions, help, discussions, and feedback
from the TCPINC working group and from other IETF reviewers, including
Marcelo Bagnulo, David Black, Bob Briscoe, Jana Iyengar, Stephen Kent,
Tero Kivinen, Mirja Kuhlewind, Yoav Nir, Christoph Paasch, Eric
Rescorla, Kyle Rose, and Dale Worley.
</t>
<t>This work was funded by gifts from Intel (to Brad Karp) and from
Google; by NSF award CNS-0716806 (A Clean-Slate Infrastructure for
Information Flow Control); by DARPA CRASH under
contract #N66001-10-2-4088; and by the Stanford Secure Internet of
Things Project.
</t>
</section>

<section anchor="contributors" title="Contributors">
<t>Dan Boneh and Michael Hamburg were co-authors of the draft that became
this document.
</t>
</section>

</middle>
<back>
<references title="Normative References">
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml3/reference.I-D.ietf-tcpinc-tcpeno.xml"?>
<reference anchor='IEEE-1363'>
<front>
<title>IEEE Standard Specifications for Public-Key Cryptography (IEEE Std 1363-2000)</title>
<author><organization>IEEE</organization></author>
<date year='2000' />
</front>
</reference>
<reference anchor='NIST-DSS'>
<front>
<title>FIPS PUB 186-4: Digital Signature Standard (DSS)</title>
<author><organization>NIST</organization></author>
<date year='2013' />
</front>
</reference>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.0793.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.2018.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.2104.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.2119.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.5116.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.5869.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.7120.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.7539.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.7748.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.8126.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.8174.xml"?>
</references>
<references title="Informative References">
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml3/reference.I-D.ietf-tcpinc-api.xml"?>
<reference anchor='NIST-fail' target="https://cr.yp.to/newelliptic/nistecc-20160106.pdf">
  <front>
    <title>Failures in NIST's ECC standards</title>
    <author initials="D. J." surname="Bernstein" fullname="Daniel J. Bernstein"></author>
    <author initials="T." surname="Lange" fullname="Tanja Lange"></author>
    <date year='2016' />
  </front>
</reference>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.1122.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.4086.xml"?>
<?rfc include="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.7696.xml"?>
<reference anchor='tcpcrypt'>

<front>
<title>The case for ubiquitous transport-level encryption</title>
<author initials='A.' surname='Bittau' fullname='A. Bittau'>
<organization /></author>
<author initials='M.' surname='Hamburg' fullname='M. Hamburg'>
<organization /></author>
<author initials='M.' surname='Handley' fullname='M. Handley'>
<organization /></author>
<author initials='D.' surname='Mazieres' fullname='D. Mazieres'>
<organization /></author>
<author initials='D.' surname='Boneh' fullname='D. Boneh'>
<organization /></author>
<date year='2010' />
</front>

  <seriesInfo name="USENIX Security" value="" />
</reference>
</references>

</back>
</rfc>
