idnits 2.17.1 

draft-lu-fn-transport-05.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

     No issues found here.

  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  == The document doesn't use any RFC 2119 keywords, yet seems to have RFC
     2119 boilerplate text.

  -- The document date (August 20, 2013) is 3901 days in the past.  Is this
     intentional?

  -- Found something which looks like a code comment -- if you have code
     sections in the document, please surround them with '<CODE BEGINS>' and
     '<CODE ENDS>' lines.


  Checking references for intended status: Proposed Standard
  ----------------------------------------------------------------------------

     (See RFCs 3967 and 4897 for information about using normative references
     to lower-maturity documents in RFCs)

  == Outdated reference: A later version (-04) exists of
     draft-enyedi-rtgwg-mrt-frr-algorithm-03

  ** Downref: Normative reference to an Informational draft:
     draft-enyedi-rtgwg-mrt-frr-algorithm (ref.
     'I-D.enyedi-rtgwg-mrt-frr-algorithm')

  == Outdated reference: A later version (-10) exists of
     draft-ietf-rtgwg-mrt-frr-architecture-03

  ** Obsolete normative reference: RFC 4970 (Obsoleted by RFC 7770)

  ** Obsolete normative reference: RFC 4971 (Obsoleted by RFC 7981)


     Summary: 3 errors (**), 0 flaws (~~), 4 warnings (==), 2 comments (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------

1	Network Working Group                                              W. Lu
2	Internet-Draft                                                   S. Kini
3	Intended status: Standards Track                         A. Csaszar, Ed.
4	Expires: February 21, 2014                                     G. Enyedi
5	                                                             J. Tantsura
6	                                                                Ericsson
7	                                                         August 20, 2013

9	                Transport of Fast Notification Messages
10	                        draft-lu-fn-transport-05

12	Abstract

14	   This document specifies mechanisms for fast and light-weight
15	   dissemination of event notifications.  The purpose is to enable
16	   dataplane dissemination of Fast Notifications (FNs).  The draft
17	   discusses the design goals, the message container and options for
18	   delivering the notifications to all routers within a routing area.

20	Status of This Memo

22	   This Internet-Draft is submitted in full conformance with the
23	   provisions of BCP 78 and BCP 79.

25	   Internet-Drafts are working documents of the Internet Engineering
26	   Task Force (IETF).  Note that other groups may also distribute
27	   working documents as Internet-Drafts.  The list of current Internet-
28	   Drafts is at http://datatracker.ietf.org/drafts/current/.

30	   Internet-Drafts are draft documents valid for a maximum of six months
31	   and may be updated, replaced, or obsoleted by other documents at any
32	   time.  It is inappropriate to use Internet-Drafts as reference
33	   material or to cite them other than as "work in progress."

35	   This Internet-Draft will expire on February 21, 2014.

37	Copyright Notice

39	   Copyright (c) 2013 IETF Trust and the persons identified as the
40	   document authors.  All rights reserved.

42	   This document is subject to BCP 78 and the IETF Trust's Legal
43	   Provisions Relating to IETF Documents
44	   (http://trustee.ietf.org/license-info) in effect on the date of
45	   publication of this document.  Please review these documents
46	   carefully, as they describe your rights and restrictions with respect
47	   to this document.  Code Components extracted from this document must
48	   include Simplified BSD License text as described in Section 4.e of
49	   the Trust Legal Provisions and are provided without warranty as
50	   described in the Simplified BSD License.

52	Table of Contents

54	   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
55	     1.1.  Requirements Language . . . . . . . . . . . . . . . . . .   3
56	     1.2.  Acronyms  . . . . . . . . . . . . . . . . . . . . . . . .   3
57	   2.  Design Goals  . . . . . . . . . . . . . . . . . . . . . . . .   4
58	   3.  Transport Logic - Distribution of the Notifications . . . . .   4
59	     3.1.  Flooding mode . . . . . . . . . . . . . . . . . . . . . .   4
60	       3.1.1.  Duplicate Check with Flooding . . . . . . . . . . . .   5
61	     3.2.  Spanning Tree Mode  . . . . . . . . . . . . . . . . . . .   6
62	   4.  Message Encoding  . . . . . . . . . . . . . . . . . . . . . .   6
63	     4.1.  Seamless Encapsulation  . . . . . . . . . . . . . . . . .   6
64	     4.2.  Dedicated FN Message  . . . . . . . . . . . . . . . . . .   6
65	       4.2.1.  Authentication  . . . . . . . . . . . . . . . . . . .   8
66	         4.2.1.1.  Area-scoped and Link-scoped Authentication  . . .   9
67	         4.2.1.2.  Simple Password Authentication  . . . . . . . . .   9
68	         4.2.1.3.  Cryptographic Authentication for FN . . . . . . .   9
69	   5.  Security Considerations . . . . . . . . . . . . . . . . . . .  12
70	   6.  FN Packet Processing Summary  . . . . . . . . . . . . . . . .  12
71	   7.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .  13
72	   8.  Acknowledgements  . . . . . . . . . . . . . . . . . . . . . .  13
73	   9.  References  . . . . . . . . . . . . . . . . . . . . . . . . .  13
74	     9.1.  Normative References  . . . . . . . . . . . . . . . . . .  13
75	     9.2.  Informative References  . . . . . . . . . . . . . . . . .  14
76	   Appendix A.  Further Options for Transport Logic  . . . . . . . .  14
77	     A.1.  Multicast Tree-based Transport  . . . . . . . . . . . . .  14
78	       A.1.1.  Fault Tolerance of a Single Distribution Tree . . . .  15
79	       A.1.2.  Pair of Redundant Trees . . . . . . . . . . . . . . .  15
80	     A.2.  Unicast . . . . . . . . . . . . . . . . . . . . . . . . .  17
81	       A.2.1.  Method  . . . . . . . . . . . . . . . . . . . . . . .  17
82	       A.2.2.  Sample Operation  . . . . . . . . . . . . . . . . . .  18
83	     A.3.  Gated Multicast through RPF Check . . . . . . . . . . . .  18
84	       A.3.1.  Loop Prevention - RPF Check . . . . . . . . . . . . .  19
85	       A.3.2.  Operation . . . . . . . . . . . . . . . . . . . . . .  19
86	     A.4.  Further Multicast Tree based Transport Options  . . . . .  20
87	       A.4.1.  Source Specific Trees . . . . . . . . . . . . . . . .  20
88	       A.4.2.  A Single Bidirectional Shared Tree  . . . . . . . . .  20
89	     A.5.  Layer 2 Networks  . . . . . . . . . . . . . . . . . . . .  21
90	   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  21

92	1.  Introduction

94	   Enabling fast dissemination of a network event to routers in a
95	   limited area could benefit multiple applications.  Existing use cases
96	   are centered around new approaches for IP Fast ReRoute such as
97	   [I-D.csaszar-ipfrr-fn].  In the future, however, multiple innovative
98	   applications may take advantage of a Fast Notification service.

100	   A hop by hop control plane based flooding mechanism is used widely
101	   today in link state routing protocols such as OSPF and ISIS to
102	   propagate routing information throughout an area.  In this mechanism,
103	   the information is processed in the control plane at each hop before
104	   being forwarded to the next.  The extra processing, scheduling, and
105	   communications overhead causes unnecessary delays in the
106	   dissemination of the information.

108	   This draft proposes a generic fast notification (FN) protocol as a
109	   separate transport layer, which focuses on delivering notifications
110	   quickly in a secure manner.  It can be used by many existing
111	   applications to enhance the performance of those applications, as
112	   well as to enable new services in the network.  This draft does not
113	   specify the payload of the notification.  Each application is
114	   required to create an own spec and define its payload as well as the
115	   preferred transport options separately.

117	1.1.  Requirements Language

119	   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
120	   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
121	   document are to be interpreted as described in RFC 2119 [RFC2119].

123	1.2.  Acronyms

125	   FN    -  Fast Notification

127	   IGP   -  Interior Gateway Protocol

129	   IS-IS -  Intermediate System to Intermediate System

131	   MD5   -  Message Digest 5

133	   OSPF  -  Open Shortest Path First

135	   RPF   -  Reverse Path Forwarding

137	   SHA   -  Secure Hash

139	   SPT   -  Shortest Path Tree

141	   STP   -  Spanning Tree Protocol

143	2.  Design Goals

145	   A light-weight event notification mechanism that could be used to
146	   facilitate quick dissemination of information in a limited area
147	   should have the following properties.

149	   1.  The mechanism should be fast.  It should provide low end to end
150	       propagation delay for the notifications.

152	   2.  The signaling mechanism should offer a high degree of reliability
153	       under network failure conditions.

155	   3.  The mechanism should be secure; that is, it should provide means
156	       to verify the authenticity of the notifications.

158	   4.  The new protocol should not be dependent upon routing protocol
159	       flooding procedures.

161	   5.  The mechanism should have low processing overhead.

163	   These design goals present a trade-off.  Proper balance needs to be
164	   found that offers good authentication and reliability while keeping
165	   processing complexity sufficiently low to enable implementation in
166	   dataplane.  This draft proposes solutions that take the above goals
167	   and trade-offs into considerations.

169	   It is important to note that information contained by the
170	   notification packet may needed to be processed at multiple points in
171	   the router (e.g. multiple linecards may need to react on that
172	   message).  This document describes the way of sending the information
173	   between nodes, but distributing this information inside the node (if
174	   needed) is out of the scope of this document.

176	3.  Transport Logic - Distribution of the Notifications

178	   The distribution of a notification to multiple receivers can be
179	   implemented in many ways.  The main body of this draft describes some
180	   such options, however, other application specific distribution
181	   mechanisms may exist.  Some more details can be found in the
182	   Appendix.

184	3.1.  Flooding mode

186	   In flooding mode, the IGP configures the dataplane cards to replicate
187	   each received FN message to each interface with a neighbour router in
188	   the same area.

190	   This happens by making use of bidirectional multicast forwarding.  In
191	   bidir multicast, all interfaces added to the multicast group can be
192	   incoming and outgoing interfaces as well.  The principle is that a
193	   router replicates the incoming packet to *all* assigned interfaces
194	   except the incoming interface.  If the local router is the source of
195	   the packet to be forwarded, then the packet is replicated to all
196	   interfaces.  That is, the decision about which interfaces should
197	   actually be used as outgoing is determined on demand.

199	   First, the FN service is assigned a multicast group address, let us
200	   call this MC-FN address.  Then, the IGP assigns all interfaces to MC-
201	   FN which lead to neighbouring routers selected by the IGP.

203	   When the FN service is instructed to disseminate a message, it
204	   creates an IP packet (as described below in Section 4) and sets its
205	   IP destination address to the MC-FN multicast address.  This IP
206	   packet is then multicasted to all IGP neighbours in the area.

208	   Recipients of FN multicast-forward the packet according to the rules
209	   of bidirectional multicast, i.e. to all interfaces which the local
210	   IGP pre-configured except the incoming interface.  As this may cause
211	   loops without pre-caution (consider three routers in a triangle),
212	   before forwarding, therefore, the forwarding engine has to perform
213	   duplicate check.

215	3.1.1.  Duplicate Check with Flooding

217	   Duplicate check can be performed in numeruous ways.

219	   Duplicate check can be performed by maintaining a short queue of
220	   previously forwarded FN messages.  Before forwarding, if the FN
221	   message is found in the queue, then it was forwarded beforehand, so
222	   it may be dropped.  Otherwise it should be forwarded and it should be
223	   added to the queue.

225	   Alternatively, the queue may contain a signature of the previously
226	   forwarded FN messages, such as an MD5 or SHA256 signature or any
227	   other hash.  This signature may be carried in the packet, e.g. due to
228	   authentication purposes, such as with the authentication mechanisms
229	   described in Section 4.2.1.

231	   In either of the above queue-based mechanisms, the size of the queue
232	   can be set to a value that corresponds to the maximal number of legal
233	   FN messages generated by a single event.  For instance, if FN is used
234	   to broadcast failure identifiers in case of failures, then it is
235	   likely that the failure of the node with the most neighbours will
236	   trigger the most FN messages (1 from each neighbour).

238	   It is also possible to use application-dependent duplicate check: the
239	   state machine of the FN-application can be left responsible to decide
240	   whether the information carried in the packet contains new
241	   information or it is a duplicate.  This is only useful in the case if
242	   the application can perform the duplicate check more efficiently than
243	   the above generic mechanisms.  Presently, [I-D.csaszar-ipfrr-fn]
244	   specifies an application-specific duplicate check procedure.

246	3.2.  Spanning Tree Mode

248	   If reliable forwarding of notification packet is not always a strict
249	   requirement, spanning trees may be used for forwarding.  In the
250	   simplest case, the nodes can build up a single spannig tree, and
251	   notification packets can be forwarded along this tree with
252	   bidirectional forwarding.  This solution has the advantage that no
253	   duplicate check is needed.  The tree may be built up with
254	   bidirectional PIM [RFC5015].

256	   Another possibility is to use Maximally Redundant Trees
257	   [I-D.ietf-rtgwg-mrt-frr-architecture], a pair of spanning trees which
258	   give some failure tolerance.  Since the common root of these trees
259	   can always be reached in the case of a single failure, and since the
260	   root can reach all the nodes, notification packets sent on both trees
261	   can tolerate any single failure, if the root propagates the packets
262	   it received on both trees.  Further details about spanning trees are
263	   described in the Appendix.

265	4.  Message Encoding

267	4.1.  Seamless Encapsulation

269	   An application may define its own message for FN to distribute
270	   quickly.  In this case, only the special destination address (e.g.
271	   MC-FN) shows that the message was sent using the FN service.

273	   In this case, the entire payload of the IP packet is determined by
274	   the application including sequence numbering and authentication.  The
275	   IP packet's protocol field can also be set by the application.

277	4.2.  Dedicated FN Message

279	   An alternative option is for the FN messages to be distributed in UDP
280	   datagrams with well-known port values in the UDP header that need to
281	   be allocated by IANA.

283	   The FN packet format inside a UDP datagram is the following:

285	       0                   1                   2                   3
286	       0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
287	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
288	      |                                                               |
289	      +-                                                             -+
290	      |                           IP Header                           |
291	      +-                +-------------+                              -+
292	      |                 | Protocol=UDP|                               |
293	      +-                +-------------+                              -+
294	      |                                                               |
295	      +-                                                             -+
296	      |                                                               |
297	      +---------------------------------------------------------------+
298	      |     UDP Source Port = FN      |   UDP Destination Port = FN   |
299	      +---------------------------------------------------------------+
300	      |                       UDP Header cont'd                       |
301	      +---------------------------------------------------------------+
302	      |                           FN Header                           |
303	      +---------------------------------------------------------------+
304	      |                              ...                              |
305	      .                                                               .
306	      .                          FN Payload                           .
307	      .                                                               .
308	      |                              ...                              |
309	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
310	      |                              ...                              |
311	      .                                                               .
312	      .                     Authentication (optional)                 .
313	      .                                                               .
314	      |                              ...                              |
315	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

317	               Figure 1: FN packet format as a UDP datagram

319	   The encoding of the FN Header is as follows:

321	       0                   1                   2                   3
322	       0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
323	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
324	      |           FN Length           |  FN App Type  | AuType|unused |
325	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

327	                       Figure 2: FN Header encoding

329	   FN Length (16 bits)
330	      The length of the FN message in bytes including the FN Header and
331	      the FN Payload.  The authentication data optionally appended to
332	      the FN packet is not considered part of the FN message: the
333	      authentication data is not included in the FN Length field,
334	      although it is included in the length field of the packet's IP
335	      header.

337	   FN App Type (8 bits)
338	      Identifies the application which should be the receiver of the
339	      notification.  A value for each application needs to be assigned
340	      by IANA.

342	   AuType
343	      Identifies the authentication procedure to be used for the packet.
344	      Authentication options are discussed in Section 4.2.1 of the
345	      specification.

347	4.2.1.  Authentication

349	   Fast Notification intends to provide a trustable service option, so
350	   that receivers of FN packets are able to verify that the packet is
351	   sent by an authentic source.  Simple password authentication and hash
352	   based authentication methods (with MD5 or SHA256) are described in
353	   the following subsections.

355	   If AuType is set to 0x0, then the FN packet is not carrying an
356	   Authentication field at the end of the packet.  Note that even in
357	   this case the FN application in the payload may still use its own
358	   authentication mechanism.

360	   If AuType is non null, an Authentication field must be appended after
361	   the FN message.  The encoding of this field is as described below.

363	       0                   1                   2                   3
364	       0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
365	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
366	      |   AuLength    |        ... Authentication Data ...            |
367	      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
368	      |                              ...                              |

370	               Figure 3: Authentication field in FN packets

372	   AuLength
373	      Describes the length of the entire Authentication field in bytes.

375	   The authentication type may be manually pre-configured or may be
376	   selected automatically.  For automatic selection, the nodes have to
377	   know what type of authentication is applicable for the rest of the
378	   nodes.  This may achieved by extending the IGP to advertise the FN
379	   authentication capabilities.  The most straightforward way to achieve
380	   this is to extend the Router Capability TLVs available both in OSPF
381	   [RFC4970] and in IS-IS [RFC4971].

383	4.2.1.1.  Area-scoped and Link-scoped Authentication

385	   Since FN is a solution to disseminate an event notification from one
386	   source to a whole area of nodes, the simplest approach would be to
387	   use per-area authentication, e.g., a common password, a common pre-
388	   shared key among all nodes in the area as described in the following
389	   sub-sections, or digital signatures.

391	   Carriers may, however, prefer per-link authentication.  In order not
392	   to lose the speed (simple per-hop processing, fast forwarding
393	   property) of FN, link-scoped authentication is suggested only if the
394	   forwarding plane supports it, i.e. if there is hardware support to
395	   verify and re-generate authentication hop-by-hop.  In such cases, the
396	   operator may need to configure a common pre-shared key only on
397	   routers connected by the same link.  It is even possible that there
398	   is no authentication on some links considered safe.

400	4.2.1.2.  Simple Password Authentication

402	   Simple password authentication guards against routers inadvertently
403	   joining the routing area; each router must first be configured with a
404	   password before it can participate in Fast Notification.

406	   The password is stored in the Authentication Data field.  AuLength is
407	   set to the length of the password in bytes plus 1.  Two AuType values
408	   for simple password authentication need to be allocated by IANA: one
409	   for area-scope and another for link-scoped.

411	   With per-link authentication mode, the Authentication field must be
412	   stripped and regenerated hop-by-hop.

414	   Simple password authentication, however, can be easily compromised as
415	   anyone with physical access to the network can read the password.

417	4.2.1.3.  Cryptographic Authentication for FN

419	   Using this authentication type, a secret key is used to generate/
420	   verify a "message digest" that is appended to the end of the FN
421	   packet.  The message digest is a one-way function of the FN packet
422	   and the secret key.  This authentication mechanism resembles the
423	   cryptographic authentication mechanism of [RFC2328].

425	4.2.1.3.1.  MD5
426	   The packet signature is created by an MD5 hash performed on an object
427	   which is the concatenation of the FN message, including the FN
428	   header, and the pre-shared secret key.  The resulting 16 byte MD5
429	   message digest is appended to the FN message into the Authentication
430	   field as shown below.

432	   The AuType in the FN header is set to indicate cryptographic
433	   authentication, the specific value is to be assigned by IANA both for
434	   area-scoped and for link-scoped versions.

436	      0                   1                   2                   3
437	      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
438	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
439	     |   AuLength    |    Key ID     |             Unused            |
440	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
441	     |                      Message Digest (bytes 1-4)               |
442	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
443	     |                      Message Digest (bytes 5-8)               |
444	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
445	     |                     Message Digest (bytes 9-12)               |
446	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
447	     |                     Message Digest (bytes 13-16)              |
448	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

450	    Figure 4: Authentication field in FN packets with MD5 cryptographic
451	                              authentication.

453	   AuLength
454	      AuLength is set to 20 bytes.

456	   Key ID
457	      This field identifies the algorithm and secret key used to create
458	      the message digest appended to the FN packet.  This field allows
459	      that multiple pre-shared keys may exist in parallel.

461	   Message Digest
462	      The 16 byte long MD5 hash performed on an object which is the
463	      concatenation of the FN message, including the FN header, and the
464	      pre-shared secret key identified by Key ID.

466	   When receiving an FN message, if the FN header indicates MD5
467	   authentication, then the last 20 bytes of the FN message are set
468	   aside.  The recipient forwarding plane element calculates a new MD5
469	   digest of the remainder of the FN message to which it appends its own
470	   known secret key identified by Key ID.  The calculated and received
471	   digests are compared.  In case of mismatch, the FN message is
472	   discarded.

474	   In per-link authentication mode, the Authentication field must be
475	   regenerated hop-by-hop using the key of the outgoing link.

477	4.2.1.3.2.  SHA256

479	   Similarly to how MD5 authentication works, it is possible to use
480	   Secure Hash 256 hash.  Currently this is a more secure hash function
481	   than MD5.  The Authentication field would look like this:

483	      0                   1                   2                   3
484	      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
485	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
486	     |   AuLength    |    Key ID     |             Unused            |
487	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
488	     |                      Message Digest (bytes 1-4)               |
489	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
490	     |                      Message Digest (bytes 5-8)               |
491	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
492	     |                              . . .                            |
493	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
494	     |                     Message Digest (bytes 25-28)              |
495	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
496	     |                     Message Digest (bytes 29-32)              |
497	     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

499	    Figure 5: Authentication field in FN packets with MD5 cryptographic
500	                              authentication.

502	   AuLength
503	      AuLength is set to 36 bytes.

505	   Key ID
506	      This field identifies the algorithm and secret key used to create
507	      the message digest appended to the FN packet.  This field allows
508	      that multiple pre-shared keys may exist in parallel.

510	   Message Digest
511	      The 32 bytes long SHA256 value calculated on an object which is
512	      the concatenation of the FN message, including the FN header, and
513	      the pre-shared secret key identified by Key ID.

515	   When receiving an FN message, if the FN header indicates SHA256
516	   authentication, then the last 68 bytes of the FN message are set
517	   aside.  The recipient forwarding plane element calculates a new
518	   SHA256 digest of the remainder of the FN message to which it appends
519	   its own known secret key identified by Key ID.  The calculated and
520	   received digests are compared.  In case of mismatch, the FN message
521	   is discarded.

523	   In per-link authentication mode, the Authentication field must be
524	   regenerated hop-by-hop using the key of the outgoing link.

526	4.2.1.3.3.  Digital Signatures

528	   A router may choose to use public key cryptography to digitally sign
529	   the notification to provide certification of authenticity.  This
530	   mechanism can avoid shared secret that is required for other
531	   authentication mechanisms described in this document.  This
532	   authentication mechanism resembles the authentication mechanism of
533	   OSPF with digital signatures as defined in [RFC2154].

535	5.  Security Considerations

537	   This draft has described basic optional procedures for
538	   authentication.  The mechanism, however, does not protect against
539	   replay attacks.

541	   If an application of FN require protection against replay attacks,
542	   then these applications should provide their own specific sequence
543	   numbering within the FN payload.  Recipient applications should
544	   accept FN messages only if the included sequence number is valid.

546	   Since the message digest of cryptographic authentication also covers
547	   the payload, even if an attacker knew how to construct the new
548	   sequence number, it would not be able to generate a correct message
549	   digest without the pre shared key.  This way, a sequence number in
550	   the payload combined with FN's cryptographic authentication offers
551	   sufficient protection against replay attacks.

553	6.  FN Packet Processing Summary

555	   When receiving an FN packet, a node has to perform the following
556	   steps.

558	   It has to identify that the packet is an FN packet.  This can be done
559	   utilising the destination IP address (MC-FN) or by inspecting the UDP
560	   port field.

562	   If the flooding like transport logic described in Section 3 is used
563	   the node has to perform duplicate check following the teachings in
564	   Section 3.1.1.

566	   If AuType is non-null, the node has to perform authentication check
567	   as discussed in Section 4.2.1.

569	   To protect against replay attacks, the node shall perform
570	   verification of the sequence number provided by the application.

572	   Punt and forward.  The notification may need to be multicasted but it
573	   also needs to be punted to the local application on the linecard to
574	   start processing.

576	   Authentication check, sequence number check and punting/forwarding
577	   may commence in any order deemed necessary by the operator.  If the
578	   operator prefers highest level of security, then both checks should
579	   be performed before forwarding.  If, however, the operator prefers
580	   per-hop performance but still wants to ensure that malice packets
581	   cannot harm the network, then authentication and sequence number
582	   checks may also happen after punting the packet, i.e. before
583	   processing the information contained inside the FN payload.  In this
584	   case, malicious packets may get propagated to every node but they
585	   still do not cause any change in the configuration.

587	7.  IANA Considerations

589	   A UDP port value needs to be assigned by IANA for FN.  IANA also
590	   needs to maintain values for FN App Type as applications are being
591	   proposed.

593	   Multicast addresses used for the distribution trees are either
594	   allocated by IANA or they can be a configuration parameter within the
595	   local domain.

597	8.  Acknowledgements

599	   The authors owe thanks to Acee Lindem, Joel Halpern and Jakob Heitz
600	   for their review and comments.  Also thanks to Alia Atlas for
601	   constructive feedback.

603	9.  References

605	9.1.  Normative References

607	   [I-D.enyedi-rtgwg-mrt-frr-algorithm]
608	              Envedi, G., Csaszar, A., Atlas, A., cbowers@juniper.net,
609	              c., and A. Gopalan, "Algorithms for computing Maximally
610	              Redundant Trees for IP/LDP Fast- Reroute", draft-enyedi-
611	              rtgwg-mrt-frr-algorithm-03 (work in progress), July 2013.

613	   [I-D.ietf-rtgwg-mrt-frr-architecture]
614	              Atlas, A., Kebler, R., Envedi, G., Csaszar, A., Tantsura,
615	              J., Konstantynowicz, M., and R. White, "An Architecture
616	              for IP/LDP Fast-Reroute Using Maximally Redundant Trees",
617	              draft-ietf-rtgwg-mrt-frr-architecture-03 (work in
618	              progress), July 2013.

620	   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
621	              Requirement Levels", BCP 14, RFC 2119, March 1997.

623	   [RFC2328]  Moy, J., "OSPF Version 2", STD 54, RFC 2328, April 1998.

625	   [RFC4970]  Lindem, A., Shen, N., Vasseur, JP., Aggarwal, R., and S.
626	              Shaffer, "Extensions to OSPF for Advertising Optional
627	              Router Capabilities", RFC 4970, July 2007.

629	   [RFC4971]  Vasseur, JP., Shen, N., and R. Aggarwal, "Intermediate
630	              System to Intermediate System (IS-IS) Extensions for
631	              Advertising Router Information", RFC 4971, July 2007.

633	   [RFC5015]  Handley, M., Kouvelas, I., Speakman, T., and L. Vicisano,
634	              "Bidirectional Protocol Independent Multicast (BIDIR-
635	              PIM)", RFC 5015, October 2007.

637	9.2.  Informative References

639	   [Eny2009]  Enyedi, G., Retvari, G., and A. Csaszar, "On Finding
640	              Maximally Redundant Trees in Strictly Linear Time, IEEE
641	              Symposium on Computers and Communications (ISCC)", 2009.

643	   [I-D.csaszar-ipfrr-fn]
644	              Csaszar, A., Envedi, G., Tantsura, J., Kini, S., Sucec,
645	              J., and S. Das, "IP Fast Re-Route with Fast Notification",
646	              draft-csaszar-ipfrr-fn-03 (work in progress), June 2012.

648	   [RFC2154]  Murphy, S., Badger, M., and B. Wellington, "OSPF with
649	              Digital Signatures", RFC 2154, June 1997.

651	Appendix A.  Further Options for Transport Logic

653	   The options described in this appendix represent alternative
654	   solutions to the flooding based approach described in
655	   Section Section 3.

657	   It is left for WG discussion and further evaluation to decide whether
658	   any of these options should potentially be preferred instead of
659	   redundant trees.

661	A.1.  Multicast Tree-based Transport

663	   One way of transporting an identical piece of information to several
664	   receivers at the same time is to use multicast distribution trees.  A
665	   tree based transport solution is beneficial since multicast support
666	   is already implemented in all forwarding entities, so it is possible
667	   to use existing implementations.

669	   With multicast or tree based transport, the Fast Notification (FN)
670	   packet can be recognized by a pre-configured or well known
671	   destination IP address, denoted by MC-FN in the following, which is
672	   the group address of the FN service.

674	   If the FN service is triggered to send out a notification, the
675	   notification will be encapsulated in a new IP packet, where the
676	   destination IP address is set to MC-FN.

678	A.1.1.  Fault Tolerance of a Single Distribution Tree

680	   Several solutions described in this draft use a single tree to
681	   disseminate a notification from one given source.

683	   The single tree solution is simple, however it is not redundant: a
684	   single failure may partition the tree, which will prevent
685	   notifications from reaching some nodes in the area.

687	   Different applications may have different needs for reliability.  For
688	   example, when we use fast notification to disseminate network failure
689	   information, all nodes surrounding the failure can detect and
690	   originate the failure notifications independently.  Any one of these
691	   notifications (or a subset of them) may be sufficient for the
692	   application to make the right decision.  This draft provides several
693	   different transport options from which an applications can choose.

695	A.1.2.  Pair of Redundant Trees

697	   If an FN application needs the exact same data to be distributed in
698	   the case of any single node or any single link failure, the FN
699	   service could opt to run in "redundant tree mode".

701	   A pair of "maximally redundant trees"
702	   [I-D.enyedi-rtgwg-mrt-frr-algorithm] ensures that at each single node
703	   or link failure each node still reaches the common root of the trees
704	   through at least one of the trees.  A redundant tree pair is a known
705	   prior-art graph-theoretical object that is possible to find on any
706	   2-node connected network.  Even better, it is even possible to find
707	   maximally redundant trees in networks where the 2-node connected
708	   criterion does not "fully" hold (e.g. there are a few cut vertices)
709	   [Eny2009], [I-D.ietf-rtgwg-mrt-frr-architecture].

711	   Note that the referenced algorithm(s) build a pair of trees
712	   considering a specific root.  The root can be selected in different
713	   ways, the only thing that is important that each node makes the same
714	   selection, consistently.  For instance, the node with the highest or
715	   lowest router ID can be used.

717	         #1 tree                              #2 tree
718	         +---+       +---+                    +---+       +---+
719	         | B |=======|   |                    | B |=======|   |
720	         +---+       +---+                    +---+       +---+
721	         //               \\                 //                 \
722	         //                 \\               //                   \
723	    +---+                     +---+      +---+                     +---+
724	    | A |---------------------| R |      | A |=====================| R |
725	    +---+                     +---+      +---+                     +---+
726	         \                  //               \\                   /
727	         \                //                 \\                 /
728	         +---+       +---+                    +---+       +---+
729	         |   |=======|   |                    |   |=======|   |
730	         +---+       +---+                    +---+       +---+

732	     Figure 6: Example: a pair of redundant trees (double lines) of a
733	                               common root R

735	   There is one special constraint in building the redundant trees.  A
736	   (maximally) redundant tree pair is needed, where in one of the trees
737	   the root has only one child in order to protect against the failure
738	   of the root itself.  Algorithms presented in [Eny2009],
739	   [I-D.enyedi-rtgwg-mrt-frr-algorithm] produce such trees.

741	   In redundant-tree mode, each node multicasts the requested
742	   notification on both trees, if it is possible, but at least along one
743	   of the trees.  Redundant trees require two multicast group addresses.
744	   MC-FN identifies one of the trees, and MC-FN-2 identifies the other
745	   tree.

747	   Each node multicast forwards the received notification packet (on the
748	   same tree).  The root node performs as every other node but in
749	   addition it also multicast the notification on the other tree!  I.e.
750	   it forwards a replica of the incoming notification in which it
751	   replaces the destination address identifying the other multicast
752	   distribution tree.

754	   When the network remains connected and the root remains operable
755	   after a single failure, the root will be reached on at least one of
756	   the trees.  Thus, since the root can reach every node along at least
757	   one of the trees, all the notifications will reach each node.
758	   However, when the root or the link to the root fails, that tree, in
759	   which the root has only one child, remains connected (the root is a
760	   leaf there), thus, all the nodes can be reached along that tree.

762	   For example, let us consider that in Figure 6 FN is used to
763	   disseminate failure information.  If link A-B fails, the
764	   notifications originating from node B (e.g. reporting that the
765	   connectivity from B to A is lost) will reach R on tree #1.
766	   Notifications originating from A (e.g. reporting that the
767	   connectivity from A to B is lost) will reach R on tree #2.  From R,
768	   each node is reachable through one of the trees, so each node will be
769	   notified about both events.

771	A.2.  Unicast

773	   This method addresses the need in a unique way.  It has the following
774	   properties:

776	      Plain simple, without the need of any forwarding plane change or
777	      cooperation;

779	      Short turnaround time (i.e. ready for next hit);

781	      100% link break coverage (may not work in certain node failure
782	      cases);

784	      Little change to OSPF (need encapsulation for IS-IS).

786	A.2.1.  Method

788	   The method is simple in design, easy to implement and quick to
789	   deploy.  It requires no topology changes or specific configurations.
790	   It adds little overhead to the overall system.

792	   The method sends the event message to every router in the area in an
793	   IP packet.  This appears burdensome to the sending router which has
794	   to duplicate the packet sending effort many times.  Practical
795	   experience has shown, however, that the amount of effort is not a big
796	   concern in reasonable sized networks.

798	   Normal flooding (regular or fast) process requires a router to
799	   duplicate the packet to all flooding eligible interfaces.  All
800	   routers have to be fast-flooding-aware.  This implies new code to
801	   every router in control plane and/or forwarding plane.

803	   The method uses a different approach.  It takes advantage of the
804	   given routing/forwarding table in each router in the IP domain.  The
805	   originating router of the flooding information simply sends multiple
806	   copies of the packet to each and every router in the domain.  These
807	   packets are forwarded to the destination routers at forwarding plane
808	   speed,

810	   just like the way the regular IP data traffic is handled.  No special
811	   handling in any other routers is needed.

813	   This small delay on the sender can be minimized by pre-downloading
814	   the link-broken message packets to the forwarding plane.  Since the
815	   forwarding plane already has the list of all routers which are part
816	   of the IGP routing table, the forwarding plane can dispatch the
817	   packet directly.

819	   In essence, the flooding in this method is tree based, just like a
820	   multicast tree.  The key is that no special tree is generated for
821	   this purpose; the normal routing table which is an SPF tree (SPT)
822	   plays a role of the flooding tree.  This logic guarantees that the
823	   flooding follows the shortest path and no flooding loop is created.

825	A.2.2.  Sample Operation

827	   Figure 7 depicts a scenario where router A wants to flood its message
828	   to all other routers in the domain using the unicast flooding method.

830	   Instead of sending one packet to each of its neighbor, and letting
831	   the neighbor flood the packet further, router A directly send the
832	   same packet to each router in the domain, one at a time.  In this
833	   sample network, router A sends out 5 packets.

835	                                  A---B---C---D
836	                                  \
837	                                  --E---F

839	                                  1. Packet(A->B);
840	                                  2. Packet(A->C);
841	                                  3. Packet(A->D);
842	                                  4. Packet(A->E);
843	                                  5. Packet(A->F).

845	                    Figure 7: Multiple Unicast Packets

847	   The unicast flooding procedure is solely controlled by the sending
848	   router.  No action is needed from other routers other than their
849	   normal forwarding functionalities.  This method is extremely simple
850	   and useful for quick prototyping and deployment.

852	A.3.  Gated Multicast through RPF Check

854	   This method fulfills the purpose with the following characters:

856	   1.  No need to build the multicast tree.  It is the same as the SPT
857	       computed by the IGP routing process;

859	   2.  Flooding loops are prevented by RPF Check.

861	   The method has all the benefits of multicast flooding.  It, however,
862	   does not require running multicast protocol to setup the multicast
863	   tree.  The unicast shortest path tree is used as a multicast tree.

865	A.3.1.  Loop Prevention - RPF Check

867	   In this mechanism, the distribution tree is not explicitly built.
868	   Rather, each node will first do a Reverse Path Forwarding (RPF) check
869	   before it floods the notification to other links.

871	   A special multicast address is defined and is subject to IANA
872	   approval.  This address is used to qualify the notification packet
873	   for fast flooding.  When a notification packet arrives, the receiving
874	   node will perform an IP unicast routing table lookup for the
875	   originator IP address of the notification and find the outgoing
876	   interface.  Only when the arriving interface of the notification is
877	   the same as the outgoing interface leading towards the originator IP
878	   address, will the notification be flooded to other interfaces.

880	   IP Multicast forwarding with RPF check is available on most of the
881	   routing/switching platforms.  To support flooding with RPF check, a
882	   special IP multicast group must be used.  A bi-directional IP
883	   multicast forwarding entry is created that consists of all interfaces
884	   within the flooding scope, typically an IGP area.

886	A.3.2.  Operation

888	   The Gated flooding operation is illustrated in Figure 8.

890	              All Routers, IGP Process:
891	              if (SPT ready) {
892	               duplicate the SPT as Bidir_Multicast_tree;
893	               download the multicast_tree to forwarding plane;
894	              }
895	              add FNF_multicast_group_addr;

897	                  Sender of the FNF notification:
898	              if (breakage detected) {
899	               pack the notification in a packet;
900	               send the packet to the FNF_multicast_group_addr;
901	              }

903	              Receiver of the FNF notification:
904	               if (notification received) {
905	               if (RPC_interface == incoming_interface) {
906	                multicast the notification to all other interfaces;
907	               }
908	               forward the notification to IGP for processing;

910	              }

912	                    Figure 8: Gated flooding operation

914	   Figure 9 shows a sample operation on a four-router mesh network.  The
915	   left figure is the topology.  The right figure is the shortest path
916	   tree rooted at A.

918	   Router A initiates the flooding.  But the downstream routers B, C,
919	   and D will drop all messages except the ones that come from their
920	   shortest path parent node.  For example, A's message to C via B is
921	   dropped by C, because C knows that its reverse path forwarding (RPF)
922	   nexthop is A.

924	                                    A       A
925	                                    /|\     / \
926	                                    B---C   B   C
927	                                    \|/     \
928	                                    D       D

930	              Figure 9: Loop Prevention through the RPF check

932	A.4.  Further Multicast Tree based Transport Options

934	A.4.1.  Source Specific Trees

936	   One implementation option is to rely on source specific multicast.
937	   This means that even though there is only a single multicast group
938	   address (MC-FN) allocated to the FN service, the FIB of each router
939	   is configured with forwarding information for as many trees as many
940	   FN sources (nodes) there are in the routing area, i.e. to each (S_i
941	   ,MC-FN) pair.

943	A.4.2.  A Single Bidirectional Shared Tree

945	   In the previous solution each source specific tree is a spanning
946	   tree.  It is possible to reduce the complexity of managing and
947	   configuring n spanning trees in the area by using bidirectional
948	   shared trees.  By building a bidirectional shared tree, all nodes on
949	   the tree can send and receive traffic using that single tree.  Each
950	   sent packet from any source is multicasted on the tree to all other
951	   receivers.

953	   The tree must be consistently computed at all routers.  For this, the
954	   following rules may be given:

956	   The tree can be computed as a shortest path tree rooted at e.g. the
957	   highest router-id.  When multiple paths are available, the
958	   neighbouring node in the graph e.g. with highest router-id can be
959	   picked.  When multiple paths are available through multiple
960	   interfaces to a neighbouring node, e.g. a numbered interface may be
961	   preferred over an unnumbered interface.  A higher IP address may be
962	   preferred among numbered interfaces and a higher ifIndex may be
963	   preferred among unnumbered interfaces.

965	   Note, however, that the important point is that the rules are
966	   consistent among nodes.  That is, a router may pick the lower router
967	   IDs if it is ensured that ALL routers will do the same to ensure
968	   consistency.

970	   Multicast forwarding state is installed using such a tree as a bi-
971	   directional tree.  Each router on the tree can send packets to all
972	   other routers on that tree.

974	   Note that the multicast spanning tree can be built using [RFC5015] so
975	   that each router within an area subscribes to the same multicast
976	   group address.  Using BIDIR-PIM in such a way will eventually build a
977	   multicast spanning tree among all routers within the area.  (BIDIR-
978	   PIM is normally used to build a shared, bidirectional multicast tree
979	   among multiple sources and receivers.)

981	A.5.  Layer 2 Networks

983	   Layer 2 (e.g. Ethernet) networks offer further options for
984	   distributing the notification (e.g. using spanning trees offered by
985	   STP).  Definition of these is being considered and will be included
986	   in a future revision of this draft.

988	Authors' Addresses

990	   Wenhu Lu
991	   Ericsson
992	   300 Holger Way
993	   San Jose, California  95134
994	   USA

996	   Email: Wenhu.Lu@ericsson.com

998	   Sriganesh Kini
999	   Ericsson
1000	   300 Holger Way
1001	   San Jose, California  95134
1002	   USA

1004	   Email: Sriganesh.Kini@ericsson.com
1005	   Andras Csaszar (editor)
1006	   Ericsson
1007	   Irinyi J utca 4-10
1008	   Budapest  1117
1009	   Hungary

1011	   Email: Andras.Csaszar@ericsson.com

1013	   Gabor Sandor Enyedi
1014	   Ericsson
1015	   Irinyi J utca 4-10
1016	   Budapest  1117
1017	   Hungary

1019	   Email: Gabor.Sandor.Enyedi@ericsson.com

1021	   Jeff Tantsura
1022	   Ericsson
1023	   300 Holger Way
1024	   San Jose, California  95134
1025	   USA

1027	   Email: Jeff.Tantsura@ericsson.com