Network Working Group B. Decraene Internet-Draft Orange Intended status: Standards TrackMay 4, 2015S. Litkowski Expires:November 5,December 21, 2015 Orange Business Service H. Gredler Juniper Networks, Inc. A. Lindem Cisco Systems P. Francois IMDEA Networks Institute June 19, 2015Back-offSPF Back-off algorithm for link stateIGP draft-ietf-rtgwg-backoff-algo-00IGPs draft-ietf-rtgwg-backoff-algo-01 Abstract This document defines a standard algorithm to back-off link-state IGP SPF computations. Having onestandardizedstandard algorithm improves interoperability by reducing the probability and/or duration of transient forwarding loops during the IGP convergencein the area/levelwhen thenetworkIGP reacts to multipleconsecutiveproximate IGP events. Requirements Language The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119]. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at http://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire onNovember 5,December 21, 2015. Copyright Notice Copyright (c) 2015 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 2. High level goals . . . . . . . . . . . . . . . . . . . . . . 3 3. Definitions and parameters . . . . . . . . . . . . . . . . . 3 4.PrinciplePrinciples of SPF delay algorithm . . . . . . . . . . . . . . 4 5. Specification of the SPF delay algorithm . . . . . . . . . . 5 6. Parameters . .4 6.. . . . . . . . . . . . . . . . . . . . . . . 6 7. Impact on micro-loops . . . . . . . . . . . . . . . . . . . .5 7.6 8. IANA Considerations . . . . . . . . . . . . . . . . . . . . .5 8.7 9. Security considerations . . . . . . . . . . . . . . . . . . .5 9.7 10. Acknowledgements . . . . . . . . . . . . . . . . . . . . . .5 10.7 11. References . . . . . . . . . . . . . . . . . . . . . . . . .6 10.1.7 11.1. Normative References . . . . . . . . . . . . . . . . . .6 10.2.7 11.2. Informative References . . . . . . . . . . . . . . . . .6 Author's Address .7 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . .68 1. Introduction Link stateIGP,IGPs, such as IS-IS [ISO10589-Second-Edition] and OSPF [RFC2328],performsperform distributed route computation on allnodesrouters of thearea/ level.area/level. In order to have consistent routing tables across the network, such distributed computation requires that all routers have the samevisionversion of the network topology (Link State DataBase (LSDB)) and perform their computation at the same time. In general, when the network is stable, there is a desire to compute the new SPF as soon as the failure isknown,detected in order to quickly route around the failure. However, when the network is experiencing multipleconsecutiveproximate failures over a short period of time, there is a conflicting desire to limit the frequency of SPF computations. Indeed, thisallow reducing theallows a reduction in control plane resources used byIGPIGPs and allprotocols/sub systemprotocols/subsystem reacting onitthe attendant route change, such as LDP, RSVP-TE, BGP, Fast ReRoute computations, FIBupdates...,updates... This will reduce the churn onnodesrouters and in thenetwork,network and, inparticularparticular, reduce the side effects such as micro-loopswhich may happenthat ensue duringeachIGP convergence. To allow for this,someIGPs implement a SPF back-offalgorithm have been implemented.algorithm. Different implementations choose differentalgorithms, hencealgorithms. Hence, in a multi-vendor network, it's not possible toenforceensure that all routerstriggerstrigger their SPF computation after the samewaitingdelay. This situation increases the average differential delay between routersend of RIBcompleting their SPF computation. It also increases the probability that different routers compute theirRIBFIBs based on a differentLSDB.LSDB versions. Bothincreasesfactors increase the probability and/or duration of micro-loops. To allowfor multi-vendorsmulti-vendor networkshavingto have alltheroutersdelayingdelay their SPF computations for the same duration, this document specifies astandardizedstandard algorithm.ImplementationsOptionally, implementations may offer alternativeoptionalalgorithms. 2. High level goals The high level goals of this algorithm are the following: o Very fast convergence for a singlesimple events (linkevent (e.g., link failure). oFastSlightly paced fast convergencein generalfor multiple proximate IGP events whiletheIGP stability is consideredunder control.acceptable. oA long delayDelayed convergence when the IGP stability isconsidered out of control, in order to let allproblematic. This will allow the IGP and relatedprocess calm down.processes to conserve resources during the period of instability. oAt any time,Always try to avoidusingdifferentSPF_TIMERSSPF_DELAY timers valuesfor nodesacross different routers in the area/level. Even though not allnodesrouters will receive IGPmessagemessages at the same time (due todifferencedifferences both in the distance from thesourceoriginator of the IGP event anddue to differentin floodingimplementations on the path from the source).implementations). 3. Definitions and parameters IGP events: An IGP LSDB change requiring a newRIB computation (topologyrouting table computation. Examples are a topology change, a prefix change, a metricchange).change on link or prefix... Routing table computation: computation of the routing table, by the IGP, using the IGP LSDB. No distinction isdonemade between the type of computationperformed (e.g.performed. e.g., full SPF, incremental SPF,PRC).Partial Route Computation (PRC). The type of computation is a local consideration. This document may indifferently use the terms routing table computation or SPF computation. The SPF_DELAYtimeris the delay introduced between the IGP event and the start of the routing table computation. It can take the following values: INITIAL_WAIT: a very small delay to quickly handle linkfailure. e.g.failure, e.g., 0millisecond.milliseconds. FAST_WAIT: a small delay to have a fastconvergence. e.g.convergence in case of single component failure (node, SRLG..), e.g., 50-100millisecond.milliseconds. Note: we want to be fast, but as this failurerequiresresults in multiple IGP events, being too fastincreaseincreases the probability to receive additionalIGPnetwork eventsjustimmediately after theRIBSPF computation. LONG_WAIT: a long delayaswhen the IGP isunstable. e.g.unstable, e.g., 2 seconds. Note:let's bring calm inAllow theIGP.IGP network to stabilize. TheTIME_TO_CONVERGETIME_TO_LEARN timer is thetimemaximum duration typically needed to learn all the IGP events related to a single component failure(e.g. node(e.g., router failure, SRLGfailure). e.g.failure), e.g., 1 second. It's mostly dependent on variation of failure detection times between allnodes whichrouters that areneighbouradjacent to thefailure, and thenfailure. Additionally, it may depend on the different floodingalgorithms of nodesimplementations for routers in the network. The HOLD_DOWN timer is the time needed with no received IGP eventsreceived,before consideringthatthe IGPis quiet again and we can setto be stable again, allowing the SPF_DELAYbacktoINITAL_WAIT. e.g. 5be restored to INITIAL_WAIT. e.g., 3 seconds. 4.PrinciplePrinciples of SPF delay algorithmTheFor this first IGPevent is handled very quickly (INITIAL_WAIT)event, we assume that there has been a single simple change inorder to be very reactive forthefirst event if it only needs one IGP event (e.g.network which can be taken into account using a single routing computation (e.g., link failure, prefixchange).(metric) change) and we optimize for very fast convergence, delaying the routing computation by INITIAL_WAIT. Under this assumption, there is no benefit in delaying the routing computation. In a typical network, this is the most common type of IGP event. Hence, it makes sense to optimize this case. Ifmoresubsequent IGP events are receivedquickly after,in a short period of time (TIME_TO_LEARN), weconsiderthen assume thatthey are related to the samea singlefailure, and handlecomponent failed, but that this failure requires the knowledge of multiple IGP eventsrelatively quickly (FAST_WAIT) duringin order for thetime neededIGP routing toreceiveconverge. Under this assumption, we want fast convergence since this is a normal network situation. However, there is a benefit in waiting for alltheIGP events related to this single component failure so that the IGP can compute the post- failure(TIME_TO_CONVERGE).routing table in a single route computation. In this situation, we delay the routing computation by FAST_WAIT. If IGP events are still received afterthis time,TIME_TO_LEARN seconds from the initial IGP event, then the network is presumably experiencing multiple independent failures andthewhile waiting foritsnetwork stability, the computations are delayed for a longer time(LONG_WAIT).represented by LONG_WAIT. This SPF_delay is kept until no IGP events are received for HOLD_DOWN seconds. Note: previous SPF delay algorithms used to count the number ofRIBSPF computations. However, as allnodesrouters may receive theLSPIGP eventsin aat differentwaytimes, we cannot assume that allnodesrouters will perform the same number of SPF computations or that they will schedule them at the same time. For example, assuming that the SPF delay is 50 ms,noderouter R1 may receive 3 IGP events (E1, E2, E3) in those 50 ms and hence will perform a single routing computation. While anothernoderouter R2 may only receive 2 events (E1, E2) in those50ms50 ms and hence will schedule another routing computation whenfurtherreceiving E3. That's why this documentprefers to definedefines a timelimit (TIME_TO_CONVERGE) since(TIME_TO_LEARN) from the initial event detection/reception as opposed to defining thefirst event, rather than anumber ofrouting computations.SPF computations to determine when the IGP is unstable. 5. Specification of the SPF delay algorithm Whenthe previousno IGP eventsis more thanhave occurred during the HOLD_DOWNago:interval: o The IGP is set to the QUIET state. When the IGP is in the QUIET state and an IGP event is received: o The time of this first IGP event is stored in FIRST_EVENT_TIME. o The nextRIBrouting table computationtimeisset to LSP receivescheduled at: this IGP event received time + INITIAL_WAIT. o The IGP is set to the FAST_WAIT state. When the IGP is in the FAST_WAIT state and an IGP event is received: o If more thanTIME_TO_CONVERGEthe TIME_TO_LEARN interval has passed since FIRST_EVENT_TIME, then the IGP is set to the HOLD_DOWN state. o Ifthe next RIB_computation time is in the past, set the next RIBa routing table computationtime to LSP receiveis not already scheduled, one is scheduled at: this IGP event received time + FAST_WAIT. When the IGP is in the HOLD_DOWN state and an IGP event is received: o Ifthe next RIB_computation time is in the past, set the next RIBa routing table computationtime to LSP receiveis not already scheduled, one is scheduled at: this IGP event received time + LONG_WAIT. 6. Parameters All the parameters MUST be configurable. All the delays (INITIAL_WAIT, FAST_WAIT, LONG_WAIT, TIME_TO_LEARN, HOLD_DOWN) SHOULD be configurable at the millisecond granularity. They MUST be configurable at least at the tenth of second granularity. The configurable range for all the parameters SHOULD be at least from 0 milliseconds to 60 seconds. This document does not propose default values for the parameters because these values are expected to be context dependent. Implementations are free to propose their own default values. When setting the (default) values, one SHOULD consider the customer's or their applications' requirements, the computational power of the routers, the size of the network, and, in particular, the number of IP prefixes advertised in the IGP, the frequency and number of IGP events, the number of protocols reactions/computations triggered by IGP SPF (e.g., BGP, PCEP, Traffic Engineering CSPF, Fast ReRoute computations). Note that some or all of these factors may change over the life of the network. In case of doubt, it's RECOMMENDED to play it safe and start with safe, i.e., longer timers. For the standard algorithm to be effective in mitigating micro-loops, it is RECOMMENDED that all routers in the IGP domain, or at least all the routers in the same area/level, have exactly the same configured values. 7. Impact on micro-loops Micro-loops during IGP convergence are due to anon synchronizednon-synchronized ornon orderednon-ordered update of the forwarding information tables (FIB) [RFC5715] [RFC6976][I-D.litkowski-rtgwg-spf-uloop-pb-statement]. FIB[I-D.ietf-rtgwg-spf-uloop-pb-statement]. FIBs are installed after multiple steps such as SPF wait time, SPF computation, FIBdistributiondistribution, and FIB update. This document onlyaddressaddresses the first contribution. This standardized procedure reduces the probability and/or duration of micro-loops when the IGP experience multipleconsecutiveproximate events. It does notremoveprevent allmicro-loops.micro- loops. However, it is beneficial and its cost seems limited compared to full solutions such as [RFC5715] or [RFC6976].7.8. IANA Considerations No IANA actions required.8.9. Security considerations This algorithm presented in this documenthas no impact ondoes not in any way compromise the security of the IGP.9.In fact, the HOLD_DOWN state may mitigate the effects of Denial-of-Service (DOS) attacks generating many IGP events. 10. Acknowledgements We would like to acknowledgeHannes Gredler,LesGinsbergGinsberg, Uma Chunduri, andPierre FrancoisMike Shand for the discussions and comments related to this document.10.11. References10.1.11.1. Normative References [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997.10.2.11.2. Informative References[I-D.litkowski-rtgwg-spf-uloop-pb-statement][I-D.ietf-rtgwg-spf-uloop-pb-statement] Litkowski, S., "Link State protocols SPF trigger and delay algorithm impact on IGP microloops",draft-litkowski- rtgwg-spf-uloop-pb-statement-02draft-ietf-rtgwg-spf- uloop-pb-statement-00 (work in progress),MarchMay 2015. [ISO10589-Second-Edition] International Organization for Standardization, "Intermediate system to Intermediate system intra-domain routeing information exchange protocol for use in conjunction with the protocol for providing the connectionless-mode Network Service (ISO 8473)", ISO/IEC 10589:2002, Second Edition, Nov 2002. [RFC2328] Moy, J., "OSPF Version 2", STD 54, RFC 2328, April 1998. [RFC5715] Shand, M. and S. Bryant, "A Framework for Loop-Free Convergence", RFC 5715, January 2010. [RFC6976] Shand, M., Bryant, S., Previdi, S., Filsfils, C., Francois, P., and O. Bonaventure, "Framework for Loop-Free Convergence Using the Ordered Forwarding Information Base (oFIB) Approach", RFC 6976, July 2013.Author's AddressAuthors' Addresses Bruno Decraene Orange 38 rue du General Leclerc Issy Moulineaux cedex 9 92794 France Email: bruno.decraene@orange.com Stephane Litkowski Orange Business Service Email: stephane.litkowski@orange.com Hannes Gredler Juniper Networks, Inc. 1194 N. Mathilda Ave. Sunnyvale, CA 94089 US Email: hannes@juniper.net Acee Lindem Cisco Systems 301 Midenhall Way Cary, NC 27513 USA Email: acee@cisco.com Pierre Francois IMDEA Networks Institute 1194 N. Mathilda Ave. Leganes ES Email: pierre.francois@imdea.org