Re: [karp] [IAB] Fwd: Attached: draft-lebovitz-kmart-roadmap-01-00.txt
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [karp] [IAB] Fwd: Attached: draft-lebovitz-kmart-roadmap-01-00.txt



Danny,
the lastest version of the document contains changes based on your feedback. Thanks a ton for the thorough review. See comments below...

At 02:14 PM 3/18/2009, Danny McPherson wrote:
Comments inline..

-
Seems this document is a framework, requirements, and
in part, solutions draft, and this tends to muddy the
objectives considerably.

Agree. It's a catch-all at present. I put in a paragraph in the intro stating as much, and suggesting how it might be broken up into smaller, more focused docs. We will break it up into more focused docs once the BoF is over, and the WG starts. Ideally we would have done it before, but I just didn't have the cycles.


-
I agree with Ward about per AF and even AF/SAF password
pairs, as folks setup discrete sessions based on these
today, and his other comments as well.

-danny


On Mar 11, 2009, at 1:19 AM, Gregory Lebovitz wrote:

Internet-Draft
Juniper
Intended status: Informational                            March 10,
2009
Expires: September 11, 2009

Roadmap for Cryptographic Authentication of Routing Protocol Packets
on
                                the Wire
                    draft-lebovitz-kmart-roadmap-01
....

Internet-Draft                KMART Roadmap                   March
2009

1.  Introduction
   In March 2006 the Internet Architecture Board (IAB) held a workshop
   on the topic of "Unwanted Internet Traffic".  The report from that
   workshop is documented in RFC 4948 [RFC4948].  Section 8.1 of that
   document states that "A simple risk analysis would suggest that an
   ideal attack target of minimal cost but maximal disruption is the
   core routing infrastructure."  Section 8.2 calls for "[t]ightening
   the security of the core routing infrastructure."  Four main steps
   were identified for that tightening:
   o  More secure mechanisms and practices for operating routers.
This
      work is being addressed in the OpSec Working Group.

I believe it's 'OPSEC' from a capitalization perspective.

done.


   o  Cleaning up the Internet Routing Registry repository [IRR], and
      securing both the database and the access, so that it can be
used
      for routing verifications.  This work is being conducted through
      liaisons with the RIR's globally.

The RIRs don't own most of the IRRs, and this is a bit confusing
in this context.  I'm not sure this work is really happening,
actually.

done.


   o  Specifications for cryptographic validation of routing message
      content.  This work is being done in the SIDR Working Group.

Actually, the only thing SIDR has really done to date is define
the repository structure for the RPKI.  They have not done anything
with actual use of this information in the operational routing
system itself.

made it a future reference, "this work will likely be done in the SIDR WG"


   o  Securing the routing protocols' packets on the wire
   This document addresses the last bullet, securing the packets on
the
   wire of the routing protocol exchanges.

It might be nice if you define in the introduction where the
term 'KMART' came from.

Done. Now called KARP.


1.1.  Terminology
   [to be filled out later]
   Base RP
   key_store
   KMP
   session keys

This whole sesssion filled in now.

1.2.  Requirements Language
   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in
this
   document are to be interpreted as described in RFC 2119 [RFC2119].
1.3.  Scope
   Four basic tactics may be employed in order to secure any piece of
   data as it is transmitted over the wire: privacy (or encryption),
   authentication, message integrity, and non-repudiation.  The focus
   for this effort, and the scope for this roadmap document, will be

Lebovitz &             Expires September 11, 2009
[Page 4]

Internet-Draft                KMART Roadmap                   March
2009

   message authentication and packet integrity only.  This work
   explicitly excludes, at this point in time, the other two tactics:
   privacy and non-repudiation.  Since the objective of most routing
   protocols is to broadly advertise the routing topology, routing
   messages are commonly sent in the clear; confidentiality is not
   normally required for routing protocols.  However, ensuring that
   routing peers truly are the trusted peers expected, and that no
roque
   peers or messages can compromise the stability of the routing
   environment is critical, and thus our focus.  The other two
   explicitly excluded tactics, privacy and non-repudiation, may be
   addressed in future work.
   It is possible for routing protocol packets to be transmitted
   employing all four security tactics mentioned above using existing
   standards.  For example, one could run unicast, layer 3 or above
   routing protocol packets through IPsec ESP [RFC4303].  This would
   provide the added benefit of privacy, and non-repudiation.
However,
   routing products have been fine tuned over the years for the
specific

"routing products" reads a bit odd.  Either say "routers" or
"routing protocols".

I used "router platforms and systems"


   processing necessary for these routing protocols non-encapsulated
   formats.

That last bit reads a bit odd, or is perhaps overly redundant
given the earlier part of the text.

cleaned it a bit.


Operators are, therefore, quite unwilling to explore new

s/unwilling/reluctant/

done.


   packet encapsulations for these tried and true protocols.
   In addition, at least in the case of BGP and LDP, these protocols
   already have existing mechanisms for cryptographically
authenticating
   and integrity checking the packets on the wire.  Products with
these
   mechanisms have already been produced, code has already been
written
   and both have been optimized for the existing mechanisms.  Rather
   than turn away from these mechanisms, we want to enhance them,
   updating them to modern and secure levels.

What's a "modern and secure level"?

   There are two main work phases for this roadmap, and for any BaseRP

What's "BaseRP"?  You use both it and "Base RP" throughout.  Also,
RP, as you're likely aware, is commonly used to compress 'Relying
Party" in PKI realms, so it might be confused when used in this
context -- unless explicitly pre-defined.

I've replaced "RP" and "BaseRP" with Routing Protocol throughout the document, and I defined the capitalized version in the terminology section to mean "Routing Protocol for which work is being done to provide or enhance its peer authentication mechanisms."


   work undertaken as part of this roadmap (discussed further in the
   Work Phases (Section 4.1) section).  The first is to enhance the
Base
   RP's current authentication mechanism, ensuring it employs modern

So "RP" is "routing protocol"?  You might want to define that
above, as I've not seen that employed in other places within the
IETF as such.

   cryptographic algorithms and methods for its basic operational
model,
   fulfillling the requirements defined in the Requirements

s/fulfillling/fulfilling/

done.


   (Section 4.2) section, and protecting against as many of the
threats
   as possible as defined in the Threats (Section 2.1)section.  Many
of

s/Section 2.1)section/Section 2.1( section/

done.


   the BaseRPs' current mechanisms use manual keys, so the first phase
   updates will focus on shoring up the manual key mechanisms that
   exist.
   The second work phase is to define the use of a key management
   protocol (KMP) for creating and managing session keys used in the
   BaseRPs' message authentication and data integrity functions.  It
is
   hoped that a general KMP framework -- or a small number of
frameworks


s/hoped/intended/

done


You also need to expand KMP and all the other acronyms
upon first use here.

done in the terminology section, which is now filled out in the most recent version.


   -- can be defined and leveraged for many BaseRPs.


Lebovitz &             Expires September 11, 2009
[Page 5]

Internet-Draft                KMART Roadmap                   March
2009

   Therefore, the scope of this roadmap of work includes:
   o  Making use of existing routing protocol security protocols,
where
      they exist, and enhancing or updating them as necessary for
modern
      cryptographic best practices,
   o  Developing a framework for using automatic key management in
order
      to ease deployment, lower cost of operation, and allow for rapid
      responses to security breaches, and

   o  Specifying the automated key management protocol that may be
      combined with the bits-on-the-wire mechanisms.
   The work also serves as an agreement between the Routing Area and
the
   Security Area about the priorities and work plan for incrementally
   delivering the above work.  This point is important.  There will be
   times when the best-security-possible will give way to vastly-
   improved-over-current-security-but-admittedly-not-yet-best- security-
   possible, in order that incremental progress toward a more secure
   Internet may be achieved.  As such, this document will call out
   places where agreement has been reached on such trade offs.


   This document does not contain protocol specifications.  Instead,
it
   defines the areas where protocol specification work is needed and
   sets a direction, a set of requirements, and a relative priority
for
   addressing that specification work.
   There are a set of threats to routing protocols that are considered
   in-scope for this document/roadmap, and a set considered out-of-
   scope.  These are described in detail in the Threats (Section 2)
   section below.
1.4.  Goals
   The goals and general guidance for this work roadmap follow:
   1. Provide authentication and integrity protection for packets on
the
      wire of existing routing protocols
   2. Deliver a path to incrementally improve security of the routing
      infrastructure.  The principle of crawl, walk, run will be in
      place.  Routing protocol authentication mechanisms may not go
      immediately from their current state to a state containing the
      best possible, most modern security practices.  Incremental
steps
      will need to be taken for a few very practical reasons.  First,
      there is a great deal of deployed routing devices in operating

s/is a great deal of/are a considerable number of/

done.


      networks that will not be able to run the most modern
      cryptographic mechanisms without significant and unacceptable

Lebovitz &             Expires September 11, 2009
[Page 6]

Internet-Draft                KMART Roadmap                   March
2009

      performance penalties.  The roadmap for any one routing protocol
      MUST allow for incremental improvements on existing operational
      devices.  Second, current routing protocol performance on
deployed
      devices has been achieved over the last 20 years through
extensive
      tuning of software and hardware elements, and is a constant
focus
      for improvement by vendors and operators alike.  The
introduction
      of new security mechanisms affects this performance balance.
The
      performance impact of any incremental step of security
improvement
      will need to be weighed by the community, and introduced in
such a
      way that allows the vendor and operator community a path to
      adoption that upholds reasonable performance metrics.
Therefore,
      certain specification elements may be introduced carrying the
      "SHOULD" guidance, with the intention that the same mechanism
will
      carry a "MUST" in the next release of the specification.  This
      gives the vendors and implementors the guidance they need to
tune
      their software and hardware appropriately over time.  Last, some
      security mechanisms require the build out of other operational
      support systems, and this will take time.  An example where
these
      three reasons are at play in an incremental improvement
roadmap is
      seen in the improvement of BGP's [RFC4271] security via the
update
      of the TCP Authentication Option (TCP-AO)
      [I-D.ietf-tcpm-tcp-auth-opt] effort.  It would be ideal, and
      reflect best common security practice, to have a fully specified
      key management protocol for negotiating TCP-AO's authentication
      material, using certificates for peer authentication in the
      keying.  However, in the spirit of incremental deployment, we
will
      first address issues like cryptographic algorithm agility,
replay
      attacks, TCP session resetting in the base TCP-AO protocol
before
      we layer key management on top of it.


   3. The deploy-ability of the improved security solutions on
currently
      running routing infrastructure equipment.  This begs the
      consideration of the current state of processing power available
      on routers in the network today.

While this is certainly a consideration, I'm not sure why it's
outlined in the Goals so explicitly, or even captured in the
#2 bullet point as well.

so, what's your suggestion for how to change the text?


   4. Operational deploy-ability - A solutions acceptability will also
      be measured by how deployable the solution is by common operator
      teams using common deployment processes and infrastructures.
I.e.
      We will try to make these solutions fit as well as possible into
      current operational practices or router deployment.  This will
be
      heavily influenced by operator input, to ensure that what we
      specify can -- and, more importantly, will -- be deployed once
      specified and implemented by vendors.  Deployment of
incrementally
      more secure routing infrastructure in the Internet is the final
      measure of success.

What's the actual Goal here and how do we measure and
manage to deliver on that goal?

How about this text:
"Measurably, we would like to see an increase in the number of surveyed respondents who report deploying the improved security mechanisms across their routers, both in any usage within the network, as well as the total number of routers enabled with the updated authentication mechanisms."


Lebovitz &             Expires September 11, 2009
[Page 7]

Internet-Draft                KMART Roadmap                   March
2009

      Interviews with operators show several points about routing
      security.  First, only about 25% of operators have deployed
      security in their routing protocols [REF???, Danny, you got
one?].

It's slightly better than that, but not much, depending on
whether or not what you call security is strictly transport
connection protection, or something more.  If you're referring
to IRR and prefix filtering functions, it's much worse.  Both
vary considerably inter-domain v. intra-domain.

[ISR2008] http://www.arbornetworks.com/dmdocuments/ISR2008_US.pdf,
which you can download for the raw numbers.

Got it. Thx.


      Of those who have deployed, only about [25% ??] of their routers
      are deployed with the authentication enabled.  Most report
      deploying with one single manual key throughout the entire
      network.

Most?  All that use standard TCP MD5 Signature Option stuff.

Well, for the one person that I didn't actually speak to who may have used a few different keys, I didn't want to be too extreme. But we agree, it's abismal. ;-)

One or two still reported IPSEC for transport connection
protection, as illustrated.

added that fact.


  These same operators report that the one single key has
      not been changed since it was originally installed, sometimes
five
      or more years ago.  When asked why, particularly for the case of
      BGP using TCP MD5, the following reasons are often given:

      A.  Changing the keys brings down the links/adjacencies,
          undermining Service Level Agreements (SLAs).

Might be useful to say "triggers TCP session reset" v.
"brings down links".

done.


      B.  For external peers, difficulty of coordination with the
other
          organization.  They often don't know who the contact is at
the
          other organization, so they don't know where to start, and
          doing so takes a lot of time in research.

Nah, that's not the issue.  They have the peering coordinators
contact information, it's just that the coordination function
is serialized and on a per peer/AS basis, and very cumbersome
and tedious to execute in practice.

ok. I clarified that, and softened the "finding the contact" issue, which I did hear from several of the people that I interviewed.


      C.  Keys must be changed at precisely the same time in order to
          limit connectivity outage duration.  This is incredibly
          difficult to do, operationally, especially between different
          organizations.

Or within some very small window (e.g., 60 seconds), which
I believe at least the two primary vendors support.

added.


      D.  Relatively low priority compared to other operatoinal
issues.

Yes!

      E.  Lack of staff to implement the changes device by device.
      F.  One operator reported that the same key is used for all
          customer premise equipment.  The same operator reported that
          if the customer mandated, a unique key could be created,
          although the last time this occurred it created such an
          operational headache that the administrators now usually
tell
          customers that the option doesn't even exist, to avoid the
          difficulties.  These customer-uniqe keys are never changed,
          unless the customer demands so.

So you've got three areas of concern: peers and interconnection
with other ISPs, Internal BGP and other routing sessions, and
customer and CPE devices.  All three have very different properties.

added.


      The main threat at play here is that a terminated employee from
      such an operator who had access to the one (or few) keys used
for
      authentication in these environments could easily wage an attack
      -- or offer the keys to others who would wage the attack -- and
      bring down many of the adjacencies, causing destabilization to
the
      routing system.
      Whatever mechanisms we specify need to be easier than the
current
      methods to deploy, and should provide obvious operational
      efficiency gains along with significantly better security and
      threat protection.  This combination of value may be enough to
      drive much broader adoption.

Sure..


Lebovitz &             Expires September 11, 2009
[Page 8]

Internet-Draft                KMART Roadmap                   March
2009

   5. Address the threats enumerated above in the "Threats" section
      (Section 2) for each routing protocol, along a roadmap.  Not all
      threats may be able to be addressed in the first specification
      update for any one protocol.  Roadmaps will be defined so that
      both the security area and the routing area agree on how the
      threats will be addressed completely over time.

In practice, it will be interesting to see how we gauge
whether the IETF "Security Area" and "Routing Area" agree.
I hope the ADs can help here with best determining how to
accomplish such a goal.

I think we are well on the way. The KARP BoF seems to be coming along quite well, with good cooperation between routing and Sec. This has a lot to do with the personalities currently seated in the roles. I'm hopeful for the future as well.


   6. Reuse common mechanisms across routing protocols whenever
possible
      - For example, designers should aim to re-use the key management
      protocol that will be defined for BGP's TCP-AO key establishment
      for as many other routing protocols as possible.  This is but
one
      example.

I think the is the crux of the KMART work, and that framework
we define needs to be employed and common to all the relevant
protocols here.

added.


   7. Bridge any gaps between routing and security engineers by
      recording agreements on work items, roadmaps, and guidance from
      the Area leads and Internet Architecture Board (IAB,
www.iab.org).

s/engineers/areas/

done.


Also, I'm not sure conveying that an IAB pivot function will
service and identify all gaps, so we should exercise caution
here.

Don't disagree, but also clear that IAB can help. Suggested new text?


   8. Create a re-usable architecture and guidelines for various IETF
      working teams who will address these security improvements for
      various protocols

I like this, and see it's concise overlap with item 6 above.
Should they not be one item?

Yeah, I think so. Done.


1.5.  Non-Goals
   The following two goals are considered out-of-scope for this
effort:
   o  Privacy of the packets on the wire, at this point in time.  Once
      this roadmap is realized, we may revisit work on privacy.
   o  Message content security.  This work is being deal with in other
      areas, like SIDR.

s/deal/dealt/

s/deal/addressed


1.6.  Audience
   The audience for this roadmap includes:
   o  Routing Area working group chairs and members -   These people
are

s/members/participants/

done.


        charged with updates to the routing protocol specifications.
        Any and all cryptographic authentication work on these
        specifications will occur in Routing Area working groups.

But if there's a common model to be employed across multiple
routing protocols, what "Routing Area working groups" and not
THE "routing area WG"?

Not sure. I think in order to update OSPF we need the OSPF WG to update their specs, no?


   o  Security Area reviewers of routing area documents -   These
people
        are delegated by the Security Area Directors to perform
reviews
        on routing protocol specifications as they pass through
working
        group last call or IESG review.  They will pay particular
        attention to the use of cryptographic authentication and
        corresponding security mechanisms for the routing protocols.
        They will ensure that incremental security improvements are

Lebovitz &             Expires September 11, 2009
[Page 9]

Internet-Draft                KMART Roadmap                   March
2009

        being made, in line with this roadmap.
   o  Security Area engineers -   These people partner with routing
area
        authors/designers on the security mechanisms in routing
protocol
        specifications.  Some of these security area engineers will be
        assigned by the Security Area Directors, while others will be
        interested parties.

s/parties./parties in the relevant working groups./

done.


   o  Operators -   The operators are a key audience for this work, as
        the work is considered to have succeeded if the operators
deploy
        the technology, presumably due to a perception of
significantly
        improved security value coupled with relative similarity to
        deployment complexity and cost.  Conversely, the work will be
        considered a failure if the operators do not care to deploy
it,
        either due to lack of value or perceived (or real) over-
        complexity of operations.

And as such, the GROW and OPSEC WGs should be kept squarely
in the loop as well.

ack. Added the text.


2.  Threats
   In RFC2828[RFC2828], a threat is defined as a potential for
violation
   of security, which exists when there is a circumstance, capability,
   action, or event that could breach security and cause harm.  This
   section defines the threats that are in scope for this roadmap, and
   those that are explicitly out of scope.  This document leverages
the
   "Generic Threats to Routing Protocols" model, RFC 4593 [RFC4593] ,
   capitalizes terms from that document, and offers a terse definition
   of those terms.  (More thorough description of routing protocol
   threats sources, motivations, consequences and actions can be found
   in RFC 4593 [RFC4593] itself).  The threat listings below expand
upon
   these threat definitions.
2.1.  Threats In Scope
   The threats that will be addressed in this roadmap are those from
   OUTSIDERS, attackers that may reside anywhere in the Internet, have
   the ability to send IP traffic to the router, may be able to
observe
   the router's replies, and may even control the path for a
legitimate
   peer's traffic.  These are not legitimate participants in the
routing
   protocol.  Message authentication and integrity protection
   specifically aims to identify messages originating from OUTSIDERS.
   The concept of OUTSIDERS can be further refined to include
attackers
   who are terminated employees, and those sitting on-path.



Lebovitz &             Expires September 11, 2009              [Page
10]

Internet-Draft                KMART Roadmap                   March
2009

   o  On-Path - attackers with control of a network resource or a tap
      along the path of packets between two routers.  An on-path
      outsider can attempt a man-in-the-middle attack, in addition to
      several other attack actions.

s/actions/classes/

done.


  A man-in-the-middle (MitM) attack
      occurs when an attacker who has access to packets flowing
between
      two peers tampers with those packets in such a way that both
peers
      think they are talking to each other directly, when in fact they
      are actually talking to the attacker only.  Protocols conforming
      to this roadmap will use cryptographic mechanisms to prevent a
      man-in-the-middle attacker from situating himself undetected.
   o  Terminated Employees - in this context, those who had access
      router configuration that included keys or keying material like
      pre-shared keys used in securing the routing protocol.  Using
this
      material, the attacker could attempt to impersonate a legitimate
      router.

I don't understand the vector you're conveying here?  What
does "impersonate" mean in this context?

added text to clarify it, as follows:
"... the attacker could send properly MAC'd spoofed packets appearing to come from router A to a router B, and thus impersonate a legitimate peer. The attacker could then send false traffic that changes the network from its operator's design."


The goal of addressing this source specifically is to
      call out the case where new keys or keying material becomes
      necessary very quickly, with little operational expense, upon
the
      termination of such an employee.  This grouping could also refer
      to any attacker who somehow managed to gain access to keying
      material, and said access had been detected by the operators
such
      that the operators have an opportunity to move to new keys in
      order to prevent attack.

s/attack/[an|the] attack/

done.


   These ATTACK ACTIONS are in scope for this roadmap:
   o  SPOOFING - when an illegitimate device assumes the identity of a

What's an "illegitimate device"?  A "legitimate one"?

A device born out of wedlock, duh!! JK.

Will change to "an unauthorized device"




      legitimate one.  Spoofing can be used, for example, to inject
      unrealistic routing information that causes the disruption of

s/unrealistic/malicious/

done.


      network services.  Spoofing can also be used to cause a neighbor
      relationship to form that subsequently denies the formation of
the
      relationship with the legitimate router.

Can you explain this vector to me?

Can a router configured to form an adjacency with routerA actually for two adjacencies with two router A's, simultanously? If an adjacency is formed with a spoofed/impersonated routerA, will a second adjacency be able to form with the real router A?


   o  FALSIFICATION - an action whereby an attacker sends false
routing
      information.  To falsify the routing information, an attacker
has
      to be either the originator or a forwarder of the routing
      information.  Falsification may occur by an ORIGINATOR, or a
      FORWARDER, and may involve OVERCLAIMING, MISCLAIMING, or
      MISTATEMENT of network resource reachability.

What's the difference between "MISCLAIMING" and "MISTATEMENT"?
Also, is the latter spelled as you intended?

Honestly, I don't recall. And I'm 10k meters in the air, w/o Internet connection, as I work through these edits.

From the intro paragraph to Sect 2:
This document leverages the "Generic Threats to Routing Protocols" model, <xref
target="RFC4593"
>RFC 4593</xref> , capitalizes terms from that document, and offers a terse definition of those terms. (More thorough description of routing protocol threats sources, motivations, consequences and actions can be found in <xref target="RFC4593">RFC 4593</xref> itself).



  We must be careful
      to remember that in this work we are only targeting
falsification
      from outsiders as may occur from tampering with packets in
flight.
      Falsification from BYZANTINES (see the Threats Out of Scope
      section (Section 2.2) below) are not addressed by this roadmap,
      but by other work in the IETF.

REF in order here to this "other work"?

Good point. Preventing a router from becoming a BYZANTINE is the work of OPSEC, but I don't think I'll go into that in this paragraph.

s/this roadmap, but by other work in the IETF/the KARP effort.




Lebovitz &             Expires September 11, 2009              [Page
11]

Internet-Draft                KMART Roadmap                   March
2009

   o  INTERFERENCE - when an attacker inhibits the exchanges by
      legitimate routers.  The types of interference addressed by this
      work include:
      *  ADDING NOISE
      *  REPLAYING OUT-DATED PACKETS
      *  INSERTING MESSAGES
      *  CORRUPTING MESSAGES
      *  BREAKING SYNCHRONIZATION
      *  Changing message content
   o  DoS attacks on transport sub-systems - This includes any other
DoS
      attacks specifically based on the above attack types.  This is
      when an attacker sends packets aimed at halting or preventing
the
      underlying protocol over which the routing protocol runs, for
      example halting a BGP session by sending a TCP FIN packet.
      Another example is sending packets which confuse or overwhelm a
      security mechanism itself, for example initiating an
overwhelming
      load of keying protocol initiations from bogus sources.

This example, can you explain that in the context of routing
protocols today?

It would be the case for something like MSEC today, and for any KMP that we put into place.


  All other
      possible DoS attacks are out of scope (see next section).
2.2.  Threats Out of Scope
   Threats from BYZANTINE sources -- faulty, misconfigured, or
subverted
   routers, i.e., legitimate participants in the routing protocol --
are
   out of scope for this roadmap.  Any of the attacks described in the
   above section (Section 2.1) that may be levied by a BYZANTINE
source
   are therefore also out of scope.
   In addition, these other attack actions are out of scope for this
   work:
   o  SNIFFING - passive observation of route message contents in
flight
   o  FALSIFICATION by BYZANTINE sources - unauthorized message
content
      by a legitimate source.
   o  INTERFERENCE due to:
      *  NOT FORWARDING PACKETS - cannot be prevented with
cryptographic
         authentication
      *  DELAYING MESSAGES - cannot be prevented with cryptographic
         authentication
      *  DENIAL OF RECEIPT - cannot be prevented with cryptographic
         authentication
      *  UNAUTHORIZED MESSAGE CONTENT - the work of the IETF's SIDR
         working group
         grouphttp://www.ietf.org/html.charters/sidr-charter.html).

s/grouphttp/group http/

done


      *  Any other type of DoS attack.  For example, a flood of
traffic
         that fills the link ahead of the router, so that the router
is
         rendered unusable and unreachable by valid packets is NOT an

Lebovitz &             Expires September 11, 2009              [Page
12]

Internet-Draft                KMART Roadmap                   March
2009

         attack that this work will address.  Many other such examples
         could be contrived.

It may be worth citing DS or related QoS functions to enable
better response to these sorts of threats .. or not.

possibly DSCP could help, but then again the spoof-ing attacker sending junk to fill the pipe could just as easily set DSCP markings. So it's probably not worth going into the whole discussion at this spot in the paper.


3.  Categorizing Routing Protocols
   For the purpose of this security roadmap definition, we will
   categorize the routing protocols into groups and have design teams
   focus on the specification work within those groupings.  It is
   believed that the groupings will have like requirements for their
   authentication mechanisms, and that reuse of authentication
   mechanisms will be greatest within these grouping.  The work items
   placed on the roadmap will be defined and assigned based on these
   categorizations.  It is also hoped that, down the road in the
Phase 2
   work, we can create one KMP per category (if not for several
   categories) so that the work can be easily leveraged by the various
   RP teams.  KMPs are useful for allowing simple, automated updates
of
   the traffic keys used in a base protocol.  KMPs replace the need
for
   humans, or OSS routines, to periodically replace keys on running
   systems.  It also removes the need for a chain of manual keys to be
   chosen or configured.  When configured properly, a KMP will enforce
   the key freshness policy of two peers by keeping track of the key
   lifetime and negotiating a new key at the defined interval.

Sure, but the inter-domain aspects make this rosy goal considerably
more challenging.

I don't see it. Could you clarify? Today the two domains need to exchange manual keys. In the case where KMP is used, they simply exchange a PSK, or SHA1 fingerprints of their self-signed certs (or whatever) and from then on the KMP takes care of traffic key lifetimes, renewals, etc. Did you mean something else?


3.1.  Category: Messaging Transaction Type
   The first categorization defines four types of messaging
transactions
   used on the wire by the base routing protocol, the Base RP.  They
   are:

   One-to-One     One peer router directly and intentionally
delivers a
                  route update specifically to one other peer router.
                  Examples are BGP and LDP. [question to reviewers:
                  Should we list all protocols into these categories
                  right here, or just give a few examples?]

IS-IS and OSPF have P-P modes, both on traditional PtP
link layers, and multi-access layers.  I'm not sure this
captures the distinction succinctly enough.


Good point. I'll site them as examples, saying:
"Point-to-point modes of both IS-IS and OSPF, when sent over both traditional point-to-point links and when using multi-access layers, may both also fall into this category.



   One-to-Many    A router peers with multiple other routers on a
single
                  network segment -- i.e. on link local -- such that
it
                  creates and sends one route update message which is
                  intended for consumption by multiple peers.
Examples
                  would be OSPF and IS-IS.

See above comment.

ack.


   Client-Server  A client-server routing protocol is one where one
                  router initiates a request for route information
from
                  another router, who then formulates a response to
that
                  request, and replies with the requested data.
                  Examples are a BGP Route Reflector and [????  Are

Lebovitz &             Expires September 11, 2009              [Page
13]

Internet-Draft                KMART Roadmap                   March
2009

                  there other examples?  Is this the right example?].

BGP Route Reflection is _exactly the same as BGP without
route reflection from a transport connection and message
process perspective.  The only differences that exist occur
at the protocol application layer.

So, you think we (a) should not have "Client-Server" category, (b) should have "Client-Server" category but use another example other than BGP, (c) something else?

Pls provide text.


   Multicast      Multicast protocols have unique security properties
                  because of the fact that they are inherently group-
                  based protocols and thus have group keying
                  requirements at the routing level where link-local
                  routing messages are multicasted.  Also, at least in
                  the case of PIM-SM, some messages are are sent
unicast
                  to a given peer(s), as is the case with router- close-
                  to-sender and the "Rendezvous Point".  Some work for
                  application layer message security has been done in
                  the Multicast Security working group (MSEC,
                  http://www.ietf.org/html.charters/msec-charter.html)
                  and may be helpful to review, but is not directly
                  applicable.
   [author's note: I think the above definitions need clean up.
Routing
   area folks, especially ADs, PLEASE suggest new text.]

Let's chat about this in SFO if you'd like, I agree.

Let's make some time to review these in Hiroshima. (Dang, it's taken a long time to processes this review.) I'll send you a separate email.


3.2.  Category: Peer vs. Group Keying
   The second axis of categorization groups protocols by the keying
   mechanism that will be necessary for distributing session keys to
the
   actual routing protocol transports.  They are:

   Peer keying    One router sends the keying messages directly and
only
                  to one other router, such that a one-to-one, unique
                  keying security association (SA) is established
                  between the two routers
   Group Keying   One router creates and distributes a single keying
                  message to multiple peers.  In this case an group SA
                  will be established and used between multiple peers
                  simultaneously.  Group keying exists for protocols
                  like OSPF [RFC2328] , and also for multicast
protocols
                  like PIM-SM [RFC4601].

How about the case where multiple sessions exist between
inter-domain peers and the same keys are used on every
session during a given instant?

can you give me an example of when this happens?

Also, customers, where it
may or may not be the same for managed and unmanaged CPE
devices?

Sorry, not following you here. Could you try clarifying?


3.3.  Security Characterization Vectors
   A few more considerations must be made about the protocol and its
use
   when initially categorizing the protocol and scoping the
   authentication work.



Lebovitz &             Expires September 11, 2009              [Page
14]

Internet-Draft                KMART Roadmap                   March
2009

3.3.1.  Internal vs. External Operation
   The designers must consider whether the protocol is an internal
   routing protocol or an external one, i.e.  Does it primarily run
   between peers within a single domain of control or between two
   different domains of control?  Some protocols may be used in both
   cases, internally and externally, and as such various modes of
   authentication operation may be required for the same protocol.
   While it is preferred that all routing exchanges run with the
utmost
   security mechanisms enabled in all deployments, the exhortation is
   greater for those protocols running at a peering point between two
   domains of control, and greatest for those on public exchange point
   links, because the volume of attackers are greater from the
outside.

You might want to use more technical language here to be
clear.  E.g., inter-domain PtP links versus shared access
interconnection link layers, or something to that effect.

done


   Note however that the consequences of internal attacks maybe no
less
   severe -- in fact they may be quite a bit more sever

s/sever/severe/ - unless you indeed meant sever :-)

done


-- than an
   external attack.  An example of this internal versus external
   consideration is BGP which has both EBGP and IBGP modes.  Another
   example is a multicast protocol where the neighbors are sometimes
   within a domain of control and sometimes external, like at an
   exchange link.

Are you referring to a protocol such as PIM here, or something
more akin to MSDP?

I guess it could be either, no? Think I should say, "examples would be both PIM and MSDP."?


  It would be more acceptable to give up some security
   to get some convenience by using a group key on large broadcast
   networks

I think you're mixing layers here and it's confusing.  The
link layer types are orthogonal of the routing adjacencies.
E.g., "It would be more acceptable to give up some security
to gain some efficiencies by using a group key within internal
BGP speakers under a single administrative control...".  I
don't think the example of "exchange links" make sense in this
context, as there are many bi-lateral agreements for inter-
domain interconnection and each will most certainly employ a
unique set of keys per peer, or at least per peer AS.

I'm trying to get at the point that, when on an external multi-access link layer -- at say a NAP -- and exchanging PIM, parties may want to use peer-wise unique keys, even though it's less efficient. However, on an internal multi-access link layer where PIM is exchanged, they would be well within reason to use a single group key for all peers on the link. It's this distinction between security vs convenience that I'm trying to draw out.

I tried to clear the text up.


within your domain, whereas operators may favor security
   over convenience and use unique keying on peering links.  In this
   case again, designers must consider both modes of operation and
   ensure the authentication mechanisms fit both.
   Operators are encouraged to run cryptographic authentication on all
   their adjacencies, but to work from the outside in, i.e.  The EBGP
   links are a higher priority than the IBGP links because they are
   externally facing.

s/./and as a result, more difficult to secure from external
attacks./


good point. How about:
s/./, and, as a result, more likely to be targeted in an attack.

Is that pretty much what you meant?


3.3.2.  Unique versus Shared Keys
   This section discusses security considerations of when it is

s/of when/regarding when/

done.


   appropriate to use the same authentication key inputs for multiple
   peers and when it is not.  This is largely a debate of convenience
   versus security.  It is often the case that the best secured
   mechanism is also the least convenient mechanism.  For example, an
   air gap between a host and the network absolutely prevents remote
   attacks on the host, but having to copy and carry files using the
   "sneaker net" is quite inconvenient and unscalable.

Indeed!  Conficker USB stick-enabled attacks appear to
be much more effective today that the intial Conficker
vectors that were enabled by the wormable MS08-067 :-)
File shares are the other primary propagation vector
today.

   Operators have erred on the side of convenience when it comes to
   securing routing protocols with cryptographic authentication.  Many
   do not use it at all.  Some use it only on external links, but
not on
   internal links.

and some the inverse!

  Those that do use it often use the same key for all
   peers across their entire network.  It is common to see the same
key
   in use for years, and that being the same key that was entered when

Lebovitz &             Expires September 11, 2009              [Page
15]

Internet-Draft                KMART Roadmap                   March
2009

   authentication was originally configured.

s/configured/configured or routing gear deployed/

done.


   The goal for designers is to create authentication mechanisms that
   are easy for the operators to deploy, and still use unique keys.

Unique keys where, per session?

and between peers. I clarified the text.


   Operators have the impression that they NEED shared keys, when in
   fact they do not.  What they need is the relative convenience they
   experience from deploying cryptographic authentication with shared
   keys, compared to the inconvenience they would experience if they
   deployed the same authentication mechanism using unique keys per
   pair.  An example is BGP Route Reflectors.  Here operators often
use
   the same authentication key between each client and the route
   reflector.  The roadmaps defined from this guidance document will
   allow for unique keys to be used between each client and the peer,

Don't use of confuse client/peer terminology in this context, it's
no different mechanically than non-RR BGP.

agreed.


   without sacrificing much convenience.  Designers should strive to
   deliver unique keying mechanisms with similar ease-of-deployment
   properties as today's shared keys.
   Operators must understand the consequences of using shared keys
   across many peers.  Unique keys are more secure than shared keys
   because the reduce both the attack target size and the attack
   consequence size.

Perhaps the former, not necessarily the latter, depending on
the attack vector.

ok. not sure there is new text needed?


In this context, the attack target size represents
   the number of unique routing exchanges across a network that an
   attacker may be able to observe in order to gain security
association
   credentials, i.e.  Crack the keys.  If a shared key is used across
   the entire internal domain of control, then the attack target
size is
   very large.  The larger the attack target, the easier it is for the
   attacker to gain access to analysis data, and greater the volume of
   analysis data he can access, both of which make his job easier.

This is a refutable argument.  One could submit that it's
the same cost for an attacker to gain access to one device
and data analysis set as it is to gain access to the whole.
I think the key issue here is that a common shared key
makes the attack vulnerability surface more penetrable than
unique keys.

Yes. AND, cryptogrpahically, the more protected data an attacker gets to analyze, the better for him. if he has access to the transactions of 1000 exchanges vs 1, then he will gain more data more quickly. That's the difference between the two points.

I like your last sentence, and have edited the paragraph to use it.


  In
   this context, the attack consequence size represents the amount of
   routing adjacencies that can be negatively affected once a breach
has
   occurred, i.e.  Once the keys have been acquired by the attacker.
   Again, if a shared key is used across the internal domain, then the
   consequence size is the whole network.  Ideally, unique key pairs
   would be used for each adjacency.

Given that most of these are routing 'link' local, this is
entirely dependent on the attack vector.

agreed.


   In some cases designers may need to use shared keys in order to
solve
   the given problem space.  For example, a multicast packet is sent
   once but then observed and consumed by several routing
neighbors.  If
   unique keys were used per neighbor, the benefit of multicast
would be
   erased because the casting peer would have to create a different
   announcement packet/stream for each listening peer.  Though this
may
   be desired and acceptable in some small amount of use cases, it is
   not the norm.  Shared group keys are an acceptable solution here,
and
   much work has been done already in this area (see MSEC working
   group).



Lebovitz &             Expires September 11, 2009              [Page
16]

Internet-Draft                KMART Roadmap                   March
2009

3.3.3.  Out-of-Band vs. In-line Key Management
   This section discusses the security and use case considerations for
   keys placed on devices through out-of-band configurations versus
   through one routing peer-to-peer key management protocol exchanges.
   Note, when we say here "Peer-to-Peer KMP" we do not mean in-band to
   the RP.  Instead, we mean that the exchange occurs in-line, over
IP,
   between the two routing peers directly.  In in-line KMP the peers
   themselves handle the key and security association negotiation
   between themselves directly, whereas in an out-of-band system the
   keys are placed onto the device through some other configuration or
   management method or interface.
   An example of an out-of-band mechanism could be an administrator
who
   makes a remote management connection (e.g. using SSH) to a router
and
   manually enters the keying information -- like the algorithm, the
   key(s), the lifetimes, etc.  Another example could be an OSS system
   which inputs the same information via a script over an SSH
   connection, or by pushing configuration through some other
management
   connection, standard (Netconf-based) or proprietary.
   The drawbacks of an out-of-band mechanism include: lack of scale-
   ability, complexity and speed of changing if breech is suspected.

s/breech/breach/

done.


   For example, if an employee who had access to keys was was
   terminated, or if a machine holding those keys was belived
   compromised, then the system would be considered insecure and
   vulnerable until new keys were defined by a human.  Those keys then
   need to be placed into the OSS system, manually, and the OSS system
   then needs to push the change -- often during a very limited change
   window -- into the relevant devices.  If there are multiple
   organizations involved in these connections, this process is
greatly
   complicated.
   The benefits of out-of-band mechanism is that once the new keys/
   parameters are set in OSS system they can be pushed automatically
to
   all devices within the OSS's domain of control.  Operators have
   mechanisms in place for this already.  In small environments with
few
   routers, a manual system is not difficult to employ.
   We further define an in-line key exchange as using cryptographicly
   protected identity verification, session key negotiation, and
   security association parameter negotiation between the two routing
   peers.  The KMP between the two peers may also include the
   negotiation of parameters, like algorithms, cryptographic inputs
   (e.g. initialization vectors), key life-times, etc.
   The benefits an in-line KMP are several.  An in-line KMP results in
   key(s) that are privately generated, and not recorded permanently

Lebovitz &             Expires September 11, 2009              [Page
17]

Internet-Draft                KMART Roadmap                   March
2009

   anywhere.  Since the traffic keys used in a particular connection
are
   not a fixed part of a device configuration no steal-able data
exists
   anywhere else in the operator's systems which can be stolen, e.g.
in
   the case of a terminated or turned employee.  If a server or other
   data store is stolen or compromised, the thieves gain no access to
   current traffic keys.  They may gain access to key derivation
   material, like a PSK, but not current traffic keys in use.  In this
   example, these PSKs can be updated into the device configurations
   (either manually or through an OSS) without bouncing or impacting
the
   existing session at all.  In the case of using raw assymetric
keys or
   certificates, instead of PSKs, the data theft would likely not even
   result in any compromise, as the key pairs would have been
generated
   on the routers, and never leave those routers.  In such a case no
   changes are needed on the routers; the connections will continue to
   be secure, non-compromised.

s/non-/un.

done


  Additoinally, with a KMP regular re-keys

s/Additoinally/Additionally/

done


   operations occur without any operator involvement or oversight.
This
   keeps keys fresh.
   The drawbacks to using a KMP are few.  First, a KMP requires more
   cryptographic processing for the router at the very beginning of a
   connection.  This will add some minor start-up time to connection
   establishment versus a purely manual key approach.  Once a
connection
   with traffic keys have been established via a KMP, the
performance is
   the same in the KMP and the out-of-band case.  KMPs also add
another
   layer of protocol and configuration complexity which can fail or be
   misconfigured.  This was more of an issue when these KMPs were
first
   deployed, but less so as these implementaitons and operational
   experience with them has matured.
   The desired end goal is in-line KMPs.

4.  The Roadmap
4.1.  Work Phases on any Particular Protocol
   The desired endstate for the KMART work contains several items.
   First, the people desiring to deploy securely authenticated and
   integrity validated packets between routing peers have the tools
   specified, implemented and shipping in order to deploy.  These
tools
   should be fairly simple to implement, and not more complex than the
   security mechanisms to which the operators are already accustomed.
   (Examples of security mechanisms to which router operators are
   accustomed include: the use of assymetric keys for authentication
in
   SSH for router configuration, the use of pre-shared keys (PSKs) in
   TCP MD5 for BGP protection, the use of self-signed certificates for
   HTTPS access to device Web-based user interfaces, the use of
strongly
   constructed passwords and/or identity tokens for user
identification

Lebovitz &             Expires September 11, 2009              [Page
18]

Internet-Draft                KMART Roadmap                   March
2009

   when logging into routers and management systems.)  While the tools
   that we intend to specify may not be able to stop a deployment from
   using "foobar" as an input key for every device across their entire
   routing domain, we intend to make a solid, modern security system
   that is not too much more difficult than that.  In other words,
   simplicity and deployability are keys to success.  The Base RP's
will
   specify modern cryptographic algorithms and security mechanisms.
   Routing peers will be able to employ unique, pair-wise keys per
   peering instance, with reasonable key lifetimes, and updating those
   keys on a somewhat regular basis will be operationally easy,
causing
   no service interruption.
   The reach the above described end-state using manual keys may
only be

s/The reach the/???/

s/The reach/Achieving/


   pragmatic in very small deployments.  In larger deployments, this
end
   state will be much more operationally difficult to reach with only
   manual keys.  Thus, there will be a need for key lifecycle

s/lifecycle/life cycle/

done.


   management, in the form of a key management protocol, or KMP.  We
   expect that the two forms, manual key usage and KMP usage, will co-
   exist in the real world.  For example, a provider's edge router
at a
   public exchange peering point will want to use a KMP for ensuring
   unique and fresh keys with external peers, while a manual key may
be
   used between a provider's access edge router and each of the same
   provider's customer premise routers with which it peers.
   In accordance with the desired end state just described, we define
   two main work phases for each Base RP:
   1.  Enhance the Base RP's current authentication mechanism.  This
       work involves enhancing a Base RP's current security mechanisms
       in order to achieve a consistent, modern level of security
       functionality within its existing keying framework.  It is
       understood and accepted that the existing keying frameworks are
       largely based on manual keys.  Since many operators have
already
       built operational support systems (OSS) around these manual key
       implementations, there is some automation available for an
       operator to leverage in that way, if the underlying mechanisms
       are themselves secure.  In this phase, we explicitly exclude
       embedding or creating a KMP.  A list of the requirements for
       Phase 1 work are below in the section "Requirements for Phase 1
       BaseRPs' Security Updates (Section 4.2).
   2.  Develop an automated keying framework.  The second phase will
       focus on the development of an automated keying framework to
       faciliate unique pair-wise (or perhaps group-wise, where
       applicable) keys per peering isntance.

s/isntance/instance/

done


This involves the use of
       a KMP.  A KMP is helpful because [will add a more full
       description here, sorry, ran out of time].

Nice :-)

filled this out with:
"A KMP is helpful because it negotiates unique, pair wise, random keys without administrator involvement. It also negotiates several of the SA parameters required for the secure connection, including key life times. It keeps track of those lifetimes using counters, and negotiates new keys and parameters before they expire, again, without administrator interaction. Additionally, in the event of a breach, changing the KMP key will immediately cause a rekey to occur for the Traffic Key on the connection.


The framework for any
       one BaseRP will fall under, and be able to leverage, the
generic

Lebovitz &             Expires September 11, 2009              [Page
19]

Internet-Draft                KMART Roadmap                   March
2009

       framework described below in section Section 4.3.
4.2.  Requirements for Phase 1 BaseRPs' Security Update
   Here is a proposed list of requirements that SHOULD be addressed by
   Phase 1 (according to "1." above) security updates to Base RPs
[to be
   reviewed after -01 is released]:
   1.   Clear definitions of which elements of the transmission
(frame,
        packet, segment, etc.) are protected by the authentication
        mechanism
   2.   Strong algorithms, and defined and accepted by the security
        community, MUST be specified.  The option should use
algorithms
        considered accepted by the security community, which are
        considered appropriately safe.  The use of non-standard or
        unpublished algorithms SHOULD BE avoided.
   3.   Algorithm agility for the cryptograhpic algorithms used in the
        authentication MUST be specified, i.e. more than one algorithm
        MUST be specified and it is clear how new algorithms MAY be
        specified and used.

Why must more than one algorithm be specified?

Great question. In case one gets broken suddenly. The security community has learned this the hard way. Research on how to break crypto is always going on. Breaking a cipher isn't a matter of if, but when. But it's highly unlikely that two different algo's will be broken simultaneously. So, if two are supported, and once gets broken, we can use the other until we get a new one in place. Having the ability within the protocols to allow for such an event, called crypto agility, is essential. Forcing two algos provides both a redundancy, and a mechanism for getting to that redundancy.

I placed an explanation in the text.


   4.   Secure use of simple PSKs, offering both operational
convenience
        as well as building something of a fence around stupidity,
MUST
        be specified.
   5.   Inter-connection replay protection.  Packets captured from one
        connection MUST NOT be able to be re-sent and accepted
during a
        later connection.
   6.   Intra-connection replay protection.  Packets captured during a
        connection MUST NOT be able to be re-sent and accepted during
        that same connection, to deal with long-lived connections.
   7.   A change of security parameters REQUIRES, and even forces, a
        change of session traffic keys
   8.   Intra-connection re-keying which occurs without a break or
        interruption to the current peering session, and, if possible,
        without data loss, MUST be specified.
   9.   Efficient re-keying SHOULD be provided.  The specificaion
SHOULD
        support rekeying during a connection without the need to
expend
        undue computational resources.  In particular, the
specification
        SHOULD avoid the need to try/compute multiple keys on a given
        packet.
   10.  Prevent DoS attacks as those described as in-scope in the
        threats section Section 2.1above

s/1above/1 above/

done.


   11.  Default mechanisms and algorithms specified and defined as
        REQUIRED for all implementations
   12.  Manual keying MUST be supported.
   13.  Architecture of the specification MUST consider and allows for
        future use of a KMP.


Lebovitz &             Expires September 11, 2009              [Page
20]

Internet-Draft                KMART Roadmap                   March
2009

4.3.  Common Framework
   Each of the categories of routing protocols above will require
unique
   designs for authenticating and integrity checking their protocols.
   However, a single underlying framework for delivering automatic
   keying to those solutions will be pursued.  Providing such a single
   framework will significantly reduce the complexity of each step of
   the overall roadmap.  For example, if each Base RP needed to define
   it's own key management protocol this would balloon the total
amount
   of different sockets that needed

s/that needed/that are needed/

done.


to be opened and processes that
   needed to be simultaneously running on an implementation.  It would
   also significantly increase the run-time complexity and memory
   requirements of such systems running multiple Base RPs, causing
   perhaps slower performance of such systems.  However, if we can
land
   on a very small set (perhaps one or two) of automatic key
management
   protocols, KMPs, that the various Base RP's can use, then we can
   reduce this implementation and run-time complexity.  We can also
   decrease the total amount of time implementers need to deliver the
   KMPs for the Base RPs that will provide better threat protection.
   The components for the framework are listed here, and described
   below:
   o  BaseRP security mechanism
   o  KMP
   o  KeyStore
   o  BaseRP-to-KMP API
   o  BaseRP-to-KeyStore API
   o  KMP-to-KeyStore API
   o  Common Base RP mechanisms
   o  Identifiers
   o  Proof of identity
   o  Profiles
   The framework is modularized for how keys and security association
   (SA) parameters generally get passed from a KMP to a transport
   protocol.  It contains three main blocks and APIs.


Lebovitz &             Expires September 11, 2009              [Page
21]

Internet-Draft                KMART Roadmap                   March
2009

      +------------+   +--------------------+
      |            |   |                    | Check     +-----------+
      | Identifier +-->|                    +---------->|           |
      |            |   |    KMP Function    |           |  Identity |
      +----------- +   |                    |<----------+   Proof   |
                       |                    |  Approve  |           |
                       +-+--------------+---+           +-----------+
                         |              |
         KMP-to-KeyStore |              |
            API          |              |
                        \|/             |
                 +-------+-------+      |
                 |               |      | KMP-to-BaseRP
                 |    Session    |      |  API
                 |    KeyStore   |      |
                 |               |      |
                 +-------+-------+      |
                         |              |
                         |              |
           KeyStore-to-  |              |
            BaseRP API   |              |
                         |             \|/
             +--------------------------+-------------+
             |           |                            |
             |          \|/          Common BaseRP    |
             |   +-------+-------+   Authentication   |
             |   |               |   Mechanisms       |
             +---|  Transport    |-----+--------------+
             |   |   Key(s)      |                    |
             |   |               |                    |
             |   +---------------+   Specific BaseRP  |
             |                       Authentication   |
             |                       Security         |
             |                       Mechanism        |
             |                                        |
             +----------------------------------------+

               Figure 1: Automatic Key Management Framework
   Each element of the framework is described here:

   o  Base RP -  Base RP security mechanism - In each case, the Base
RP
           will contain a mechanism for using session keys in their
           security option.

But what if that option is a function of the transport
substrate employed by the RP?  E.g., TCP MD5 Signature
Option for MSDP, LDP, BGP.

clarification placed in the text.


Lebovitz &             Expires September 11, 2009              [Page
22]

Internet-Draft                KMART Roadmap                   March
2009


   o  KeyStore -   Each implementation will also contain a protocol
           independent mechanism for storing keys, called KeyStore.
The
           key_store will have multiple different logical containers,
           one container for each session key that any given Base RP
           will need.
   o  RP-KeyStore API -   There will be an API for Base RP to retrieve
           the keys from the KeyStore.  This will enable
implementers to
           reuse the same API calls for all their Base RPs.  The API
           will necessarily include facility to retrieve other
           parameters required for the construction of the BaseRP's
           packets, like key IDs or key lifetimes, etc.

Are these other parameters contained in the KeyStore as well,
and if so, it should be indicated as such above.  I think the
word "parameters" should be in that definition.

added "here may also be associated parameters as required by the SA for any given Routing Protocol." to the KeyStore definition.


   o  KMP -   There will be an automated key management protocol, KMP.
           This KMP will run between the peers.  The KMP serves as a
           protected channel between the peers, through which they can
           negotiate and pass important data required to exchange
proof
           of key identifiers, derive session keys, determine re- keying,
           synchronize their keying state, signal various keying
events,
           notify with error messages, etc.  As an analogy, in the
IPsec
           protocol [RFC 4301, 4303 and 4306] IKEv2 is the KMP that
runs
           between the two peers, while AH and ESP are two different
           base protocols that take session keys from IKEv2 and use
them
           in their transmissions.  In the analogy, the Base RP, say
BGP
           and LDP, are analogous to ESP and AH, while the KMP is
           analogous to IKEv2 itself.

I don't like this decoupling as it should be employing the
facilities of the RP, if possible.

However, the Routing Protocols don't have KMP's, and they weren't designed to be KMPs. KMPs are VERY tricky things to get right, security wise. Look at IKE. We've been working for 15 yrs to get that right, and IKEv2 is very good, strong and secure, having undergone tremendous peer review, implementation and use. We want to leverage that goodness, not attempt to start over from scratch, one-by-one, in each routing protocol.

 New sessions and a much
larger attack surface are a primary reason no one uses this
stuff today for infrastructure protocols.

not really. Everyone uses SSH today for managing their infrastructure. It's really no different. In fact, the same ID and Identity proof could be re-used as that used in SSH already.

Also, this seems
to assume solely two peers, versus multi-access networks
and RP mechanisms.

There are keying protocols for multi-access situations, and they involve a GDOI model, where a key server exists, each peer uses IKE to get to the key server in 1:1 connections, then the key server delivers a group key. It's kind of like a route reflector for keying. At least that's one model. But the basic priniciple holds true: create a secure channel for keying and session specifics, and run the actual secured connection on its own.


   o  RP-KMP API -   There will be an API for the Base RP to request a
           session key of the KMP, and be notified when the keys are
           available for it.  The API will also contain a mechanism
for
           the KMP to notify the Base RP that there are new keys
that it
           must now use, even if it didn't request those keys.  The
API
           will also include a mechanism for the KMP to receive
requests
           for session keys and other parameters from the routing
           protocol.  The KMP will also be aware of the various Base
RPs
           and each of their unique parameters that need to be
           negotiated and returned.
   o  KMP-KeyStore API -   There will be an API for the KMP to place
           keys and parameters into the KeyStore after their
negotiation
           and derivation with the other peer.  This will enable the
           implementers to reuse the same calls for multiple KMPs that
           may be needed to address the various categories of RPs as
           described in the section definingcategories (Section 3).

s/definingcategories/defining categories/

done.




Lebovitz &             Expires September 11, 2009              [Page
23]

Internet-Draft                KMART Roadmap                   March
2009


   [after writing this all up, I'm not sure we really need the
key_store
   in the middle.  As long as we standardize fully all the calls
needed
   from any RP to any KMP, then there can be a generic hand-down
   function from the KMP to the RP when the key and parameters are
   ready.  Let's sleep on it.]
   [will need state machines and function calls for these APIs, as one
   of the work items.  In essence, there is a need for a core team to
   develop the APIs out completely in order for the RP teams to use
   them.  Need to get this team going asap.]

This should be a guiding document, not a solutions document.
Lots of concern here.

sorry, didn't quite catch that. Could you say more? Are you against the idea of a design team spinning up to draft these APIs?


   o Identifiers -   A KMP is fed by identities.  The identities are
           text strings used by the peers to indicate to each other
that
           each are known to the other, and authorized to establish
           connections.  Those identities must be represented in some
           standard string format, e.g. an IP address -- either v4 or
           v6, an FQDN, an RFC 822 email address, a Common Name [RFC
           PKI], etc.  Note that even though routers do not normally
           have email addresses, one could use an RFC 822 email
address
           string as a formatted identifier for a router.  They
would do
           so simply by putting the router's reference number or name-
           code as the "NAME" part of the address, left of the "@"
           symbol.  They would then place some locational context in
the
           "DOMAIN" part of the string, right of the "@" symbol.  An
           example would be "rtr0210 at sf.ca.us.company.com".  This
           document does not suggest this string value at all.

In the weeds...

  Instead,
           the concept is used only to clarify that the type of string
           employed does not matter.  It also does not matter what
           specific text you chose to place in that string type.  It
           only matters that the type of string -- and it's format --
           must be agreed upon by the two endpoints.

Is that a provisioning or dynamic function?  Something
else that has to be specified manually to make things
dynamic.

you have to decide what identity type and value you are using. Right now we do this implicitly; we use the IP address as the identity, and we use the manual key as the proof. All this is saying is that we could use something else, like a SubjAltName from an x.509 certificate, or a CN, and a value appropriately structured for that ID type.


Further, the
           string can be used as an identifier in this context, even
if
           the string is not actually provisioned in it's source
domain.
           For example, the email address "rtr0210 at sf.ca.us.company.com "
           may not actually exist as an email address in that domain,
           but that string of characters may still be used as an
           identifier type(s) in the routing protocol security
context.
           What is important is that the community decide on a small
but
           flexible set of Identifiers they will all support, and that
           they decide on the exact format of those string.  The
formats
           that will be used must be standardized and must be sensible
           for the routing infrastructure.

We're way in the weeds here, IMO.

Probably. The last sentence is the key. The other stuff above it was just examples. My point is that routing people get freaked out because they think KMP's means we have to use certs and PKI. But it doesn't. You could use whatever id type (or handful of them) we care to specify. For example, in IKE using PSK, we call out these 6: FQDN, RFC822name (email addr), IPaddr_v6, IPaddr_v4, X.509-Subject, X.509.SubjAltName. We could do the same or some subset for KARP. But it doesn't have to be certs.



Lebovitz &             Expires September 11, 2009              [Page
24]

Internet-Draft                KMART Roadmap                   March
2009

   o  Identity Proof -   Once the form of identity is decided, then
           there must be a cryptographic proof of that identity, that
           the peer really is who they assert themselves to be.  Proof
           of identity can be arranged between the peers in a few
ways,
           for example pre-shared keys, raw assymetric keys, or a more
           user-friendly representation of assymetric keys, like a
           certificate.  Certificates can be used in a way requiring
no
           additional supporting systems -- e.g. public keys for each
           peer can be maintained locally for verification upon
contant.

s/contant/contact/

done.



           Certificate management can be made more simple and scalable
           with the use minor additional supporting systems, as is the
           case with self-signed certificates and a flat file list of
           "approved thumbprints".  Self-signed certificates will have
           somewhat lower security properties than Certificate
Authority
           signed certificates [RFC Certs].  The use of these
different
           identity proofs vary in ease of deployment, ease of ongoing
           management, startup effort, ongoing effort and management,
           security strength, and consequences from loss of secrets
from
           one part of the system to the rest of the system.  For
           example, they differ in resistance to a security breach,
and
           the effort required to remediate the whole system in the
           event of such a breach.  The point here is that there are
           options, many of which are quite simple to employ and
deploy.

Sure, and this is in the weeds as well, IMO.  Need to uplevel a
few...

We could, but I've found this exact point to be a place of misunderstanding for many routing folks. Once they hear the above, they say, "oh, ok. I guess it's not so bad as I imagined; we already do that for SSH anyway..." And one more objection is lowered.

   o  Profiles -   Once the KMP, Identifiers and Proofs mechanisms are
           converged upon, they must be clearly profiled for each Base
           RP, so that implementors and deployers alike understand the
           different pieces of the solution, and can have similar
           configurations and interoperability across multiple
vendors'
           devices, so as to reduce management difficulty.  The
profiles
           SHOULD also provide guidance on when to use which various
           combinations of options.  This will, again, simplify use
and
           interoperability.
   Common Mechanisms - In as much as they exist, the framework will
   capture mechanisms that can be used commonly not only within a
   particular category of Base RP and Base RP to KMP, but also between
   Base RP categories.  Again, the goal here is simplifying the
   implementations and runtime code and resource requirements.
There is
   also a goal here of favoring well vetted, reviewed, operationally
   proven security mechanisms over newly brewed mechanisms that are
less
   well tried in the wild.




Lebovitz &             Expires September 11, 2009              [Page
25]

Internet-Draft                KMART Roadmap                   March
2009

4.4.  Work Items Per Routing Protocol
   Each Base RP will have a team (the [RP]-KMART team) working on
   incrementally improving their Base RP's security, These teams will
   have the following main work items:

I'm not sure what we're attempting in this section, or
if these teams are design teams or JWTs, or something
else, but I don't think this should be in the draft, although
it SHOULD be specified external to the draft.  The IETF
has facilities and mechanics to enable this, we need to
strictly employ those.

What are they; what are the best ways to capture this agreement about how to do the work?

Yes, agreed, this sort of statement about how the work will get done really is its own draft, or a charter or something. For now, I'm just using this doc as a catch all and recording the agreements to date between the ADs and ourselves.


   PHASE 1:
   Characterize the RP
      Assess the Base RP to see what authentication mechanisms it has
      today.  Does it needs significant improvement to its existing
      mechanisms or not?  This will include determining if modern,
      strong security algorithms and parameters are present.
   Define Optimal State
      List the requirements for the Base RP's session key usage and
      format to contain to modern, strong security algorithms and
      mechanisms, per the Requirements (Section 4.2)section above.

s/)section/) section/

done.


The
      goal here is to determine what is needed for they BaseRP alone
to
      be used securely with at least manual keys.
   Gap Analysis
      Enumerate the requirements for this protocol to move from its
      current security state, the first bullet, to its optimal state,
      bullet two above.

"Above" where?

just above. Clarified in the text.


   Define, Assign, Design
      Create a deliverables list of the design and specification work,
      with milestones.  Define owners.  Release a document(s)

   PHASE 2:
   KMP Analysis
      Review requirements for KMPs [RFC????].  Identify any nuances
for
      this particular protocol's needs and its use cases for KMP.
List
      the requirements that this RP has for being able to be use in
      conjunctions with a KMP.  Define the optimal state.
   Gap Analysis
      Enumerate the requirements for this protocol to move from its
      current security state to its optimal state.
   Define, Assign, Design



Lebovitz &             Expires September 11, 2009              [Page
26]

Internet-Draft                KMART Roadmap                   March
2009

      Create a deliverabels list of the design and specification work,
      with miletsones.  Define owners.  Do the design and document
work
      for a KMP to be able to generate the Base RP's session keys for
      the packets on the wire.  These will be the arguments passed in
      the API to the KMP in order to bootstrap the session keys to the
      Base RP.

Isn't this a WG chartering or WG charter update sort of thing
that should take place, to the point above?  There exists clear
framework for this today.

Yeah, I think so too. Once the WG gets chartered, this whole doc becomes multiple more focused docs... as stated above a few times.


   There will also be a team formed to work on the base framework
   mechanisms for each of the main categories, i.e. the blocks and
API's
   represented in figure 1 (Figure 1).
4.5.  Protocols in Categories
   This section groups the Base RPs into like categories, according to
   attributes set forth in Categories Section (Section 3).  Each group
   will have a design team tasked with improving the security of the
   Base RP mechanisms and defining the KMP requirements for their
group,
   then rolling both into a roadmap document upon which they will
   execute.

Again, we need to avoid mechanics of how the work should be
carried out within the IETF, and focus on the problems and
goals of the framework document here.  The other stuff needs
to be captured, but that should happen as a management function
external to this.

agreed.


   BGP, LDP and MSDP
       The Base RP's that fall into the category of the one-to-one
       peering messages, and will use peer keying protocols, AND are
all
       transmitted over TCP include BGP RFC 4271 [RFC4271], LDP
       [RFC3036] and MSDP [RFC3618].  A team will work on one
mechanism
       to cover these three protocols.  Much of the work on the BaseRP
       update for its existing authentication mechanism is already
       occuring in the TCPM Working Group, on the TCP-AO
       [I-D.ietf-tcpm-tcp-auth-opt] document, as well as its
       cryptography-helper document, TCP-AO-CRYPTO [I-D.ao-crypto].
The
       exception is the mode where LDP is used directly on the LAN
       [RFC????].  The work for this may go into the Group keying
       category (w/ OSPF) mentioned below.
   OSPF, ISIS, and RIP
       The Base RPs that fall into the category Group keying with one-
       to-many peering messages includes OSPF [RFC2328], ISIS
[RFC1195]
       and RIP [RFC2453].  Not surprisingly, all these routing
protocols
       have two other things in common.  First, they are run on a
       combination of the OSI datalink layer 2, and the OSI network
       layer 3.

What?  IS-IS over the datalink layer, and RIP and OSPF over
IP.  The statement above confuses me.

They all have specification for something that happens at layer 2, i.e. they aren't simply Layer 3 and Layer 3 alone. Their operation has specification in layer 2. Can you think of a better way to say this?


Second, they are all internal gateway protocols, or
       IGPs.  The keying mechanisms and use will be much more
       complicated to define for these.

Depends, not if it's group keying and static :-)

Lebovitz &             Expires September 11, 2009              [Page
27]

Internet-Draft                KMART Roadmap                   March
2009

   BFD
       Because it is less of a routing protocol, per se, and more of a
       peer aliveness detection mechanism, Bidirectional Forwarding
       Detection (BFD) [RFC????] will have its own team.
   RSVP [RFC????], RSVP-TE [RFC????], and PCE
       These three protocols will be handled together. [what more
       characterisation should we give here?  Routing AD's, provide
text
       pls?]

Heh, I'd drop the first RSVP, I think..  PCE, be interesting
to see where that lands.

Any explanation for why?


   PIM-SM and PIM-DM
       Finally, the multicast protocols of PIM-SM [RFC4601] and PIM-DM
       [RFC3973] will be handled together.  PIM-SM multicasts routing
       information (Hello, Join/Prune, Assert) on a link-local basis,
       using a defined multicast address.  In addition, it specifies
       unicast communication for exchange of information (Register,
       Register-Stop) between the router closest to a group sender and
       the "rendezvous point" (RP).  The RP is typically not "on-link"
       for a particular router.  While much work has been done on
       multicast security for application-layer groups, little has
been
       done to address the problem of managing hundreds or thousands
of
       small one-to-many groups with link-local scope.  This will be
       necessary if we are to have unique keys per speaking router
in a
       PIM chain.

Did we decide somewhere that unique keys are a requirement
here?  I missed that?

I guess you are right. I pushed a certain solution type into the wording. I deleted that sentence.


Such an authentication mechanism should be considered
       along with the router-to-Rendezvous Point authentication
       mechanism.  The most important issue is ensuring that only the
       "legitimate neighbors"

s/legitimate/authorized/

get the keys for (S,G), so that rogue
       routers cannot participate in the exchanges.  Another issue is
       that some of the communication may occur intra-domain, e.g. the
       link-local messages in an enterprise, while others for the same
       (*,G) may occur inter-domain, e.g. the router-to-Rendezvous
Point
       messges may be from one enterprise's router to another.  One
       possible solution proposes a region-wide "master" key server
       (possibly replicated), and one "local" key server per speaking
       router.  There is no issue with propagating the messages
outside
       the link, because link-local messages, by definition, are not
       forwarded.  This solution is offered only as an example of how
       work may progress; further discussion should occur in this work
       team.  Specification of a link-local protection mechanism for
       PIM-SM occurred in RFC 4601 [RFC4601], and this work is being
       updated in PIM-SM-LINKLOCAL [I-D.ietf-pim-sm-linklocal].
       However, the KMP part is completely unspecified, and will
require
       work outside the expertise of the PIM working group to
       accomplish, which is why this roadmap is being created.



Lebovitz &             Expires September 11, 2009              [Page
28]

Internet-Draft                KMART Roadmap                   March
2009

   These protocols are deemed out-of-scope for this current
iteration of
   the work roadmap.  Once all of the protocols listed above have had
   their work completed, or are clearly within site of completion,
then
   the community will revisit the need and interest for working on
   these:
   o  MANET
   o  FORCES
   [need text from routing ADs on why these are out of scope]
4.6.  Priorites
   Resources from both the routing area and the security area will be
   applied to work on these problem spaces as quickly as possible.
   Realizing that such resources are far from unlimited, a rank order
   priority for addressing the work of incrementally securing these
   groups of routing protocols is provided:
   o  Priority 1 - BGP / LDP / MSDP
   o  Priority 2 - BFD
   o  Priority 3 - OSPF / ISIS / RIP
   o  Priority 4 - RSVP and RSVP-TE
   By far the most important group is the Priority 1 group as these
are
   the protocols used on the most public and exposed segments of the
   networks, at the peering points between operators and between
   operators and their customers.  BFD, as a detection mechanism
   underlying the Priority 1 protocols is therefore second.

Again, we need to structure this under IETF mechanics and
ensure that the work and model are aligned, not invent a
new one.


5.  Security Considerations
   This entire document focuses on improving the security of routing
   protocols by improving or implementing cryptographic authentication
   for each routing protocol.  Security considerations are largely
   contained within the body text of the document.
   The mechanisms that will be defined under this roadmap aim to
improve
   the security, better protect against more threats, and provider far
   greater operational efficiencies than the state of things at the
time
   of this writing.

These comments are sorta rolling thoughts, please don't
take any of them to harshly and discard at will (mostly :-)

Thanks for the great and thorough review, Danny. It's taken me hours to pour through it, respond and make the appropriate changes in the text. I think that time has been well spent. Pls excuse the fact that it has taken me SOOO LONG to get this done. On we go...

Gregory.


-danny


Note: Messages sent to this list are the opinions of the senders and do not imply endorsement by the IETF.