Internet Engineering Task Force (IETF) C. Filsfils, Ed.
Request for Comments: 8754 D. Dukes, Ed.
Category: Standards Track Cisco Systems, Inc.
ISSN: 2070-1721 S. Previdi
Huawei
J. Leddy
Individual
S. Matsushima
SoftBank
D. Voyer
Bell Canada
March 2020
IPv6 Segment Routing Header (SRH)
Abstract
Segment Routing can be applied to the IPv6 data plane using a new
type of Routing Extension Header called the Segment Routing Header
(SRH). This document describes the SRH and how it is used by nodes
that are Segment Routing (SR) capable.
Status of This Memo
This is an Internet Standards Track document.
This document is a product of the Internet Engineering Task Force
(IETF). It represents the consensus of the IETF community. It has
received public review and has been approved for publication by the
Internet Engineering Steering Group (IESG). Further information on
Internet Standards is available in Section 2 of RFC 7841.
Information about the current status of this document, any errata,
and how to provide feedback on it may be obtained at
https://www.rfc-editor.org/info/rfc8754.
Copyright Notice
Copyright (c) 2020 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(https://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction
1.1. Terminology
1.2. Requirements Language
2. Segment Routing Header
2.1. SRH TLVs
2.1.1. Padding TLVs
2.1.2. HMAC TLV
3. SR Nodes
3.1. SR Source Node
3.2. Transit Node
3.3. SR Segment Endpoint Node
4. Packet Processing
4.1. SR Source Node
4.1.1. Reduced SRH
4.2. Transit Node
4.3. SR Segment Endpoint Node
4.3.1. FIB Entry Is a Locally Instantiated SRv6 SID
4.3.2. FIB Entry Is a Local Interface
4.3.3. FIB Entry Is a Nonlocal Route
4.3.4. FIB Entry Is a No Match
5. Intra-SR-Domain Deployment Model
5.1. Securing the SR Domain
5.2. SR Domain as a Single System with Delegation among
Components
5.3. MTU Considerations
5.4. ICMP Error Processing
5.5. Load Balancing and ECMP
5.6. Other Deployments
6. Illustrations
6.1. Abstract Representation of an SRH
6.2. Example Topology
6.3. SR Source Node
6.3.1. Intra-SR-Domain Packet
6.3.2. Inter-SR-Domain Packet -- Transit
6.3.3. Inter-SR-Domain Packet -- Internal to External
6.4. Transit Node
6.5. SR Segment Endpoint Node
6.6. Delegation of Function with HMAC Verification
6.6.1. SID List Verification
7. Security Considerations
7.1. SR Attacks
7.2. Service Theft
7.3. Topology Disclosure
7.4. ICMP Generation
7.5. Applicability of AH
8. IANA Considerations
8.1. Segment Routing Header Flags Registry
8.2. Segment Routing Header TLVs Registry
9. References
9.1. Normative References
9.2. Informative References
Acknowledgements
Contributors
Authors' Addresses
1. Introduction
Segment Routing (SR) can be applied to the IPv6 data plane using a
new type of routing header called the Segment Routing Header (SRH).
This document describes the SRH and how it is used by nodes that are
SR capable.
"Segment Routing Architecture" [RFC8402] describes Segment Routing
and its instantiation in two data planes: MPLS and IPv6.
The encoding of IPv6 segments in the SRH is defined in this document.
1.1. Terminology
This document uses the terms Segment Routing (SR), SR domain, SR over
IPv6 (SRv6), Segment Identifier (SID), SRv6 SID, Active Segment, and
SR Policy as defined in [RFC8402].
1.2. Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in
BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all
capitals, as shown here.
2. Segment Routing Header
Routing headers are defined in [RFC8200]. The Segment Routing Header
(SRH) has a new Routing Type (4).
The SRH is defined as follows:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Next Header | Hdr Ext Len | Routing Type | Segments Left |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Last Entry | Flags | Tag |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Segment List[0] (128-bit IPv6 address) |
| |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| |
...
| |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Segment List[n] (128-bit IPv6 address) |
| |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
// //
// Optional Type Length Value objects (variable) //
// //
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
where:
Next Header: Defined in [RFC8200], Section 4.4.
Hdr Ext Len: Defined in [RFC8200], Section 4.4.
Routing Type: 4.
Segments Left: Defined in [RFC8200], Section 4.4.
Last Entry: contains the index (zero based), in the Segment List, of
the last element of the Segment List.
Flags: 8 bits of flags. Section 8.1 creates an IANA registry for
new flags to be defined. The following flags are defined:
0 1 2 3 4 5 6 7
+-+-+-+-+-+-+-+-+
|U U U U U U U U|
+-+-+-+-+-+-+-+-+
U: Unused and for future use. MUST be 0 on transmission and
ignored on receipt.
Tag: Tag a packet as part of a class or group of packets -- e.g.,
packets sharing the same set of properties. When Tag is not used
at the source, it MUST be set to zero on transmission. When Tag
is not used during SRH processing, it SHOULD be ignored. Tag is
not used when processing the SID defined in Section 4.3.1. It may
be used when processing other SIDs that are not defined in this
document. The allocation and use of tag is outside the scope of
this document.
Segment List[0..n]: 128-bit IPv6 addresses representing the nth
segment in the Segment List. The Segment List is encoded starting
from the last segment of the SR Policy. That is, the first
element of the Segment List (Segment List[0]) contains the last
segment of the SR Policy, the second element contains the
penultimate segment of the SR Policy, and so on.
TLV: Type Length Value (TLV) is described in Section 2.1.
In the SRH, the Next Header, Hdr Ext Len, Routing Type, and Segments
Left fields are defined in Section 4.4 of [RFC8200]. Based on the
constraints in that section, Next Header, Header Ext Len, and Routing
Type are not mutable while Segments Left is mutable.
The mutability of the TLV value is defined by the most significant
bit in the type, as specified in Section 2.1.
Section 4.3 defines the mutability of the remaining fields in the SRH
(Flags, Tag, Segment List) in the context of the SID defined in this
document.
New SIDs defined in the future MUST specify the mutability properties
of the Flags, Tag, and Segment List and indicate how the Hashed
Message Authentication Code (HMAC) TLV (Section 2.1.2) verification
works. Note that, in effect, these fields are mutable.
Consistent with the SR model, the source of the SRH always knows how
to set the Segment List, Flags, Tag, and TLVs of the SRH for use
within the SR domain. How it achieves this is outside the scope of
this document but may be based on topology, available SIDs and their
mutability properties, the SRH mutability requirements of the
destination, or any other information.
2.1. SRH TLVs
This section defines TLVs of the Segment Routing Header.
A TLV provides metadata for segment processing. The only TLVs
defined in this document are the HMAC (Section 2.1.2) and padding
TLVs (Section 2.1.1). While processing the SID defined in
Section 4.3.1, all TLVs are ignored unless local configuration
indicates otherwise (Section 4.3.1.1.1). Thus, TLV and HMAC support
is optional for any implementation; however, an implementation adding
or parsing TLVs MUST support PAD TLVs. Other documents may define
additional TLVs and processing rules for them.
TLVs are present when the Hdr Ext Len is greater than (Last
Entry+1)*2.
While processing TLVs at a segment endpoint, TLVs MUST be fully
contained within the SRH as determined by the Hdr Ext Len. Detection
of TLVs exceeding the boundary of the SRH Hdr Ext Len results in an
ICMP Parameter Problem, Code 0, message to the Source Address,
pointing to the Hdr Ext Len field of the SRH, and the packet being
discarded.
An implementation MAY limit the number and/or length of TLVs it
processes based on local configuration. It MAY limit:
* the number of consecutive Pad1 (Section 2.1.1.1) options to 1. If
padding of more than one byte is required, then PadN
(Section 2.1.1.2) should be used.
* The length in PadN to 5.
* The maximum number of non-Pad TLVs to be processed.
* The maximum length of all TLVs to be processed.
The implementation MAY stop processing additional TLVs in the SRH
when these configured limits are exceeded.
0 1
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-----------------------
| Type | Length | Variable-length data
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-----------------------
Type: An 8-bit codepoint from the "Segment Routing Header TLVs"
[IANA-SRHTLV]. Unrecognized Types MUST be ignored on receipt.
Length: The length of the variable-length data field in bytes.
Variable-length data: data that is specific to the Type.
Type Length Value (TLV) entries contain OPTIONAL information that may
be used by the node identified in the Destination Address (DA) of the
packet.
Each TLV has its own length, format, and semantic. The codepoint
allocated (by IANA) to each TLV Type defines both the format and the
semantic of the information carried in the TLV. Multiple TLVs may be
encoded in the same SRH.
The highest-order bit of the TLV type (bit 0) specifies whether or
not the TLV data of that type can change en route to the packet's
final destination:
0: TLV data does not change en route
1: TLV data does change en route
All TLVs specify their alignment requirements using an xn+y format.
The xn+y format is defined as per [RFC8200]. The SR source nodes use
the xn+y alignment requirements of TLVs and Padding TLVs when
constructing an SRH.
The Length field of the TLV is used to skip the TLV while inspecting
the SRH in case the node doesn't support or recognize the Type. The
Length defines the TLV length in octets, not including the Type and
Length fields.
The following TLVs are defined in this document:
Padding TLVs
HMAC TLV
Additional TLVs may be defined in the future.
2.1.1. Padding TLVs
There are two types of Padding TLVs, Pad1 and PadN, and the following
applies to both:
Padding TLVs are used for meeting the alignment requirement of the
subsequent TLVs.
Padding TLVs are used to pad the SRH to a multiple of 8 octets.
Padding TLVs are ignored by a node processing the SRH TLV.
Multiple Padding TLVs MAY be used in one SRH.
2.1.1.1. Pad1
Alignment requirement: none
0 1 2 3 4 5 6 7
+-+-+-+-+-+-+-+-+
| Type |
+-+-+-+-+-+-+-+-+
Type: 0
A single Pad1 TLV MUST be used when a single byte of padding is
required. A Pad1 TLV MUST NOT be used if more than one consecutive
byte of padding is required.
2.1.1.2. PadN
Alignment requirement: none
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length | Padding (variable) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
// Padding (variable) //
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Type: 4
Length: 0 to 5. The length of the Padding field in bytes.
Padding: Padding bits have no semantic. They MUST be set to 0 on
transmission and ignored on receipt.
The PadN TLV MUST be used when more than one byte of padding is
required.
2.1.2. HMAC TLV
Alignment requirement: 8n
The keyed Hashed Message Authentication Code (HMAC) TLV is OPTIONAL
and has the following format:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length |D| RESERVED |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| HMAC Key ID (4 octets) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| //
| HMAC (variable) //
| //
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
where:
Type: 5.
Length: The length of the variable-length data in bytes.
D: 1 bit. 1 indicates that the Destination Address verification is
disabled due to use of a reduced Segment List (see Section 4.1.1).
RESERVED: 15 bits. MUST be 0 on transmission.
HMAC Key ID: A 4-octet opaque number that uniquely identifies the
pre-shared key and algorithm used to generate the HMAC.
HMAC: Keyed HMAC, in multiples of 8 octets, at most 32 octets.
The HMAC TLV is used to verify that the SRH applied to a packet was
selected by an authorized party and to ensure that the segment list
is not modified after generation. This also allows for verification
that the current segment (by virtue of being in the authorized
Segment List) is authorized for use. The SR domain ensures that the
source node is permitted to use the source address in the packet via
ingress filtering mechanisms as defined in BCP 84 [RFC3704] or other
strategies as appropriate.
2.1.2.1. HMAC Generation and Verification
Local configuration determines when to check for an HMAC. This local
configuration is outside the scope of this document. It may be based
on the active segment at an SR Segment endpoint node, the result of
an Access Control List (ACL) that considers incoming interface, HMAC
Key ID, or other packet fields.
An implementation that supports the generation and verification of
the HMAC supports the following default behavior, as defined in the
remainder of this section.
The HMAC verification begins by checking that the current segment is
equal to the destination address of the IPv6 header. The check is
successful when either:
* HMAC D bit is 1 and Segments Left is greater than Last Entry, or
* HMAC Segments Left is less than or equal to Last Entry, and the
destination address is equal to Segment List[Segments Left].
The HMAC field is the output of the HMAC computation as defined in
[RFC2104], using:
* key: The pre-shared key identified by HMAC Key ID
* HMAC algorithm: Identified by the HMAC Key ID
* Text: A concatenation of the following fields from the IPv6 header
and the SRH, as it would be received at the node verifying the
HMAC:
- IPv6 header: Source address (16 octets)
- SRH: Last Entry (1 octet)
- SRH: Flags (1 octet)
- SRH: HMAC 16 bits following Length
- SRH: HMAC Key ID (4 octets)
- SRH: All addresses in the Segment List (variable octets)
The HMAC digest is truncated to 32 octets and placed in the HMAC
field of the HMAC TLV.
For HMAC algorithms producing digests less than 32 octets long, the
digest is placed in the lowest-order octets of the HMAC field.
Subsequent octets MUST be set to zero such that the HMAC length is a
multiple of 8 octets.
If HMAC verification is successful, processing proceeds as normal.
If HMAC verification fails, an ICMP error message (parameter problem,
error code 0, pointing to the HMAC TLV) SHOULD be generated (but rate
limited) and logged, and the packet SHOULD be discarded.
2.1.2.2. HMAC Pre-shared Key Algorithm
The HMAC Key ID field allows for the simultaneous existence of
several hash algorithms (SHA-256, SHA3-256 ... or future ones) as
well as pre-shared keys.
The HMAC Key ID field is opaque -- i.e., it has neither syntax nor
semantic except as an identifier of the right combination of pre-
shared key and hash algorithm.
At the HMAC TLV generating and verification nodes, the Key ID
uniquely identifies the pre-shared key and HMAC algorithm.
At the HMAC TLV generating node, the Text for the HMAC computation is
set to the IPv6 header fields and SRH fields as they would appear at
the verification node(s), not necessarily the same as the source node
sending a packet with the HMAC TLV.
Pre-Shared key rollover is supported by having two key IDs in use
while the HMAC TLV generating node and verifying node converge to a
new key.
The HMAC TLV generating node may need to revoke an SRH for which it
previously generated an HMAC. Revocation is achieved by allocating a
new key and key ID, then rolling over the key ID associated with the
SRH to be revoked. The HMAC TLV verifying node drops packets with
the revoked SRH.
An implementation supporting HMAC can support multiple hash
functions. An implementation supporting HMAC MUST implement SHA-2
[FIPS180-4] in its SHA-256 variant.
The selection of pre-shared key and algorithm and their distribution
is outside the scope of this document. Some options may include:
* setting these items in the configuration of the HMAC generating or
verifying nodes, either by static configuration or any SDN-
oriented approach
* dynamically using a trusted key distribution protocol such as
[RFC6407]
While key management is outside the scope of this document, the
recommendations of BCP 107 [RFC4107] should be considered when
choosing the key management system.
3. SR Nodes
There are different types of nodes that may be involved in segment
routing networks: SR source nodes that originate packets with a
segment in the destination address of the IPv6 header, transit nodes
that forward packets destined to a remote segment, and SR segment
endpoint nodes that process a local segment in the destination
address of an IPv6 header.
3.1. SR Source Node
A SR source node is any node that originates an IPv6 packet with a
segment (i.e., SRv6 SID) in the destination address of the IPv6
header. The packet leaving the SR source node may or may not contain
an SRH. This includes either:
* A host originating an IPv6 packet, or
* An SR domain ingress router encapsulating a received packet in an
outer IPv6 header, followed by an optional SRH.
It is out of the scope of this document to describe the mechanism
through which a segment in the destination address of the IPv6 header
and the Segment List in the SRH are derived.
3.2. Transit Node
A transit node is any node forwarding an IPv6 packet where the
destination address of that packet is not locally configured as a
segment or a local interface. A transit node is not required to be
capable of processing a segment or SRH.
3.3. SR Segment Endpoint Node
An SR segment endpoint node is any node receiving an IPv6 packet
where the destination address of that packet is locally configured as
a segment or local interface.
4. Packet Processing
This section describes SRv6 packet processing at the SR source,
Transit, and SR segment endpoint nodes.
4.1. SR Source Node
A source node steers a packet into an SR Policy. If the SR Policy
results in a Segment List containing a single segment, and there is
no need to add information to the SRH flag or add TLV; the DA is set
to the single Segment List entry, and the SRH MAY be omitted.
When needed, the SRH is created as follows:
The Next Header and Hdr Ext Len fields are set as specified in
[RFC8200].
The Routing Type field is set to 4.
The DA of the packet is set with the value of the first segment.
The first element of the SRH Segment List is the ultimate segment.
The second element is the penultimate segment, and so on.
The Segments Left field is set to n-1, where n is the number of
elements in the SR Policy.
The Last Entry field is set to n-1, where n is the number of
elements in the SR Policy.
TLVs (including HMAC) may be set according to their specification.
The packet is forwarded toward the packet's Destination Address
(the first segment).
4.1.1. Reduced SRH
When a source does not require the entire SID list to be preserved in
the SRH, a reduced SRH may be used.
A reduced SRH does not contain the first segment of the related SR
Policy (the first segment is the one already in the DA of the IPv6
header), and the Last Entry field is set to n-2, where n is the
number of elements in the SR Policy.
4.2. Transit Node
As specified in [RFC8200], the only node allowed to inspect the
Routing Extension Header (and therefore the SRH) is the node
corresponding to the DA of the packet. Any other transit node MUST
NOT inspect the underneath routing header and MUST forward the packet
toward the DA according to its IPv6 routing table.
When a SID is in the destination address of an IPv6 header of a
packet, it's routed through an IPv6 network as an IPv6 address.
SIDs, or the prefix(es) covering SIDs, and their reachability may be
distributed by means outside the scope of this document. For
example, [RFC5308] or [RFC5340] may be used to advertise a prefix
covering the SIDs on a node.
4.3. SR Segment Endpoint Node
Without constraining the details of an implementation, the SR segment
endpoint node creates Forwarding Information Base (FIB) entries for
its local SIDs.
When an SRv6-capable node receives an IPv6 packet, it performs a
longest-prefix-match lookup on the packet's destination address.
This lookup can return any of the following:
* A FIB entry that represents a locally instantiated SRv6 SID
* A FIB entry that represents a local interface, not locally
instantiated as an SRv6 SID
* A FIB entry that represents a nonlocal route
* No Match
4.3.1. FIB Entry Is a Locally Instantiated SRv6 SID
This document and section define a single SRv6 SID. Future documents
may define additional SRv6 SIDs. In such a case, the entire content
of this section will be defined in that document.
If the FIB entry represents a locally instantiated SRv6 SID, process
the next header chain of the IPv6 header as defined in Section 4 of
[RFC8200]. Section 4.3.1.1 describes how to process an SRH;
Section 4.3.1.2 describes how to process an upper-layer header or the
absence of a Next Header.
Processing this SID modifies the Segments Left and, if configured to
process TLVs, it may modify the "variable-length data" of TLV types
that change en route. Therefore, Segments Left is mutable, and TLVs
that change en route are mutable. The remainder of the SRH (Flags,
Tag, Segment List, and TLVs that do not change en route) are
immutable while processing this SID.
4.3.1.1. SRH Processing
S01. When an SRH is processed {
S02. If Segments Left is equal to zero {
S03. Proceed to process the next header in the packet,
whose type is identified by the Next Header field in
the routing header.
S04. }
S05. Else {
S06. If local configuration requires TLV processing {
S07. Perform TLV processing (see TLV Processing)
S08. }
S09. max_last_entry = ( Hdr Ext Len / 2 ) - 1
S10. If ((Last Entry > max_last_entry) or
S11. (Segments Left is greater than (Last Entry+1)) {
S12. Send an ICMP Parameter Problem, Code 0, message to
the Source Address, pointing to the Segments Left
field, and discard the packet.
S13. }
S14. Else {
S15. Decrement Segments Left by 1.
S16. Copy Segment List[Segments Left] from the SRH to the
destination address of the IPv6 header.
S17. If the IPv6 Hop Limit is less than or equal to 1 {
S18. Send an ICMP Time Exceeded -- Hop Limit Exceeded in
Transit message to the Source Address and discard
the packet.
S19. }
S20. Else {
S21. Decrement the Hop Limit by 1
S22. Resubmit the packet to the IPv6 module for transmission
to the new destination.
S23. }
S24. }
S25. }
S26. }
4.3.1.1.1. TLV Processing
Local configuration determines how TLVs are to be processed when the
Active Segment is a local SID defined in this document. The
definition of local configuration is outside the scope of this
document.
For illustration purposes only, two example local configurations that
may be associated with a SID are provided below.
Example 1:
For any packet received from interface I2
Skip TLV processing
Example 2:
For any packet received from interface I1
If first TLV is HMAC {
Process the HMAC TLV
}
Else {
Discard the packet
}
4.3.1.2. Upper-Layer Header or No Next Header
When processing the upper-layer header of a packet matching a FIB
entry locally instantiated as an SRv6 SID defined in this document:
IF (Upper-layer Header is IPv4 or IPv6) and
local configuration permits {
Perform IPv6 decapsulation
Resubmit the decapsulated packet to the IPv4 or IPv6 module
}
ELSE {
Send an ICMP parameter problem message to the Source Address and
discard the packet. Error code (4) "SR Upper-layer
Header Error", pointer set to the offset of the upper-layer
header.
}
A unique error code allows an SR source node to recognize an error in
SID processing at an endpoint.
4.3.2. FIB Entry Is a Local Interface
If the FIB entry represents a local interface and is not locally
instantiated as an SRv6 SID, the SRH is processed as follows:
If Segments Left is zero, the node must ignore the routing header
and proceed to process the next header in the packet, whose type
is identified by the Next Header field in the routing header.
If Segments Left is non-zero, the node must discard the packet and
send an ICMP Parameter Problem, Code 0, message to the packet's
Source Address, pointing to the unrecognized Routing Type.
4.3.3. FIB Entry Is a Nonlocal Route
Processing is not changed by this document.
4.3.4. FIB Entry Is a No Match
Processing is not changed by this document.
5. Intra-SR-Domain Deployment Model
The use of the SIDs exclusively within the SR domain and solely for
packets of the SR domain is an important deployment model.
This enables the SR domain to act as a single routing system.
This section covers:
* securing the SR domain from external attempts to use its SIDs
* using the SR domain as a single system with delegation between
components
* handling packets of the SR domain
5.1. Securing the SR Domain
Nodes outside the SR domain are not trusted: they cannot directly use
the SIDs of the domain. This is enforced by two levels of access
control lists:
1. Any packet entering the SR domain and destined to a SID within
the SR domain is dropped. This may be realized with the
following logic. Other methods with equivalent outcome are
considered compliant:
* Allocate all the SIDs from a block S/s
* Configure each external interface of each edge node of the
domain with an inbound infrastructure access list (IACL) that
drops any incoming packet with a destination address in S/s
* Failure to implement this method of ingress filtering exposes
the SR domain to source-routing attacks, as described and
referenced in [RFC5095]
2. The distributed protection in #1 is complemented with per-node
protection, dropping packets to SIDs from source addresses
outside the SR domain. This may be realized with the following
logic. Other methods with equivalent outcome are considered
compliant:
* Assign all interface addresses from prefix A/a
* At node k, all SIDs local to k are assigned from prefix Sk/sk
* Configure each internal interface of each SR node k in the SR
domain with an inbound IACL that drops any incoming packet
with a destination address in Sk/sk if the source address is
not in A/a.
5.2. SR Domain as a Single System with Delegation among Components
All intra-SR-domain packets are of the SR domain. The IPv6 header is
originated by a node of the SR domain and is destined to a node of
the SR domain.
All interdomain packets are encapsulated for the part of the packet
journey that is within the SR domain. The outer IPv6 header is
originated by a node of the SR domain and is destined to a node of
the SR domain.
As a consequence, any packet within the SR domain is of the SR
domain.
The SR domain is a system in which the operator may want to
distribute or delegate different operations of the outermost header
to different nodes within the system.
An operator of an SR domain may choose to delegate SRH addition to a
host node within the SR domain and delegate validation of the
contents of any SRH to a more trusted router or switch attached to
the host. Consider a top-of-rack switch T connected to host H via
interface I. H receives an SRH (SRH1) with a computed HMAC via some
SDN method outside the scope of this document. H classifies traffic
it sources and adds SRH1 to traffic requiring a specific Service
Level Agreement (SLA). T is configured with an IACL on I requiring
verification of the SRH for any packet destined to the SID block of
the SR domain (S/s). T checks and verifies that SRH1 is valid and
contains an HMAC TLV; T then verifies the HMAC.
An operator of the SR domain may choose to have all segments in the
SR domain verify the HMAC. This mechanism would verify that the SRH
Segment List is not modified while traversing the SR domain.
5.3. MTU Considerations
An SR domain ingress edge node encapsulates packets traversing the SR
domain and needs to consider the MTU of the SR domain. Within the SR
domain, well-known mitigation techniques are RECOMMENDED, such as
deploying a greater MTU value within the SR domain than at the
ingress edges.
Encapsulation with an outer IPv6 header and SRH shares the same MTU
and fragmentation considerations as IPv6 tunnels described in
[RFC2473]. Further investigation on the limitation of various
tunneling methods (including IPv6 tunnels) is discussed in
[INTAREA-TUNNELS] and SHOULD be considered by operators when
considering MTU within the SR domain.
5.4. ICMP Error Processing
ICMP error packets generated within the SR domain are sent to source
nodes within the SR domain. The invoking packet in the ICMP error
message may contain an SRH. Since the destination address of a
packet with an SRH changes as each segment is processed, it may not
be the destination used by the socket or application that generated
the invoking packet.
For the source of an invoking packet to process the ICMP error
message, the ultimate destination address of the IPv6 header may be
required. The following logic is used to determine the destination
address for use by protocol-error handlers.
* Walk all extension headers of the invoking IPv6 packet to the
routing extension header preceding the upper-layer header.
- If routing header is type 4 Segment Routing Header (SRH)
o The SID at Segment List[0] may be used as the destination
address of the invoking packet.
ICMP errors are then processed by upper-layer transports as defined
in [RFC4443].
For IP packets encapsulated in an outer IPv6 header, ICMP error
handling is as defined in [RFC2473].
5.5. Load Balancing and ECMP
For any interdomain packet, the SR source node MUST impose a flow
label computed based on the inner packet. The computation of the
flow label is as recommended in [RFC6438] for the sending Tunnel End
Point.
For any intradomain packet, the SR source node SHOULD impose a flow
label computed as described in [RFC6437] to assist ECMP load
balancing at transit nodes incapable of computing a 5-tuple beyond
the SRH.
At any transit node within an SR domain, the flow label MUST be used
as defined in [RFC6438] to calculate the ECMP hash toward the
destination address. If a flow label is not used, the transit node
would likely hash all packets between a pair of SR Edge nodes to the
same link.
At an SR segment endpoint node, the flow label MUST be used as
defined in [RFC6438] to calculate any ECMP hash used to forward the
processed packet to the next segment.
5.6. Other Deployments
Other deployment models and their implications on security, MTU,
HMAC, ICMP error processing, and interaction with other extension
headers are outside the scope of this document.
6. Illustrations
This section provides illustrations of SRv6 packet processing at SR
source, transit, and SR segment endpoint nodes.
6.1. Abstract Representation of an SRH
For a node k, its IPv6 address is represented as Ak, and its SRv6 SID
is represented as Sk.
IPv6 headers are represented as the tuple of (source,destination).
For example, a packet with source address A1 and destination address
A2 is represented as (A1,A2). The payload of the packet is omitted.
An SR Policy is a list of segments. A list of segments is
represented as <S1,S2,S3> where S1 is the first SID to visit, S2 is
the second SID to visit, and S3 is the last SID to visit.
(SA,DA) (S3,S2,S1; SL) represents an IPv6 packet with:
* Source Address SA, Destination Addresses DA, and next header SRH.
* SRH with SID list <S1,S2,S3> with SegmentsLeft = SL.
* Note the difference between the <> and () symbols. <S1,S2,S3>
represents a SID list where the leftmost segment is the first
segment. In contrast, (S3,S2,S1; SL) represents the same SID list
but encoded in the SRH Segment List format where the leftmost
segment is the last segment. When referring to an SR Policy in a
high-level use case, it is simpler to use the <S1,S2,S3> notation.
When referring to an illustration of detailed behavior, the
(S3,S2,S1; SL) notation is more convenient.
At its SR Policy headend, the Segment List <S1,S2,S3> results in SRH
(S3,S2,S1; SL=2) represented fully as:
Segments Left=2
Last Entry=2
Flags=0
Tag=0
Segment List[0]=S3
Segment List[1]=S2
Segment List[2]=S1
6.2. Example Topology
The following topology is used in examples below:
+ * * * * * * * * * * * * * * * * * * * * +
* [8] [9] *
| |
* | | *
[1]----[3]--------[5]----------------[6]---------[4]---[2]
* | | *
| |
* | | *
+--------[7]-------+
* *
+ * * * * * * * SR domain * * * * * * * +
Figure 1
* 3 and 4 are SR domain edge routers
* 5, 6, and 7 are all SR domain routers
* 8 and 9 are hosts within the SR domain
* 1 and 2 are hosts outside the SR domain
* The SR domain implements ingress filtering as per Section 5.1 and
no external packet can enter the domain with a destination address
equal to a segment of the domain.
6.3. SR Source Node
6.3.1. Intra-SR-Domain Packet
When host 8 sends a packet to host 9 via an SR Policy <S7,A9> the
packet is
P1: (A8,S7)(A9,S7; SL=1)
6.3.1.1. Reduced Variant
When host 8 sends a packet to host 9 via an SR Policy <S7,A9> and it
wants to use a reduced SRH, the packet is
P2: (A8,S7)(A9; SL=1)
6.3.2. Inter-SR-Domain Packet -- Transit
When host 1 sends a packet to host 2, the packet is
P3: (A1,A2)
The SR domain ingress router 3 receives P3 and steers it to SR domain
egress router 4 via an SR Policy <S7,S4>. Router 3 encapsulates the
received packet P3 in an outer header with an SRH. The packet is
P4: (A3,S7)(S4,S7; SL=1)(A1,A2)
If the SR Policy contains only one segment (the egress router 4), the
ingress router 3 encapsulates P3 into an outer header (A3,S4) without
SRH. The packet is
P5: (A3,S4)(A1,A2)
6.3.2.1. Reduced Variant
The SR domain ingress router 3 receives P3 and steers it to SR domain
egress router 4 via an SR Policy <S7,S4>. If router 3 wants to use a
reduced SRH, it encapsulates the received packet P3 in an outer
header with a reduced SRH. The packet is
P6: (A3,S7)(S4; SL=1)(A1,A2)
6.3.3. Inter-SR-Domain Packet -- Internal to External
When host 8 sends a packet to host 1, the packet is encapsulated for
the portion of its journey within the SR domain. From 8 to 3 the
packet is
P7: (A8,S3)(A8,A1)
In the opposite direction, the packet generated from 1 to 8 is
P8: (A1,A8)
At node 3, P8 is encapsulated for the portion of its journey within
the SR domain, with the outer header destined to segment S8. This
results in
P9: (A3,S8)(A1,A8)
At node 8, the outer IPv6 header is removed by S8 processing, then
processed again when received by A8.
6.4. Transit Node
Node 5 acts as transit node for packet P1 and sends packet
P1: (A8,S7)(A9,S7;SL=1)
on the interface toward node 7.
6.5. SR Segment Endpoint Node
Node 7 receives packet P1 and, using the logic in Section 4.3.1,
sends packet
P7: (A8,A9)(A9,S7; SL=0)
on the interface toward router 6.
6.6. Delegation of Function with HMAC Verification
This section describes how a function may be delegated within the SR
domain. In the following sections, consider a host 8 connected to a
top of rack 5.
6.6.1. SID List Verification
An operator may prefer to apply the SRH at source 8, while 5 verifies
that the SID list is valid.
For illustration purposes, an SDN controller provides 8 an SRH
terminating at node 9, with Segment List <S5,S7,S6,A9>, and HMAC TLV
computed for the SRH. The HMAC key ID and key associated with the
HMAC TLV is shared with 5. Node 8 does not know the key. Node 5 is
configured with an IACL applied to the interface connected to 8,
requiring HMAC verification for any packet destined to S/s.
Node 8 originates packets with the received SRH, including HMAC TLV.
P15: (A8,S5)(A9,S6,S7,S5;SL=3;HMAC)
Node 5 receives and verifies the HMAC for the SRH, then forwards the
packet to the next segment
P16: (A8,S7)(A9,S6,S7,S5;SL=2;HMAC)
Node 6 receives
P17: (A8,S6)(A9,S6,S7,S5;SL=1;HMAC)
Node 9 receives
P18: (A8,A9)(A9,S6,S7,S5;SL=0;HMAC)
This use of an HMAC is particularly valuable within an enterprise-
based SR domain [SRN].
7. Security Considerations
This section reviews security considerations related to the SRH,
given the SRH processing and deployment models discussed in this
document.
As described in Section 5, it is necessary to filter packets' ingress
to the SR domain, destined to SIDs within the SR domain (i.e.,
bearing a SID in the destination address). This ingress filtering is
via an IACL at SR domain ingress border nodes. Additional protection
is applied via an IACL at each SR Segment Endpoint node, filtering
packets not from within the SR domain, destined to SIDs in the SR
domain. ACLs are easily supported for small numbers of seldom
changing prefixes, making summarization important.
Additionally, ingress filtering of IPv6 source addresses as
recommended in BCP 38 [RFC2827] SHOULD be used.
7.1. SR Attacks
An SR domain implements distributed and per-node protection as
described in Section 5.1. Additionally, domains deny traffic with
spoofed addresses by implementing the recommendations in BCP 84
[RFC3704].
Full implementation of the recommended protection blocks the attacks
documented in [RFC5095] from outside the SR domain, including
bypassing filtering devices, reaching otherwise-unreachable Internet
systems, network topology discovery, bandwidth exhaustion, and
defeating anycast.
Failure to implement distributed and per-node protection allows
attackers to bypass filtering devices and exposes the SR domain to
these attacks.
Compromised nodes within the SR domain may mount the attacks listed
above along with other known attacks on IP networks (e.g., DoS/DDoS,
topology discovery, man-in-the-middle, traffic interception/
siphoning).
7.2. Service Theft
Service theft is defined as the use of a service offered by the SR
domain by a node not authorized to use the service.
Service theft is not a concern within the SR domain, as all SR source
nodes and SR segment endpoint nodes within the domain are able to
utilize the services of the domain. If a node outside the SR domain
learns of segments or a topological service within the SR domain,
IACL filtering denies access to those segments.
7.3. Topology Disclosure
The SRH is unencrypted and may contain SIDs of some intermediate SR
nodes in the path towards the destination within the SR domain. If
packets can be snooped within the SR domain, the SRH may reveal
topology, traffic flows, and service usage.
This is applicable within an SR domain, but the disclosure is less
relevant as an attacker has other means of learning topology, flows,
and service usage.
7.4. ICMP Generation
The generation of ICMPv6 error messages may be used to attempt
denial-of-service attacks by sending an error-causing destination
address or SRH in back-to-back packets. An implementation that
correctly follows Section 2.4 of [RFC4443] would be protected by the
ICMPv6 rate-limiting mechanism.
7.5. Applicability of AH
The SR domain is a trusted domain, as defined in [RFC8402], Sections
2 and 8.2. The SR source is trusted to add an SRH (optionally
verified as having been generated by a trusted source via the HMAC
TLV in this document), and segments advertised within the domain are
trusted to be accurate and advertised by trusted sources via a secure
control plane. As such, the SR domain does not rely on the
Authentication Header (AH) as defined in [RFC4302] to secure the SRH.
The use of SRH with AH by an SR source node and its processing at an
SR segment endpoint node are not defined in this document. Future
documents may define use of SRH with AH and its processing.
8. IANA Considerations
This document makes the following registrations in the "Internet
Protocol Version 6 (IPv6) Parameters" "Routing Types" subregistry
maintained by IANA:
+-------+------------------------------+---------------+
| Value | Description | Reference |
+=======+==============================+===============+
| 4 | Segment Routing Header (SRH) | This document |
+-------+------------------------------+---------------+
Table 1: SRH Registration
This document makes the following registrations in the "Type 4 -
Parameter Problem" message of the "Internet Control Message Protocol
version 6 (ICMPv6) Parameters" registry maintained by IANA:
+------+-----------------------------+
| Code | Name |
+======+=============================+
| 4 | SR Upper-layer Header Error |
+------+-----------------------------+
Table 2: SR Upper-layer Header
Error Registration
8.1. Segment Routing Header Flags Registry
This document describes a new IANA-managed registry to identify SRH
Flags Bits. The registration procedure is "IETF Review" [RFC8126].
The registry name is "Segment Routing Header Flags". Flags are 8
bits.
8.2. Segment Routing Header TLVs Registry
This document describes a new IANA-managed registry to identify SRH
TLVs. The registration procedure is "IETF Review". The registry
name is "Segment Routing Header TLVs". A TLV is identified through
an unsigned 8-bit codepoint value, with assigned values 0-127 for
TLVs that do not change en route and 128-255 for TLVs that may change
en route. The following codepoints are defined in this document:
+---------+--------------------------+---------------+
| Value | Description | Reference |
+=========+==========================+===============+
| 0 | Pad1 TLV | This document |
+---------+--------------------------+---------------+
| 1 | Reserved | This document |
+---------+--------------------------+---------------+
| 2 | Reserved | This document |
+---------+--------------------------+---------------+
| 3 | Reserved | This document |
+---------+--------------------------+---------------+
| 4 | PadN TLV | This document |
+---------+--------------------------+---------------+
| 5 | HMAC TLV | This document |
+---------+--------------------------+---------------+
| 6 | Reserved | This document |
+---------+--------------------------+---------------+
| 124-126 | Experimentation and Test | This document |
+---------+--------------------------+---------------+
| 127 | Reserved | This document |
+---------+--------------------------+---------------+
| 252-254 | Experimentation and Test | This document |
+---------+--------------------------+---------------+
| 255 | Reserved | This document |
+---------+--------------------------+---------------+
Table 3: Segment Routing Header TLVs Registry
Values 1, 2, 3, and 6 were defined in draft versions of this
specification and are Reserved for backwards compatibility with early
implementations and should not be reassigned. Values 127 and 255 are
Reserved to allow for expansion of the Type field in future
specifications, if needed.
9. References
9.1. Normative References
[FIPS180-4]
National Institute of Standards and Technology (NIST),
"Secure Hash Standard (SHS)", FIPS PUB 180-4, DOI 10.6028/
NIST.FIPS.180-4, August 2015,
<http://csrc.nist.gov/publications/fips/fips180-4/fips-
180-4.pdf>.
[IANA-SRHTLV]
IANA, "Segment Routing Header TLVs",
<https://www.iana.org/assignments/ipv6-parameters/>.
[RFC2104] Krawczyk, H., Bellare, M., and R. Canetti, "HMAC: Keyed-
Hashing for Message Authentication", RFC 2104,
DOI 10.17487/RFC2104, February 1997,
<https://www.rfc-editor.org/info/rfc2104>.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
<https://www.rfc-editor.org/info/rfc2119>.
[RFC2473] Conta, A. and S. Deering, "Generic Packet Tunneling in
IPv6 Specification", RFC 2473, DOI 10.17487/RFC2473,
December 1998, <https://www.rfc-editor.org/info/rfc2473>.
[RFC2827] Ferguson, P. and D. Senie, "Network Ingress Filtering:
Defeating Denial of Service Attacks which employ IP Source
Address Spoofing", BCP 38, RFC 2827, DOI 10.17487/RFC2827,
May 2000, <https://www.rfc-editor.org/info/rfc2827>.
[RFC3704] Baker, F. and P. Savola, "Ingress Filtering for Multihomed
Networks", BCP 84, RFC 3704, DOI 10.17487/RFC3704, March
2004, <https://www.rfc-editor.org/info/rfc3704>.
[RFC4107] Bellovin, S. and R. Housley, "Guidelines for Cryptographic
Key Management", BCP 107, RFC 4107, DOI 10.17487/RFC4107,
June 2005, <https://www.rfc-editor.org/info/rfc4107>.
[RFC4302] Kent, S., "IP Authentication Header", RFC 4302,
DOI 10.17487/RFC4302, December 2005,
<https://www.rfc-editor.org/info/rfc4302>.
[RFC5095] Abley, J., Savola, P., and G. Neville-Neil, "Deprecation
of Type 0 Routing Headers in IPv6", RFC 5095,
DOI 10.17487/RFC5095, December 2007,
<https://www.rfc-editor.org/info/rfc5095>.
[RFC6407] Weis, B., Rowles, S., and T. Hardjono, "The Group Domain
of Interpretation", RFC 6407, DOI 10.17487/RFC6407,
October 2011, <https://www.rfc-editor.org/info/rfc6407>.
[RFC6437] Amante, S., Carpenter, B., Jiang, S., and J. Rajahalme,
"IPv6 Flow Label Specification", RFC 6437,
DOI 10.17487/RFC6437, November 2011,
<https://www.rfc-editor.org/info/rfc6437>.
[RFC6438] Carpenter, B. and S. Amante, "Using the IPv6 Flow Label
for Equal Cost Multipath Routing and Link Aggregation in
Tunnels", RFC 6438, DOI 10.17487/RFC6438, November 2011,
<https://www.rfc-editor.org/info/rfc6438>.
[RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
May 2017, <https://www.rfc-editor.org/info/rfc8174>.
[RFC8200] Deering, S. and R. Hinden, "Internet Protocol, Version 6
(IPv6) Specification", STD 86, RFC 8200,
DOI 10.17487/RFC8200, July 2017,
<https://www.rfc-editor.org/info/rfc8200>.
[RFC8402] Filsfils, C., Ed., Previdi, S., Ed., Ginsberg, L.,
Decraene, B., Litkowski, S., and R. Shakir, "Segment
Routing Architecture", RFC 8402, DOI 10.17487/RFC8402,
July 2018, <https://www.rfc-editor.org/info/rfc8402>.
9.2. Informative References
[INTAREA-TUNNELS]
Touch, J. and M. Townsley, "IP Tunnels in the Internet
Architecture", Work in Progress, Internet-Draft, draft-
ietf-intarea-tunnels-10, 12 September 2019,
<https://tools.ietf.org/html/draft-ietf-intarea-tunnels-
10>.
[RFC4443] Conta, A., Deering, S., and M. Gupta, Ed., "Internet
Control Message Protocol (ICMPv6) for the Internet
Protocol Version 6 (IPv6) Specification", STD 89,
RFC 4443, DOI 10.17487/RFC4443, March 2006,
<https://www.rfc-editor.org/info/rfc4443>.
[RFC5308] Hopps, C., "Routing IPv6 with IS-IS", RFC 5308,
DOI 10.17487/RFC5308, October 2008,
<https://www.rfc-editor.org/info/rfc5308>.
[RFC5340] Coltun, R., Ferguson, D., Moy, J., and A. Lindem, "OSPF
for IPv6", RFC 5340, DOI 10.17487/RFC5340, July 2008,
<https://www.rfc-editor.org/info/rfc5340>.
[RFC8126] Cotton, M., Leiba, B., and T. Narten, "Guidelines for
Writing an IANA Considerations Section in RFCs", BCP 26,
RFC 8126, DOI 10.17487/RFC8126, June 2017,
<https://www.rfc-editor.org/info/rfc8126>.
[SRN] Lebrun, D., Jadin, M., Clad, F., Filsfils, C., and O.
Bonaventure, "Software Resolved Networks: Rethinking
Enterprise Networks with IPv6 Segment Routing", 2018,
<https://inl.info.ucl.ac.be/system/files/
sosr18-final15-embedfonts.pdf>.
Acknowledgements
The authors would like to thank Ole Troan, Bob Hinden, Ron Bonica,
Fred Baker, Brian Carpenter, Alexandru Petrescu, Punit Kumar Jaiswal,
David Lebrun, Benjamin Kaduk, Frank Xialiang, Mirja Kühlewind, Roman
Danyliw, Joe Touch, and Magnus Westerlund for their comments to this
document.
Contributors
Kamran Raza, Zafar Ali, Brian Field, Daniel Bernier, Ida Leung, Jen
Linkova, Ebben Aries, Tomoya Kosugi, Éric Vyncke, David Lebrun, Dirk
Steinberg, Robert Raszuk, Dave Barach, John Brzozowski, Pierre
Francois, Nagendra Kumar, Mark Townsley, Christian Martin, Roberta
Maglione, James Connolly, and Aloys Augustin contributed to the
content of this document.
Authors' Addresses
Clarence Filsfils (editor)
Cisco Systems, Inc.
Brussels
Belgium
Email: cfilsfil@cisco.com
Darren Dukes (editor)
Cisco Systems, Inc.
Ottawa
Canada
Email: ddukes@cisco.com
Stefano Previdi
Huawei
Italy
Email: stefano@previdi.net
John Leddy
Individual
United States of America
Email: john@leddy.net
Satoru Matsushima
SoftBank
Email: satoru.matsushima@g.softbank.co.jp
Daniel Voyer
Bell Canada
Email: daniel.voyer@bell.ca