INTAREA D.H. Daniel
Internet-Draft B.T. Bin
Intended status: Standards Track ZTE Corporation
Expires: 8 March 2024 D.Y. Dong
Beijing Jiaotong University
5 September 2023
Service Aware Network Framework
draft-huang-intarea-san-framework-01
Abstract
Cloud has been migrating from concentrated center sites to edge nodes
with responsive and agile services to the subscribers. This
industry-wide trend would be reasonably expected to continue into the
future which would enjoy geographically ubiquitous services. Rather
than transmitting service data streams to the stable and limited
service locations such as centered cloud sites, routing and
forwarding network will have to adapt to the emerging scenarios where
the service instances would be highly dynamic and distributed, and
further more, demand more fine-grained networking policies than the
current routing and forwarding scheme unaware of service SLA
requirements. This proposal is to demonstrate a framework under
which the above-mentioned requirements would be satisfied.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on 8 March 2024.
Copyright Notice
Copyright (c) 2023 IETF Trust and the persons identified as the
document authors. All rights reserved.
Daniel, et al. Expires 8 March 2024 [Page 1]
Internet-Draft Abbreviated Title September 2023
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents (https://trustee.ietf.org/
license-info) in effect on the date of publication of this document.
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document. Code Components
extracted from this document must include Revised BSD License text as
described in Section 4.e of the Trust Legal Provisions and are
provided without warranty as described in the Revised BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1. Requirements Language . . . . . . . . . . . . . . . . . . 4
2. Terminology . . . . . . . . . . . . . . . . . . . . . . . . . 4
3. SAN framework and its chief components . . . . . . . . . . . 5
4. Layer 4 of SAN framework . . . . . . . . . . . . . . . . . . 6
5. Layer 3 of SAN framework . . . . . . . . . . . . . . . . . . 7
5.1. SAN ingress . . . . . . . . . . . . . . . . . . . . . . . 7
5.2. SAN relay . . . . . . . . . . . . . . . . . . . . . . . . 8
5.3. SAN egress . . . . . . . . . . . . . . . . . . . . . . . 8
5.4. SAN control plane consideration . . . . . . . . . . . . . 8
5.4.1. Centralized control plane consideration . . . . . . . 9
5.4.2. Distributed control plane consideration . . . . . . . 9
5.4.3. Hybrid control plane consideration . . . . . . . . . 9
5.5. SAN user plane consideration . . . . . . . . . . . . . . 9
5.5.1. SIL encapsulation . . . . . . . . . . . . . . . . . . 10
5.5.2. SIL in forwarding and routing network . . . . . . . . 10
5.5.3. SIL-based routing . . . . . . . . . . . . . . . . . . 10
5.6. Hierarchical service routing architecture . . . . . . . . 11
5.6.1. Routing scheme in line with multiple service-related
resources granularity . . . . . . . . . . . . . . . . 12
5.6.2. Two-segment routing and forwarding . . . . . . . . . 12
5.6.3. Cross-domain computing routing and forwarding . . . . 13
5.6.4. Service traffic affinity . . . . . . . . . . . . . . 14
5.7. Logical sub-layer of service routing in forwarding and
routing network . . . . . . . . . . . . . . . . . . . . . 14
6. Governance and life cycle of service identification label . . 15
6.1. Originality and governance of SIL . . . . . . . . . . . . 15
6.2. Life cycle of SIL . . . . . . . . . . . . . . . . . . . . 15
7. An example of end-to-end SAN work flow . . . . . . . . . . . 15
7.1. Initiation and maintenance of rendering algorithm in SAN
system . . . . . . . . . . . . . . . . . . . . . . . . . 16
7.2. Configuration of a rendering algorithm (as SIL-RA) in SAN
forwarding and routing network . . . . . . . . . . . . . 17
7.3. Publication/Subscription of SIL-RA . . . . . . . . . . . 17
7.4. SIL-RA service data stream treatment at SAN forwarding and
routing network . . . . . . . . . . . . . . . . . . . . . 17
Daniel, et al. Expires 8 March 2024 [Page 2]
Internet-Draft Abbreviated Title September 2023
7.4.1. SIL-RA service data stream treatment at SAN
ingress . . . . . . . . . . . . . . . . . . . . . . . 18
7.4.2. SIL-RA service data stream treatment at SAN relay . . 18
7.4.3. SIL-RA service data stream treatment at SAN egress . 18
7.5. SIL-RA service data stream treatment at cloud site . . . 18
8. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 19
9. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 19
10. Security Considerations . . . . . . . . . . . . . . . . . . . 19
11. Informative References . . . . . . . . . . . . . . . . . . . 19
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 19
1. Introduction
When it comes to user data security and service responsiveness, it’s
imperative to migrate the cloud services and resources to the
locations with good proximity to the users who could reside anywhere
and launch service requests at any time in the ongoing and upcoming
industry scenarios. Therefore, the cloud services and resources has
been and would continue to be deployed in such a distributed way that
the services would be ubiquitous, and scheduled and requested
dynamically by various subscribers. Cloud and networking services
and resources operate more coherently with each other as more and
more services migrate into cloud. Network has to without gap delay
adapt as the cloud shift into a new distributed architecture. The
same service could be instantiated at multiple locations with
different networking and computing resources which would be updated
dynamically. Under this circumstance, the best service quality
should be guaranteed by both fine-grained networking and computing
policies.
This proposal introduces a light-weight service identification label
as an index in the user plane to enable the network to be highly
effectively aware of the dynamic requirements of various cloud
applications. The service identification label is designed to
purport to the fundamental and common services for which the service
qualities should be guaranteed by both fine-grained networking and
computing resources. Combined with an enhanced control plane, a
logical sub-layer of service function has been employed in this
framework to enable the network respond to the application’s
networking and computing demands in a more fine-grained and
intelligent way, which would bring significant benefits to all
parties involved in the network and cloud ecosystem while ensure the
framework to be compatible with the ongoing network architecture.
Daniel, et al. Expires 8 March 2024 [Page 3]
Internet-Draft Abbreviated Title September 2023
1.1. Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in [RFC2119].
2. Terminology
* SAN. Service Awareness Network
* SIL. Service Identification Label, a light-weight label designed
to indicate the fundamental and common service types
* SCMS. Service Control and Management System, an entity
responsible for SIL management and controlling which includes
materializing networking and computing policies in terms of SIL
and delivering them to the SAN forwarding and routing nodes.
* SAN ingress: routing node maintaining computing resource as well
as service status from remote cloud sites, and executing the
cross-site routing policies in terms of the aforementioned status
as well as the identification of computing service. SAN ingress
usually resides at the network edge and works as ingress of the
end to end service flow.
* SAN egress: routing node maintaining computing resource as well as
service status from the geographically local cloud sites and being
responsible for the last hop of the service flow towards the
computing service instance in the specific cloud site. SAN egress
usually resides at the network edge and works as egress of the end
to end service flow.
* SAN relay: routing node which is optionally aware of computing
resource and service status . SAN relay usually resides between
SAN ingress and SAN egress and works as ordinary routing nodes and
only gets involved in computing delivery when SAN ingress fails to
do so. In particular, when an end-to-end networking policy in
which SAN relay would be required to identify SIL with specific
routing and forwarding behavior is involved, SAN relay would
decapsulate and encapsulate SIL and execute the SIL-specific
policies.
* Global Service Resource and Service Status(GSRS) : General cloud
site status of the service-related resources and service which
consists of overall resource occupation and types of service
(algorithms, functions etc.) the specific cloud site provides.
GSRS is maintained at SAN ingress and expected to remain
relatively stable and change in slow frequency.
Daniel, et al. Expires 8 March 2024 [Page 4]
Internet-Draft Abbreviated Title September 2023
* Local Service Resource and Service Status(LSRS) : fine-grained
cloud site status of the service-related resource and service
which consists of status of each active service instance as well
as its parameters which could impact the way the instance would be
selected and visited by SAN egress. LSRS is maintained at SAN
egress and expected to stay quite dynamic and change in high
frequency.
* Service Instance(SI): an active instance of a SIL which resides in
a host usually purporting to a server, container or virtual
machine.
3. SAN framework and its chief components
An host address is request from DNS system to indicate the user’s
intention of it’s service destination as well as establish a service
connection under the conventional internet servivce architecture,
while SAN framework proposes a refined architecture under which
user’s intention is simply indicated by the service indentification
label regardless of the actual service destination. Therefore, the
center piece of the SAN framework is the light-weight service
identification label. SIL should be confined within a limited and
exhaustive service type space which only covers the indispensable and
fundamental networking and computing service blocks. SIL could be
component service which would be expected to be invoked by multiple
application parties, or explicitly specified sub-stream of the same
service data stream. A Service control and management system (SCMS)
which could be a standalone or an enhanced version of the existing
system. An entity with the authority of service and resource
provisioning and delivery takes control of registry, publishing,
authorization, authentication and policing of SIL within a closed
governance domain. The life cycle of SIL runs through end user
client, routing and forwarding network and cloud in terms of the end-
to-end service process, and will survive in the above-mentioned
closed governance domain permanently unless it’s withdrawn or updated
with the new service identification label norms.
SIL is designed under this reference framework as a sophisticated
interface between user and service as well as between service and
network and cloud. User client requests the service through SIL
encapsulated in user data packet header with customized networking
and computing provisioning guaranteed by SAN routing and forwarding
network which would index SIL with the corresponding networking and
computing policies and resources configured from SCMS. Cloud
governance system takes SIL as a public interface between user client
and the cloud service provisioning system.
Daniel, et al. Expires 8 March 2024 [Page 5]
Internet-Draft Abbreviated Title September 2023
End user subscribes the service from SCMS and service client carries
on materializing the service by instantiating an SIL into the service
packet. Upon arriving at networking edge, SIL is identified and
further enables the fine-grained networking service and computing-
based routing and scheduling. Along the routing path the service
data stream gets customized treatment in terms of SIL which is
inherently defined to bo domain independent. Figure 1 is the
reference framework of SAN. SAN domain reaches beyond conventional
forwarding and routing network with SIL presence in both service
client and server at end user devices and cloud sites respectively
which includes layer 4 and layer 3 from a network architecture
perspective. Nevertheless, SAN forwarding and routing domain would
be a predominant part of the entire reference framework.
+--------------------------------------------------------------+
| Service Control and Management System |
+----------------------------|---------------------------------+
|
Service Client | Service Server
+--+ Service connection upon SIL +--+
| |<------------------------|----------------------------->| |
| | | | |
| | +-------------------|------------------------+ | |
| |<----| Service Identification Label |---->| |
| | +---|---------------|------------------|-----+ | |
| | | | | | |
| | | | | | |
| | +----V------+ +----V------+ +-----V------+ | |
| |--->|SAN Ingress|-->| SAN Relay |---->| SAN Egress |--->| |
+--+ +-----------+ +-----------+ +------------+ +--+
| | SAN forwarding and routing domain | |
| |<--------------------------------->| |
| Service Identification domain |
|<-------------------------------------------------------->|
Figure 1
4. Layer 4 of SAN framework
From perspective of layer 4 which builds and maintains the service
connection for the application, end user quality of experience would
be significantly increased with a host-address irrelevant connection
that remains stable regardless of whether or not the application
server host shifts. It’s achieved by establishing the layer 4
connection with SIL which indicates only the service type without
semantics of server host location.
Daniel, et al. Expires 8 March 2024 [Page 6]
Internet-Draft Abbreviated Title September 2023
In particular, SIL could be encapsulated in layer 4 protocol-intact
way and layer 4 protocol-updated way. The former entails informing
the app client of an SIL when it comes to the process of informing it
of a destination address such as DNS inquiry process. Therefore,
layer 4 protocol remains intact by taking SIL as destination address
and proceeds as usual. SIL would be identified and treated with
fine-grained networking and computing policies of which an
addressable server host address would be selected and encapsulated in
the out layer routing header by SAN ingress. A mapping between SIL
and server host address could be made either by SAN egress or app
client and server. So far as the latter is concerned, a dedicated
sub-layer between layer 4 and layer 3 should be employed to
encapsulate SIL which is to be used by layer 4 protocol to establish
the service connection. The dedicated SIL sub-layer survives through
end user, SAN network and the cloud, and SAN nodes would index the
SIL from this sub-layer to execute the fine-grained service treatment
in terms of both networking and computing policies.
SIL structure designing as well as the dedicated sub-layer protocol
specification are out of scope of this proposal.
5. Layer 3 of SAN framework
Layer 3 SAN domain is responsible for the fine-grained treatment of
service flow referenced by SIL in terms of both networking and
computing requirements.
5.1. SAN ingress
SAN ingress extracts SIL from either the destination address field to
be encapsulated with SIL or the dedicated SIL sub-layer as
illustrated in section 4, and determines its fine-grained networking
policy as well as its next hop in light of which cloud site hosts the
satisfying service node. Under this particular circumstance, the
next hop could be either an SAN egress which connects directly to the
selected remote cloud site or the selected service server within the
local cloud site.
Meanwhile, SCMS delivers both fine-grained networking and computing
policies in terms of SIL to SAN ingress which maps the extracted SIL
to the corresponding policies and executes them upon the forwarding
plane. SCMS could be the combination of computing and networking
orchestrator and controller which logically should be two standalone
entities. The orchestrator part is responsible for the general
scheduling policies of computing and networking resources, such as at
which threshold a policy is triggered, while the controller generates
and delivers the SIL-indexed policies according to the scheduling
policies from the orchestrator. Under distributed routing scheme,
Daniel, et al. Expires 8 March 2024 [Page 7]
Internet-Draft Abbreviated Title September 2023
the computing and networking policies generated by the protocols
(such as BGP) within the SAN forwarding and routing node could also
be part of SCMS logically.
5.2. SAN relay
SAN relay is optionally designed to identify SIL and execute the
according service routing as well as networking policies when SAN
ingress alone could not do this. Particularly, SIL-indexed routing
status would not necessarily be maintained by SAN ingress once and
all, so SAN relay could play the role of recursive routing table
query. Also, SAN relay could coordinate with SAN ingress to execute
the networking policy in terms of SIL when necessary. Nevertheless,
SAN relay could be reduced to an ordinary forwarding and routing node
without knowledge of SIL when SAN ingress completes the service and
networking service policies.
5.3. SAN egress
When it comes to SAN egress, SIL could be extracted from the
destination address field when there is a tunnel encapsulation as an
outside header such as SRH (SRv6 Header), or from the dedicated SIL
header when the original SIL in the destination address field has
been replaced with a selected SAN egress address by SAN ingress. SAN
egress maps the SIL with the networking and computing policies from
SCMS and executes them upon the forwarding plane, the next hop would
be selected through the computing policy as a server hosting the
service or a proxy of the service. SAN egress always removes the
outside tunnel header and terminates the networking policy.
The networking and computing policy exchange with SCMS follows the
same process of SAN ingress other than the difference of
particularity of the policies generated and delivered.
5.4. SAN control plane consideration
As the conventional IP routing control plane scheme in place, SAN
control plane could also be deployed by centralized and distributed
scheme or combination of both with additional service status as well
as the polices. Service-related control plane only has impacts upon
SAN ingress and egress, and could logically be decoupled from the
conventional IP control plane.
Daniel, et al. Expires 8 March 2024 [Page 8]
Internet-Draft Abbreviated Title September 2023
5.4.1. Centralized control plane consideration
SIL-centered networking and computing resource and policy generation
and maintenance is the key feature of SAN control plane. Networking
resource and policy generally aligns with the existing scheme other
than the SIL-centered networking policy is differentiated in a fine-
grained granularity. As far as computing resource and policy is
concerned, LSRS’s volatility makes it infeasible to be maintained and
controlled in a centralized entity, GSRS is the chief computing
resource and service status information to be collected and managed
in the controller with regard to service stream delivery in routing
network architecture. Routing and forwarding policies from GSRS
calculated in the centralized controller apply only to the segment
between SAN ingress and egress, while the second segment routing
policy from SAN egress to the selected service instance in the cloud
site is determined by LSRS at SAN egress.
Hierarchically centralized control plane architecture would be
strongly recommended under the circumstances of nationwide networking
and computing management and scheduling.
5.4.2. Distributed control plane consideration
Networking resource is notified and updated through existing
distributed protocols (BGP/IGP etc.) and the SIL-centered networking
policy would be formulated as well. When it comes to computing
resource, GSRS is updated among the SAN edge routers which have been
connected in a mesh way that each pair of edge routers could exchange
GSRS to each other, while LSRS will be unidirectionally updated from
cloud site to the associated SAN edge router in which LSRS is
maintained and its update process is terminated.
Protocol consideration upon which GSRS and LSRS is updated is out of
the scope of this proposal and will be illustrated in forthcoming
draft.
5.4.3. Hybrid control plane consideration
In terms of the particularity of service-related resource updating
and notification, it would be more efficient to update the GSRS by a
distributed way than a centralized way in terms of routing request
and response in a limited network and cloud domain, but would be the
opposite case in a nationwide circumstance. This is how hybrid
control plane could be deployed in such a scheme that overall
optimization could be achieved.
5.5. SAN user plane consideration
Daniel, et al. Expires 8 March 2024 [Page 9]
Internet-Draft Abbreviated Title September 2023
5.5.1. SIL encapsulation
Service identification label is the predominant index across the
entire SAN framework ranging through user terminal, forwarding and
routing network and cloud with SIL working as the virtual
destination. Data plane determines the routing and forwarding
orientation with SIL by inquiring GSRS and LSRS at SAN ingress and
SAN egress respectively. SIL encapsulation could be achieved by
extending the existing packet header and also achieved by designing a
dedicated SIL sub-layer, which along with the specific structure of
SIL are out of the scope of this proposal and will be illustrated in
forthcoming draft.
5.5.2. SIL in forwarding and routing network
SAN ingress obtains SIL from either the destination address field or
the dedicated SIL sub-layer of the user packet header explicitly or
mapping from the traditional 5 tuples implicitly. Either way SIL
starts being indexed for networking and computing policies until it
arrives at SAN egress. SIL in the dedicated sub-layer would remain
intact in the user packet header, while SIL in the destination
address field would remain intact if SAN ingress employs a tunnel
header or could be replaced with a selected SAN egress address with a
dedicated SIL encapsulation in extension headers such as DoH, SRH,
HBH. In the case of absence of SIL in user packet header, SAN
ingress would generate and encapsulate SIL in extension headers by
combination of existing parameters from user packet header such as 5
tuples etc. Apart from the networking and computing policy execution
at SAN ingress and egress, SIL could be ignored by SAN relay and
without computing state maintenance unless the mapping between SIL
and GSRS fails or SIL fine-grained networking policy is involved.
5.5.3. SIL-based routing
As illustrated in section 3, SIL encapsulated in the headers and
maintained in GSRS and LSRS indicates an abstract service type rather
than a geographically explicit destination label, thus the routing
scheme based upon SIL is actually a two-part and two-layer process in
which SIL only indicates the routing intention of user’s requested
service type while routing does not actually materialize in
forwarding plane and the explicit routing destination of the two
segments would be determined by GSRS and LSRS respectively.
Therefore the actual routing falls within the traditional routing
scheme which remains as they are.
Daniel, et al. Expires 8 March 2024 [Page 10]
Internet-Draft Abbreviated Title September 2023
Apart from the indication of service routing intention, SIL could
also indicates a specific network serivice requirements by
associating the networking service policy in GSRS which would
therefore schedule the network resources such as an SR tunnel,
guaranteed bandwidth etc. at SAN ingress.
Therefore, GSRS and LSRS in control plane along with SIL
encapsulation in user plane enables an logical service routing sub-
layer which is able to be aware of the computing status from cloud
sites and forward the service flow in terms of networking services as
well as computing resources. Nevertheless, this logical sub-layer
remains predominantly at SAN ingress and egress nodes. There're
drafts such as [I-D.liu-dyncast-ps-usecases] and
[I-D.li-dyncast-architecture] which analyze the benefits of
computing-based routing and demonstrate an anycast-as-service-
identification solution.
5.6. Hierarchical service routing architecture
In addition to the existing networking resource and status sensing
scheme, SAN routing and forwarding network is designed specifically
to enable sensing the service-related resource and service status
from the cloud sites and routing the service flow according to both
network and computing metrics as illustrated in figure 2. The
architecture is a horizontal convergence of cloud and network, while
the latter maintains the converged resource status and thus is able
to achieve an end to end routing and forwarding policy from a
perspective of cloud and network resource. PE1 maintains GSRS with a
whole picture of the multiple cloud sites, and executes the routing
policy for the network segment between PE1 and PE2 or PE3, namely
between SAN ingress and egress, while PE2 maintains LSRS with a focus
picture of the cloud site where S1 resides, and establishes a
connection towards S1. S1 is an active instance of a specific
service type. On top of the role of SAN egress which maintains LSRS,
PE2 and PE3 also fulfill the role of SAN ingress which maintains GSRS
from neighboring cloud sites. P provides traditional routing and
forwarding functionality for computing service flow, and optionally
remains unaware of service-related status.
Daniel, et al. Expires 8 March 2024 [Page 11]
Internet-Draft Abbreviated Title September 2023
+----------+ +-------+
+------>|SAN egress|----->| SI |
| +----------+ +-------+
+-----------+ +-----------+ PE2 S1
|SAN ingress|--->| SAN relay|
+-----------+ +-----------+ PE3 S2
PE1 P | +----------+ +-------+
+------>|SAN egress|----->| SI |
+----------+ +-------+
|<--------- Networking domain --------->|<-Computing domain->|
Figure 2
5.6.1. Routing scheme in line with multiple service-related resources
granularity
Status updates of service-related resource and service in the cloud
sites extend in a quite broad range from relatively stable service
types and overall resource occupation to extremely dynamic capacity
changes as well as busy and idle cycle of service instances. It
would be a disaster to build all of the status updates in the network
layer which would bring overburdened and volatile routing tables.
It should be reasonable to divide the wide range of service-related
resource and services into different categories with differentiated
characteristics from routing perspective. GSRS and LSRS correspond
to cross-site domain and local site domain respectively, and GSRS
aggregates the service-related resource and service status with low
update frequency from multiple cloud sites while LSRS focuses only
upon the status with high frequency in the local sites. Under this
two-granularity scheme, service-related routing table of GSRS in the
SAN ingress remains in a position roughly as stable as the
traditional routing table, and the LSRS in the SAN egress maintains a
near synchronized state table of the highly dynamic updates of
service instances in the local cloud site. Nonetheless, LSRS
focusing upon a single and local cloud site is the normal case while
upon multiple sites should be exception if not impossible.
5.6.2. Two-segment routing and forwarding
When it comes to end to end service flow routing and forwarding,
there is an status information gap between GSRS and LSRS, therefore a
two-segment mechanism has to be in place in line with the two-
granularity routing scheme demonstrated in 5.6.1. As is illustrated
in figure 3, R1 as ingress determines the specific service flow’s
egress which turns out to be R2 according to policy calculation from
GSRS. In particular, the SIL from both in-band (user plane) and out-
Daniel, et al. Expires 8 March 2024 [Page 12]
Internet-Draft Abbreviated Title September 2023
band (control plane) is the only index for R1 to calculate and
determine the egress, it’s highly possible to make this egress
calculation in terms of both networking (bandwidth, latency etc) and
computing requirements. Nevertheless, the two SLA routing
optimization could be decoupled to such a degree that the traditional
routing algorithms could remain as they are. The convergence of the
SLA policies as well as the methods to make SAN ingress aware of the
two SLA would be illustrated by the example work flow of section 7.
Nevertheless, the specific solution is out of scope of this proposal.
+--------+ +--------+ +--------+ +--------+
| GSRS |--->| |--->| LSRS |--->| SI |
+--------+ +--------+ +--------+ +--------+
R1 R R2 S1
|<---------- GSRS segment ----->|<-LSRS segment--->|
Figure 3
When the service flow arrives at R2 which terminates the GSRS segment
routing and determines S1 which is the service instance selected
according to LSRS maintained at R2. Again SIL is the only index for
LSRS segment routing process.
5.6.3. Cross-domain computing routing and forwarding
Co-ordinated computing resource scheduling among multiple regions
which are usually connected by multiple network domains, is an
important part of intended scenarios with regard to why computing-
based scheduling and routing is proposed in the first place. The
two-segment routing and forwarding scheme illustrated in5.6.2 is a
typical use case of cross-domain service routing and forwarding and a
good building block for the full-domain scenario solution. Service
status information is brought into network domain to enable the
latter scheduling routing policies beyond network. However, a
particular scheme has to be put in place to ensure mild and
acceptable impacts upon the ongoing IP routing scheme. A consistent
SIL across terminal, network (multiple domains) and cloud along with
hierarchical SIL-indexed service-related resource and service status
which corresponds with different network domains, is the enhanced
full-domain routing and forwarding solution. Each domain maintains a
corresponding ervice-related resource and service status at its edge
node and makes the service-based routing for the domain-specific
segment which should be connected by the neighboring segments.
Daniel, et al. Expires 8 March 2024 [Page 13]
Internet-Draft Abbreviated Title September 2023
5.6.4. Service traffic affinity
SIL holds the only semantics of the service type that could be
deployed as multiple instances within specific cloud site or across
multiple cloud sites, SIL in the destination field is not explicit
enough for all of the service flow packets to be forwarded to a
specific host. Traffic affinity has to be guaranteed at both ingress
and egress. Once the egress is determined at SAN ingress, the
binding relationship between the egress and the service flow’s unique
identification (5-tuple or other specifically designed labels) is
maintained and the subsequent flow could be forwarded upon this
binding table. Likewise SAN egress maintains the binding
relationship between the service flow identification and the selected
service instance.
Traffic affinity could be guaranteed by mechanisms beyond routing
layer, but they will not be in the scope of this proposal.
5.7. Logical sub-layer of service routing in forwarding and routing
network
A SIL-indexed networking and computing policy state from SCMS is
maintained at the control plane in SAN forwarding and routing
network. The policy state regulates and guides the forwarding and
routing behaviors of SAN nodes on the basis of SIL as well as the
identification of the service stream. Namely, the fine-grained
networking and computing policies bring new abilities for forwarding
and routing network to be highly efficiently aware of the service
from perspectives of both networking and computing requirements.
Either SIL is originated from user client or SAN ingress, SIL’s
presence in the data plane is guaranteed by various encapsulation
solutions and could be extracted and indexed by any node for
execution of either networking policy, computing policy or both.
As for the networking connection services for which only network-
domain resources would be involved, SIL provides a fine-grained
interface between network and application which would be unavailable
otherwise. Under this particular circumstance, SIL-indexed
networking policy state maintenance actually aligns perfectly with
the existing scheme.
When it comes to the computing services for which both networking and
computing resources would be involved, SIL is a light-weight index
for the fine-grained policies, and the particular computing policies
are clearly decoupled with that of networking policies .
Daniel, et al. Expires 8 March 2024 [Page 14]
Internet-Draft Abbreviated Title September 2023
Therefore, a logical SIL sub-layer with both control and data plane
has been conceptually employed within SAN forwarding and routing
architecture. The sub-layer simply delivers the SIL specific
policies but would leave them for the existing routing layer to
execute the policies on a basis of service stream and packet.
6. Governance and life cycle of service identification label
6.1. Originality and governance of SIL
SIL is designated to indicate the fundamental and common service
types, and could be registered from both networking and computing
domain with regard to networking connection services and
comprehensive computing services respectively. The SIL templates
should be specified by the entity which is both technically able to
coordinate the SIL-indexed networking and computing resources and
services, or by public standardization organizations. SCMS publishes
the SIL which has been authenticated, authorized and configured with
networking and computing policies.
Application developer and operator subscribes SIL from SCMS and
integrates it into its application system, and the service client
initiates the service by encapsulating SIL into its IP protocol
headers. SIL template specification is out of scope of this
proposal.
6.2. Life cycle of SIL
Upon the registration and publication of the service, SIL is active
and available through the ecosystem of terminal, network and cloud
until it’s withdrawn and terminated by SCMS. Life cycle of SIL would
not terminate with a specific end of user service, and the same SIL
could be instantiated by multiple users simultaneously. SIL renders
the services as an effective interface between service and network
which is inherently absent in the decoupled internet protocol system.
7. An example of end-to-end SAN work flow
SIL is controlled and managed by SCMS as illustrated in section 6.1,
therefore SIL subscription and service agreement process between
service client and SCMS should be finished before the service client
initiates a service request and starts sending service data stream
while SIL would be encapsulated in layer 3 header and lives on
through the service terminal, SAN routing and forwarding network, and
service server in the cloud site. An example work flow of cloud game
application will be demonstrated under SAN reference framework. From
networking and computing resource perspectives, rendering algorithm
of real-time game situation data should be the dominant sub-service
Daniel, et al. Expires 8 March 2024 [Page 15]
Internet-Draft Abbreviated Title September 2023
of cloud game application. The work flow instantiates the mechanism
under which the rendering algorithm would be materialized by SAN.
7.1. Initiation and maintenance of rendering algorithm in SAN system
Rendering algorithm is identified as a fundamental and common service
which could be invoked by multiple parties, and at least one provider
has registered to SCMS and has been authenticated and verified by
SCMS for the service availability. An SIL might be allocated or
activated as SIL-RA which should be structured service identification
specifically and uniquely allocated for the rendering algorithm.
When it comes to the networking requirements of rendering algorithm,
SCMS verifies the corresponding networking resource availability as
well as capability and establishes a comprehensive SLA state for
rendering algorithm indexed by SIL-RA.
When a rendering algorithm provider registers/withdraws the rendering
algorithm service and the corresponding networking resources update
to a degree the service would not be able to be rendered as promised,
the computing and networking SLA state of SIL-RA should be updated
accordingly. The signal as well as data stream of SIL-RA is
illustrated as an example work flow in figure 4.
+--+ L4:SIL(rendering algorithm)-based service connection +--+
| |<-------------------------------------------------------->| |
| | +-------------------------------------------+ | |
| |<=====| SCMS |======>| |
| | +-#------------------#--------------------#-+ | |
| | # # # | |
| | # # # | |
| | +-v---------+ +--V------+ +------V---+ | |
| |----->|SAN Ingress|---->|SAN Relay|----->|SAN Egress|---->| |
+--+ +-----------+ +---------+ +----------+ +--+
Service Client Service Server
-------------> ===============> ###############>
service flow SIL management flow SIL policy config flow
Figure 4
The service client launches a cloud game application request to the
designated server in a centered cloud site, and a cloud game service
connection thus would be established, while the rendering algorithm
might be deployed in the edge cloud nodes for the sake of
responsiveness of the game service.
Daniel, et al. Expires 8 March 2024 [Page 16]
Internet-Draft Abbreviated Title September 2023
7.2. Configuration of a rendering algorithm (as SIL-RA) in SAN
forwarding and routing network
End-to-end latency determines both the user experience and the
feasibility of the service deployed in the cloud site. The latency
resulting from the service server and the network path depends on the
rendering algorithm capability along with the computing resources and
the networking delay respectively. Therefore, SCMS maps a networking
connection policy with 10 mini-seconds of latency and computing
policy with GPU to SIL-RA, and makes the networking and computing
policies configuration to the SAN forwarding and routing nodes such
as SAN ingress, SAN relay and SAN egress through network management
and control interfaces.
7.3. Publication/Subscription of SIL-RA
Upon completion of SIL-RA initiation and configuration, SIL-RA is
ready to be published to and subscribed by all of the interested
parties. The SIL-RA publish/subscribe process is more of a
commercial agreement than a technical exchange process. The
rendering algorithm service client initiates the service by sending
the service requests as well as data stream with SIL-RA encapsulated
in layer 3 header. The rendering algorithm service connection would
be based soly upon SIL-RA which could be indexed and mapped to a
computing node located in other edge cloud site, and thus the actual
service connection could be different from that of cloud game
application connection as illustrated in section 7.1.
7.4. SIL-RA service data stream treatment at SAN forwarding and routing
network
In the ongoing process of the real-time cloud game service, the
rendering algorithm service would be invoked simultaneously during
the entire life-cycle of the game service. Therefore, the rendering
algorithm would not be invoked as the initiation of the game service
and would also not reside in a same cloud site of the master game
service. The rendering algorithm service should be specifically
guaranteed by SAN with comprehensive networking and computing
policies configured for it. The game service client would initiate
an independent rendering algorithm service by encapsulating SIL-RA in
the data packet headers.
Daniel, et al. Expires 8 March 2024 [Page 17]
Internet-Draft Abbreviated Title September 2023
7.4.1. SIL-RA service data stream treatment at SAN ingress
Upon arrival of SIL-RA service data stream, SAN ingress extracts SIL-
RA from the designated field of layer 3 header and identifies the
type of the SIL-RA. SIL-RA availability as well as the supporting
computing resources among multiple servers has been maintained in the
service routing table, the SAN egress connecting to a proper SIL-RA
server would be selected.
As configured in 7.3, SIL-RA service stream should be transmitted by
a forwarding and routing policy with 10 mini-seconds latency between
SAN ingress and the selected SAN egress. The SIL-RA service data
stream would be guided into a designated networking path. As far as
bounded latency of routing network is concerned, a path-specific flow
label could be employed in line with the underling networking
technologies such as Detnet.
7.4.2. SIL-RA service data stream treatment at SAN relay
SIL-RA service data stream treatment with SIL-RA involved is actually
optional, it’s only necessary when SAN ingress relays the SIL-RA
service routing request to a SAN relay where the SIL-RA indexed
service routing table is homed or one or more SAN relay nodes has to
be involved to provide SIL-RA specific networking connection service.
7.4.3. SIL-RA service data stream treatment at SAN egress
As illustrated in section 5.5, SIL-RA indexed service routing table
would be aggregated with two class granularity at SAN ingress and SAN
egress respectively. So SIL-RA would be extracted from the
designated field and mapped with the LSRS table. The final hop
purporting to the selected rendering algorithm service server would
be determined by SAN egress and the service data stream would be
forwarded to the server or the server’s proxy accordingly.
SAN egress should be the terminating node of the networking
connection path and the the associated path header would be removed.
7.5. SIL-RA service data stream treatment at cloud site
SIL-RA encapsulation in layer 3 header would remain intact until the
data packet arrives at the cloud site. SIL-RA would be identified
and treated as both a service connection ID by the rendering
algorithm server and a rendering algorithm to be guided and scheduled
by the cloud system respectively .
Daniel, et al. Expires 8 March 2024 [Page 18]
Internet-Draft Abbreviated Title September 2023
8. Acknowledgements
To be added upon contributions, comments and suggestions.
9. IANA Considerations
This memo includes no request to IANA.
10. Security Considerations
SIL employment in SAN framework would bring security challenges for
user and application in the cloud sites as well as SAN forwarding and
routing network. SCMS is responsible for registry, authorization and
authentication of SIL and acts as the first check point for SIL. The
detailed specification of the security process of SCMS is out of
scope of this proposal. When it comes to SAN forwarding and routing
network, it’s imperative for the service gateway to execute security
policies with regard to SIL and the specific security solution would
be illustrated in a dedicated proposal.
11. Informative References
[I-D.li-dyncast-architecture]
Li, Y., "Dynamic-Anycast Architecture", February 2021,
.
[I-D.liu-dyncast-ps-usecases]
Liu, Peng., "Dynamic-Anycast (Dyncast) Use Cases and
Problem Statement", February 2021,
.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
.
Authors' Addresses
Daniel Huang
ZTE Corporation
Nanjing
Phone: +86 13770311052
Email: huang.guangping@zte.com.cn
Daniel, et al. Expires 8 March 2024 [Page 19]
Internet-Draft Abbreviated Title September 2023
Bin Tan
ZTE Corporation
Nanjing
Phone: +86 13918622159
Email: tan.bin@zte.com.cn
Dong Yang
Beijing Jiaotong University
Beijing
Email: dyang@bjtu.edu.cn
Daniel, et al. Expires 8 March 2024 [Page 20]