Using Crosswork Planning to pave the way for Segment Routing migration

6 minutes read

Overview

Many customers have deployed Segment Routing to date. The benefits of Segment Routing cannot be overstated. Segment routing eliminates the need to create forwarding state in the network, minimizes the number of protocols required for operations, thereby drastically simplifying the network. In other words, it brings enhanced flexibility and resiliency and operational simplification. The use of controllers such as Crosswork Network Controller provides another layer of abstraction by exposing easy to consume network APIs at the transport and services level, enabling the rollout of advanced traffic engineering use cases with fine grained control.

Service Providers and large enterprises considering the migration to Segment Routing as an end state have traditionally been concerned about maintaining the same quality of experience for their customers, especially those who have deployed full-meshed RSVP-TE auto bandwidth tunnels. Their main concerns are that without the deployment of auto bandwidth tunnels, the network would not be able to provide the same resiliency and responsiveness in dealing with changing network conditions.

Crosswork Planning provides capabilities for network model building and simulation. It offers predictive simulation for a variety of use cases, including routing protocol migration and network migration scenarios. In this tutorial, we provide a simple demonstration on how operators may leverage on Crosswork Planning to model the existing network which has been deployed with full-meshed RSVP-TE tunnels and if migrating this network to Segment Routing will provide the same performance and resiliency.

The methodology described in this tutorial has been applied on real-world network models with positive results. However, the outcome of these modelling exercises are unique to each customer’s networks and one needs to perform proper modelling and simulation before drawing the same conclusions on their networks. Additional considerations in migrating the network through intermediate states, e.g., co-existence of RSVP-TE and Segment Routing, are also outside the scope of this document. Due to customer confidentiality, this document uses modified sample network models (plan-file) for illustration purposes.

A Paradigm Shift

As customers transition to Segment Routing, we advocate a shift from the full-mesh RSVP-TE tunnel mindset. This transition promises significant benefits in terms of operational simplification. By embracing the use of IGP shortest path to handle the bulk of best-effort traffic and with proper capacity planning and optimization, full-mesh RSVP-TE tunnels should not be necessary for most customer scenarios. This is particularly true for customer networks designed with high levels of redundancy and proper capacity planning.

For traffic that requires special treatment, such as low-latency or avoidance of specific links, countries, or continents, we have the flexibility of using Segment Routing policies or FlexAlgo. These Segment Routing policies uses efficient SR-native algorithms that leverage the ECMP-awareness of SR and minimize the size of the resulting SID-list. They are designed for use cases that go beyond bandwidth considerations, providing a robust solution for handling exceptions.

We are mindful that there could be scenarios where capacity planning and optimization could not have anticipated (e.g., multiple links going down simultaneously, flash crowd, or unexpected failures occurring concurrently when nodes/links are taken out for maintenance). This is where tactical traffic engineering capabilities such as that provided by Local Congestion Mitigation (LCM) will help.

LCM is designed to be used on best-effort traffic, leaving traffic that is already in SR policies unaffected. The deployment of LCM allows us to divert the minimum amount of traffic from the congested link (to bring it below the congestion threshold) to a detour path for the duration of the congestion. LCM leverages spare capacity elsewhere to carry this traffic (potentially over a longer path), which would otherwise be discarded.

Methodology

In the modeling exercise, we aim to build an accurate network model with the current network with RSVP-TE auto bandwidth tunnels deployed. Crosswork Planning can assist in the network collection and model-building process. This would provide us with an accurate network model (plan file) complete with topology, LSPs, and traffic that is suitable for simulation studies.

Next, we leverage Crosswork Planning Design to perform various simulations to characterize this network and its failure modes. This may include using simulation analysis to simulate permutation of failures on selected objects such as nodes and links to provide the worst-case traffic utilization per link. The same analysis will also allow us to gain insights into which failure scenarios will lead to these worst-case traffic utilization.

In the example below, we load the plan file containing a model of the topology, LSPs, and traffic and perform a simulation analysis to obtain the worst-case interface utilization. The report shows that 5.88% of the failure cases will cause the interfaces to be oversubscribed.

RSVP with Sim Analysis

Next, we remove the RSVP-TE LSPs and rely on IGP routing. We performed a simulation analysis and found that now, 23.53% of the failures will cause the interfaces to be oversubscribed. This shows that the RSVP-TE LSPs do help lower the percentage of failure cases which causes the interfaces to be oversubscribed. However, this is only part of the story.

Another conclusion which can be drawn from the simulation analysis exercise might be the nature of failures which lead to worst case traffic utilization. If the failures are caused by failed link bundles for example, the probability of this happening would be lower as compared to a single port failure, provided the link bundle members are carried over diverse paths and terminate on different linecards on distributed routing platforms which support high availability.

RSVP with Sim Analysis

Crosswork Planning’s toolset encompasses various optimizers that can be used to improve the performance of networks under stressful conditions. We leverage IGP Metric Optimization to provide optimal link metrics in order to minimize the maximum interface utilization under failure scenarios.

RSVP with Sim Analysis

We perform another simulation analysis on the optimized network model and observe that now none of the failure cases will lead to any links being oversubscribed. This means that inspite of removing RSVP auto bandwidth tunnels, no link or mode failures will now lead to any links being oversubscribed and customers should not experience any degradation of service caused by link oversubscription.

RSVP with Sim Analysis

Very broadly speaking, some networks are designed with good redundancy (i.e. meshy topology) with multiple parallel paths allowing for equal cost multipath, and has sufficient capacity to deal with failure scenarios. Other networks may have very limited capacity and links and are not able to upgrade capacity in accordance to requirements, and have multiple links over capacity. In the latter case RSVP-TE may be serving more as a band-aid to resolve issues which could otherwise be handled with capacity planning and network design best practices.

In the Segment Routing space, toolsets such as Local Congestion Mitigation can help operators deal with these exceptions on a as needed basis, providing lifecycle management of tactical traffic engineering tunnels for the duration of the congestion event.

Conclusion

When planning for migration from full-mesh RSVP-TE to Segment Routing, a worth-while exercise would be to build an accurate model of the network, complete with topology, LSPs and traffic. This allows us to simulate how traffic would flow, and what possible issues we may face in the event of failures, or if the existing demands continue to grow.

Optimizations may be performed on this model to minimize the maximum interface utilization under failures scenarios. Simulations can be used to compare the worst case interface utilization for both RSVP-TE auto bandwidth and Segment Routing scenarios. Networks with sufficient capacity and resiliency should not see the removal of full meshed RSVP-TE auto bandwidth tunnels being worst off.

Crosswork Planning provides an intuitive, easy to use suite of tools for operators to proactively manage their network infrastructure and helps to provide enhanced visibility and insights, improved performance, reliability, and resiliency, and efficient capacity management.

For more information, please refer to the Crosswork Planning Installation Guide, Collection Setup and Administration Guide, and Design User Guide.

Leave a Comment