Highlights
    High Availability

High availability is critical at the customer-facing edge of carrier networks. Unlike core routers which are designed with alternative paths should a router fail, the edge is a single point of customer-facing connectivity. Any failure at the service provider edge can result in significant service disruption.

Though IP networks have become increasingly reliable, they are still prone to outages. With business-critical traffic traversing IP networks, any edge-facing router must be designed to minimize the effect of network outages through device stability, redundancy, distributed hardware design, software modularity and advanced recovery capabilities.

The ST-series was designed with the hardware and software reliability required in the world's largest service provider networks. Functionality is distributed to multiple line cards and Network Processor Blades (NPBs), eliminating a single point of failure. All common equipment is redundant and hot-swappable for hitless software upgrades and maximum uptime and reliability

High availability begins with carrier-class hardware. The NEBS-compliant ST includes distributed, redundant components, a requirement at the customer-facing service provider edge.

· Redundant Route Control Processor: The redundant Route Control Processors (RCPs) are the main controller for the ST, housing the system management software and the full suite of routing protocols supported by the ST. One RCP is designated as active, while the other is the standby RCP. In the event of an RCP failure, the secondary RCP immediately takes over, without requiring system reboot or reload of line cards

· Distributed Line Cards and Network Processor Blades: Functionality is distributed to multiple line cards and Network Processor Blades (NPBs), eliminating a single point of failure

· Redundant and Hot Swappable Common Equipment: All common equipment is redundant and hot-swappable for hitless software upgrades and maximum uptime and reliability

. Route Control Processor (RCP)
. Timing Control Module (TCM)
. System I/O (SIO)
. Optical Switching Fabric (OXF)
. Packet Switching Fabric (PXF)
. Cooling and Power systems

Graceful Restart

The ST-series supports the graceful restart protocol as defined by the IETF. Graceful restart alerts peering routers that the restarting router will continue to forward packets, despite going down temporarily. ST support of graceful restart of routing protocols (BGP, OSPF, IS-IS and LDP) enhances its ability to maintain uninterrupted packet forwarding when network stress forces a routing protocol restart. Routers without graceful restart will allow peers to detect if a session goes down and has to be restarted. The result is route re-computing and network-wide routing updates (also know as router flapping) which could result in packet loss.

Stateful Failover

Stateful failover maintains Layer 2 sessions to minimize network disruption in the event of an RCP failure. ST Layer 2 protocols supported include: PPP, HDLC, Ethernet with 802.1q VLAN, Frame Relay, ATM (SAR at OC-48c), X.86 Ethernet over SONET, MPLS over Ethernet and PPP. Stateful failover is critical at the edge, where large numbers of individual sessions are effected in the event of an RCP failure.

Fast Re-route

ST-series Fast Re-route technology enables diversion of traffic around points of failure along an MPLS path in tens of milliseconds. This is particularly critical due to the singularity of devices at the customer-facing edge and the increasing shift of business-critical traffic to MPLS-based networks.

Non-stop Forwarding

In the event a protocol crash or network instability affects the control plane, ST non-stop forwarding continues to forward packets. ST forwarding is distributed across multiple Network Processor Blades that perform all packet processing, Layer 2/3 forwarding, IP filtering, accounting and link management for the system, while the Route Control Processor is dedicated exclusively to routing. As a result, the network remains available in the event of control plane issues.

Software Scalability and Modularity

Designed in-house specifically to ensure the stability and scalability required for service provider edge networks, ShadeTree software includes multiple, independent processes to maximize efficiency and scalability, protect against system corruption and eliminate a single point of failure. This is critical to ensure stable and reliable operation under extreme network conditions.

Fault isolation and memory protection occurs between each software process and the system kernel to protect the kernel (which links all system process) and other processes from corruption in the event of a single process failure. This modular design ensures no one process can consume all CPU resources, while preventing a single process failure from initiating a chain of events that could dramatically impact system performance. Modular components include:

· Route Control Process (RCP): controls routing protocols and MPLS label allocation, maintains routing information, performs routing policy, manages processes on ST-series hardware and sends operational information to the CLI (command line interface) and the element manger

· Configuration Process: manages the configuration database and performs configuration operations. It responds to and propagates configuration change requests to the rest of the system

· SNMP Process: controls and manages all SNMP requests for ST operation monitoring via SNMP read access

· Command Line Interface (CLI): primary interface used to access, configure and monitor the ST via a console or remotely via telnet or secure shell. The CLI operates in operational and configuration mode

· Process Manager: monitors and verifies the state of all system processes. It automatically restarts processes if issues are discovered, and also is responsible for initially starting system processes

Conclusion

High availability is a critical requirement in any service provider edge device. ST-series high availability includes a distributed design and redundant, hot-swappable components to minimize the impact of individual component failure, as well as modular, in-house developed software. To assure maximum service provider uptime, ST-series advanced high availability features include graceful restart, stateful failover, fast reroute and non-stop forwarding.

 

Copyright 2006 ECI Telecom DND, Inc. Privacy Statement