IX System Network and Bandwidth Requirements
This chapter describes the network and bandwidth requirements to ensure the immersive video quality of the IX system. This chapter contains the following sections:
– Jitter
– Latency
Note The bandwidth requirements in this section are based on network testing in environments without jitter or packet loss. Due to the extremely high-definition of the video, a small amount of packet loss or jitter can cause noticeable disruption to the user experience. To maintain the real-time interaction of human conversations, latency must be kept to an absolute minimum. Networks that have not been configured with proper QoS or inefficient routing over the WAN may cause loss of video quality.
Quality of Service
This section briefly describes Quality of Service (QoS) considerations for the IX5000 and IX5200. The immersive quality of the IX system requires that QoS mechanisms available on Cisco switches and routers throughout the network be configured to reduce packet loss, latency and jitter. Redundant devices and network links that provide quick convergence after network failures or topology changes are also important to ensure a highly available infrastructure. The following sections describe the packet loss, jitter and latency requirements for the IX system.
For more detailed information, see the Cisco Collaboration System 11.x Solution Reference Network Designs (SRND) document at:
https://www.cisco.com/c/en/us/td/docs/voice_ip_comm/cucm/srnd/collab11/collab11.html
Packet Loss
Because the IX system is highly sensitive to packet loss, your network should be configured so that packet loss does not exceed a target of 0.05%. Cisco TelePresence codecs transmit at approximately 5 Mbps (max) per 1080pdisplay, which translates to more than 99% compression. The overall effect of packet loss is proportionally magnified, and dropping even one packet in 2000 (0.05% packet loss) becomes noticeable to end users.
For more detailed information about packet loss requirements for TelePresence systems, see the Cisco TelePresence Network Systems 1.1 Design Guide at:
https://www.cisco.com/c/dam/en/us/td/docs/solutions/TelePresence_Network_Systems_1-1_DG.pdf
Jitter
Understanding Jitter and Defining Jitter Thresholds
Jitter is defined as the difference between the time the video frame is expected to arrive at the endpoint, and the actual time that it arrives. There are two types of jitter: packet jitter, and video frame jitter. The video frames are enclosed in packets.
For packet jitter, you should follow the guidelines in the “Network Infrastructure” section of the Cisco Collaboration System 11.x Solution Reference Network Designs (SRND) . Currently the jitter level at the packet level is set at a maximum of 100 milliseconds (ms).
You should measure jitter at the video frame level (application layer) for Cisco TelePresence systems, rather than the packet level (network layer). A network with 0 ms of packet frame jitter can still have jitter at the video frame level if the RX buffer of the codec is overwhelmed with a large number of packets. Therefore, you should measure the arrival time of the entire video frame vs. the expected arrival time of that frame, based on the clock rate of video frame intervals. The clock rate for 30 fps is 33ms, while the clock rate for 60 fps is 16.5ms. Use the information in Table 1-1 as the guideline for jitter in your network.
Measure jitter from the Ethernet port of the system codec on the far end to the Ethernet port of the system codec on the near end.
|
|
|
|||
---|---|---|---|---|---|
Your system monitors jitter in the following ways:
- During a call, you can view the jitter information on the Touch device by tapping More > Status > Call Status and viewing the Jitter field. If the jitter value is Good, the jitter rate is lower than 125 ms. A value of Marginal represents jitter rates between 125 and 165 ms. A value of Poor represents jitter rates above 165 ms.
- You can monitor the jitter rate for your system by logging into the IX Administration Console for your system and navigating to Monitoring > Call Statistics, then clicking the AV Call Video Stream Statistics tab and checking the jitter rates in both the Transmit and Receive areas.
The jitter rates are color-coded. Jitter rates that are less than 125 ms is marked in green. Jitter rates between 125 and 165 ms are marked in yellow, and rates above 165 ms are marked in red.
Although no system actions are performed when jitter levels are exceeded, jitter at the video frame level is closely related to dropped video frames. If the RX buffer is exceeded, the system starts dropping frames. The Cisco TelePresence system changes the call quality based on the number of dropped frames.
- You can use the status bars on the IX system screen to monitor packet drop, which is closely related to jitter. For more information, see the “Checking IX Bandwidth Quality On the System Display” section.
Note The status bars are not shown on calls that use a Cisco TelePresence Server; however, the same steps are taken, whether or not the bars are shown.
How Your IX System Measures Jitter
The IX system measures jitter upon the arrival of each frame and reports the jitter based on per 10-second and per call averages. The jitter period report provides the jitter measurement for the last 10-second period. The jitter call report shows the average jitter measurement per call. You can access both reports in the Monitoring > Call Statistics page in the IX Administration interface.
Note The IX system measures jitter between video frames and not packet frames, which applies to network devices. A video frame is a compressed picture that is used to update the screen.
The IX system calculates jitter as the sum of the maximum deviation (both late and early packets) from the expected arrival time as given by the frame period. (A late packet is a packet delivered after the picture has been reassembled and sent to the display. These packets are not lost but have the same impact as lost packets because they cannot be used by the video decoder.) The IX system computes frame jitter based on the arrival time of the last packet of a frame.
For example, for a 30 fps video stream with a measurement period of 165 ms or 5 frames (instead of 10 seconds and 300 frames), the IX system performs 5 jitter calculations. The Jitter (Period) would be reported as 5 ms (or 1 ms per frame [5 ms/5 frames = 1 ms jitter per frame]).
Frame Expected Arrival Time(ms) = 0 33 66 99 132
Offset = 0 0 +4 0 -1
Max Late = 4ms (absolute value)
Min Late = 1ms (absolute value)
Jitter/Period = 5ms (for this 165ms period)
If there were only 2 jitter periods in this call, the first period jitter measurement would be 5 msecs and the second period jitter measurement would be 10 msecs. The Jitter/Call would be reported as 7.5 msecs.
Latency
The IX system, like other Cisco TelePresence systems, has a network latency target of 150ms. This target includes network flight time but does not include codec processing time. For more detailed information about latency requirements for TelePresence systems, see the Cisco TelePresence Network Systems 1.1 Design Guide at:
https://www.cisco.com/c/dam/en/us/td/docs/solutions/TelePresence_Network_Systems_1-1_DG.pdf
Bandwidth Management for the IX System
This section describes bandwidth management considerations for the IX5000 and IX5200. The following sections describe the enhanced bandwidth control mechanisms and how they are implemented on the IX5000 and IX5200:
- Bandwidth Management Overview
- Bandwidth Provisioning Guidelines
- Bandwidth Requirements for 1080p 60-fps Main Video
- High-Definition Presentation
- Required Configuration For HD Presentation
- Video Bandwidth Allocation Weights
Note This section was previously included in the “IX Software Features” section.
Bandwidth Management Overview
To provide the immersive video experience for the IX5000 and IX5200, the system requires careful planning for peak bandwidth requirements. Make sure to provision enough bandwidth from your service provider to account for total traffic requirements. The IX supports the following bandwidth management mechanisms:
Due to the nature of the video compression, the amount of TelePresence traffic may fluctuate with microbursts. Peak rates during a video conference generally occur at the following times:
- At the beginning of the call (or when a held call is resumed).
- When repair frames are generated (for example, recovering from packet loss).
- During periods of high motion in the video (for example, when participants all stand up or walk about the room).
- When there are details and high motion in the content, or when there is a complete scene change in the presentation video.
When microbursts happen, it may cause network packet loss and trigger repair frames to be created, which introduce further traffic variability in the bandwidth utilization profile. The IX system uses enhanced encoder rate control and packet pacing to maintain constant average bandwidth usage and reduce microburst in network data traffic. See the “Gradual Decoder Refresh (GDR)” section and the “Encoder Pacing” section for more information.
Gradual Decoder Refresh (GDR)
Gradual Decoder Refresh is used to replace IDR (Instantaneous Decoder Refresh) to fulfill frame repair requests. Because IDR frames are much larger than other frames, IDR is usually the source of microbursts in video traffic. GDR gradually refreshes the picture over several frames, providing a smoother, less bursty bitstream. The IX system supports GDR for H.264 Main video on point-to-point calls between the IX system and the following systems:
GDR is supported for H.264 point-to-point Presentation video on calls between an IX system and the following systems:
For more conceptual information about GDR, see the “Bandwidth Management” chapter of the Cisco Collaboration System 11.x Solution Reference Network Designs (SRND) document.
Encoder Pacing
Encoder pacing is a bandwidth management technique used to spread packets as evenly as possible, smoothing out the peaks of the bursts of bandwidth. Network packet pacing is used in the IX5000 after the video encoder.
Encoder pacing is supported in all call scenarios. For more conceptual information about encoder pacing, see the “Bandwidth Management” chapter of the Cisco Collaboration System 11.x Solution Reference Network Designs (SRND) document.
Microbursts are observed more frequently when bandwidth is measured in smaller intervals such as 100 milliseconds. Microbursts are observed less frequently when bandwidth is measured in intervals of one second or longer over the duration of the video call.
When measuring bandwidth statistics for how the IX system performs encoder pacing, note the following:
- Perform measurements at the network interface.
- Do not rely on statistics taken from the Touch 10 device or the Administrator user interface.
Statistics shown on the Touch 10 device show only instantaneous measurements on the encoder and decoder behavior, and the jitter buffer state. These statistics do not necessarily represent accurate measurements of the bandwidth used. These statistics should be interpreted as the bandwidth prior to the encoder pacing mechanism.
Bandwidth Provisioning Guidelines
To ensure the immersive video experience of the IX system, make sure to provision sufficient bandwidth from your service provider. Like other systems, the IX system requires bandwidth consumption headroom on top of average bandwidth usage to account for network overhead and microburst absorption.
Table 1-2 , Table 1-3 , Table 1-4 , and Table 1-5 show the recommended amount of bandwidth to provision from a service provider to handle different call scenarios.
The following information applies to the recommended bandwidth guidelines:
- The bandwidth provisioning guidelines add 20 percent over normal bandwidth requirements to account for Layer 2-to-Layer 4 and other overhead. The calculations also include allowance for bursts and microbursts.
- The guidelines shown are for best motion quality only. Bandwidth requirements for good and better motion quality are less.
For information on calculating bandwidth requirements, see the “Understanding How Endpoints Determine fps and Video Quality” section.
Table 1-2 shows the bandwidth provisioning guidelines for 1080p30 content over H.264.
|
|
||
---|---|---|---|
|
|
|
|
|
|
||
IX system to CTS or TX system1 |
|||
IX system to Cisco TelePresence Server2 |
|||
2.Cisco TelePresence Server supports “Better” motion quality only in TIP/MUX calls. |
Table 1-3 shows the bandwidth provisioning guidelines for 1080p5 content over H.264.
|
|
||
---|---|---|---|
|
|
|
|
|
|
||
IX system to CTS or TX system3 |
|||
|
Table 1-4 shows the bandwidth provisioning guidelines for 1080p30 content over H.265.
|
|
||
---|---|---|---|
|
|
|
|
|
|
||
Table 1-5 shows the bandwidth provisioning guidelines for 1080p5 content over H.265.
|
|
||
---|---|---|---|
|
|
|
|
|
|
||
Bandwidth Requirements for 1080p 60-fps Main Video
The Cisco TelePresence IX5000 System can send and receive main video at 60 fps (frames per second) with 1080p quality (1080p 60) during a point-to-point call. The following sections describe the 1080p 60-fps Main Video functionality:
Required Main Video Configuration
The following configuration is required to enable the 60-fps main video feature in your Cisco TelePresence environment:
– Set Main Display Frames Per Second in the Phone Configuration page to “60 fps main”. For more information, refer to the “Product Specific Configuration Layout Area” section of the Configuring Cisco Unified Communications Manager for the IX System document.
– (Optional) Set Video Bandwidth Allocation Weights on the Phone Configuration page appropriately. For more information about this parameter, see the “Video Bandwidth Allocation Weights” section.
After performing the required configuration, the IX endpoints can send and receive main video at a maximum frame rate of 60 fps.
Understanding How Endpoints Determine fps and Video Quality
During Cisco TelePresence call setup, the sending and receiving endpoints determine the fps (30 or 60 fps) and video quality (1080p or 720p) for the sent and received video streams.
The determination is made as a result of the following factors:
- The amount of Transport Independent Application Specific (TIAS) bandwidth that is negotiated between the sending and receiving endpoints.
The minimum amount of bandwidth is determined by the settings of Main Display Frames Per Second and Quality (Per Display) in the Phone Configuration page of Unified CM. Figure 1-1 and Table 1-6 show the bandwidth requirements for H.265 based on the Unified CM configuration for 30 fps and 60 fps calls. Figure 1-2 and Table 1-7 show the corresponding bandwidth requirements for H.264.
- The video compression standard (H.264 or H.265) that is used.
- The maximum frame size that the network and system can accommodate.
- The negotiation of the video resolution and frame rate (in fps) by the sending and receiving endpoints.
- The maximum bit rate allowed in the Region settings for your device in Unified CM. These settings are applied to the Device Pool, which in turn are applied to your device.
To find your region settings, log in to the Cisco Unified CM Administration GUI and navigate to System > Region. The maximum rate is the value shown in the Max Video Call Bit Rate (Includes Audio) field.
- The packet loss that is detected during a call. This loss is shown as status bars that appear on the main display. If the rate changes, the new rate is shown on the main display. Table 1-11 shows the bars and provides a description.
Table 1-6 and Table 1-7 show the required criteria in the first three columns of the table. The resulting video stream that can be sent is shown in the fourth column of the table.
If any factors do not meet the minimum requirements, the system attempts to send and receive video at the next lowest rate as shown in Table 1-10 .
For example, if the network cannot meet the minimum requirements to send a video stream with the maximum video quality of 1080p 60, the system attempts to negotiate a video stream of 720p 60. If the system cannot meet the requirements to send a video stream of 720p 60, it attempts to negotiate a video stream of 720p 30 as shown in Table 1-10 .
If a call is put on hold, then resumed, the amount of bandwidth is renegotiated using the same factors.
Note These rates are given per video stream, and do not include the presentation stream. Since the IX system has three video streams for the three displays, multiply this number by three. Then add the bandwidth required for the presentation given in Table 1-15 to obtain the required network bandwidth.
|
|
|
|
---|---|---|---|
|
|
|
|
---|---|---|---|
Figure 1-1 shows the bandwidth required when using the H.265 video compression format, while Figure 1-2 shows the bandwidth required when using the H.264 format.
Figure 1-1 H.265 Bandwidth Requirements Per Unified CM Quality (Per Display)
Figure 1-2 H.264 Bandwidth Requirements Per Unified CM Quality (Per Display)
Table 1-8 provides the bandwidth requirements for H.265 for 30 fps and 60-fps calls; Table 1-9 provides the same information for H.264.
|
|
||
---|---|---|---|
|
|
||
---|---|---|---|
|
|
---|---|
The 60 fps-capable Cisco TelePresence endpoints and device send the main video stream at 30 fps under the following circumstances:
- When in a call with an endpoint or device that supports a maximum frame rate of 30 fps.
- When in a call with a Cisco TelePresence endpoint that is registered with a Cisco Unified CM version that does not support 60 fps as a setting for Main Display Frames Per Second in the Phone Configuration page.
- When in a call with a Cisco TelePresence endpoint that is registered with a Unified CM version that supports 60 fps, but “30 fps main” is the setting for Main Display Frames Per Second in the Phone Configuration page.
Checking IX Bandwidth Quality On the System Display
When the IX software detects changes in network quality in the network, an icon is displayed on the main display screen. When connection quality reaches the poor state, the call is terminated.
Table 1-11 describes the call connection status icons that display on the main display.
Note Five, four, and three bars are reserved to show the video rates (1080p, 720p, or CIF) for the call. The remaining bars are reserved to show packet loss. A call dropping to one or two bars indicates that your network is having excessive packet loss at the rates shown in Table 1-11.
High-Definition Presentation
This section provides you with information about the supported presentation resolutions and presentation audio and video cables and includes the following sections:
HD Presentation Overview
Systems running IX software support high-definition (HD) presentations up to a maximum of 1080p resolution at 30 frames per second (1080p 30). The Cisco TelePresence IX5000 and IX5200 systems support the HD presentation feature.
Supported Presentation Devices and Tested Adapters
The presentation cable for the IX5000 system includes the following digital connectors: DisplayPort, Mini-DisplayPort, and HDMI.
Various third-party presentation devices are supported by connectors on the IX system’s presentation cable. Supported presentation devices include:
Note Some Apple devices show flickering on the screen when being used for presentation sharing. If you encounter this issue, change the resolution quality to a lower resolution.
If an adapter is required for VGA presentations, Cisco recommends the following tested adapter:
Resolution Support
The proportional relationship between the width and height (also known as aspect ratio) of the HD presentation signal is 16:9, whereas an analog presentation signal is 4:3.
Table 1-12 outlines the common resolutions shared by the presentation devices, the corresponding aspect ratios, and the presentation digital cables that support the resolutions. While the IX system supports all of the presentation resolutions indicated in the table, EDID (Extended Display Identification Data) only supports 1080p, 720p, and VGA.
|
|
---|---|
Note Some presentation resolutions do not take up the full space of the presentation display area. For example, if a laptop that is sending the presentation is set to a resolution of 1600x900, and the presentation display is set to a resolution of 1920x1080, the image shown on the display is 1600x900 pixels with a black border around it to make a total pixel size of 1920x1080.
To eliminate this black border around the presentation display when using a PC, open the PC’s Control Panel, navigate to the laptop’s Change Desktop Background, and change the picture position to “Fill.”
Required Configuration For HD Presentation
If both Binary Floor Control Protocol (BFCP) and TelePresence Interoperability Protocol (TIP) are negotiated for a call, TIP takes precedence, and BFCP is not used to control presentation. BFCP is the preferred protocol for controlling the presentation for systems that run IX software.
Table 1-13 describes the Unified CM configuration that enables HD presentation to function on IX endpoints. No additional configuration is required.
|
|
---|---|
For each IX endpoint, use the Standard SIP Profile (not the Standard SIP BFCP Profile) for the SIP profile, even if you use BFCP. |
If you specify the Standard SIP BFCP profile, calls might be dropped and BFCP might not function properly. |
Make sure that you check the Allow Presentation Sharing Using BFCP check box in the Protocol Specific Information area of the Phone Configuration window. |
For more information about configuring BFCP, refer to the “Configuring the BFCP over UDP Collaboration Feature” section of the Configuring Cisco Unified Communications Manager for the IX System document. |
In the Phone configuration, the default value of Video Bandwidth Allocation Weights is 8 Main/2 Presentation. Adjust this setting if desired. |
For more information about this parameter, see the “Video Bandwidth Allocation Weights” section. |
For more details about the Unified CM configuration, see the Configuring Cisco Unified Communications Manager for the IX System document.
Scaling HD Presentation Video Resolution
For HD presentation, the system running IX software that is sharing content can automatically change resolutions for the content being shared, such as a slide presentation, document, or video. Table 1-14 outlines common presentation device display resolutions, and the higher or lower resolutions to which content can be scaled and sent by the IX endpoint.
In general, HD presentation supports the following scaling schemes:
- Scaling to a higher resolution is not supported except for 1280x720 (720p), which can be scaled to XGA.
- Scaling to lower resolutions of XGA is supported.
Scaling is done to accommodate lower network bandwidth. Before the system running IX software scales the presentation, it lowers the frame rate of the presentation, which preserves the video clarity.
|
|
|||
---|---|---|---|---|
Bandwidth Requirements for the HD Presentation Feature
The resolutions and frame rates at which each type of video can be sent is based on the available bandwidth.
Table 1-8 and Table 1-9 display the bandwidths required for supported 60 fps main video send resolutions and frame rates, while Table 1-15 displays the bandwidth required for supported HD presentation send resolutions and frame rates.
Note Regardless of how much network bandwidth is actually used as described in Table 1-15, the IX system always negotiates the maximum amount of bandwidth for an HD presentation (4000Kbps).
|
(Kbps) |
(Kbps) |
(Kbps) |
(Kbps) |
---|---|---|---|---|
4.These bandwidth rates are the lowest possible rates required. The higher bandwidths are the recommended minimum bandwidth for best performance. |
For calls that support more than one presentation stream, the required bandwidth is multiplied for each additional stream. When two presentation streams are used in a P2P call between IX endpoints, the required bandwidth is doubled. For example, two presentation streams at 1080p60 would require 8 Mbps.
For more information about calculating bandwidth for video and presentation content, see the “Sample Bandwidth Calculations” section,
Tip If, in a limited bandwidth scenario, you want to send a presentation with a higher fps but a lower resolution, you can change the resolution of the presentation at the source of the presentation. For example, given a maximum rate of 1000 Kbps, if you are sending a 1920x1080 presentation at 5 fps, you can instead send 640x480at 30 fps by changing the resolution of your presentation to 640x480.
Multiple Presentation Streams
On an IX-to-IX point-to-point conference, you can share up to two presentation streams - for example, one presentation content source and one whiteboard source. If hosting a meeting locally with no video conferencing, you can share up to three presentation streams.
Video Bandwidth Allocation Weights
The Video Bandwidth Allocation Weights parameter allows you to balance the bandwidth ratio for main video and presentation video during a conference.
Use this feature when the amount of session bandwidth that is used by a Cisco TelePresence endpoint to send audio, main video, and presentation video media streams exceeds the amount of available session bandwidth.
You add this value in the Bandwidth Allocation Weights field in the Product Specific Configuration Layout Area of the Unified CM administration console.
The weight is based on a total number of 10m and the default value of this parameter is a weight of 8 for main video and a weight of 2 for presentation video (8 main / 2 presentation).
The Cisco TelePresence System IX5000 supports the bandwidth allocation feature.
The following values are supported for this feature. The first value is the weight for main video, and the second value is the weight for the presentation video.
8 main / 2 presentation (default)
Note If you see bandwidth allocation weights used in the System Operations Log, check the Unified CM region settings and ensure enough video bandwidth is allowed for video and immersive video calls. The Unified CM region settings frequently can trigger the bandwidth allocation weights, which can significantly reduce call quality.
Sample Bandwidth Calculations
Bandwidth for a full 1080p60 “best” quality call with a 1080p30 presentation would require the following bandwidth:
H.265: 8.1 Mbps main video + 4 Mbps (presentation video@1080p 30) = 12.1 Mbps
H.264: 13.5 Mbps main video + 4 Mbps (presentation video@1080p 30) = 17.5 Mbps
Note The main video rates are derived from the rate per display multiplied by the number of screens (three). For H.265, given the rates in Table 1-8, the rate is 2.7 Mbps per screen for 1080p60 (Good), and for H.264, given the rates in Table 1-9, the rate is 4.5 Mbps per screen for 1080p60 (Good). Multiplying these numbers gives you the video rates of 8.1 and 13.5 Mbps, respectively.
To fit the available bandwidth, the endpoint performs calculations based on these general formulas, which include values from the Video Bandwidth Allocation Weights parameter:
Session Video Bandwidth (SVB) = Total session bandwidth - Audio bandwidth
Main Video Weight (Mwt) = Configured weight for main video stream
Total Weight for main video (T_Mwt) = Number of Streams x Main Video weight
Presentation Video Weight (Pwt) = Configured weight for presentation video stream
Total Weight for presentation video (T_Pwt) = Number of streams x Presentation video weight
Total Weight (TW) = T_Mwt + T_Pwt
Formula to allocate main video bandwidth:
Formula to allocate presentation video bandwidth:
Given a total available network bandwidth of 6.0 megabits per second (Mbps), and a single presentation stream:
T_Mwt = (3 (number of main video streams) x 8) (Mwt) = 24
T_Pwt = (1 (number of presentation video streams) x 2) (Pwt) = 2
Allocated main video bandwidth = 6 x (24/26) = 5.53 Mbps (1.84 Mbps per display)
Allocated presentation video bandwidth = 6 x (2/26) = 0.47 Mbps
For H.265, the 1.84 Mbps per display is sufficient to support either 720p60 (better) or 1080p30 (good), per the rates in Table 1-8 , depending on whether the initial call was negotiated at 60 or 30 fps.
For H.264, the 1.84 Mbps per display is sufficient to support 720p30 (better), per the rates in Table 1-9 .
Both H.265 and H.264 encoding support a resolution stream of 720p @ 5 fps if receiving, and 640x480 @ 5 fps if sending, per the rates in Table 1-15 .
Note This example assumed a single presentation stream. Additional content streams change the calculation in that the available bandwidth stream is divided by 2 for each content.