SIP and H.323 Protocols: Architecture, Components, and Functionality
SIP: Session Initiation Protocol
SIP (Session Initiation Protocol) is a signaling protocol used at the application layer to create, modify, and terminate sessions with one or more participants. These sessions can include voice, video, data, and other forms of internet media.
IETF Architecture Protocols
The IETF architecture includes protocols such as:
- RTP and RTCP: Provide real-time delivery of media.
- RTSP: A real-time streaming protocol that provides a supply-demand mechanism in real time.
- SDP: Session Description Protocol, which provides a standard format for describing media exchange capabilities.
SIP Attributes
Key attributes of SIP include:
- Simplicity
- Efficiency
- Scalability
- Modularity
- Mobility support
- User programming
- Extensibility
System Components
The main system components are:
- User Agents (UA): Endpoint applications that send and receive SIP requests.
- Proxy Servers: Applications that receive requests from user agents and forward them towards the destination. They also maintain records to indicate routing directions.
Structure of SIP Messages
SIP messages consist of:
- Start Line
- Header
- Body
Common SIP methods include:
- INVITE: Starts a VoIP call.
- ACK: Acknowledges a SIP response.
- OPTIONS: Queries a server’s capabilities.
- BYE: Signals the end of a call.
H.323 Protocol
H.323 is an ITU-T specification for transmitting audio, video, and data over IP networks. It supports signaling and call control, media transport and control, bandwidth management, and point-to-point and multipoint conferencing.
H Series Recommendations
H.323 is part of the H Series recommendations, including H.320 for ISDN and H.324 for POTS.
Components and Protocols
Key components and protocols include:
- Signaling: H.225
- Media Control: H.245
- Audio Codecs: G.711, G.722, G.723, G.728, G.729
- Video Codecs: H.261, H.263
- Data Sharing: T.120
- Media Transport: RTP/RTCP
H.323 Elements
H.323 networks consist of:
- Terminals
- Gateways: Provide interfaces with ISDN or PSTN for interworking.
- Gatekeepers: Offer admission control and address translation services.
- MCUs: Multipoint Control Units that enable audio and video conferencing between two or more terminals and gateways.
Main Control Areas
The main control areas are:
- Signaling and Registration
- Admission and RAS (Registration, Admission, and Status)
- Call Control Signaling
- Media Control and Transport
RAS Protocol
RAS provides pre-call control in H.323 networks with gatekeepers, establishing a zone between endpoints and gatekeepers over an IP network.
H.245 Control
H.245 procedures and messages enable:
- Capability Exchange
- Master-Slave Determination
- Round-Trip Delay Measurement
- Logical Channel Signaling
Fast Connect Procedure
The procedures available for establishing media channels between endpoints are H.245 and Fast Connect.
H.245 Tunneling
H.245 Tunneling improves call connection time and resource allocation and synchronizes signaling and call control.
Codecs
Waveform Codecs
Waveform codecs rebuild the input signal without modeling the process that created it.
Source Codecs
Source codecs attempt to replicate the physical process of sound creation.
Hybrid Codecs
Hybrid codecs combine modeling and waveform analysis, employing strategies like MPE, RPE, and CELP.
Conversation Quality Factors
Factors affecting conversation quality include:
- Frame loss or codec errors
- Background noise
- Simultaneous speakers
- Non-conversational speech
- Multiple codec cycles
Audio and Signaling Paths
Audio Path
The audio path follows this structure: RTP, RTCP (12 bits) > UDP (8 bits) > IP (20 bits) > link > physical layer.
Signaling Path
The signaling path follows this structure: H.323 > RAS (admission, registration, status) > H.225 (setup Q.931) > H.245 (capability exchange, logical channel establishment, TCP channel establishment, A/V channel setup).
Common Codecs
Common codecs include:
- G.711/PCM/64
- G.726/ADPCM/32
- G.728/LDCELP/16
- G.729/CSACELP/8
- G.729A/CSACELP/8
- G.723.1/MPMLQ/6.3
- G.723.1/ACELP/5.3
Codec Selection Criteria
Criteria for selecting codecs include:
- Bit rate
- Algorithm delay
- Processing complexity
- MOS (Mean Opinion Score)
- Return signals that are not conversational
Voice over IP Calculations
Bytes voice/plot = (64000 * 0.020) / 8; +40 byte IP packets, +26 bytes eth frame; Bandwidth = bytes plot * ETHx8 / 0.020
VAD = Bandwidth