G.722.2 (AMR-WB) Codec: How Adaptive Multi-Rate Wideband Audio Works

Jun 12, 2026 Melissa Shannon

Have you ever noticed the difference between a phone call on an old landline and one on a modern smartphone? The voice sounds richer, clearer, and more natural. That shift isn't just better hardware; it's the result of smarter software compressing your voice data. At the heart of this improvement is G.722.2, also known as AMR-WB. It is the engine that makes high-quality voice communication possible across both wireless networks and internet protocols without eating up all your bandwidth.

Quick Summary / Key Takeaways

G.722.2 (AMR-WB) is a wideband audio codec standardized by ITU-T and 3GPP, offering superior voice quality compared to narrowband alternatives like G.711.
It operates at a fixed sampling rate of 16 kHz but adapts its bit rate dynamically across nine modes, ranging from 6.6 kbps to 23.85 kbps.
The codec uses ACELP technology to compress audio efficiently, making it ideal for VoIP, mobile networks (GSM/WCDMA), and teleconferencing.
Features like VAD (Voice Activity Detection) and DTX (Discontinuous Transmission) help save network resources during silence.
Unlike newer codecs like Opus, G.722.2 remains a mandatory standard in many cellular networks due to its reliability and widespread compatibility.

What Is G.722.2 (AMR-WB)?

To understand why G.722.2 matters, we first need to look at what it actually does. In simple terms, it is an algorithm that squeezes your spoken voice into small digital packets so they can travel over a network-whether that’s a 4G cell tower or a Wi-Fi router-and then unpacks them on the other end so the listener hears clear speech.

The "WB" in AMR-WB stands for Wideband. Traditional telephone systems, often called Plain Old Telephone Service (POTS), use a narrow frequency range of roughly 300 Hz to 3,400 Hz. This cuts off the higher frequencies that make voices sound crisp and distinct. G.722.2 expands this range significantly to 50 Hz to 7,000 Hz. By capturing these extra frequencies, the codec preserves the nuances of human speech, including breathiness and consonant clarity, which makes conversations feel less like talking through a tin can.

This technology was a game-changer when it arrived. Standardized by the ITU-T in July 2003 and earlier by 3GPP in 2001, it was the first codec designed to work seamlessly across both wireline (fixed-line) and wireless (mobile) services. Before this, engineers had to deal with different standards for different networks. G.722.2 bridged that gap, allowing a call to start on a mobile phone and transfer to a landline without losing quality or requiring complex translation layers.

How Does the Technology Work?

Under the hood, G.722.2 relies on a sophisticated method called Algebraic Code Excited Linear Prediction (ACELP). While that name sounds intimidating, the concept is straightforward. Instead of recording every single detail of the sound wave, ACELP creates a mathematical model of your vocal tract. It sends instructions on how to recreate the sound rather than the sound itself.

Here is the process in plain English:

Sampling: The microphone captures audio, and the codec samples it at 16,000 times per second (16 kHz). This is double the rate of traditional narrowband codecs, which sample at 8 kHz.
Analysis: The codec analyzes short chunks of audio, specifically frames lasting 20 milliseconds. It uses an analysis window of 384 samples to understand the structure of the speech within that tiny slice of time.
Compression: Using ACELP, it identifies the essential parameters needed to reconstruct the voice. It strips away redundant data and compresses the remaining information into a compact packet.
Transmission: These packets are sent over the network. Because the codec is adaptive, it can change how much data it sends based on current conditions.

The magic of G.722.2 lies in its adaptability. It doesn't stick to one size fits all. It offers nine distinct bit rates:

6.6 kbps
8.85 kbps
12.65 kbps
14.25 kbps
15.85 kbps
18.25 kbps
19.85 kbps
23.05 kbps
23.85 kbps

If the network is congested, the codec can drop to a lower bit rate like 6.6 kbps to ensure the call stays connected, albeit with slightly reduced quality. If the connection is strong, it jumps to 23.85 kbps for crystal-clear audio. This switching happens instantly at every 20-millisecond frame boundary, ensuring smooth transitions without audible glitches.

Cartoon robot filtering raw audio data into organized compressed packets.

Performance and Resource Usage

For developers and network engineers, knowing how much computing power a codec requires is crucial. G.722.2 is efficient, but it does demand resources, especially on older hardware.

Computational Requirements for G.722.2 Encoding and Decoding

Metric	Encoding (Processing Power)	Decoding (Processing Power)	Memory Footprint
C6xx/C7x Processors	63.8 - 93.4 MIPS	26.8 - 34.1 MIPS	Varies by implementation
MIPS32 24Ke Architecture	60 - 115 MIPS (depending on mode)	8 - 18 MIPS	~150 KB Program Memory (Encode+Decode)
Data Memory	29,138 bytes (Encode only)	24,588 bytes (Decode only)	32,500 bytes (Combined)

Note that encoding (compressing the audio) is always more computationally expensive than decoding (playing it back). On a typical mobile device, the battery impact is minimal because modern chips handle these calculations effortlessly. However, in embedded systems or IoT devices with limited processing power, engineers must carefully balance the bit rate against available MIPS (Million Instructions Per Second).

G.722.2 vs. Other VoIP Codecs

You might wonder why we don't just use the latest codec for everything. The answer lies in compatibility and specific use cases. Let's compare G.722.2 with its main competitors: G.711, G.729, and Opus.

Comparison of Common VoIP Codecs

Feature	G.711 (Narrowband)	G.729 (Low Bitrate)	G.722.2 (AMR-WB)	Opus (Modern Hybrid)
Bandwidth	300-3,400 Hz	300-3,400 Hz	50-7,000 Hz (Wideband)	Up to 20 kHz (Fullband)
Bit Rate	64 kbps (Fixed)	8 kbps (Fixed)	6.6-23.85 kbps (Adaptive)	6-510 kbps (Highly Adaptive)
Sampling Rate	8 kHz	8 kHz	16 kHz	8, 12, 16, 24, 48 kHz
Primary Use Case	Legacy PSTN, High Quality LAN	Bandwidth-constrained WAN	Mobile Networks (3G/4G), VoIP	WebRTC, Gaming, Streaming
Latency	Very Low (<1 ms)	Low (15 ms)	Medium (20 ms + jitter buffer)	Ultra-Low (configurable down to 5 ms)

G.711 is the gold standard for uncompressed quality but hogs bandwidth at 64 kbps. It sounds great on a local network but is wasteful over cellular connections. G.729 is the opposite-it’s incredibly efficient at 8 kbps but sacrifices high-frequency details, making voices sound robotic. Opus is the modern champion for internet applications like Zoom or Discord because it handles music and speech equally well with ultra-low latency. However, G.722.2 holds its ground because it is the mandatory standard for GSM and WCDMA networks. You cannot build a cellular modem without supporting it. It strikes the perfect balance between quality and efficiency for voice-centric mobile communications.

Key Features That Make It Reliable

Beyond raw compression, G.722.2 includes several features that enhance the user experience in real-world scenarios where networks are imperfect.

Voice Activity Detection (VAD)

When you stop talking, the codec detects the silence. Instead of sending empty packets, it pauses transmission. This saves significant bandwidth, especially in group calls where people talk intermittently.

Discontinuous Transmission (DTX)

Closely linked to VAD, DTX ensures that no data is sent during silent periods. This reduces network congestion and extends battery life on mobile devices since the radio transmitter doesn't have to stay active constantly.

Comfort Noise Generation (CNG)

Absolute silence on a call feels unnatural and can make users think the call dropped. CNG generates subtle background noise (like room hum) on the receiving end to mimic a real environment. This psychological trick maintains the perception of a stable connection even when no voice data is being transmitted.

Packet Loss Concealment (PLC)

In IP networks, packets sometimes get lost due to congestion. G.722.2 implementations often include PLC algorithms that predict what the missing audio should sound like based on previous frames. This prevents harsh clicks or gaps in conversation, smoothing out minor network issues.

Whimsical network scene showing adaptive data rates and silence detection.

Implementation and Standards

Because G.722.2 is an international standard, you don't have to write the compression algorithm from scratch. The ITU-T Recommendation G.722.2 includes Annex C, which provides reference C source code. This allows vendors to create compliant encoders and decoders quickly.

Major telecom equipment manufacturers and software providers offer optimized libraries. For example, companies like Adaptive Digital and Synopsys provide DSP-optimized versions that run efficiently on specific hardware architectures. These implementations support multi-channel operations, meaning a single server can handle thousands of concurrent calls without crashing.

The codec is also part of the CableLabs PacketCable 2.0 specification, ensuring it works not just on cellular towers but also on cable broadband voice services. This broad adoption means that if you build a VoIP app today, supporting G.722.2 is almost non-negotiable if you want interoperability with existing telephony infrastructure.

Frequently Asked Questions

Is G.722.2 still relevant in 2026?

Yes, absolutely. While newer codecs like Opus dominate internet-based video conferencing, G.722.2 remains the backbone of cellular voice communications. It is mandatory for GSM and WCDMA networks, which still serve billions of users globally. As long as traditional voice calls exist on mobile networks, G.722.2 will be required.

Can G.722.2 transmit music clearly?

Not really. G.722.2 is optimized for human speech, not music. Its wideband range (50-7000 Hz) captures vocal nuances well, but it lacks the full frequency spectrum (up to 20 kHz) needed for high-fidelity music reproduction. For music streaming, codecs like AAC or Opus are far superior.

What is the difference between AMR and AMR-WB?

AMR (Adaptive Multi-Rate) refers to the narrowband version used in early 3G networks, operating at 8 kHz sampling. AMR-WB (Wideband) is G.722.2, which operates at 16 kHz sampling. AMR-WB provides significantly better voice quality because it captures higher frequencies, making voices sound more natural.

Does G.722.2 require encryption?

The codec itself does not include encryption. It is purely a compression tool. To secure a call using G.722.2, you must wrap the compressed audio stream in a secure protocol like SRTP (Secure Real-Time Transport Protocol) or TLS (Transport Layer Security) depending on the signaling method (e.g., SIP over TLS).

Why does my VoIP call sound choppy even with G.722.2?

Choppiness is usually caused by network jitter or packet loss, not the codec itself. While G.722.2 has Packet Loss Concealment, severe network issues can overwhelm it. Ensure your network has low latency (<150 ms round trip) and sufficient bandwidth. Also, check if Voice Activity Detection is causing abrupt silences; adjusting VAD sensitivity can help.

Next Steps for Developers and Admins

If you are setting up a VoIP system, here is how to proceed:

Enable G.722.2 as a Primary Codec: Configure your SIP servers (like Asterisk or FreeSWITCH) to prefer G.722.2 for wideband calls, falling back to G.711 or G.729 for legacy compatibility.
Test Network Conditions: Use tools like Wireshark to monitor packet loss and jitter. G.722.2 performs best on stable connections with minimal latency.
Optimize Hardware: If deploying on embedded devices, verify that your processor has enough MIPS headroom for encoding, especially at higher bit rates.
Consider Hybrid Solutions: For modern web apps, consider implementing Opus alongside G.722.2 to support both high-quality internet media and seamless handoffs to cellular networks.

By understanding the strengths and limitations of G.722.2, you can build communication systems that deliver clear, reliable voice experiences regardless of the underlying network infrastructure.