ISO/IEC 23003-3:2020 信息技术 MPEG音频技术 第3部分:统一语音和音频编码

标准编号:ISO/IEC 23003-3:2020

中文名称:信息技术 MPEG音频技术 第3部分:统一语音和音频编码

英文名称:Information technology — MPEG audio technologies — Part 3: Unified speech and audio coding

发布日期:2020-06

标准范围

本文档规定了一种统一的语音和音频编解码器,其能够对具有语音和音频内容的任意混合的信号进行编码。编解码器的性能与最好的已知编码技术相当或更好,最好的已知编码技术可以专门定制为语音或一般音频内容的编码。该编解码器支持高比特率的单通道和多通道编码,并提供感知透明的质量。同时,它能够在非常低的比特率下实现非常高效的编码,同时保留完整的音频带宽。本文结合了在先前MPEG标准中开发的几种基于感知的压缩技术:感知成形的量化噪声、上频谱区域的参数编码和立体声级的参数编码。然而,它将这些众所周知的感知技术与源编码技术相结合:声音产生的模型,特别是人类语音的模型。

This document specifies a unified speech and audio codec which is capable of coding signals having an arbitrary mix of speech and audio content. The codec has a performance comparable to, or better than, the best known coding technology that might be tailored specifically to coding of either speech or general audio content. The codec supports single and multi-channel coding at high bitrates and provides perceptually transparent quality. At the same time, it enables very efficient coding at very low bitrates while retaining the full audio bandwidth.
This document incorporates several perceptually-based compression techniques developed in previous MPEG standards: perceptually shaped quantization noise, parametric coding of the upper spectrum region and parametric coding of the stereo sound stage. However, it combines these well-known perceptual techniques with a source coding technique: a model of sound production, specifically that of human speech.

标准预览图


立即下载标准文件