ISO/IEC 23003-2:2010 信息技术 MPEG音频技术 第2部分:三维空间音频对象编码(SAOC)
标准编号:ISO/IEC 23003-2:2010
中文名称:信息技术 MPEG音频技术 第2部分:三维空间音频对象编码(SAOC)
英文名称:Information technology — MPEG audio technologies — Part 2: Spatial Audio Object Coding (SAOC)
发布日期:2010-10
标准范围
ISO/IEC 23003-2:20 10规定了MPEG空间音频对象编码(SAOC)的参考模型:一种高效的参数编码技术,设计用于编码、传输和交互式渲染多个音频对象,以利用各种声道配置(单声道、立体声、5.1、耳机/双耳)进行回放。MPEG SAOC不是执行单个音频输入信号的离散编码,而是将音频信号的感知相关属性捕获到一组紧凑的参数中,这些参数用于从传输的下混信号合成灵活渲染的音频场景。MPEG SAOC扩展了MPEG环绕声,在用户可用的附加功能方面提供了几个显著的优点。它允许解码侧的用户交互式地控制多-不同类型的声音再现设置上每个单独音频对象的通道渲染。此外,MPEG SAOC继承了MPEG环绕声技术的许多优点,如(以向后兼容的方式)以不比其单声道或立体声下混所需的比特率高多少的比特率传输复杂的多对象音频内容。MPEG SAOC处理以计算高效的方式有效地重用MPEG环绕的多声道渲染功能。因此,MPEG SAOC技术可以直接用于扩展MPEG环绕声和升级用于立体声或单声道音频内容(电话会议系统、音乐下载、互联网流等)的现有分发基础设施,以实现音频内容的传送,同时保持与现有接收器的完全兼容性。渲染可以由最终用户交互地控制,并且独立于回放系统设置。MPEG SAOC的关键特征是:在解码器/接收器侧交互渲染音频对象;发送的SAOC比特流独立于扬声器(或耳机)配置;低功率处理模式(例如,用于便携式设备上的应用);低延迟处理模式(例如,用于通信应用);可灵活选择的比特率开销,允许从低比特率应用(如互联网流)到高质量应用(如音乐的自定义混音)的可伸缩性;它可以应用于使用任何编码方案的音频;向后兼容性:默认缩混始终适用于传统播放设备。
ISO/IEC 23003-2:2010 specifies the reference model of MPEG Spatial Audio Object Coding (SAOC): an efficient parametric coding technology designed to encode, transmit, and interactively render multiple audio objects for playback with various kinds of channel configurations (mono, stereo, 5.1, headphones/binaural). Rather than performing a discrete coding of the individual audio input signals, MPEG SAOC captures the perceptually relevant properties of audio signals into a compact set of parameters that are used to synthesize a flexibly rendered audio scene from a transmitted downmix signal.
MPEG SAOC extends MPEG Surround in a way that provides several significant advantages in terms of additional functionality available to users. It allows the user on the decoding side to interactively control the multi-channel rendering of each individual audio object on different kinds of sound reproduction setup. In addition, MPEG SAOC inherits many advantages of MPEG Surround technology, like transmission (in a backward compatible way) of complex multi-object audio content at bitrates not much higher than what is required for its mono or stereo downmix. MPEG SAOC processing effectively reuses the multi-channel rendering functionality of MPEG Surround in a computationally efficient manner. Therefore, MPEG SAOC technology can be directly used to extend MPEG Surround and upgrade existing distribution infrastructures for stereo or mono audio content (teleconferencing systems, music downloads, Internet streaming, etc.) towards the delivery of audio content while retaining full compatibility with existing receivers. Rendering can be interactively controlled by the end-user and is independent of the playback system setup.
Key features of MPEG SAOC are:
- interactive rendering of audio objects on the decoder/receiver side;
- transmitted SAOC bit stream is independent of loudspeaker (or headphones) configuration;
- low-power processing mode (e.g. for applications on portable devices);
- low-delay processing mode (e.g. for communication applications);
- flexibly selectable bitrate overhead, allowing scalability from low bitrate applications such as Internet streaming to high-quality applications such as custom remix of music;
- it can be applied upon audio using any coding scheme;
- backward compatibility: the default downmix is always available for legacy playback devices.
标准预览图


