ISO/IEC 23003-2:2010

Withdrawn (Edition 1, 2010)

New version available: ISO/IEC 23003-2:2018

Abstract

ISO/IEC 23003-2:2010 specifies the reference model of MPEG Spatial Audio Object Coding (SAOC): an efficient parametric coding technology designed to encode, transmit, and interactively render multiple audio objects for playback with various kinds of channel configurations (mono, stereo, 5.1, headphones/binaural). Rather than performing a discrete coding of the individual audio input signals, MPEG SAOC captures the perceptually relevant properties of audio signals into a compact set of parameters that are used to synthesize a flexibly rendered audio scene from a transmitted downmix signal.

MPEG SAOC extends MPEG Surround in a way that provides several significant advantages in terms of additional functionality available to users. It allows the user on the decoding side to interactively control the multi-channel rendering of each individual audio object on different kinds of sound reproduction setup. In addition, MPEG SAOC inherits many advantages of MPEG Surround technology, like transmission (in a backward compatible way) of complex multi-object audio content at bitrates not much higher than what is required for its mono or stereo downmix. MPEG SAOC processing effectively reuses the multi-channel rendering functionality of MPEG Surround in a computationally efficient manner. Therefore, MPEG SAOC technology can be directly used to extend MPEG Surround and upgrade existing distribution infrastructures for stereo or mono audio content (teleconferencing systems, music downloads, Internet streaming, etc.) towards the delivery of audio content while retaining full compatibility with existing receivers. Rendering can be interactively controlled by the end-user and is independent of the playback system setup.

Key features of MPEG SAOC are:

interactive rendering of audio objects on the decoder/receiver side;
transmitted SAOC bit stream is independent of loudspeaker (or headphones) configuration;
low-power processing mode (e.g. for applications on portable devices);
low-delay processing mode (e.g. for communication applications);
flexibly selectable bitrate overhead, allowing scalability from low bitrate applications such as Internet streaming to high-quality applications such as custom remix of music;
it can be applied upon audio using any coding scheme;
backward compatibility: the default downmix is always available for legacy playback devices.

General information

Status
: Withdrawn

Publication date
: 2010-10

Stage
: Withdrawal of International Standard [95.99]
Edition
: 1

Number of pages
: 130
Technical Committee :
ISO/IEC JTC 1/SC 29

ICS :
35.040.40
RSS updates

Life cycle

Now

Withdrawn

ISO/IEC 23003-2:2010
Stage: 95.99
Corrigenda

Correct the current edition; free; not included in the text of the existing standard.

Withdrawn

ISO/IEC 23003-2:2010/Cor 1:2012

Withdrawn

ISO/IEC 23003-2:2010/Cor 2:2014

Amendments

Provide additional content; available for purchase; not included in the text of the existing standard.

Withdrawn

ISO/IEC 23003-2:2010/Amd 1:2015

Withdrawn

ISO/IEC 23003-2:2010/Amd 2:2015

Withdrawn

ISO/IEC 23003-2:2010/Amd 3:2015

Withdrawn

ISO/IEC 23003-2:2010/Amd 4:2016

Withdrawn

ISO/IEC 23003-2:2010/Amd 5:2016
Revised by

Published

ISO/IEC 23003-2:2018

Got a question?

Check out our Help and Support

Abstract

General information

Life cycle

Now

ISO/IEC 23003-2:2010

Corrigenda

Amendments

Revised by

Got a question?