Designing a Robust MPEG H Audio Mastering Workflow Using Marquise Technologies’ MIST

The adoption of immersive and object‑based audio formats in broadcast and streaming environments has introduced new technical requirements for audio production and mastering systems. MPEG‑H Audio combines audio objects, scene descriptions, and interactive metadata into a single delivery format that must remain coherent and verifiable throughout the production chain. While content creation tools for immersive audio are now widely available, the downstream processes – assembly, inspection, validation, and preparation for delivery – require equally specialized technical workflows.
Header_MPEG-H_Marquise
Object‑based audio differs fundamentally from channel‑based approaches. In addition to audio signals, mastering systems must handle complex metadata structures that describe object behavior, interactivity, and rendering logic for different playback environments. Ensuring that these elements survive format conversions, packaging steps, and handover to playout or encoding systems is a non‑trivial task. Errors in metadata handling or validation can lead to unintended playback behavior or incompatibilities at distribution endpoints.
This article examines the technical requirements of MPEG‑H Audio mastering and describes how Marquise Technologies’ MIST addresses these challenges. The focus is on workflow design, supported formats, metadata handling, and validation processes required to prepare MPEG‑H Audio deliverables for professional broadcast and streaming applications.

“As immersive audio becomes mainstream, the challenge is no longer just creation – it’s  ensuring that complex audio objects and metadata remain fully intact and verifiable throughout the entire delivery chain,” says Laurence Stoll, Chief Executive Officer of Marquise Technologies.

Mastering Video and MPEG-H Audio
Marquise Technologies’ MIST is a mastering solution designed to simplify the creation and preparation of files containing both video and MPEG-H Audio. It allows operators to assemble video and audio, inspect immersive audio metadata, monitor audio behavior, and generate deliverables ready for playout or encoding.
By combining video and immersive audio assets in a single mastering environment, MIST ensures that object-based audio structures and metadata remain intact throughout the packaging process. The platform supports widely used production formats in professional MPEG-H Audio workflows, including:

  • MPEG-H Production Format (MPF) with metadata as MPEG-H control track
  • BWF/ADM and MXF/S-ADM compliant with the MPEG-H ADM profile

These formats allow audio objects, scene definitions, and interactive metadata to travel consistently across production systems, preserving creative intent while ensuring technical integrity.

Assembling the Master
WorkflowDiagram_Marquise_MIST
A typical mastering workflow begins with assembling the program timeline. In MIST, MPEG-H Audio content is automatically recognized, triggering the integrated MPEG-H Renderer.
At this stage, the system preserves all relevant metadata, including object information and interactive program elements. This approach allows content creators and post-production teams to maintain full control over the structure of the audio experience while preparing deliverables for broadcast or streaming platforms.

Inspecting and Monitoring Immersive Audio
Working with object-based audio requires precise inspection and validation of metadata and interactive structures. MIST provides advanced inspection tools that allow users to visualize:

  • MPEG-H metadata
  • Audio programs
  • Interactive elements available to the end user

This visibility is crucial for verifying the integrity of immersive mixes and ensuring that interactive features behave as intended.
Additionally, the platform offers comprehensive audio monitoring tools, including:

  • Loudness measurement
  • Signal distribution analysis
  • Level monitoring

 

Rendering and Playback
MIST enables synchronized playback of video and MPEG-H Audio content directly from the mastering timeline.
During playback, the audio is processed by the MPEG-H Renderer, reproducing how the immersive mix will be experienced on compatible playback devices.
This allows operators to verify object positioning, scene behavior, and interactive elements, while ensuring synchronization with the video content.
By replicating the final listening environment, teams can confirm both the creative intent and technical integrity of the immersive mix before exporting the final master.

Preparing Delivery Formats
Once the program is assembled and validated, MIST offers a wide range of export options tailored for broadcast and media distribution workflows including:

  • ST-2131 MXF AS-02 with ADM, designed for archiving
  • ST-2127-1 MXF AS-02 MGA with S-ADM, designed for playout systems, encoding workflows, and archiving
  • ISO BMFF QuickTime ProRes with MPEG-H Production Format, suited for content exchange and SDI-based playout systems and encoding workflows

Supporting multiple delivery formats ensures that immersive audio can be integrated seamlessly into existing media pipelines while maintaining compatibility with playout, encoding, and distribution systems.

Quality Control and Workflow Validation
Ensuring the quality and compliance of immersive audio deliverables after encoding is critical. Specialized quality control (QC) tools can be used to analyze and verify the rendered output, allowing users to:

  • Validate object-based and scene-based rendering behavior
  • Confirm that the mix behaves correctly across different listening environments such as stereo, 5.1, 7.1.4, and binaural
  • Detect potential audio issues including clipping, silence, or loudness violations
  • Generate detailed analysis reports in PDF or EBU-QC XML formats
  • Perform side-by-side comparisons between the original sources and the rendered output

Automated validation ensures that immersive audio experiences remain consistent, reliable, and compliant with industry standards across distribution platforms.

Enabling Next-Generation Audio Workflows
As immersive audio technologies expand across broadcast and streaming ecosystems, efficient and transparent workflows become increasingly important.
Solutions like MIST play a pivotal role by providing a centralized environment for assembling, inspecting, and validating video and MPEG-H Audio content.
By supporting industry-standard production formats, integrated monitoring tools, and flexible export options, MIST empowers broadcasters and content creators to confidently deliver high-quality immersive audio experiences for modern media distribution pipelines.
“MIST enables comprehensive quality control of both audio and video assets while ensuring that MPEG-H metadata is fully validated before final delivery. The ability to directly compare the rendered bitstream signal against the original master is a significant advancement, bringing a new level of confidence to the QC process”, says Yannik Grewe, Senior Manager – Media Technologies and Business Development at Fraunhofer IIS.  “MIST is a great addition to the MPEG-H ecosystem, and we are very pleased to have collaborated with Marquise Technologies to bring this capability to life.”
From a system design perspective, such single technical frameworks shift the focus of mastering from file preparation alone to end‑to‑end validation of audio behavior and metadata integrity. As immersive audio moves further into regular production use, this level of technical transparency and verifiability is increasingly relevant to deliver media that suits personal preferences of their target audience while maintaining artistic intent.

Contact

If you have any questions, suggestions, or if you need further information, please do not hesitate to contact us. For more information, please contact
audio-info@iis.fraunhofer.de
 or visit our website 
www.iis.fraunhofer.de/audio.