0110.be logo

Author version | Version of record | Further information

Publication type: Journal Articles

Abstract: Research on the interaction between movement and music often involves analysis of multi-track audio, video streams and sensor data. To facilitate such research a framework is presented here that allows synchronization of multimodal data. A low cost approach is proposed to synchronize streams by embedding ambient audio into each data-stream. This effectively reduces the synchronization problem to audio-to-audio alignment. As a part of the framework a robust, computationally efficient audio-to-audio alignment algorithm is presented for reliable synchronization of embedded audio streams of varying quality. The algorithm uses audio fingerprinting techniques to measure offsets. It also identifies drift and dropped samples, which makes it possible to find a synchronization solution under such circumstances as well. The framework is evaluated with synthetic signals and a case study, showing millisecond accurate synchronization.

Cite this article:
@Article{Six2015,
author="Six, Joren
and Leman, Marc",
title="Synchronizing multimodal recordings using audio-to-audio alignment",
journal="Journal on Multimodal User Interfaces",
year="2015",
month="Sep",
day="01",
volume="9",
number="3",
pages="223--229",
issn="1783-8738",
doi="10.1007/s12193-015-0196-1",
url="https://doi.org/10.1007/s12193-015-0196-1"
}
      
Download 'Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment'