Publications

Partial lists of my my publications can be found in the research information system of HoGent and UGent. A list of my publications is also available on Google Scholar. Below a more complete list can be found.

Dissertation

Engineering systematic musicology: methods and services for computational and empirical music research
Joren Six
(2018) Phd Dissertation
Author version | Version of record | Further information | BibTeX

Download 'Engineering systematic musicology: methods and services for computational and empirical music research'

Journal Articles

Peak tibial accelerations in different foot strike patterns during level running: an independent investigation in different cohorts
Pieter Van den Berghe, Sander De Bock, Bastiaan Breine, Nicolas Horvais, Allison Gruber, Joren Six, Pierre Samozino, Marc Leman, Jean-Benoît Morin, Dirk De Clercq and Marlène Giandolini
(2024) Sports Biomechanics
Author version | Version of record | Further information | BibTeX

Download 'Peak tibial accelerations in different foot strike patterns during level running: an independent investigation in different cohorts'

Olaf: a lightweight, portable audio search system
Joren Six
(2023) Journal of Open Source Software
Author version | Version of record | Further information | BibTeX

Download 'Olaf: a lightweight, portable audio search system'

Cholinergic-related pupil activity reflects level of emotionality during motor performance
Vidal, M., Onderdijk, K. E., Aguilera, A. M., Six, J., Maes, P.-J., Fritz, T. H., & Leman, M.
(2023) European Journal of Neuroscience,
Author version | Version of record | BibTeX

Download 'Cholinergic-related pupil activity reflects level of emotionality during motor performance'

Panako: a scalable audio search system
Joren Six
(2022) Journal of Open Source Software
Author version | Version of record | Further information | BibTeX

Download 'Panako: a scalable audio search system'

Motor sequence learning in a goal-directed stepping task in persons with multiple sclerosis : a pilot study
Veldkamp, R., Moumddjian, L., Dun, K., Six, J., Vanbeylen, A., Kos, D., & Feys, P.
(2022) Annals of the New York Academy of Science
Author version | Version of record | BibTeX

Download 'Motor sequence learning in a goal-directed stepping task in persons with multiple sclerosis : a pilot study'

Embodied learning in multiple sclerosis using melodic, sound, and visual feedback : a potential rehabilitation approach.
Moumddjian, Lousin, Joren Six, Renee Veldkamp, Jenke Geys, Channa Van Der Linden, Mieke Goetschalckx, Johan Van Nieuwenhoven, Ilse Bosmans, Marc Leman, and Peter Feys
(2022) Annals of the New York Academy of Science
Author version | Version of record | BibTeX

Download 'Embodied learning in multiple sclerosis using melodic, sound, and visual feedback : a potential rehabilitation approach.'

Music-based biofeedback to reduce tibial shock in over-ground running: a proof-of-concept study
Pieter Van den Berghe, Valerio Lorenzoni, Rud Derie, Joren Six, Joeri Gerlo, Marc Leman & Dirk De Clercq
(2021) Scientific Reports
Author version | Version of record | Further information | BibTeX

Download 'Music-based biofeedback to reduce tibial shock in over-ground running: a proof-of-concept study'

The influence of performing gesture type on interpersonal musical timing, and the role of visual contact and tempo
Esther Coorevits, Pieter-Jan Maes, Joren Six, Marc Leman
(2020) Acta Psychologica
Author version | Version of record | BibTeX

Download 'The influence of performing gesture type on interpersonal musical timing, and the role of visual contact and tempo'

Synchronisation sensorimotrice et comportements non verbaux dans la maladie d’Alzheimer : l’influence du contexte social et musical
Matthieu Ghilain, Lise Hobeika, Loris Schiaratura, Micheline Lesaffre, Joren Six, Frank Desmet, Sylvain Clément and Séverine Samson
(2020) Gériatrie et Psychologie Neuropsychiatrie du Vieillissement.
Author version | Version of record | BibTeX

Download 'Synchronisation sensorimotrice et comportements non verbaux dans la maladie d’Alzheimer : l’influence du contexte social et musical'

Timing Markers of Interaction Quality During Semi-Hocket Singing
Alessandro Dell’Anna, Jeska Buhmann, Joren Six, Pieter-Jan Maes and Marc Leman
(2020) Frontiers in Neuroscience
Author version | Version of record | BibTeX

Download 'Timing Markers of Interaction Quality During Semi-Hocket Singing'

Validity and reliability of peak tibial accelerations as real-time measure of impact loading during over-ground rearfoot running at different speeds
Pieter Van den Berghe, Joren Six, Joeri Gerlo, Marc Leman, Dirk De Clercq
(2019) Journal of Biomechanics
Author version | Version of record | Further information | BibTeX

A Case for Reproducibility in MIR: Replication of ‘A Highly Robust Audio Fingerprinting System’
Joren Six, Federica Bressan and Marc Leman
(2018) Transactions of the International Society of Music Information Retrieval
Author version | Version of record | BibTeX

Download 'A Case for Reproducibility in MIR: Replication of ‘A Highly Robust Audio Fingerprinting System’'

Beyond documentation – The digital philology of interaction heritage
Marc Leman, Joren Six
(2018) Journal of New Music Research, Special edition on Digital Philology
Author version | Version of record | BibTeX

Download 'Beyond documentation – The digital philology of interaction heritage'

The SoundBike: musical sonification strategies to enhance cyclists’ spontaneous synchronization to external music
Pieter-Jan Maes, Valerio Lorenzoni and Joren Six
(2018) Journal on Multimodal User Interfaces
Author version | Version of record | BibTeX

Download 'The SoundBike: musical sonification strategies to enhance cyclists’ spontaneous synchronization to external music'

Embodied, Participatory Sense-Making in Digitally-Augmented Music Practices: Theoretical Principles and the Artistic Case “SoundBikes”
Pieter-Jan Maes, Valerio Lorenzoni, Bart Moens, Joren Six, Federica Bressan, Ivan Schepers and Marc Leman
(2018) Critical Arts South-North Cultural and Media Studies
Author version | Version of record | BibTeX

Download 'Embodied, Participatory Sense-Making in Digitally-Augmented Music Practices: Theoretical Principles and the Artistic Case “SoundBikes”'

Adopting a music-to-heart rate alignment strategy to measure the impact of music and music tempo on human heart rate
Edith Van Dyck, Joren Six , Esin Soyer, Marlies Denys, Ilka Bardijn, and Marc Leman
(2017) Musicae Scientiae
Author version | Version of record | BibTeX

Download 'Adopting a music-to-heart rate alignment strategy to measure the impact of music and music tempo on human heart rate'

Acoustical properties in Inhaling Singing: a case-study
Françoise Vanhecke, Mieke Moerman, Frank Desmet, Joren Six, Kristin Daemers, Godfried-Willem Raes, Marc Leman
(2017) Physics in Medicine
Author version | Version of record | BibTeX

Download 'Acoustical properties in Inhaling Singing: a case-study'

Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment
Joren Six and Marc Leman
(2015) Journal on Multimodal User Interfaces
Author version | Version of record | Further information | BibTeX

Download 'Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment'

Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music
Olmo Cornelis, Joren Six, Andre Holzapfel, and Marc Leman
(2013) Journal of New Music Research
Author version | Version of record | Further information | BibTeX

Download 'Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music'

Tarsos, a modular platform for precise pitch analysis of western and non-western music
Joren Six, Olmo Cornelis and Marc Leman
(2013) Journal of New Music Research. 42(2)
Author version | Version of record | Further information | BibTeX

Download 'Tarsos, a modular platform for precise pitch analysis of western and non-western music'

Book Chapters

Duplicate detection for for digital audio archive management: two case studies
Joren Six, Federica Bressan en Koen Renders
(2023) Advances in Speech and Music Technology
Author version | BibTeX

Download 'Duplicate detection for for digital audio archive management: two case studies'

Articles in peer reviewed conference proceedings

DiscStitch: towards audio-to-audio alignment with robustness to playback speed variabilities
Joren Six
(2022) ISMIR 2022 Late Breaking / Demo abstracts
Author version | Version of record | BibTeX

Download 'DiscStitch: towards audio-to-audio alignment with robustness to playback speed variabilities'

Panako 2.0 : updates for an acoustic fingerprinting system
Joren Six
(2022) Late Breaking Demo session of the 22st International Society for Music Information Retrieval Conference - ISMIR 2021
Author version | Version of record | Further information | BibTeX

Download 'Panako 2.0 : updates for an acoustic fingerprinting system'

BAF: an audio fingerprinting dataset for broadcast monitoring
Cortès, G., Ciurana, A., Molina, E., Miron, M., Meyers, O., Six, J., & Serra, X.
(2022) ISMIR 2022
Author version | BibTeX

Download 'BAF: an audio fingerprinting dataset for broadcast monitoring'

OLAF: Overly Lightweight Acoustic Fingerprinting
Joren Six
(2020) ISMIR 2020 Late Breaking / Demo abstracts
Author version | Version of record | Further information | BibTeX

Download 'OLAF: Overly Lightweight Acoustic Fingerprinting'

Automatic comparison of global children’s and adult songs
Shoichiro Sato, Joren Six, Peter Pfordresher, Shinya Fujii and Patrick Savage
(2019) Proceedings of the 9th Folk Music Analysis (FMA) conference
Author version | Version of record | Further information | BibTeX

Download 'Automatic comparison of global children’s and adult songs'

Automatic comparison of human music, speech, and bird song suggests uniqueness of human scales
Jiei Kuroyanagi, Shoichiro Sato, Meng-Jou Ho, Gakuto Chiba, Joren Six, Peter Pfordresher, Adam Tierney, Shinya Fujii and Patrick Savage
(2019) Proceedings of the 9th Folk Music Analysis (FMA) conference
Author version | Version of record | Further information | BibTeX

Download 'Automatic comparison of human music, speech, and bird song suggests uniqueness of human scales'

Automatic analysis of global music recordings suggests scale tuning universals
Meng-Jou Ho, Shoichiro Sato, Jiei Kuroyanagi, Joren Six, Steven Brown, Shinya Fujii, Patrick E Savage
(2018) Extended abstracts for the Late-Breaking Demo Session of the 19th International Society for Music Information
Author version | BibTeX

Download 'Automatic analysis of global music recordings suggests scale tuning universals'

Real-time music-based biofeedback to reduce impact loading during over-ground running
Pieter Van den Berghe, Valerio Lorenzoni, Joeri Gerlo, Bastiaan Breine , Rud Derie, Joren Six, Marc Leman and Dirk De Clercq
(2018) Proceedings on the 42nd American of Biomechanics Congress.
Author version | Further information | BibTeX

Applications of Duplicate Detection in Music Archives: from Metadata Comparison to Storage Optimisation
Joren Six, Federica Bressan and Marc Leman
(2018) Proceedings of the 14th Italian Research Conference on Digital Libraries (IRCDL 2018)
Author version | Version of record | BibTeX

Download 'Applications of Duplicate Detection in Music Archives: from Metadata Comparison to Storage Optimisation'

Applications of duplicate detection: linking meta-data and merging music archives – The experience of the IPEM historical archive of electronic music
Federica Bressan, Joren Six and Marc Leman
(2017) Proceedings of the 4th International Digital Libraries for Musicology workshop (DLfM 2017)
Author version | Version of record | BibTeX

Download 'Applications of duplicate detection: linking meta-data and merging music archives – The experience of the IPEM historical archive of electronic music'

A framework to provide fine-grained time-dependent context for active listening experiences
Joren Six and Marc Leman
(2017) Proceedings of AES Conference on Semantic Audio
Author version | Version of record | Further information | BibTeX

Download 'A framework to provide fine-grained time-dependent context for active listening experiences'

Regularity and asynchrony when tapping to tactile, auditory and combined pulses
Joren Six, Laura Arens, Hade Demoor, Thomas Kint and Marc Leman
(2017) Proceedings of the ESCOM conference
Author version | Further information | BibTeX

Download 'Regularity and asynchrony when tapping to tactile, auditory and combined pulses'

MIRchiving: Challenges and opportunities of connecting MIR research and digital music archives
Reinier de Valk, Anja Volk, Andre Holzapfel, Aggelos Pikrakis, Nadine Kroher, Joren Six
(2017) Proceedings of the 4th International Digital Libraries for Musicology workshop (DLfM 2017)
Author version | Version of record | BibTeX

Download 'MIRchiving: Challenges and opportunities of connecting MIR research and digital music archives'

The Deep History of Music Project
Armand Leroi, Matthias Mauch, Pat Savage, Emmanouil Benetor, Juan Bello, Maria Panteli, Joren Six, Tillman Weyde
(2015) Proceedings of the 5th Folk Music Analysis (FMA) conference
Author version | BibTeX

Download 'The Deep History of Music Project'

TarsosDSP, a Real-Time Audio Processing Framework in Java
Joren Six, Olmo Cornelis and Marc Leman
(2014) Proceedings of the Audio Engineering Society Conference: 53rd International Conference: Semantic Audio
Author version | Version of record | Further information | BibTeX

Download 'TarsosDSP, a Real-Time Audio Processing Framework in Java'

Panako – A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification
Joren Six and Marc Leman
(2014) Proceedings of the 15th ISMIR Conference (ISMIR 2014)
Author version | Version of record | BibTeX

Download 'Panako – A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification'

Peachnote Piano: Making MIDI instruments social and smart using Arduino, Android and Nodejs
Joren Six, Vladimir Viro
(2011) Demo Sessions of the 12th International Society for Music Information Retrieval Conference (ISMIR 2011)
Author version | Further information | BibTeX

Download 'Peachnote Piano: Making MIDI instruments social and smart using Arduino, Android and Nodejs'

Tarsos – a Platform to Explore Pitch Scales in Non-Western and Western Music
Joren Six and Olmo Cornelis
(2011) Proceedings of the 12th International Symposium on Music Information Retrieval
Author version | Version of record | Further information | BibTeX

Download 'Tarsos – a Platform to Explore Pitch Scales in Non-Western and Western Music'

Master's Thesis

Presentations, Discussions Guest Lectures, by Invitation

Panel discussion, 2012: Technological challenges for the computational modelling of the world’s musical heritage, Folk Music Analysis Conference 2012 – FMA 2012, organizers: Polina Proutskova and Emilia Gomez, Seville, Spain

Guest lecture, 2012: Non-western music and digital humanities, for: “Studies in Western Music History: Quantitative and Computational Approaches to Music History”, M.I.T., Boston, U.S.

Guest lecture, 2011: Presenting Tarsos, a software platform for pitch analysis. At: Electrical and Electronics Eng.Dept. IYTE, Izmir, Turkey

Workshop 2017:Computational Ethnomusicology – Methodologies for a new field Leiden, The Netherlands

Experience as Lecturer

A002301 (2016-2017) “Grondslagen van de muzikale acoustica en sonologie” – Theory and Practice sessions together with dr. Pieter-Jan Maes

Other Output

I am recognized as co-inventor on a Patent titled Low impact running WO/2020/002275

For research software see the software output page

~ Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment - In Journal on Multimodal User Interfaces

» By Joren on Thursday 06 August 2015

The article titled “Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment” by Joren Six and Marc Leman has been accepted for publication in the Journal on Multimodal User Interfaces. The article will be published later this year. It describes and tests a method to synchronize data-streams. Below you can find the abstract, pointers to the software under discussion and an author version of the article itself.

Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment
An Application of Acoustic Fingerprinting to Facilitate Music Interaction Research

Abstract: Research on the interaction between movement and music often involves analysis of multi-track audio, video streams and sensor data. To facilitate such research a framework is presented here that allows synchronization of multimodal data. A low cost approach is proposed to synchronize streams by embedding ambient audio into each data-stream. This effectively reduces the synchronization problem to audio-to-audio alignment. As a part of the framework a robust, computationally efficient audio-to-audio alignment algorithm is presented for reliable synchronization of embedded audio streams of varying quality. The algorithm uses audio fingerprinting techniques to measure offsets. It also identifies drift and dropped samples, which makes it possible to find a synchronization solution under such circumstances as well. The framework is evaluated with synthetic signals and a case study, showing millisecond accurate synchronization.

To read the article, consult the author version of Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment. The data-set used in the case study is available here. It contains a recording of balanceboard data, accelerometers, and two webcams that needs to be synchronized. The final publication is available at Springer via 10.1007/s12193-015-0196-1

The algorithm under discussion is included in Panako an audio fingerprinting system but is also available for download here. The SyncSink application has been packaged separately for ease of use.

To use the application start it with double click the downloaded SyncSink JAR-file. Subsequently add various audio or video files using drag and drop. If the same audio is found in the various media files a time-box plot appears, as in the screenshot below. To add corresponding data-files click one of the boxes on the timeline and choose a data file that is synchronized with the audio. The data-file should be a CSV-file. The separator should be ‘,’ and the first column should contain a time-stamp in fractional seconds. After pressing Sync a new CSV-file is created with the first column containing correctly shifted time stamps. If this is done for multiple files, a synchronized sensor-stream is created. Also, ffmpeg commands to synchronize the media files themselves are printed to the command line.

This work was supported by funding by a Methusalem grant from the Flemish Government, Belgium. Special thanks goes to Ivan Schepers for building the balance boards used in the case study. If you want to cite the article, use the following BiBTeX:

@article{six2015multimodal,
  author      = {Joren Six and Marc Leman},
  title       = {{Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment}},
  issn        = {1783-7677},
  volume      = {9},
  number      = {3},
  pages       = {223-229},
  doi         = {10.1007/s12193-015-0196-1},
  journal     = {{Journal of Multimodal User Interfaces}}, 
  publisher   = {Springer Berlin Heidelberg},
  year        = 2015
}

Synchronized streams in Sonic Visualizer. Here you can see two channel audio synchronized with accelerometer data (top, green) and balanceboard data (bottom, purple).
The synchronized data from the two webcams, accelerometer and balanceboard in ELAN. From top to bottom the synchronized streams are two video-streams, balance-board data (red), accelerometer-data (green) and audio (black).
Conceptual drawing used as a basis for the SyncSync application. A reference stream (blue) can be synchronized with streams one and two. It allows a workflow where streams are started and stopped (red) or start before the reference stream (green).
A microcontroller fitted with an electret microphone and a microSD card slot. It can record audio in real-time together with sensor data.
SyncSink Synchronize media files. A user-friendly interface to synchronize media and data files. First a reference media-file is added using drag-and-drop. The audio steam of the reference is extracted and plotted on a timeline as the topmost box. Subsequently other media-files are added. The offsets with respect to the reference are calculated and plotted. CSV-files with timestamps and data recorded in sync with a stream can be attached to a respective audio stream. Finally, after pressing Sync!, the data and media files are modified to be exactly in sync with the reference.
Multimodal recording system diagram. Each webcam has a microphone and is connected to the pc via USB. The dashed arrows represent analog signals. The balance board has four analog sensors but these are simplified to one connection in the schematic. The analog output of the microphones is also recorded through the DAQ. An analog accelerometer is connected with a microcontroller which also records audio.
Two streams of audio with fingerprints marked. Some fingerprints are present in both streams (green, O) while others are not (red, x). Matching fingerprints have the same offset, indicated by the dotted lines.

Research papers Attachments

2015.synchronized-recording.pdf, SyncSink-1.4.jar, and SyncDataset.zip

~ Audio Fingerprinting - Opportunities for digital musicology

» By Joren on Tuesday 25 November 2014

The 27th of November, 2014 a lecture on audio fingerprinting and its applications for digital musicology will be given at IPEM. The lecture introduces audio fingerprinting, explains an audio fingerprinting technique and then goes on to explain how such algorithm offers opportunities for large scale digital musicological applications. Here you can download the slides about audio fingerprinting and its opportunities for digital musicology.

With the explained audio fingerprinting technique a specific form of very reliable musical structure analysis can be done. Below, in the figure section, an example of repetitive structure in the song Ribs Out is shown. Another example is comparing edits or versions of songs. Below, also in the figure section, the radio edit of Daft Punk’s Get Lucky is compared with the original version. Audio synchronization using fingerprinting is another application that is actively used in the field of digital musicology to align audio with extracted features.

Since acoustic fingerprinting makes structure analysis very efficiently it can be applied on a large scale (20k songs). The figure below shows that identical repetition is something that has been used more and more since the mid 1970’s. The trend probably aligns with the amount of technical knowledge needed to ‘copy and paste’ a snippet of music.

Fig: How much identical repetition is used in music, over the years.

The Panako audio fingerprinting system was used to generate data for these case studies. The lecture and this post are partly inspired by a blog post by Paul Brossier.

Radio edit vs. original of Daft Punk's Get Lucky
Spectral peak Acoustic fingerprinting system
Structure in Ribs Out
How much identical repetition is used in a set of 20k songs.

Presentation Attachments

2014.11.27.aucoustic_fingerprinting.pdf, repetition_over_time.ods, music_structure_analysis_ribs_out.ods, and get_lucky_radio_vs_original.ods

~ ISMIR 2014 - Panako - A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification

» By Joren on Monday 27 October 2014

At ISMIR 2014 i will present a paper on a fingerprinting system. ISMIR is the annual conference of the International Society for Music Information Retrieval is the world’s leading interdisciplinary forum on accessing, analyzing, and organizing digital music of all sorts. This years instalment takes place in Taipei, Taiwan. My contribution is a paper titled Panako – A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification, it will be presented during a poster session the 27th of October.

This paper presents a scalable granular acoustic fingerprinting system. An acoustic fingerprinting system uses condensed representation of audio signals, acoustic fingerprints, to identify short audio fragments in large audio databases. A robust fingerprinting system generates similar fingerprints for perceptually similar audio signals. The system presented here is designed to handle time-scale and pitch modifications. The open source implementation of the system is called Panako and is evaluated on commodity hardware using a freely available reference database with fingerprints of over 30,000 songs. The results show that the system responds quickly and reliably on queries, while handling time-scale and pitch modifications of up to ten percent.

The system is also shown to handle GSM-compression, several audio effects and band-pass filtering. After a query, the system returns the start time in the reference audio and how much the query has been pitch-shifted or time-stretched with respect to the reference audio. The design of the system that offers this combination of features is the main contribution of this paper.

The system is available, together with documentation and information on how to reproduce the results from the ISMIR paper, on the Panako website. Also available for download is the Panako poster, Panako ISMIR paper and the Panako poster.

Fingerprint and modifications
General fingerprinter
Results after time stretching
Results after time scale modification
Results after pitch shifting

Research papers and Presentation Attachments

panako_poster_portrait.svg, panako_poster.pdf, ismir_2014_panako_fingerprinter.pdf, and panako_poster.png

~ TarsosDSP Paper and Presentation at AES 53rd International conference on Semantic Audio

» By Joren on Tuesday 24 December 2013

TarsosDSP will be presented at the AES 53rd International conference on Semantic Audio in London . During the conference both a presentation and demonstration of the paper TarsosDSP, a Real-Time Audio Processing Framework in Java, by Joren Six, Olmo Cornelis and Marc Leman, in Proceedings of the 53rd AES Conference (AES 53rd), 2014. From their website:

Semantic Audio is concerned with content-based management of digital audio recordings. The rapid evolution of digital audio technologies, e.g. audio data compression and streaming, the availability of large audio libraries online and offline, and recent developments in content-based audio retrieval have significantly changed the way digital audio is created, processed, and consumed. New audio content can be produced at lower cost, while also large audio archives at libraries or record labels are opening to the public. Thus the sheer amount of available audio data grows more and more each day. Semantic analysis of audio resulting in high-level metadata descriptors such as musical chords and tempo, or the identification of speakers facilitate content-based management of audio recordings. Aside from audio retrieval and recommendation technologies, the semantics of audio signals are also becoming increasingly important, for instance, in object-based audio coding, as well as intelligent audio editing, and processing. Recent product releases already demonstrate this to a great extent, however, more innovative functionalities relying on semantic audio analysis and management are imminent. These functionalities may utilise, for instance, (informed) audio source separation, speaker segmentation and identification, structural music segmentation, or social and Semantic Web technologies, including ontologies and linked open data.

This conference will give a broad overview of the state of the art and address many of the new scientific disciplines involved in this still-emerging field. Our purpose is to continue fostering this line of interdisciplinary research. This is reflected by the wide variety of invited speakers presenting at the conference.

The paper presents TarsosDSP, a framework for real-time audio analysis and processing. Most libraries and frameworks offer either audio analysis and feature extraction or audio synthesis and processing. TarsosDSP is one of a only a few frameworks that offers both analysis, processing and feature extraction in real-time, a unique feature in the Java ecosystem. The framework contains practical audio processing algorithms, it can be extended easily, and has no external dependencies. Each algorithm is implemented as simple as possible thanks to a straightforward processing pipeline. TarsosDSP’s features include a resampling algorithm, onset detectors, a number of pitch estimation algorithms, a time stretch algorithm, a pitch shifting algorithm, and an algorithm to calculate the Constant-Q. The framework also allows simple audio synthesis, some audio effects, and several filters. The Open Source framework is a valuable contribution to the MIR-Community and ideal fit for interactive MIR-applications on Android. The full paper can be downloaded TarsosDSP, a Real-Time Audio Processing Framework in Java

A BibTeX entry for the paper can be found below.


  1
2
3
4
5
6

  @inproceedings{six2014tarsosdsp,
  author      = {Joren Six and Olmo Cornelis and Marc Leman},
  title       = {{TarsosDSP, a Real-Time Audio Processing Framework in Java}},
  booktitle   = {{Proceedings of the 53rd AES Conference (AES 53rd)}}, 
  year        =  2014
}

Constant-Q
AES53
Samping
Pitch Shifting
Flanger

Research papers and Presentation Attachments

aes53_tarsos_dsp.pdf

~ Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music - In Journal Of New Music Research

» By Joren on Wednesday 09 October 2013

The journal paper Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music – In Journal Of New Music Research by Cornelis, Six, Holzapfel and Leman was published in a special issue about Computational Ethnomusicology of the Journal of New Music Research on the 20th of august 2013. Below you can find the abstract for the article, and the full text author version of the article itself.

Abstract: Large digital archives of ethnic music require automatic tools to provide musical content descriptions. While various automatic approaches are available, they are to a wide extent developed for Western popular music. This paper aims to analyze how automated tempo estimation approaches perform in the context of Central-African music. To this end we collect human beat annotations for a set of musical fragments, and compare them with automatic beat tracking sequences. We first analyze the tempo estimations derived from annotations and beat tracking results. Then we examine an approach, based on mutual agreement between automatic and human annotations, to automate such analysis, which can serve to detect musical fragments with high tempo ambiguity.

To read the full text you can either download Evaluation and Recommendation of Pulse ant Tempo Annotation in Ethnic Music, Author version. Or obtain the published version of Evaluation and Recommendation of Pulse ant Tempo Annotation in Ethnic Music, published version

Below the BibTex entry for the article is embedded.


  1
2
3
4
5
6
7
8
9
10

  @article{cornelis2013tempo_jnmr,
  author = {Olmo Cornelis, Joren Six, Andre Holzapfel, and Marc Leman},
  title = {{Evaluation and Recommendation of Pulse ant Tempo Annotation in Ethnic Music}},
  journal = {{Journal of New Music Research}},
  volume = {42},
  number = {2},
  pages = {131-149},
  year = {2013},
  doi = {10.1080/09298215.2013.812123}
}

Research papers Attachments

jnmr.cover.jpg and 2013.10.09.Tempo_annotation_in_ethnic_music-JNMR-Author_Version.pdf

~ Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music - In Journal Of New Music Research

» By Joren on Thursday 22 August 2013

The journal paper Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music by Six, Cornelis, and Leman was published in a special issue about Computational Ethnomusicology of the Journal of New Music Research on the 20th of august 2013. Below you can find the abstract for the article, and pointers to audio examples, the Tarsos software, and the author version of the article itself.

Abstract: This paper presents Tarsos, a modular software platform used to extract and analyze pitch organization in music. With Tarsos pitch estimations are generated from an audio signal and those estimations are processed in order to form musicologically meaningful representations. Tarsos aims to offer a flexible system for pitch analysis through the combination of an interactive user interface, several pitch estimation algorithms, filtering options, immediate auditory feedback and data output modalities for every step. To study the most frequently used pitches, a fine-grained histogram that allows up to 1200 values per octave is constructed. This allows Tarsos to analyze deviations in Western music, or to analyze specific tone scales that differ from the 12 tone equal temperament, common in many non-Western musics. Tarsos has a graphical user interface or can be launched using an API – as a batch script. Therefore, it is fit for both the analysis of individual songs and the analysis of large music corpora. The interface allows several visual representations, and can indicate the scale of the piece under analysis. The extracted scale can be used immediately to tune a MIDI keyboard that can be played in the discovered scale. These features make Tarsos an interesting tool that can be used for musicological analysis, teaching and even artistic productions.

To read the full text you can either download Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music, Author version. Or obtain the published version of Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music, published version

Ladrang Kandamanyura (slendro pathet manyura), is the name of the piece used in the article throughout section 2. The album on which the piece can be found is available at wergo. Below a thirty second fragment is embedded. You can also download the thirty second fragment to analyse it yourself.

Below the BibTex entry for the article is embedded.


  1
2
3
4
5
6
7
8
9
10
11
12

  @article{six2013tarsos_jnmr,
  author = {Six, Joren and Cornelis, Olmo and Leman, Marc},
  title = {Tarsos, a Modular Platform for Precise Pitch Analysis 
            of Western and Non-Western Music},
  journal = {Journal of New Music Research},
  volume = {42},
  number = {2},
  pages = {113-129},
  year = {2013},
  doi = {10.1080/09298215.2013.797999},
 URL = {http://www.tandfonline.com/doi/abs/10.1080/09298215.2013.797999}
}

Research papers Attachments

2013.08.20.tarsos_jnmr_author_version.pdf, jnmr.cover.jpg, and 08._Ladrang_Kandamanyura_10s-20s_up.wav

~ FMA 2013 - Computer Assisted Transcripton of Ethnic Music

» By Joren on Friday 14 June 2013

At the third international workshop on Folk Music Analysis we presented a poster titled Computer Assisted Transcription of Ethnic Music]. The workshop took place in Amsterdam, Netherlands, June 6 and 7, 2013.

In the extended abstract, also titled Computer Assisted Transcription of Ethnic Music, it is described how the Tarsos software program now has features aiding transcription. Tarsos is especially practical for ethnic music of which the tone scale is not known beforehand. The proceedings of FMA 2013 are available as well.

During the conference there also was an interesting panel on transcription. The following people participated: John Ashley Burgoyne, moderator (University of Amsterdam), Kofi Agawu (Princeton University), Dániel P. Biró (University of Victoria), Olmo Cornelis (University College Ghent, Belgium), Emilia Gómez (Universitat Pompeu Fabra, Barcelona), and Barbara Titus (Utrecht University). Some pictures can be found below.

Research papers Attachments

IMG_20130607_152620.jpg, FMA_2013.computer_assisted_transcription.pdf, A0-Poster.pdf, FMA_2013.computer_assisted_transcription.pdf, A0-Poster.jpg, and IMG_20130607_140442.jpg

~ CIM 2012 - Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means

» By Joren on Friday 31 August 2012

Logo Universiteit Utrecht What follows is about the Conference on Interdisciplinary Musicology and the 15th international Conference of the Gesellschaft fur Musikfoschung. First this text will give information about our contribution to CIM2012: Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means and then a number of highlights of the conference follow. The joint conference took place from the 4th to the 8th of september 2012.

In 2012, CIM will tackle the subject of History. Hosted by the University of Göttingen, whose one time music director Johann Nikolaus Forkel is widely regarded as one of the founders of modern music historiography, CIM12 aims to promote collaborations that provoke and explore new methods and methodologies for establishing, evaluating, preserving and communicating knowledge of music and musical practices of past societies and the factors implicated in both the preservation and transformation of such practices over time.

Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means

Our contribution ton CIM 2012 is titled Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means. The aim was to show how tone scales of the past, e.g. organ tuning, can be extracted and sonified. During the demo special attention was given to historic Central African tuning systems. The presentation I gave is included below and or available for download

Highlights

What follows are some personal highlights for the Conference on Interdisciplinary Musicology and the 15th international Conference of the Gesellschaft fur Musikfoschung. The joint conference took place from the 4th to the 8th of september 2012.

The work presented by Rytis Ambrazevicius et al. Modal changes in traditional Lithuanian singing: Diachronic aspect has a lot in common with our research, it was interesting to see their approach. Another highlight of the conference was the whole session organized by Klaus-Peter Brenner around Mbira music.

Rainer Polak gave a talk titled ‘Swing, Groove and Metre. Asymmetric Feels, Metric Ambiguity and Metric Transformation in African Musics’. He showed how research about rhythm in jazz research, music theory and empirical musicology ( amongst others) could be bridged and applied to ethnic music.

The overview Eleanore Selfridge-Field gave during her talk Between an Analogue Past and a Digital Future: The Evolving Digital Present was refreshing. She had a really clear view on all the different ways musicology and digital media can benifit from each-other.

From the concert programme I found two especially interesting: the lecture-performance by Margarete Maierhofer-Lischka and Frauke Aulbert of Lotofagos, a piece by Beat Furrer and Burdocks composed and performed by Christian Wolff and a bunch of enthusiastic students.

Research papers and Presentation Attachments

CIM12_Submission.pdf, Forkeljpg.jpg, and 2012.09.05-Revealing_and_listening_to_scales_from_the_past__tone_scale_analysis_of_archived_Central-African_music_using_computational_means..ppt

~ ICMC 2012 - Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition

» By Joren on Friday 31 August 2012

Logo Universiteit Utrecht At this years ICMC Conference, ICMC 2012 we presented a paper describing a way to experiment with tone scales and how to use Tarsos as a compositional tool. What follows are some pointers to the presentation, paper and to other interesting talks that were presented there.

ICMC 2012 was organized in Ljubljana from the 9 to 14 septembre and had a very dense program of talks, posters, presentations, demos and concerts.

Since 1974 the International Computer Music Conference has been the major international forum for the presentation of the full range of outcomes from technical and musical research, both musical and theoretical, related to the use of computers in music. This annual conference regularly travels the globe, with recent conferences in the Americas, Europe and Asia. This year we welcome the conference to Slovenia for the first time.

Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition

Our contribution to the conference was a paper titled Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition.

If you want to cite our work, this BibTeX entry is included for your convenience:


  1
2
3
4
5
6
7
8

  @inproceedings{cornelis2012sound_to_scale,
  author     = {Olmo Cornelis and Joren Six},
  title      = {{Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition}},
  booktitle  = {{Proceedings of the 2012 International Computer Music Conference,
               (ICMC 2012)}},
  year       = {2012},
  publisher = {The International Computer Music Association}
}

Program highlights

What follows are a number of pointers to my personal program highlights.

Verena Thomas presented two very well polished software tools. One to detect patterns in scores, called motifviewer and a tool to search in score databases in a multi-modal way. The Probado tool does score-to-audio alignment and much more.

Gibber is an impressive live-coding environment with an easy syntax. Since it is all done with javascript you can start playing with it immediately. Overtone Another live-coding environment, presented at the conference by Sam Aaron, was equally impressive. It is programmed using the Closure language.

At ICMC there were a number of tools to assist in composition. One of those is The Bach Project, by Andrea Agostini. Togheter with CatART by Diemo Swartz it forms a very expressive platform to work with sound, which was demonstrated by Aaron Einbond and Christopher Trapani in their paper titled Precise Pitch Control In Real Time Corpus-Based Concatenative Synthesis. Diemo Swartz presented work on Audio Mosaicing, it can be seen as a follow-up to AuidioGuild by Ben Hackbarth.

I also got to know the work by Thomas Grill, on his website a nice piece of software can be found a Python implementation of the Non Stationary Gabor Transform. Another software system I got to know is the functional signal processing programming language FAUST

My personal highlights of the concert programme include the works by Johannes Kreidler, Aura Pon, Daniel Mayer, Alexander Schubert and the remarkable performance by Dexter Ford. The concept behind Soundlog by Johannes Kretz was also interesting.

Presentation Attachments

ICMC_Logo.png, 2012.09.08-Sound_to_Scale_to_Sound__a_Setup_for_Microtonal_Exploration_and_Composition.odp, and icmc2012_submission_45.pdf

~ Analytical Approaches To World Music - Microtonal Scale Exploration in Central Africa

» By Joren on Wednesday 06 June 2012

At the 2012 AAWM conference we presented a way to explore tone scales in the music of Central Africa. Since the audience consisted of (ethno)musicologists, the main focus of the presentation was on the applicication part, the technical aspects were only briefly mentioned.

The extended abstract can be consulted: Towards the tangible: microtonal scale exploration in Central-African music

The conference program itself was very diverse and interesting.

Presentation Attachments

2012.05.11-Towards_the_Tangible_-_Microtonal_Scale_Exploration_in_Central-African_Music.odp and AAWM_abstract_short.doc

~ Guest Lecture at MIT - Ethnic Music Analysis: Challenges & Opportunities - Tarsos as a Case Study

» By Joren on Monday 07 May 2012

Thursday the 3th of May I gave a guest lecture titled ‘Ethnic Music Analysis: Challenges & Opportunities’ it featured Tarsos as a Case Study. The goal was to identify the difficulties when dealing with ethnic music and to show a possible approach, the approach implemented by Tarsos.

The invitation to give the guest lecture came from Michael Cuthbert who is one of the driving forces behind music21. The audience was a small group of double majors in both musicology and computer science: the ideal profile to gather useful feedback.

HoGent

2012.05.03-Ethnic_Music_Analysis.pdf and 2012.05.03-Ethnic_Music_Analysis.odp

Publications

Dissertation

Journal Articles

Book Chapters

Articles in peer reviewed conference proceedings

Master's Thesis

Presentations, Discussions Guest Lectures, by Invitation

Experience as Lecturer

Other Output

~ Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment - In Journal on Multimodal User Interfaces

Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment
An Application of Acoustic Fingerprinting to Facilitate Music Interaction Research

~ Audio Fingerprinting - Opportunities for digital musicology

~ ISMIR 2014 - Panako - A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification

~ TarsosDSP Paper and Presentation at AES 53rd International conference on Semantic Audio

~ Evaluation and Recommendation of Pulse and Tempo Annotation in Ethnic Music - In Journal Of New Music Research

~ Tarsos, a Modular Platform for Precise Pitch Analysis of Western and Non-Western Music - In Journal Of New Music Research

~ FMA 2013 - Computer Assisted Transcripton of Ethnic Music

~ CIM 2012 - Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means

Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means

Highlights

~ ICMC 2012 - Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition

Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition

Program highlights

~ Analytical Approaches To World Music - Microtonal Scale Exploration in Central Africa

~ Guest Lecture at MIT - Ethnic Music Analysis: Challenges & Opportunities - Tarsos as a Case Study

Previous blog posts

12-12-2011 ~ Kinderuniversiteit - Muziek onder de microscoop!

02-12-2011 ~ Software for Music Analysis

25-10-2011 ~ Tarsos at 'Study Day: Tuning and Temperament - Insitute of Musical Research, London'

25-10-2011 ~ Tarsos presentation at 'ISMIR 2011'

18-10-2011 ~ Tarsos at 'WASPAA 2011'

04-10-2011 ~ Bruikbare software voor muziekanalyse

26-09-2011 ~ PeachNote Piano at the ISMIR 2011 demo session

21-09-2011 ~ Simplify Collaboration on a LaTeX Documents with Dropbox and a Build Server

08-09-2011 ~ PeachNote Piano

22-08-2011 ~ Tarsos at 'ISMIR 2011'

27-05-2011 ~ Tarsos at 'IPEM Open House'

Publications

Dissertation

Journal Articles

Book Chapters

Articles in peer reviewed conference proceedings

Master's Thesis

Presentations, Discussions Guest Lectures, by Invitation

Experience as Lecturer

Other Output

Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment An Application of Acoustic Fingerprinting to Facilitate Music Interaction Research

Revealing and Listening to Scales From the Past; Tone Scale Analysis of Archived Central-African Music Using Computational Means

Highlights

Sound to Scale to Sound, a Setup for Microtonal Exploration and Composition

Program highlights

Previous blog posts

Synchronizing Multimodal Recordings Using Audio-To-Audio Alignment
An Application of Acoustic Fingerprinting to Facilitate Music Interaction Research