A multimodal approach for percussion music transcription from audio and video

Bernardo Marenco, Magdalena Fuentes, Florencia Lanzaro, Martín Rocamora, Alvaro Gómez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A multimodal approach for percussion music transcription from audio and video recordings is proposed in this work. It is part of an ongoing research effort for the development of tools for computeraided analysis of Candombe drumming, a popular afro-rooted rhythm from Uruguay. Several signal processing techniques are applied to automatically extract meaningful information from each source. This involves detecting certain relevant objects in the scene from the video stream. The location of events is obtained from the audio signal and this information is used to drive the processing of both modalities. Then, the detected events are classified by combining the information from each source in a feature-level fusion scheme. The experiments conducted yield promising results that show the advantages of the proposed method.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsAlvaro Pardo, Josef Kittler
PublisherSpringer Verlag
Pages92-99
Number of pages8
ISBN (Print)9783319257501
DOIs
StatePublished - 2015
Event20th Iberoamerican Congress on on Pattern Recognition, CIARP 2015 - Montevideo, Uruguay
Duration: Nov 9 2015Nov 12 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9423
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th Iberoamerican Congress on on Pattern Recognition, CIARP 2015
Country/TerritoryUruguay
CityMontevideo
Period11/9/1511/12/15

Keywords

  • Machine learning applications
  • Multimodal signal processing
  • Music transcription
  • Percussion music
  • Sound classification

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A multimodal approach for percussion music transcription from audio and video'. Together they form a unique fingerprint.

Cite this