Paper on multi-modal music classification accepted for ISMIR

I. Vatolkin and C. McKay: Stability of Symbolic Feature Group Importance in the Context of Multi-Modal Music Classification. accepted for Proceedings of the 23rd International Society for Music Information Retrieval Conference (ISMIR)

Abstract: Multi-modal music classification creates supervised models trained on features from different sources (modalities): the audio signal, the score, lyrics, album covers, expert tags, etc. A concept of “multi-group feature importance” not only helps to measure the individual relevance of features belonging to a feature type under investigation (such as the instruments present in a piece), but also serves to quantify the potential for further improving classification quality by adding features from other feature types or extracted from different kinds of sources, based on a multi-objective analysis of feature sets after evolutionary feature selection. In this study, we investigate the stability of feature group importance when different classification methods and different measures of classification quality are applied. Since musical scores are particularly helpful in deriving semantically meaningful, robust genre characteristics, we focus on the feature groups analyzed by the jSymbolic feature extraction software, which describe properties associated with instrumentation, basic pitch statistics, melody, chords, tempo, and other rhythmic aspects. These symbolic features are analyzed in the context of musical information drawn from five other modalities, and experiments are conducted involving two datasets, one small and one large. The results show that, although some feature groups can remain similarly important compared to others, differences can also be evident in various application cases, and can depend on the particular classifier and evaluation measure being used. Insights drawn from this type of analysis can potentially be helpful in effectively matching specific features or feature groups to particular classifiers and evaluation measures in future feature-based MIR research.

Veröffentlicht unter MIR Research, Publications | Schreib einen Kommentar

SIGMA #53

The program of the following 53rd SIGMA meeting on 30.06.2022, 14:00-16:00, which takes place online (please send an email to igor.vatolkin [at] udo.edu if you wish to get the Zoom link):

14:00-14:05 Welcome greetings

14:05-14:35 Conference study
Mark Gotham: What if the ‚when‘ implies the ‚what‘?

14:35-15:10 Master’s thesis (introduction)
Alexander Ostrop: Generation of Orchestra Pieces with Transformer Models

15:10-15:40 Research studies
Hauke Egermann: Predicting Listener Experience in Functional Music Settings: Research at the Intersection between Music Psychology and Computer Science 

15:40-16:00 Conferences and calls, miscellaneous, next meeting

Veröffentlicht unter SIGMA | Schreib einen Kommentar

SIGMA #52

The program of the following 52nd SIGMA meeting on 31.03.2022, 14:00-16:10, which takes place online (please send an email to igor.vatolkin [at] udo.edu if you wish to get the Zoom link):

14:00-14:05 Welcome greetings

14:05-14:35 Bachelor’s thesis (results)
Marcel Schrauder: Music genre classification with artificial neural networks

14:35-15:05 Master’s thesis (results)
Pia Eickhoff: Extended replication study to tone consolidation after Carl Stumpf 

15:05-15:35 Master’s thesis (results)
Florian Scholz: Inclusion of different instrument bodies for robust training of neural networks for instrument recognition

15:35-16:05 Research plan for a PhD thesis
Fabian Ostermann: Artificial Intelligence as a tool for music composers

16:05-16:10 Conferences and calls, miscellaneous, next meeting

Veröffentlicht unter SIGMA | Schreib einen Kommentar

Paper on multi-modal music classification using six modalities published in TISMIR

I. Vatolkin and C. McKay: Multi-Objective Investigation of Six Feature Source Types for Multi-Modal Music Classification. Transactions of the International Society for Music Information Retrieval, 5(1), pp.1–19, 2022.

Abstract: Every type of musical data (audio, symbolic, lyrics, etc.) has its limitations, and cannot always capture all relevant properties of a particular musical category. In contrast to more typical MIR setups where supervised classification models are trained on only one or two types of data, we propose a more diversified approach to music classification and analysis based on six modalities: audio signals, semantic tags inferred from the audio, symbolic MIDI representations, album cover images, playlist co-occurrences, and lyric texts. Some of the descriptors we extract from these data are low-level, while others encapsulate interpretable semantic knowledge that describes melodic, rhythmic, instrumental, and other properties of music. With the intent of measuring the individual impact of different feature groups on different categories, we propose two evaluation criteria based on “non-dominated hypervolumes”: multi-group feature “importance” and “redundancy”. Both of these are calculated after the application of a multi-objective feature selection strategy using evolutionary algorithms, with a novel approach to optimizing trade-offs between both “pure” and “mixed” feature subsets. These techniques permit an exploration of how different modalities and feature types contribute to class discrimination. We use genre classification as a sample research domain to which these techniques can be applied, and present exploratory experiments on two disjoint datasets of different sizes, involving three genre ontologies of varied class similarity. Our results highlight the potential of combining features extracted from different modalities, and can provide insight on the relative significance of different modalities and features in different contexts.

Veröffentlicht unter MIR Research, Publications | Schreib einen Kommentar

SIGMA #51

The program of the following 51st SIGMA meeting on 10.01.2022, 10:00-12:00, which takes place online (please send an email to igor.vatolkin [at] udo.edu if you wish to get the Zoom link):

10:00-10:05 Welcome greetings

10:05-10:35 Conference study
Johannes Gauer: Can spectral complexity reduction improve music perception in cochlear implant users?

10:35-11:05 Conference study
Igor Vatolkin: Music categorization with zygons and semantic features

11:05-11:45 Master’s thesis (results)
Philipp Ginsel: Distance measures for evolutionary approximation of audio data

11:45-12:00 Ongoing teachning courses, conferences and calls, miscellaneous, next meeting

Veröffentlicht unter SIGMA | Schreib einen Kommentar

Paper on EAR Drummer accepted for TISMIR special collection on AI and Musical Creativity

F. Ostermann, I. Vatolkin, and G. Rudolph: Evaluating Creativity in Automatic Reactive Accompaniment of Jazz Improvisation

Abstract: Music generating computer programs can support jazz musicians and students during performance and practice, for instance by providing accompaniment for solo improvisation. However, such software typically plays sequences of static precomposed snippets and does not react to the user. In that context, it is hardly possible to determine whether such a system has any of its own creative powers. Within the scope of a user study with 20 participants, we evaluate and compare the mobile application iReal Pro to our own system, the evolutionary automatic and reactive system called ‘EAR Drummer’ that generates drum patterns as accompaniment to jazz solo improvisation. It adapts its behaviour in real-time by heuristic rules based on music properties derived from the user’s melodies. The user-based evaluation is performed by following the standardised procedure for evaluating creative systems (SPECS). The analysis of the results is based on a Linear Mixed Effects Model to consider fixed and random effects on the survey data. The model reveals that our system outperforms iReal Pro in all of SPECS’s partial components of creativity and significantly outperforms it for 7 of those 14 components including variety, originality, emotional involvement, and social interaction. Further, it is characterised as “better” and “more interesting” in the user survey. A conflicting observation is that while 70% of the study participants tend to prefer our more “creative” system as support for stage performances, only 40% find it more suitable for practice. Further analysis addresses differences between user groups defined by their played instrument, age, and musical experience.

Veröffentlicht unter Publications | Schreib einen Kommentar

Job offer in computational music theory

A PhD/PostDoc position is available within a new lab for computational approaches to music theory, analysis and composition (TU Dortmund, the In­sti­tute of Music and Musicology).

Please see the full description (in German).

Veröffentlicht unter Jobs | Schreib einen Kommentar

Paper on multi-modal music classification accepted for Entropy

The following paper was accepted for Entropy:

B. Wilkes, I. Vatolkin, and H. Müller: Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition

Veröffentlicht unter Publications | Schreib einen Kommentar

Job offer for a student assistant

For assistance during software project „Music Informatics“, a position as a student assistant (8 hours per week) is offered at the Chair of Algorithm Engineering, Department of Computer Science, TU Dortmund. Please see the full description in German.

Veröffentlicht unter AMUSE & MIR Software, General, Teaching Activities | Schreib einen Kommentar

Job offers at Dortmund Systematic Musicology Lab

Two new positions are available in the newly established Dortmund Systematic Musicology Lab, Technische Universität Dortmund, Germany:

  1. Postdoctoral Research Fellow in Systematic Musicology / Music Psychology / Music Cognition 100% E-14, for 5 years
    http://www.egermann.net/?page_id=550
    Deadline: 18 October 2021.
  2. PhD candidate in Systematic Musicology / Music Psychology / Music Cognition, 50%, E-13, for 3 years
    http://www.egermann.net/?page_id=553
    Deadline: 18 October 2021.

For more information, please contact Prof. Dr. Hauke Egermann.

Veröffentlicht unter General | Schreib einen Kommentar