papers | Ben Ma

2023

Sci. Rep.

Creating musical features using multi-faceted, multi-task encoders based on transformers

Timothy Greer, Xuan Shi, Benjamin Ma, and 1 more author

Scientific Reports, Jul 2023

Abs

Computational machine intelligence approaches have enabled a variety of music-centric technologies in support of creating, sharing and interacting with music content. A strong performance on specific downstream application tasks, such as music genre detection and music emotion recognition, is paramount to ensuring broad capabilities for computational music understanding and Music Information Retrieval. Traditional approaches have relied on supervised learning to train models to support these music-related tasks. However, such approaches require copious annotated data and still may only provide insight into one view of music—namely, that related to the specific task at hand. We present a new model for generating audio-musical features that support music understanding, leveraging self-supervision and cross-domain learning. After pre-training using masked reconstruction of musical input features using self-attention bidirectional transformers, output representations are fine-tuned using several downstream music understanding tasks. Results show that the features generated by our multi-faceted, multi-task, music transformer model, which we call M3BERT, tend to outperform other audio and music embeddings on several diverse music-related tasks, indicating the potential of self-supervised and semi-supervised learning approaches toward a more generalized and robust computational approach to modeling music. Our work can offer a starting point for many music-related modeling tasks, with potential applications in learning deep representations and enabling robust technology applications.

2022

USC

Multi-modal, Multi-task, Music BERT: A Context-Aware Music Encoder Based on Transformers

Timothy Greer, Xuan Shi, Benjamin Ma, and 1 more author

Jul 2022

2021

PLoS ONE

A computational lens into how music characterizes genre in film

Benjamin Ma, Timothy Greer, Dillon Knox, and 1 more author

PloS one, Jul 2021
CBMI

Loss Function Approaches for Multi-label Music Tagging

Dillon Knox, Timothy Greer, Benjamin Ma, and 3 more authors

In 2021 International Conference on Content-Based Multimedia Indexing (CBMI), Jul 2021
USC

Coordination or Dominance? An Investigation of Social Dynamics in Conversational Entrainment

Nikolaos Flemotomos, Benjamin Ma, and Raghuveer Peri

Jul 2021

2020

MediaEval

MediaEval 2020 Emotion and Theme Recognition in Music Task: Loss Function Approaches for Multi-label Music Tagging.

Dillon Knox, Timothy Greer, Benjamin Ma, and 3 more authors

In MediaEval, Jul 2020

2019

ACM

A multimodal view into music’s effect on human neural, physiological, and emotional experience

Timothy Greer, Benjamin Ma, Matthew Sachs, and 2 more authors

In Proceedings of the 27th ACM international conference on multimedia, Jul 2019
ACII

Predicting human-reported enjoyment responses in happy and sad music

Benjamin Ma, Timothy Greer, Matthew Sachs, and 3 more authors

In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII), Jul 2019
ICASSP

Learning shared vector representations of lyrics and chords in music

Timothy Greer, Karan Singla, Benjamin Ma, and 1 more author

In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jul 2019