publications

Show all

2025

Watcharasupat, Karn N.; Ding, Yiwei; Ma, T. Aleksandra; Seshadri, Pavan; Lerch, Alexander

Uncertainty Estimation in the Real World: A Study on Music Emotion Recognition Proceedings Article

In: Proceedings of the European Conference on Information Retrieval (ECIR), arXiv, Lucca, Italy, 2025.

Abstract | Links | BibTeX | Tags: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

2024

Kim, Yonghyun; Lerch, Alexander

Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), arXiv, San Francisco, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Ma, T. Aleksandra; Lerch, Alexander

Music auto-tagging in the long tail: A few-shot approach Proceedings Article

In: Proceedings of the AES Convention, New York, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, H.3.3

Han, Chaeyeon; Seshadri, Pavan; Ding, Yiwei; Posner, Noah; Koo, Bon Woo; Agrawal, Animesh; Lerch, Alexander; Guhathakurta, Subhrajit

Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors Journal Article

In: Urban Informatics, vol. 3, no. 1, pp. 22, 2024, ISSN: 2731-6963.

Abstract | Links | BibTeX | Tags: Active mobility, Audio-based, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multimedia, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Pedestrian, Sensors

Watcharasupat, Karn N.; Lerch, Alexander

A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), San Francisco, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Seshadri, Pavan; Han, Chaeyeon; Koo, Bon-Woo; Posner, Noah; Guhathakurta, Subhrajit; Lerch, Alexander

ASPED: An Audio Dataset for Detecting Pedestrians Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Seoul, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Watcharasupat, Karn N; Lerch, Alexander

Quantifying Spatial Audio Quality Impairment Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Seoul, 2024.

Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

2023

Ding, Yiwei; Lerch, Alexander

Audio Embeddings as Teachers for Music Classification Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Milan, Italy, 2023.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

2022

Ma, Alison B; Lerch, Alexander

Representation Learning for the Automatic Indexing of Sound Effects Libraries Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Bangalore, IN, 2022, (arXiv:2208.09096 [cs, eess]).

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Vinay, Ashvala; Lerch, Alexander

Evaluating Generative Audio Systems and their Metrics Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Bangalore, IN, 2022, (arXiv:2209.00130 [cs, eess]).

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Kalbag, Vedant; Lerch, Alexander

Scream Detection in Heavy Metal Music Proceedings Article

In: Proceedings of the Sound and Music Computing Conference (SMC), Saint-Etienne, 2022.

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Hung, Yun-Ning; Lerch, Alexander

Feature-informed Embedding Space Regularization for Audio Classification Proceedings Article

In: Proceedings of the European Signal Processing Conference (EUSIPCO), Belgrade, Serbia, 2022.

Abstract | Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

2021

Watcharasupat, Karn N; Lerch, Alexander

Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Online, 2021.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Information Theory, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Music Informatics Group