publications

125 entries « 1 of 3 »

2024

Kim, Yonghyun; Lerch, Alexander

Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), arXiv, San Francisco, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Ma, T. Aleksandra; Lerch, Alexander

Music auto-tagging in the long tail: A few-shot approach Proceedings Article

In: Proceedings of the AES Convention, New York, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, H.3.3

Han, Chaeyeon; Seshadri, Pavan; Ding, Yiwei; Posner, Noah; Koo, Bon Woo; Agrawal, Animesh; Lerch, Alexander; Guhathakurta, Subhrajit

Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors Journal Article

In: Urban Informatics, vol. 3, no. 1, pp. 22, 2024, ISSN: 2731-6963.

Abstract | Links | BibTeX | Tags: Active mobility, Audio-based, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multimedia, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Pedestrian, Sensors

Watcharasupat, Karn N.; Lerch, Alexander

A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), San Francisco, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Ooi, Kenneth; Goh, Jessie; Lin, Hao-Weng; Ong, Zhen-Ting; Wong, Trevor; Watcharasupat, Karn N.; Lam, Bhan; Gan, Woon-Seng

Lion City Soundscapes: Modified Partitioning around Medoids for a Perceptually Diverse Dataset of Singaporean Soundscapes Journal Article

In: JASA Express Letters, vol. 4, no. 4, pp. 047402, 2024, ISSN: 2691-1191.

Links | BibTeX | Tags:

Ding, Yiwei; Han, Chaeyeon; Seshadri, Pavan; Koo, Bon Woo; Posner, Noah; Guhathakurta, Subhro; Lerch, Alexander

Toward audio-based sensing for pedestrian detection Journal Article

In: The Journal of the Acoustical Society of America, vol. 155, no. 3_Supplement, pp. A282, 2024, ISSN: 0001-4966.

Abstract | Links | BibTeX | Tags:

Ding, Yiwei; Lerch, Alexander

Embedding Compression for Teacher-to-Student Knowledge Transfer Proceedings Article

In: Proceedings of the International Conference on Acoustics Speech and Signal Processing (ICASSP) - Satellite Workshop Deep Neural Network Model Compression, Institute of Electrical and Electronics Engineers (IEEE), Seoul, Korea, 2024, (arXiv:2402.06761 [cs]).

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning

Seshadri, Pavan; Han, Chaeyeon; Koo, Bon-Woo; Posner, Noah; Guhathakurta, Subhrajit; Lerch, Alexander

ASPED: An Audio Dataset for Detecting Pedestrians Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Seoul, 2024.

Abstract | Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Watcharasupat, Karn N; Lerch, Alexander

Quantifying Spatial Audio Quality Impairment Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Seoul, 2024.

Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Ooi, Kenneth; Ong, Zhen-Ting; Watcharasupat, Karn N.; Lam, Bhan; Hong, Joo Young; Gan, Woon-Seng

ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes Journal Article

In: IEEE Transactions on Affective Computing, vol. 15, no. 1, pp. 105–120, 2024, ISSN: 1949-3045.

Links | BibTeX | Tags:

Watcharasupat, Karn N; Ooi, Kenneth; Lam, Bhan; Ong, Zhen-Ting; Jaratjarungkiat, Sureenate; Gan, Woon-Seng

Validating Thai Translations of Perceptual Soundscape Attributes: A Non-Procrustean Approach with a Procrustes Projection Journal Article

In: Applied Acoustics, 2024.

Links | BibTeX | Tags:

Liu, Shimiao; Lerch, Alexander

Enhancing Video Music Recommendation with Transformer-Driven Audio-Visual Embeddings Proceedings Article

In: Proceedings of the IEEE International Symposium on the Internet of Sounds (IS2), Erlangen, 2024.

Abstract | Links | BibTeX | Tags: Contrastive learning, Encoding, Fitting, Immersive experience, Internet, Labeling, Manuals, multi-modal, music, music recommendation, Recommender systems, trans-former, Transformers

2023

Lam, Bhan; Chieng, Julia; Ooi, Kenneth; Ong, Zhen Ting; Watcharasupat, Karn N.; Hong, Joo Young; Gan, Woon Seng

Crossing the Linguistic Causeway: Ethnonational Differences on Soundscape Attributes in Bahasa Melayu Journal Article

In: Applied Acoustics, vol. 214, 2023, ISSN: 1872910X.

Links | BibTeX | Tags:

Ding, Yiwei; Lerch, Alexander

Audio Embeddings as Teachers for Music Classification Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Milan, Italy, 2023.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Knees, Peter; Lerch, Alexander

MILC 2023: 3rd Workshop on Intelligent Music Interfaces for Listening and Creation Proceedings Article

In: Companion Proceedings of the 28th International Conference on Intelligent User Interfaces, pp. 185–186, Association for Computing Machinery, Sydney, 2023, ISBN: 9798400701078.

Abstract | Links | BibTeX | Tags:

Chen, Hsin-Hung; Lerch, Alexander

Music Instrument Classification Reprogrammed Proceedings Article

In: Proceedings of the International Conference on Multimedia Modeling (MMM), Bergen, Norway, 2023.

Links | BibTeX | Tags:

Lerch, Alexander

An Introduction to Audio Content Analysis: Music Information Retrieval Tasks and Applications Book

2, Wiley-IEEE Press, Hoboken, N.J, 2023, ISBN: 978-1-119-89094-2.

Abstract | Links | BibTeX | Tags: analysis, audio, Audio content analysis, audio signal processing, Automatic Music Transcription, Computer sound processing, machine listening, Matlab, MIR, music analysis, music informatics, music information retrieval, Python

Lerch, Alexander

Audioinhaltsanalyse Book Section

In: Weinzierl, Stefan (Ed.): Handbuch der Audiotechnik, pp. 1–20, Springer Berlin Heidelberg, Berlin, Heidelberg, 2023, ISBN: 978-3-662-60357-4.

Abstract | Links | BibTeX | Tags: Audio content analysis, Grundfrequenzerkennung, music information retrieval, Musikklassifizierung, Musiktranskription, Tonarterkennung

Smith, Jason Brent; Vinay, Ashvala; Freeman, Jason

The Impact of Salient Musical Features in a Hybrid Recommendation System for a Sound Library Proceedings Article

In: Joint Proceedings of the ACM IUI Workshops (MILC), Sydney, 2023.

Abstract | Links | BibTeX | Tags:

Hung, Yun-Ning; Yang, Chao-Han Huck; Chen, Pin-Yu; Lerch, Alexander

Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Rhodes Island, Greece, 2023.

Links | BibTeX | Tags:

Lerch, Alexander

Grundlagen digitaler Audiosignale Book Section

In: Weinzierl, Stefan (Ed.): Handbuch der Audiotechnik, pp. 1–13, Springer Berlin Heidelberg, Berlin, Heidelberg, 2023, ISBN: 978-3-662-60357-4.

Abstract | Links | BibTeX | Tags:

Vinay, Ashvala; Lerch, Alexander

AQUATK: An Audio Quality Assessment Toolkit Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), International Society for Music Information Retrieval (ISMIR), Milan, 2023.

Links | BibTeX | Tags:

Watcharasupat, Karn N; Wu, Chih-Wei; Ding, Yiwei; Orife, Iroro; Hipple, Aaron J; Williams, Phillip A; Kramer, Scott; Lerch, Alexander; Wolcott, William

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation Journal Article

In: IEEE Open Journal of Signal Processing, pp. 1–9, 2023, ISSN: 2644-1322.

Abstract | Links | BibTeX | Tags:

Lam, Bhan; Ooi, Kenneth; Ong, Zhen-Ting; Wong, Trevor; Gan, Woon-Seng; Watcharasupat, Karn

Preliminary Investigation of the Short-Term in Situ Performance of an Automatic Masker Selection System Proceedings Article

In: Proceedings of the 52nd International Congress and Exposition on Noise Control Engineering, 2023.

Links | BibTeX | Tags:

Ong, Zhen-Ting; Ooi, Kenneth; Wong, Trevor; Lam, Bhan; Gan, Woon-Seng; Watcharasupat, Karn N.

Effect of Masker Selection Schemes on the Perceived Affective Quality of Soundscapes: A Pilot Study Proceedings Article

In: Proceedings of the 52nd International Congress and Exposition on Noise Control Engineering, 2023.

Links | BibTeX | Tags:

Ooi, Kenneth; Ong, Zhen-Ting; Lam, Bhan; Wong, Trevor; Gan, Woon-Seng; Watcharasupat, Karn

ARAUSv2: An Expanded Dataset and Multimodal Models of Affective Responses to Augmented Urban Soundscapes Proceedings Article

In: Proceedings of the 52nd International Congress and Exposition on Noise Control Engineering, 2023.

Links | BibTeX | Tags:

Ooi, Kenneth; Watcharasupat, Karn N; Lam, Bhan; Ong, Zhen-Ting; Gan, Woon-Seng

Autonomous Soundscape Augmentation with Multimodal Fusion of Visual and Participant-linked Inputs Proceedings Article

In: Proceedings of the 2023 International Conference on Acoustics, Speech, and Signal Processing, 2023.

Links | BibTeX | Tags:

2022

Hung, Yun-Ning; Wu, Chih-Wei; Orife, Iroro; Hipple, Aaron; Wolcott, William; Lerch, Alexander

A large TV dataset for speech and music activity detection Journal Article

In: EURASIP Journal on Audio, Speech, and Music Processing, vol. 2022, no. 1, pp. 21, 2022, ISSN: 1687-4722.

Abstract | Links | BibTeX | Tags: Dataset, Production TV audio, Speech and music activation detection

Ma, Alison B; Lerch, Alexander

Representation Learning for the Automatic Indexing of Sound Effects Libraries Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Bangalore, IN, 2022, (arXiv:2208.09096 [cs, eess]).

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Vinay, Ashvala; Lerch, Alexander

Evaluating Generative Audio Systems and their Metrics Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Bangalore, IN, 2022, (arXiv:2209.00130 [cs, eess]).

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Lerch, Alexander

libACA, pyACA, and ACA-Code: Audio Content Analysis in 3 Languages Journal Article

In: Software Impacts, pp. 100349, 2022, ISSN: 2665-9638.

Abstract | Links | BibTeX | Tags: Audio content analysis, C++, Matlab, music information retrieval, Python

Hung, Yun-Ning; Lerch, Alexander

Feature-informed Latent Space Regularization for Music Source Separation Miscellaneous

2022, (arXiv:2203.09132 [eess]).

Abstract | Links | BibTeX | Tags: Electrical Engineering and Systems Science - Audio and Speech Processing

Wang, Ju-Chiang; Hung, Yun-Ning; Smith, Jordan B. L.

To Catch A Chorus, Verse, Intro, or Anything Else: Analyzing a Song with Structural Functions Proceedings Article

In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 416–420, 2022, (ISSN: 2379-190X).

Abstract | Links | BibTeX | Tags: Location awareness, music, Music structure, segmentation, semantic labeling, Semantics, Signal processing, Signal processing algorithms, SpecTNT, Taxonomy, Transformer, Transformers

Watcharasupat, Karn N; Lee, Junyoung; Lerch, Alexander

Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models Journal Article

In: Software Impacts, pp. 100222, 2022, ISSN: 26659638.

Abstract | Links | BibTeX | Tags:

Kalbag, Vedant; Lerch, Alexander

Scream Detection in Heavy Metal Music Proceedings Article

In: Proceedings of the Sound and Music Computing Conference (SMC), Saint-Etienne, 2022.

Abstract | Links | BibTeX | Tags: Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Hung, Yun-Ning; Lerch, Alexander

Feature-informed Embedding Space Regularization for Audio Classification Proceedings Article

In: Proceedings of the European Signal Processing Conference (EUSIPCO), Belgrade, Serbia, 2022.

Abstract | Links | BibTeX | Tags: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Guo, Weian; Hua, Zhenyao; Kang, Zecheng; Li, Dongyang; Wang, Lei; Wu, Qidi; Lerch, Alexander

Deep Reinforcement Learning for Urban Multi-taxis Cruising Strategy Journal Article

In: Neural Computing and Applications, 2022, ISSN: 1433-3058.

Abstract | Links | BibTeX | Tags: Data-driven model, deep Q-learning network, Multi-taxis cruising, Urban transportation

Herre, Jürgen; Disch, Sascha; Lerch, Alexander

Quellcodierung Book Section

In: Weinzierl, Stefan (Ed.): Handbuch der Audiotechnik, pp. 1–23, Springer, Berlin, Heidelberg, 2022, ISBN: 978-3-662-60357-4.

Abstract | Links | BibTeX | Tags: Audiocodierung, Audiokomprimierung, Codec, MPEG, Psychoakustik, verlustfrei, verlustlos

Li, Dongyang; Wang, Lei; Li, Li; Guo, Weian; Wu, Qidi; Lerch, Alexander

A Large-Scale Multiobjective Particle Swarm Optimizer With Enhanced Balance of Convergence and Diversity Journal Article

In: IEEE Transactions on Cybernetics, pp. 1–12, 2022, ISSN: 2168-2275.

Abstract | Links | BibTeX | Tags: Complexity theory, Convergence, Cybernetics, diversity, Estimation, large-scale multiobjective optimization, multidimensional local sparseness, Optimization, Particle swarm optimization, particle swarm optimization (PSO), Weight measurement, weighted convergence factor (WCF)

Watcharasupat, Karn N.; Ooi, Kenneth; Lam, Bhan; Wong, Trevor; Ong, Zhen Ting; Gan, Woon Seng

Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain Journal Article

In: IEEE Signal Processing Letters, vol. 29, pp. 1749–1753, 2022, ISSN: 15582361.

Links | BibTeX | Tags:

2021

Watcharasupat, Karn N; Lerch, Alexander

Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Online, 2021.

Abstract | Links | BibTeX | Tags: Computer Science - Information Retrieval, Computer Science - Information Theory, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing

Hung, Yun-Ning; Wichern, Gordon; Roux, Jonathan Le

Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision Proceedings Article

In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 46–50, 2021, (ISSN: 2379-190X).

Abstract | Links | BibTeX | Tags: audio source separation, Conferences, Instruments, music, music transcription, Particle separators, Source separation, Time-frequency analysis, Training, weakly-labeled data, weakly-supervised separation

Li, Dongyang; Wang, Lei; Lerch, Alexander; Wu, Qidi

An Adaptive Particle Swarm Optimizer with Decoupled Exploration and Exploitation for Large Scale Optimization Journal Article

In: Swarm and Evolutionary Computation, vol. 60, 2021, ISSN: 2210-6502.

Links | BibTeX | Tags:

Vinay, Ashvala; Lerch, Alexander; Leslie, Grace

Mind the Beat: Detecting Audio Onsets from EEG Recordings of Music Listening Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE), Toronto, Ontario, Canada, 2021.

Abstract | Links | BibTeX | Tags:

Seshadri, Pavan; Lerch, Alexander

Improving Music Performance Assessment with Contrastive Learning Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), pp. 8, Online, 2021.

Abstract | Links | BibTeX | Tags:

Pati, Ashis; Lerch, Alexander

Is Disentanglement Enough? On Latent Representations for Controllable Music Generation Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), pp. 8, Online, 2021.

Abstract | Links | BibTeX | Tags:

Gururani, Siddharth; Lerch, Alexander

Semi-Supervised Audio Classification with Partially Labeled Data Proceedings Article

In: Proceedings of the IEEE International Symposium on Multimedia (ISM), Institute of Electrical and Electronics Engineers (IEEE), online, 2021.

Links | BibTeX | Tags:

Lerch, Alexander; Knees, Peter

Machine Learning Applied to Music/Audio Signal Processing Journal Article

In: Electronics, vol. 10, no. 24, pp. 3077, 2021.

Abstract | Links | BibTeX | Tags: n/a

Hung, Yun-Ning; Watcharasupat, Karn N; Wu, Chih-Wei; Orife, Iroro; Li, Kelian; Seshadri, Pavan; Lee, Junyoung

AVASPEECH-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-occurence Proceedings Article

In: Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), pp. 3, Online, 2021.

Abstract | BibTeX | Tags:

2020

Huang, Jiawen; Hung, Yun-Ning; Pati, Ashis K; Gururani, Siddharth; Lerch, Alexander

Score-informed Networks for Music Performance Assessment Proceedings Article

In: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), International Society for Music Information Retrieval (ISMIR), Montreal, 2020.

Links | BibTeX | Tags:

125 entries « 1 of 3 »

Music Informatics Group