older projects

Assessment of Music Performances
Design and evaluation of features for the characterization of (student) music performances and create models to automatically assess these performances, detect errors, and give instantaneous feedback to the performer.

[show_more color=”#eeb211″]Resources
Source repository: github
Publications:
– Wu, C.-W.; Gururani, S.; Laguna, C.; Pati, A.; Vidwans, A.; Lerch, A., Towards the Objective Assessment of Music Performances, Proceedings of the International Conference on Music Perception and Cognition (ICMPC), San Francisco, 2016
Contributors (current)
Siddharth Kumar Gururani, Chris Laguna, Ashis Pati, Amruta Jayant Vidwans, Chih-Wei Wu
Contributors (past)
Cian O’Brien, Yujia Yan, Ying Zhan

[/show_more]

Automatic Drum Transcription (PhD Project)
Automatic drum transcription in polyphonic mixtures of music using a signal-adaptive NMF-based method.

[show_more color=”#eeb211″]

Resources
Source repository: github
Publications:
– Wu, C.-W.; Lerch, A., On Drum Playing Technique Detection in Polyphonic Mixtures, Proceedings of the International Conference on Music Information Retrieval (ISMIR), New York, 2016
– Wu, C.-W.; Lerch, A., Drum Transcription using Partially Fixed Non-Negative Matrix Factorization With Template Adaptation, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Malaga, 2015.
– Wu, C.-W.; Lerch, A., Drum Transcription using Partially Fixed Non-Negative Matrix Factorization, Proceedings of the European Signal Processing Conference (EUSIPCO), Nice, 2015.
Contributors
Chih-Wei Wu

[/show_more]

Audio Quality Enhancement (MS Project)
Web application to improve audio quality of low quality recordings (especially for low quality mobile phone recordings). Processing steps include detecting and correcting clipping (distortion), removing noise, normalization of loudness, and equalization. The REPAIR Web App allows users to upload low-quality audio and download the improved audio.

[show_more color=”#eeb211″]

Resources
Web Application: REPAIR Web App
Source repository: github
Publications:
– Laguna, C, Master Project Report: A Web Application for Audio Quality Enhancement, MS Project Report, Georgia Institute of Technology, 2016
– Laguna, C.; Lerch, A., An Efficient Algorithm for Clipping Detection and Declipping Audio, Proceedings of the 141st AES Convention, Los Angeles, 2016
– Laguna, C.; Lerch, A., Client-Side Audio Declipping, Proceedings of the 2nd Web Audio Conference (WAC), Atlanta, 2016
Contributors
Chris Laguna

[/show_more]

Outlier detection in music datasets (Cooperation with Virginia Tech)
Unsupervised detection of anomalies in music datasets.

[show_more color=”#eeb211″]

Resources
Publications:
– Lu, Y.-C.; Wu, C.-W.; Lu, C.T.; Lerch, A., Automatic Outlier Detection in Music Genre Datasets, Proceedings of the International Conference on Music Information Retrieval (ISMIR), New York, 2016
– Lu, Y.-C.; Wu, C.-W.; Lu, C.-T.; Lerch, A., An Unsupervised Approach to Anomaly Detection in Music Datasets, Proceedings of the ACM SIGIR Conference (SIGIR), Pisa, 2016
Contributors
Chih-Wei Wu

[/show_more]

Automatic Practice Logging (Semester Project)
Automatic identification of continuous recordings of musicians practicing their repertoire. The goal is a detailed description of what and where they practiced, which can be used by students and instructors to communicate about the countless hours spent practicing.

[show_more color=”#eeb211″]

Resources
Publications:
– Winters, R. M.; Gururani, S.; Lerch, A., Automatic Practice Logging: Introduction, Dataset & Preliminary Study, Proceedings of the International Conference on Music Information Retrieval (ISMIR), New York, 2016
Source repository: github
Contributors
R. Michael Winters, Siddharth Kumar Gururani

[/show_more]

Machine Listening Module (MS Project)
Machine listening provides a set of data with which music can be synthesized, modified, or sonified. Real time audio feature extraction opens up new worlds for interactive music, improvisation, and generative composition. Promoting the use of machine listening as a compositional tool, this project brings the technique into DIY embedded systems such as the Raspberry Pi, integrating machine listening with analog synthesizers in the eurorack format.

[show_more color=”#eeb211″]

Resources
Source repository: github
Project Report:
– Latina, C., Machine Listening Eurorack Module, MS Project Report, Georgia Institute of Technology, 2016.
Contributors
Chris Latina

[/show_more]

Sample detection in Polyphonic Music
Sampling, the usage of snippets or loops from existing songs or libraries in new music productions or mashups, is a common technique in many music genres. The goal of this project is to design an NMF-based algorithm that is able to detect the presence of a sample of audio in a set of tracks. The sample audio may be pitch shifted or time stretched so the algorithm should ideally be robust against such manipulation.

[show_more color=”#eeb211″]

Resources
Contributors
Siddarth Kumar

[/show_more]

Web Resources for Audio Content Analysis
Online resources for tasks related to music information retrieval and machine learning, including matlab files, a list of datasets, and exercises.

[show_more color=”#eeb211″]

Resources
WWW: AudioContentAnalysis.org
Contributors
Alexander Lerch

[/show_more]

Other Projects

Application of MIR Techniques to Medical Signals
Based on the physionet.org challenge dataset for reducing false alarms in ECG and blood pressure signals, MIR approaches are investigated for the detection of alarm situations in the intensive care unit. The 5 types of alarms asystole, extreme bradycardia, extreme tachycardia, ventricular tachycardia, and ventricular flutter are detected.

[show_more color=”#eeb211″]

Resources
Contributors
Amruta Vidwans

[/show_more]

Real-time speaker annotation in conference settings
Generating a transcript of a conference meeting requires not only the transcription of text but also assigning the text to specific speakers. This system is designed to detect an unknown number of speakers and assign text to these speakers in a real-time scenario.

[show_more color=”#eeb211″]

Resources
Source repository: github
Contributors
Avrosh Kumar

[/show_more]

Application for Vocal Training and Assessment using Real-Time Pitch Tracking
A cross-platform application for vocal training and evaluation Screenshot Vocal Assessmentusing monophonic pitch tracking. The system is designed to take real-time voice input using standard microphones available in most mobile devices. The assessment is carried out in reference to reference vocal lessons based on pitch and timing accuracy. Real-time feedback is provided to the user in the form of a pitch contour plotted against the reference pitch to be sung.

[show_more color=”#eeb211″]

Resources
Source repository: github
Project Report:
– Pati, A., An Application for Vocal Training and Evaluation using Real-time Monophonic Pitch Tracking, Technical Report, Georgia Tech, 2015.
Contributors
Ashis Pati

[/show_more]

Vocopter Singing Game
Vocopter is a mobile game adapted from the classic Copter game. Vocopter allows a playful approach to assess the accuracy of intonation.

[show_more color=”#eeb211″]

Resources
Source repository: github
Contributors
Rithesh Kumar

[/show_more]

Project Riyaaz
Riyaaz is an Urdu word which means devoted practice. The project aims at implementating an app that aids the practice of Indian classical vocal music. It requires the student pass through a curriculum of exercises designed to strengthen their grasp of Swara (tonality, pitch) and Tala (rhythm). The interface provides real-time graphical feedback in order to help improving their skills.

[show_more color=”#eeb211″]

Resources
Contributors
Milap Rane

[/show_more]

Audio-Adaptive Visual Animations of Paintings

Animated_Gif
Original oil on canvas painting: Dusan Malobabic

A painting is an expression frozen in time. It is the imagination of the viewer that paints the untold past and the future of the captured moment. This project is an attempt to induce movements in a painting evoked by sounds or music. The idea is to extract various descriptors from music, for example, onsets and tonal content, and map them to a function to process an image and bring it to life as well enhance the music listening experience.

[show_more color=”#eeb211″]

Resources
Contributors
Avrosh Kumar

[/show_more]

Automatic Audio-Lyrics Alignment
Automatic alignment of song lyrics to audio recordings at the line level. The alignment makes use of voice activity detection, pitch detection, and the detection of repeating structures.

[show_more color=”#eeb211″]

Resources
Contributors
Amruta Vidwans

[/show_more]

Genre-specific Key Profiles
Investigation of differences and commonalities of audio pitch class profiles of different musical genres.

[show_more color=”#eeb211″]

Resources
Publication: O’Brien, C.; Lerch, A., Genre-Specific Key Profiles; Proceedings of the International Computer Music Conference (ICMC), Denton, 2015.
Contributors
Cian O’Brien

[/show_more]

Supervised Feature Learning via Sparse Coding for Music Information Retrieval
Sparse coding allows to learn features from the dataset in an unsupervised way. It is investigated how added supervised training functionality can improve the descriptiveness of the learned features.

[show_more color=”#eeb211″]

Resources:
Thesis: smartech
Contributors
Cian O’Brien

[/show_more]

Real-time Onset Detection
Design of an Onset Detection Algorithm suitable for real-time processing and a low latency live input scenario.

[show_more color=”#eeb211″]

<Contributors
Rithesh Kumar

[/show_more]

Predominant Instrument Recognition in Polyphonic Audio
Identification of a single predominant instrument per audio file using pitch features, timbre features and features extracted from short-time harmonics.

[show_more color=”#eeb211″]

Contributors
Chris Laguna

[/show_more]

Time-Domain Multi-Pitch Detection with Sparse Additive Modeling
Frame-level multi-pitch detection in the time domain with locally periodic kernel functions and sparsity constraints.

[show_more color=”#eeb211″]

Contributors
Yujia Yan

[/show_more]

Identification of live music performance via ambient audio content features
Automatic identification of recordings of live performance as opposed to studio recordings.

[show_more color=”#eeb211″]

Resources
dataset: github
Contributors
Raja Raman

[/show_more]

Wiki tutorial for running SuperCollider on Raspberry Pi
Various tutorial on installation and configuration of SuperCollider on a Raspberry Pi.

[show_more color=”#eeb211″]

Resources
WWW: Embedded Music Page
Contributors
Chris Latina

[/show_more]

Metric Learning for Music Discovery with Source and Target Playlists
Playlist generation for music exploration by defining sets of source songs and target songs and deriving a playlist through metric learning and boundary constraints.

[show_more color=”#eeb211″]

Resources
slides: presentation
Contributors
Ying-Shu Kuo

[/show_more]

Audio Chord Detection Using Deep Learning
Improve audio chord detection by using a Deep Network to extract the tonal features from the audio.

[show_more color=”#eeb211″]

Resources
Publication: Zhou, X.; Lerch, A., Chord Detection Using Deep Learning, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Malaga, 2015.
Contributors
Xinquan Zhou

[/show_more]