Using Weakly Aligned Score–audio Pairs to Train Deep Chroma Models for Cross-modal Music Retrieval

Abstract: Many music information retrieval tasks involve the comparison of a symbolic score representation with an audio recording. A typical strategy is to compare score–audio pairs based on a common mid-level representation, such as chroma features. Several recent studies demonstrated the effectiveness of deep learning models that learn task-specific mid-level representations from temporally aligned training pairs. However, in practice, there is often a lack of strongly aligned training data, in particular for real-world scenarios. In our study, we use weakly aligned score–audio pairs for training, where only the beginning and end of a score excerpt is annotated in an audio recording, without aligned correspondences in between. To exploit such weakly aligned data, we employ the Connectionist Temporal Classification (CTC) loss to train a deep learning model for computing an enhanced chroma representation. We then apply this model to a cross-modal retrieval task, where we aim at finding relevant audio recordings of Western classical music, given a short monophonic musical theme in symbolic notation as a query. We present systematic experiments that show the effectiveness of the CTC-based model for this theme-based retrieval task.

Using Weakly Aligned Score–audio Pairs to Train Deep Chroma Models for Cross-modal Music Retrieval

Frank Zalkow, Meinard Müller

Comments

Similar Papers

Modelling Hierarchical Key Structure with Pitch Scapes

Robert Lieck, Martin Rohrmeier

Keywords Abstract Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Computational music theory and musicology, Representations of music, MIR tasks, Automatic classification, Musical features and properties, Harmony, chords, and tonality, Structure, segmentation, and form

Music Fadernets: Controllable Music Generation Based on High-level Features via Low-level Feature Modelling

HAO HAO TAN, Dorien Herremans

Keywords Abstract Paper

Domain knowledge, Machine learning/Artificial intelligence for music, MIR tasks, Music synthesis and transformation

Metric Learning vs Classification for Disentangled Music Representation Learning

Jongpil Lee, Nicholas J. Bryan, Justin Salamon and Zeyu Jin, Juhan Nam

Keywords Abstract Paper

MIR tasks, Similarity metrics, Automatic classification

Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining

Timothy Tsai, Kevin Ji

Keywords Abstract Paper

Musical features and properties, Musical style and genre, Domain knowledge, Machine learning/Artificial intelligence for music, Representations of music, MIR fundamentals and methodology, Symbolic music processing, MIR tasks, Automatic classification

Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds

Yin-Jyun Luo, Kin Wai Cheuk, Tomoyasu Nakano and Masataka Goto, Dorien Herremans

Keywords Abstract Paper

Domain knowledge, Machine learning/Artificial intelligence for music

Enhanced Audio Tagging via Multi- to Single-Modal Teacher-Student Mutual Learning

Yifang Yin, Harsh Shrivastava, Ying Zhang and Zhenguang Liu, Rajiv Ratn Shah, Roger Zimmermann

Keywords Abstract Paper

Learning with Noisy Correspondence for Cross-modal Matching

Zhenyu Huang, Guocheng Niu, Xiao Liu and Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng

Keywords Abstract Paper

deep learning, language

Semantically Meaningful Attributes from Co-listen Embeddings for Playlist Exploration and Expansion

Ayush Patwari, Nicholas Kong, Jun Wang and Ullas Gargi, Michele Covell, Aren Jansen

Keywords Abstract Paper

Applications, Music recommendation and playlist generation, MIR tasks, Automatic classification, Musical features and properties, Musical affect, emotion, and mood

AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features From Multi-Modal Embeddings

Pratik Mazumder, Pravendra Singh, Kranti Kumar Parida, Vinay P. Namboodiri

Keywords Abstract Paper

Modeling the Compatibility of Stem Tracks to Generate Music Mashups

Jiawen Huang, Ju-Chiang Wang, Jordan B. L. Smith and Xuchen Song, Yuxuan Wang

Keywords Abstract Paper

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Abstract Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

Training Noise-Robust Deep Neural Networks via Meta-Learning

Zhen Wang, Guosheng Hu, Qinghua Hu

Keywords Abstract Paper

label noise, noise-robust learning, loss correction approach, noise transition matrix, meta-learning

Improving Polyphonic Music Models with Feature-rich Encoding

Omar A Peracha

Keywords Abstract Paper

MIR tasks, Pattern matching and detection, Applications, Music composition, performance, and production

Neural data-to-text generation with LM-based text augmentation

Ernie Chang, Xiaoyu Shen, Dawei Zhu and Vera Demberg, Hui Su

Keywords Abstract Paper

Telling Left From Right: Learning Spatial Correspondence of Sight and Sound

Karren Yang, Bryan Russell, Justin Salamon

Keywords Abstract Paper

audio-visual learning in video, self-supervision, video dataset, spatial audio, localization, spatialization, upmixing, source separation

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao and Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Abstract Paper

visual question answering, counterfactual, debias, language bias, data augmentation, visual-and-language

The geometry of integration in text classification RNNs

Kyle Aitken, Vinay Ramasesh, Ankush Garg and Yuan Cao, David Sussillo, Niru Maheswaranathan

Keywords Abstract Paper

interpretability, dynamical systems, reverse engineering, document classification, Recurrent neural networks

Connective Fusion: Learning Transformational Joining of Sequences with Application to Melody Creation

Taketo Akama

Keywords Abstract Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Applications, Music composition, performance, and production

Multi-level Generative Models for Partial Label Learning with Non-random Label Noise

Yan Yan, Yuhong Guo

Keywords Abstract Paper

Machine Learning, Classification, Weakly Supervised Learning

Dmelodies: a Music Dataset for Disentanglement Learning

Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch

Keywords Abstract Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Representations of music, Evaluation, datasets, and reproducibility, Novel datasets and use cases, Reproducibility, MIR fundamentals and methodology, Symbolic music processing

Keywords Paper

Keywords Paper

Jongpil Lee, Nicholas J. Bryan, Justin Salamon and
Zeyu Jin, Juhan Nam

Keywords Paper

Keywords Paper

Yin-Jyun Luo, Kin Wai Cheuk, Tomoyasu Nakano and
Masataka Goto, Dorien Herremans

Keywords Paper

Yifang Yin, Harsh Shrivastava, Ying Zhang and
Zhenguang Liu, Rajiv Ratn Shah, Roger Zimmermann

Keywords Paper

Zhenyu Huang, Guocheng Niu, Xiao Liu and
Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng

Keywords Paper

Ayush Patwari, Nicholas Kong, Jun Wang and
Ullas Gargi, Michele Covell, Aren Jansen

Keywords Paper

Keywords Paper

Jiawen Huang, Ju-Chiang Wang, Jordan B. L. Smith and
Xuchen Song, Yuxuan Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ernie Chang, Xiaoyu Shen, Dawei Zhu and
Vera Demberg, Hui Su

Keywords Paper

Keywords Paper

Long Chen, Xin Yan, Jun Xiao and
Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Paper

Kyle Aitken, Vinay Ramasesh, Ankush Garg and
Yuan Cao, David Sussillo, Niru Maheswaranathan

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

Keywords Paper

Zhonghao Sheng, Kaitao Song, Xu Tan and
Yi Ren, Wei Ye, Shikun Zhang, Tao Qin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jiawen Huang, Yun-Ning Hung, Ashis Pati and
Siddharth Kumar Gururani, Alexander Lerch

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xiaowan Hu, Yuanhao Cai, Zhihong Liu and
Haoqian Wang, Yulun Zhang

Keywords Paper

Zhe Liu, Yun Li, Lina Yao and
Xianzhi Wang, Guodong Long

Keywords Paper

Keywords Paper