Automatic Rank-ordering of Singing Vocals with Twin-neural Network

11/10/2020

Automatic Rank-ordering of Singing Vocals with Twin-neural Network

Chitralekha Gupta, Lin Huang, Haizhou Li

Keywords: Domain knowledge, Machine learning/Artificial intelligence for music, Applications, Music training and education, Evaluation, datasets, and reproducibility, Evaluation methodology, MIR fundamentals and methodology, Music signal processing, MIR tasks, Similarity metrics, Musical features and properties, Timbre, instrumentation, and voice

Abstract Paper Similar Papers

Abstract: When making judgements, humans are known to be better at choosing a preferred option amongst a small number of options, rather than giving an absolute ranking of all the options. This preference-based judgment rank-ordering method is called Best-Worst Scaling (BWS). Inspired by this concept, we propose a preference-based framework to generate a relative rank-ordering of singing vocals, and therefore, singers. We adopt a twin-neural network (Siamese) that learns to choose a preferred candidate in terms of singing quality between two inputs. With a few such pairwise comparisons, this method generates a relative rank-order of a complete list of singers. Additionally, we incorporate a knowledge-based musically-relevant pitch histogram representation, as a conditioning vector, to provide explicit musical information to the network. The experiments show that this method is able to reliably evaluate singing quality and rank-order singing vocals, independent of the song or the singer. The results suggest that the twin-neural network learns the underlying discerning properties relevant to singing quality, instead of being specific to the content of a song or singer.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ISMIR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

11/10/2020

Music Fadernets: Controllable Music Generation Based on High-level Features via Low-level Feature Modelling

HAO HAO TAN, Dorien Herremans

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, MIR tasks, Music synthesis and transformation

0

0

0

0

4:18

22/11/2021

A cappella: Audio-visual Singing Voice Separation

Juan Felipe Montesinos, Venkatesh Shenoy Kadandale, Gloria Haro

Keywords Paper

audiovisual, audio-visual, source separation, singing, speech, graph, acappella

0

0

0

0

2:51

11/10/2020

Explaining Perceived Emotion Predictions in Music: an Attentive Approach

Sanga Chaki, Pranjal Doshi, Sourangshu Bhattacharya, Prof. Priyadarshi Patnaik

Keywords Paper

Musical features and properties, Musical affect, emotion, and mood, Applications, Music recommendation and playlist generation, Music retrieval systems, Domain knowledge, Machine learning/Artificial intelligence for music, MIR tasks, Automatic classification, Pattern matching and detection

0

0

0

0

3:15

02/11/2020

Model selection for deep audio source separation via clustering analysis

Alisa Liu, Prem Seetharaman, Bryan Pardo

Keywords Paper

0

0

0

0

12:12

11/10/2020

The Multiple Voices of Musical Emotions: Source Separation for Improving Music Emotion Recognition Models and Their Interpretability

Jacopo de Berardinis, Angelo Cangelosi, Eduardo Coutinho

Keywords Paper

Musical features and properties, Musical affect, emotion, and mood, Domain knowledge, Machine learning/Artificial intelligence for music, MIR tasks, Sound source separation

0

0

0

0

4:00

11/10/2020

Modeling Perception with Hierarchical Prediction: Auditory Segmentation with Deep Predictive Coding Locates Candidate Evoked Potentials in EEG

André Ofner, Sebastian Stober

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Cognitive MIR, Representations of music, Human-centered MIR, Personalization, MIR fundamentals and methodology, Multimodality, Musical features and properties, Rhythm, beat, tempo

0

0

0

0

3:38

04/07/2020

Information-Theoretic Probing for Linguistic Structure

Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay and
Ran Zmigrod, Adina Williams, Ryan Cotterell

Keywords Paper

Information-Theoretic Probing, NLP tasks, linguistic task, probing

0

0

0

0

10:30

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

11/10/2020

Using Weakly Aligned Score–audio Pairs to Train Deep Chroma Models for Cross-modal Music Retrieval

Frank Zalkow, Meinard Müller

Keywords Paper

Applications, Music retrieval systems, Domain knowledge, Machine learning/Artificial intelligence for music, Representations of music, MIR fundamentals and methodology, Music signal processing, MIR tasks, Alignment, synchronization, and score following, Musical features and properties, Harmony, chords, and tonality

0

0

0

0

4:08

11/10/2020

Zero-shot Singing Voice Conversion

Shahan Nercessian

Keywords Paper

MIR tasks, Music synthesis and transformation, Domain knowledge, Machine learning/Artificial intelligence for music, Musical features and properties, Timbre, instrumentation, and voice

0

0

0

0

2:51

25/04/2020

Novice-AI Music Co-Creation via AI-Steering Tools for Deep Generative Models

Ryan Louie, Andy Coenen, Cheng Zhi Huang and
Michael Terry, Carrie Cai

Keywords Paper

human-ai interaction, generative deep neural networks, co-creation

0

0

0

0

13:20

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

14/06/2020

WaveletStereo: Learning Wavelet Coefficients of Disparity Map in Stereo Matching

Menglong Yang, Fangrui Wu, Wei Li

Keywords Paper

stereo matching, wavelet coefficients, inverse wavelet transform, supervised learning, deep representation, multi-scale features, multi-resolution cost volume, wavelet regression, disparity reconstruction, disparity refinement

0

0

0

0

1:01

06/12/2020

Gibbs Sampling with People

Peter Harrison, Raja Marjieh, Fede G Adolfi and
Pol van Rijn, Manuel Anglada-Tort, Ofer Tchernichovski, Pauline Larrouy-Maestri, Nori Jacoby

Keywords Paper

0

0

0

0

3:20

11/10/2020

Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds

Yin-Jyun Luo, Kin Wai Cheuk, Tomoyasu Nakano and
Masataka Goto, Dorien Herremans

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music

0

0

0

0

4:08

19/08/2021

FedSpeech: Federated Text-to-Speech with Continual Learning

Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao

Keywords Paper

Natural Language Processing, Speech, Federated Learning, Privacy Preserving Data Mining

0

0

0

0

6:06

18/07/2021

Generalization Guarantees for Neural Architecture Search with Train-Validation Split

Samet Oymak, Mingchen Li, Mahdi Soltanolkotabi

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:16

16/11/2020

Towards More Accurate Uncertainty Estimation In Text Classification

Jianfeng He, Xuchao Zhang, Shuo Lei and
Zhiqian Chen, Fanglan Chen, Abdulaziz Alhamadani, Bei Xiao, ChangTien Lu

Keywords Paper

uncertainty classified, rectification, text classification, mix-up

0

0

0

0

11:45

02/02/2021

Matching on Sets: Conquer Occluded Person Re-identification Without Alignment

Mengxi Jia, Xinhua Cheng, Yunpeng Zhai and
Shijian Lu, Siwei Ma, Yonghong Tian, Jian Zhang

Keywords Paper

0

0

0

0

15:02

11/10/2020

Semantically Meaningful Attributes from Co-listen Embeddings for Playlist Exploration and Expansion

Ayush Patwari, Nicholas Kong, Jun Wang and
Ullas Gargi, Michele Covell, Aren Jansen

Keywords Paper

Applications, Music recommendation and playlist generation, MIR tasks, Automatic classification, Musical features and properties, Musical affect, emotion, and mood

0

0

0

0

4:00

11/10/2020

Score-informed Source Separation of Choral Music

Matan Gover, Philippe Depalle

Keywords Paper

MIR tasks, Sound source separation, Domain knowledge, Machine learning/Artificial intelligence for music, MIR fundamentals and methodology, Music signal processing

0

0

0

0

4:10

16/11/2020

The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

response selection, retrieval-based systems, learning-to-rank problem, learning-to-rank

0

0

0

0

12:03

19/08/2021

Multi-Scale Selective Feedback Network with Dual Loss for Real Image Denoising

Xiaowan Hu, Yuanhao Cai, Zhihong Liu and
Haoqian Wang, Yulun Zhang

Keywords Paper

Computer Vision, Computational Photography, Photometry, Shape from X, Deep Learning

0

0

0

0

9:52

02/02/2021

Classification with Strategically Withheld Data

Anilesh K. Krishnaswamy, Haoming Li, David Rein and
Hanrui Zhang, Vincent Conitzer

Keywords Paper

0

0

0

0

17:15

02/02/2021

Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis

Sang-Hoon Lee, Hyun-Wook Yoon, Hyeong-Rae Noh and
Ji-Hoon Kim, Seong-Whan Lee

Keywords Paper

0

0

0

0

14:19

18/07/2021

Learning de-identified representations of prosody from raw audio

Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

4:37

11/10/2020

A Simple Method for User-driven Music Thumbnailing

Arianne N. van Nieuwenhuijsen, John Ashley Burgoyne, Frans Wiering, Mick Sneekes

Keywords Paper

MIR tasks, Music summarization, Applications, Music retrieval systems, Human-centered MIR, User behavior analysis and mining, user modeling

0

0

0

0

3:39

02/02/2021

Modeling the Compatibility of Stem Tracks to Generate Music Mashups

Jiawen Huang, Ju-Chiang Wang, Jordan B. L. Smith and
Xuchen Song, Yuxuan Wang

Keywords Paper

0

0

0

0

19:31

06/12/2021

Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems

Menoua Keshishian, Samuel Norman-Haignere, Nima Mesgarani

Keywords Paper

deep learning, machine learning

0

0

0

0

10:28

04/07/2020

Relabel the Noise: Joint Extraction of Entities and Relations via Cooperative Multiagents

Daoyuan Chen, Yaliang Li, Kai Lei, Ying Shen

Keywords Paper

entity extraction, re-labeling instances, extraction tasks, re-labeling instance

0

0

0

0

11:24

11/10/2020

Multiple F0 Estimation in Vocal Ensembles Using Convolutional Neural Networks

Helena Cuesta, Brian McFee, Emilia Gomez

Keywords Paper

MIR tasks, Music transcription and annotation, MIR fundamentals and methodology, Music signal processing, Musical features and properties, Melody and motives

0

0

0

0

4:07

26/04/2020

Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification

Yixiao Ge, Dapeng Chen, Hongsheng Li

Keywords Paper

Label Refinery, Unsupervised Domain Adaptation, Person Re-identification

0

0

0

0

5:03

05/01/2021

Boosting Monocular Depth With Panoptic Segmentation Maps

Faraz Saeedan, Stefan Roth

Keywords Paper

0

0

0

0

4:59

12/07/2020

Improving generalization by controlling label-noise information in neural network weights

Hrayr Harutyunyan, Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

Supervised Learning

0

0

0

0

14:01

14/09/2020

Learning Gradient Boosted Multi-label Classification Rules

Michael Rapp, Eneldo Loza Mencía, Johannes Fürnkranz and
Vu-Linh Nguyen, Eyke Hüllermeier

Keywords Paper

multi-label classification, gradient boosting, rule learning

0

0

0

0

15:45

11/10/2020

"Butter Lyrics Over Hominy Grit": Comparing Audio and Psychology-based Text Features in MIR Tasks

Jaehun Kim, Andrew M. Demetriou, Sandy Manolios and
M. Stella Tavella, Cynthia C. S. Liem

Keywords Paper

MIR fundamentals and methodology, Lyrics and other textual data, web mining, and natural language , Applications, Music recommendation and playlist generation, Domain knowledge, Machine learning/Artificial intelligence for music, Evaluation, datasets, and reproducibility, MIR tasks, Automatic classification

0

0

0

0

3:55

11/10/2020

Joyful for You and Tender for Us: the Influence of Individual Characteristics and Language on Emotion Labeling and Classification

Juan S. Gómez-Cañón, Estefania Cano, Perfecto Herrera, Emilia Gomez

Keywords Paper

Musical features and properties, Musical affect, emotion, and mood, Domain knowledge, Cognitive MIR, Evaluation, datasets, and reproducibility, Annotation protocols, Evaluation methodology, Human-centered MIR, User-centered evaluation

0

0

0

0

3:38

14/06/2020

Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision

Denis Gudovskiy, Alec Hodgkinson, Takuya Yamaguchi, Sotaro Tsukizawa

Keywords Paper

active learning, data bias, class imbalance, self-supervised learning, unsupervised learning, fisher kernel, fisher vectors, influence functions, density matching, image recognition

0

0

0

0

1:01

06/12/2020

Part-dependent Label Noise: Towards Instance-dependent Label Noise

Xiaobo Xia, Tongliang Liu, Bo Han and
Nannan Wang, Mingming Gong, Haifeng Liu, Gang Niu, Dacheng Tao, Masashi Sugiyama

Keywords Paper

0

0

0

0

3:00

06/12/2021

Towards Biologically Plausible Convolutional Networks

Roman Pogodin, Yash Mehta, Timothy Lillicrap, Peter E Latham

Keywords Paper

deep learning

0

0

0

0

5:15